|
gem5
v21.0.1.0
|
#include <shader.hh>
Classes | |
| struct | ShaderStats |
Public Types | |
| enum | hsail_mode_e { SIMT, VECTOR_SCALAR } |
| typedef ShaderParams | Params |
Public Types inherited from ClockedObject | |
| using | Params = ClockedObjectParams |
| Parameters of ClockedObject. More... | |
Public Types inherited from SimObject | |
| typedef SimObjectParams | Params |
Public Member Functions | |
| GPUDispatcher & | dispatcher () |
| void | sampleLoad (const Tick accessTime) |
| void | sampleStore (const Tick accessTime) |
| void | sampleInstRoundTrip (std::vector< Tick > roundTripTime) |
| void | sampleLineRoundTrip (const std::map< Addr, std::vector< Tick >> &roundTripTime) |
| const ApertureRegister & | gpuVmApe () const |
| const ApertureRegister & | ldsApe () const |
| const ApertureRegister & | scratchApe () const |
| bool | isGpuVmApe (Addr addr) const |
| bool | isLdsApe (Addr addr) const |
| bool | isScratchApe (Addr addr) const |
| Addr | getScratchBase () |
| Addr | getHiddenPrivateBase () |
| void | initShHiddenPrivateBase (Addr queueBase, uint32_t offset) |
| Shader (const Params &p) | |
| ~Shader () | |
| virtual void | init () |
| init() is called after all C++ SimObjects have been created and all ports are connected. More... | |
| void | execScheduledAdds () |
| void | ScheduleAdd (int *val, Tick when, int x) |
| bool | processTimingPacket (PacketPtr pkt) |
| void | AccessMem (uint64_t address, void *ptr, uint32_t size, int cu_id, MemCmd cmd, bool suppress_func_errors) |
| void | ReadMem (uint64_t address, void *ptr, uint32_t sz, int cu_id) |
| void | ReadMem (uint64_t address, void *ptr, uint32_t sz, int cu_id, bool suppress_func_errors) |
| void | WriteMem (uint64_t address, void *ptr, uint32_t sz, int cu_id) |
| void | WriteMem (uint64_t address, void *ptr, uint32_t sz, int cu_id, bool suppress_func_errors) |
| void | doFunctionalAccess (const RequestPtr &req, MemCmd cmd, void *data, bool suppress_func_errors, int cu_id) |
| void | registerCU (int cu_id, ComputeUnit *compute_unit) |
| void | prepareInvalidate (HSAQueueEntry *task) |
| void | prepareFlush (GPUDynInstPtr gpuDynInst) |
| dispatcher/shader arranges flush requests to the CUs More... | |
| bool | dispatchWorkgroups (HSAQueueEntry *task) |
| Addr | mmap (int length) |
| void | functionalTLBAccess (PacketPtr pkt, int cu_id, BaseTLB::Mode mode) |
| void | updateContext (int cid) |
| void | notifyCuSleep () |
| void | incVectorInstSrcOperand (int num_operands) |
| void | incVectorInstDstOperand (int num_operands) |
Public Member Functions inherited from ClockedObject | |
| ClockedObject (const ClockedObjectParams &p) | |
| void | serialize (CheckpointOut &cp) const override |
| Serialize an object. More... | |
| void | unserialize (CheckpointIn &cp) override |
| Unserialize an object. More... | |
Public Member Functions inherited from SimObject | |
| const Params & | params () const |
| SimObject (const Params &p) | |
| virtual | ~SimObject () |
| virtual const std::string | name () const |
| virtual void | loadState (CheckpointIn &cp) |
| loadState() is called on each SimObject when restoring from a checkpoint. More... | |
| virtual void | initState () |
| initState() is called on each SimObject when not restoring from a checkpoint. More... | |
| virtual void | regProbePoints () |
| Register probe points for this object. More... | |
| virtual void | regProbeListeners () |
| Register probe listeners for this object. More... | |
| ProbeManager * | getProbeManager () |
| Get the probe manager for this object. More... | |
| virtual Port & | getPort (const std::string &if_name, PortID idx=InvalidPortID) |
| Get a port with a given name and index. More... | |
| virtual void | startup () |
| startup() is the final initialization call before simulation. More... | |
| DrainState | drain () override |
| Provide a default implementation of the drain interface for objects that don't need draining. More... | |
| virtual void | memWriteback () |
| Write back dirty buffers to memory using functional writes. More... | |
| virtual void | memInvalidate () |
| Invalidate the contents of memory buffers. More... | |
| void | serialize (CheckpointOut &cp) const override |
| Serialize an object. More... | |
| void | unserialize (CheckpointIn &cp) override |
| Unserialize an object. More... | |
Public Member Functions inherited from EventManager | |
| EventQueue * | eventQueue () const |
| void | schedule (Event &event, Tick when) |
| void | deschedule (Event &event) |
| void | reschedule (Event &event, Tick when, bool always=false) |
| void | schedule (Event *event, Tick when) |
| void | deschedule (Event *event) |
| void | reschedule (Event *event, Tick when, bool always=false) |
| void | wakeupEventQueue (Tick when=(Tick) -1) |
| This function is not needed by the usual gem5 event loop but may be necessary in derived EventQueues which host gem5 on other schedulers. More... | |
| void | setCurTick (Tick newVal) |
| EventManager (EventManager &em) | |
| Event manger manages events in the event queue. More... | |
| EventManager (EventManager *em) | |
| EventManager (EventQueue *eq) | |
Public Member Functions inherited from Serializable | |
| Serializable () | |
| virtual | ~Serializable () |
| void | serializeSection (CheckpointOut &cp, const char *name) const |
| Serialize an object into a new section. More... | |
| void | serializeSection (CheckpointOut &cp, const std::string &name) const |
| void | unserializeSection (CheckpointIn &cp, const char *name) |
| Unserialize an a child object. More... | |
| void | unserializeSection (CheckpointIn &cp, const std::string &name) |
Public Member Functions inherited from Drainable | |
| DrainState | drainState () const |
| Return the current drain state of an object. More... | |
| virtual void | notifyFork () |
| Notify a child process of a fork. More... | |
Public Member Functions inherited from Stats::Group | |
| Group (Group *parent, const char *name=nullptr) | |
| Construct a new statistics group. More... | |
| virtual | ~Group () |
| virtual void | regStats () |
| Callback to set stat parameters. More... | |
| virtual void | resetStats () |
| Callback to reset stats. More... | |
| virtual void | preDumpStats () |
| Callback before stats are dumped. More... | |
| void | addStat (Stats::Info *info) |
| Register a stat with this group. More... | |
| const std::map< std::string, Group * > & | getStatGroups () const |
| Get all child groups associated with this object. More... | |
| const std::vector< Info * > & | getStats () const |
| Get all stats associated with this object. More... | |
| void | addStatGroup (const char *name, Group *block) |
| Add a stat block as a child of this block. More... | |
| const Info * | resolveStat (std::string name) const |
| Resolve a stat by its name within this group. More... | |
| void | mergeStatGroup (Group *block) |
| Merge the contents (stats & children) of a block to this block. More... | |
| Group ()=delete | |
| Group (const Group &)=delete | |
| Group & | operator= (const Group &)=delete |
Public Member Functions inherited from Clocked | |
| void | updateClockPeriod () |
| Update the tick to the current tick. More... | |
| Tick | clockEdge (Cycles cycles=Cycles(0)) const |
| Determine the tick when a cycle begins, by default the current one, but the argument also enables the caller to determine a future cycle. More... | |
| Cycles | curCycle () const |
| Determine the current cycle, corresponding to a tick aligned to a clock edge. More... | |
| Tick | nextCycle () const |
| Based on the clock of the object, determine the start tick of the first cycle that is at least one cycle in the future. More... | |
| uint64_t | frequency () const |
| Tick | clockPeriod () const |
| double | voltage () const |
| Cycles | ticksToCycles (Tick t) const |
| Tick | cyclesToTicks (Cycles c) const |
Public Attributes | |
| SimpleThread * | cpuThread |
| ThreadContext * | gpuTc |
| BaseCPU * | cpuPointer |
| EventFunctionWrapper | tickEvent |
| bool | timingSim |
| hsail_mode_e | hsail_mode |
| int | impl_kern_launch_acq |
| int | impl_kern_end_rel |
| int | coissue_return |
| int | trace_vgpr_all |
| int | n_cu |
| int | n_wf |
| int | globalMemSize |
| int | nextSchedCu |
| uint32_t | sa_n |
| std::vector< int * > | sa_val |
| std::vector< uint64_t > | sa_when |
| std::vector< int32_t > | sa_x |
| std::vector< ComputeUnit * > | cuList |
| GPUCommandProcessor & | gpuCmdProc |
| GPUDispatcher & | _dispatcher |
| int64_t | max_valu_insts |
| int64_t | total_valu_insts |
Public Attributes inherited from ClockedObject | |
| PowerState * | powerState |
Protected Attributes | |
| Shader::ShaderStats | stats |
Protected Attributes inherited from SimObject | |
| const SimObjectParams & | _params |
| Cached copy of the object parameters. More... | |
Protected Attributes inherited from EventManager | |
| EventQueue * | eventq |
| A pointer to this object's event queue. More... | |
Private Attributes | |
| ApertureRegister | _gpuVmApe |
| ApertureRegister | _ldsApe |
| ApertureRegister | _scratchApe |
| Addr | shHiddenPrivateBaseVmid |
| int | _activeCus |
| Tick | _lastInactiveTick |
Additional Inherited Members | |
Static Public Member Functions inherited from SimObject | |
| static void | serializeAll (CheckpointOut &cp) |
| Serialize all SimObjects in the system. More... | |
| static SimObject * | find (const char *name) |
| Find the SimObject with the given name and return a pointer to it. More... | |
Static Public Member Functions inherited from Serializable | |
| static const std::string & | currentSection () |
| Gets the fully-qualified name of the active section. More... | |
| static void | serializeAll (const std::string &cpt_dir) |
| Serializes all the SimObjects. More... | |
| static void | unserializeGlobals (CheckpointIn &cp) |
Protected Member Functions inherited from Drainable | |
| Drainable () | |
| virtual | ~Drainable () |
| virtual void | drainResume () |
| Resume execution after a successful drain. More... | |
| void | signalDrainDone () const |
| Signal that an object is drained. More... | |
Protected Member Functions inherited from Clocked | |
| Clocked (ClockDomain &clk_domain) | |
| Create a clocked object and set the clock domain based on the parameters. More... | |
| Clocked (Clocked &)=delete | |
| Clocked & | operator= (Clocked &)=delete |
| virtual | ~Clocked () |
| Virtual destructor due to inheritance. More... | |
| void | resetClock () const |
| Reset the object's clock using the current global tick value. More... | |
| virtual void | clockPeriodUpdated () |
| A hook subclasses can implement so they can do any extra work that's needed when the clock rate is changed. More... | |
| typedef ShaderParams Shader::Params |
| enum Shader::hsail_mode_e |
| Shader::Shader | ( | const Params & | p | ) |
Definition at line 55 of file shader.cc.
References execScheduledAdds().
| Shader::~Shader | ( | ) |
| void Shader::AccessMem | ( | uint64_t | address, |
| void * | ptr, | ||
| uint32_t | size, | ||
| int | cu_id, | ||
| MemCmd | cmd, | ||
| bool | suppress_func_errors | ||
| ) |
Definition at line 375 of file shader.cc.
References cuList, doFunctionalAccess(), and ChunkGenerator::done().
Referenced by ReadMem(), and WriteMem().
| GPUDispatcher & Shader::dispatcher | ( | ) |
Definition at line 98 of file shader.cc.
References _dispatcher.
Referenced by Gcn3ISA::Inst_SOPP__S_ENDPGM::execute(), and ComputeUnit::DataPort::recvTimingResp().
| bool Shader::dispatchWorkgroups | ( | HSAQueueEntry * | task | ) |
Definition at line 234 of file shader.cc.
References _activeCus, _lastInactiveTick, cuList, curTick(), HSAQueueEntry::dispComplete(), DPRINTF, HSAQueueEntry::globalWgId(), HSAQueueEntry::markWgDispatch(), n_cu, nextSchedCu, panic_if, Event::scheduled(), and tickEvent.
Referenced by GPUDispatcher::exec().
| void Shader::doFunctionalAccess | ( | const RequestPtr & | req, |
| MemCmd | cmd, | ||
| void * | data, | ||
| bool | suppress_func_errors, | ||
| int | cu_id | ||
| ) |
Definition at line 283 of file shader.cc.
References cuList, data, Packet::dataStatic(), fatal, functionalTLBAccess(), BaseTLB::Read, MemCmd::ReadReq, Packet::req, roundDown(), Packet::setSuppressFuncError(), BaseTLB::Write, and MemCmd::WriteReq.
Referenced by AccessMem().
| void Shader::execScheduledAdds | ( | ) |
| void Shader::functionalTLBAccess | ( | PacketPtr | pkt, |
| int | cu_id, | ||
| BaseTLB::Mode | mode | ||
| ) |
Definition at line 426 of file shader.cc.
References cuList, gpuTc, ArmISA::mode, and Packet::senderState.
Referenced by doFunctionalAccess().
|
inline |
Definition at line 166 of file shader.hh.
References shHiddenPrivateBaseVmid.
Referenced by GPUDynInst::resolveFlatSegment().
|
inline |
Definition at line 160 of file shader.hh.
References _scratchApe, and ApertureRegister::base.
Referenced by GPUDynInst::resolveFlatSegment().
|
inline |
|
inline |
Definition at line 284 of file shader.hh.
References stats, and Shader::ShaderStats::vectorInstDstOperand.
Referenced by Wavefront::exec().
|
inline |
Definition at line 278 of file shader.hh.
References stats, and Shader::ShaderStats::vectorInstSrcOperand.
Referenced by Wavefront::exec().
|
virtual |
init() is called after all C++ SimObjects have been created and all ports are connected.
Initializations that are independent of unserialization but rely on a fully instantiated and connected SimObject graph should be done here.
Reimplemented from SimObject.
Definition at line 137 of file shader.cc.
References cpuPointer, BaseCPU::getContext(), and gpuTc.
|
inline |
Definition at line 172 of file shader.hh.
References ArmISA::offset, panic_if, and shHiddenPrivateBaseVmid.
Referenced by Wavefront::initRegState().
|
inline |
Definition at line 135 of file shader.hh.
References _gpuVmApe, X86ISA::addr, ApertureRegister::base, and ApertureRegister::limit.
|
inline |
Definition at line 143 of file shader.hh.
References _ldsApe, X86ISA::addr, ApertureRegister::base, and ApertureRegister::limit.
Referenced by GPUDynInst::doApertureCheck().
|
inline |
Definition at line 151 of file shader.hh.
References _scratchApe, X86ISA::addr, ApertureRegister::base, and ApertureRegister::limit.
|
inline |
| Addr Shader::mmap | ( | int | length | ) |
Definition at line 104 of file shader.cc.
References Process::allocateMem(), DPRINTF, ThreadContext::getProcessPtr(), gpuTc, Process::memState, Process::mmapGrowsDown(), X86ISA::PageBytes, and roundUp().
| void Shader::notifyCuSleep | ( | ) |
Definition at line 516 of file shader.cc.
References _activeCus, _lastInactiveTick, cuList, curTick(), panic_if, Shader::ShaderStats::shaderActiveTicks, and stats.
Referenced by ComputeUnit::exec().
| void Shader::prepareFlush | ( | GPUDynInstPtr | gpuDynInst | ) |
dispatcher/shader arranges flush requests to the CUs
Definition at line 221 of file shader.cc.
References _dispatcher, cuList, GPUDispatcher::getOutstandingWbs(), and GPUDispatcher::updateWbCounter().
Referenced by Gcn3ISA::Inst_SOPP__S_ENDPGM::execute().
| void Shader::prepareInvalidate | ( | HSAQueueEntry * | task | ) |
Definition at line 190 of file shader.cc.
References _dispatcher, cuList, HSAQueueEntry::dispatchId(), HSAQueueEntry::isInvStarted(), n_cu, HSAQueueEntry::outstandingInvs(), and GPUDispatcher::updateInvCounter().
Referenced by GPUDispatcher::exec().
| bool Shader::processTimingPacket | ( | PacketPtr | pkt | ) |
| void Shader::ReadMem | ( | uint64_t | address, |
| void * | ptr, | ||
| uint32_t | sz, | ||
| int | cu_id | ||
| ) |
Definition at line 393 of file shader.cc.
References AccessMem(), and MemCmd::ReadReq.
| void Shader::ReadMem | ( | uint64_t | address, |
| void * | ptr, | ||
| uint32_t | sz, | ||
| int | cu_id, | ||
| bool | suppress_func_errors | ||
| ) |
Definition at line 399 of file shader.cc.
References AccessMem(), and MemCmd::ReadReq.
|
inline |
| void Shader::sampleInstRoundTrip | ( | std::vector< Tick > | roundTripTime | ) |
Definition at line 467 of file shader.cc.
References Shader::ShaderStats::gmEnqueueLatency, Shader::ShaderStats::gmToCompleteLatency, Shader::ShaderStats::initToCoalesceLatency, InstMemoryHopMax, Shader::ShaderStats::rubyNetworkLatency, Stats::DistBase< Derived, Stor >::sample(), stats, ArmISA::t1, ArmISA::t2, ArmISA::t3, ArmISA::t4, and ArmISA::t5.
Referenced by GlobalMemPipeline::exec().
| void Shader::sampleLineRoundTrip | ( | const std::map< Addr, std::vector< Tick >> & | roundTripTime | ) |
Definition at line 487 of file shader.cc.
References Shader::ShaderStats::cacheBlockRoundTrip, Shader::ShaderStats::coalsrLineAddresses, Stats::DistBase< Derived, Stor >::sample(), and stats.
Referenced by GlobalMemPipeline::exec().
| void Shader::sampleLoad | ( | const Tick | accessTime | ) |
Definition at line 460 of file shader.cc.
References Shader::ShaderStats::allLatencyDist, Shader::ShaderStats::loadLatencyDist, Stats::DistBase< Derived, Stor >::sample(), and stats.
Referenced by GlobalMemPipeline::exec().
| void Shader::sampleStore | ( | const Tick | accessTime | ) |
Definition at line 450 of file shader.cc.
References Shader::ShaderStats::allLatencyDist, Stats::DistBase< Derived, Stor >::sample(), stats, and Shader::ShaderStats::storeLatencyDist.
Referenced by GlobalMemPipeline::exec().
| void Shader::ScheduleAdd | ( | int * | val, |
| Tick | when, | ||
| int | x | ||
| ) |
Definition at line 356 of file shader.cc.
References curTick(), DPRINTF, EventManager::reschedule(), sa_n, sa_val, sa_when, sa_x, Event::scheduled(), tickEvent, X86ISA::val, Event::when(), and RiscvISA::x.
Referenced by LocalMemPipeline::exec(), ScalarMemPipeline::exec(), and GlobalMemPipeline::exec().
|
inline |
Definition at line 129 of file shader.hh.
References _scratchApe.
| void Shader::updateContext | ( | int | cid | ) |
Definition at line 152 of file shader.cc.
References cpuPointer, BaseCPU::getContext(), and gpuTc.
Referenced by GPUDispatcher::exec().
| void Shader::WriteMem | ( | uint64_t | address, |
| void * | ptr, | ||
| uint32_t | sz, | ||
| int | cu_id | ||
| ) |
Definition at line 407 of file shader.cc.
References AccessMem(), and MemCmd::WriteReq.
| void Shader::WriteMem | ( | uint64_t | address, |
| void * | ptr, | ||
| uint32_t | sz, | ||
| int | cu_id, | ||
| bool | suppress_func_errors | ||
| ) |
Definition at line 413 of file shader.cc.
References AccessMem(), and MemCmd::WriteReq.
|
private |
Definition at line 96 of file shader.hh.
Referenced by dispatchWorkgroups(), and notifyCuSleep().
| GPUDispatcher& Shader::_dispatcher |
Definition at line 230 of file shader.hh.
Referenced by dispatcher(), prepareFlush(), and prepareInvalidate().
|
private |
Definition at line 90 of file shader.hh.
Referenced by gpuVmApe(), and isGpuVmApe().
|
private |
Definition at line 99 of file shader.hh.
Referenced by dispatchWorkgroups(), and notifyCuSleep().
|
private |
Definition at line 91 of file shader.hh.
Referenced by isLdsApe(), and ldsApe().
|
private |
Definition at line 92 of file shader.hh.
Referenced by getScratchBase(), isScratchApe(), and scratchApe().
| int Shader::coissue_return |
Definition at line 202 of file shader.hh.
Referenced by LocalMemPipeline::exec(), ScalarMemPipeline::exec(), and GlobalMemPipeline::exec().
| BaseCPU* Shader::cpuPointer |
Definition at line 114 of file shader.hh.
Referenced by init(), and updateContext().
| SimpleThread* Shader::cpuThread |
| std::vector<ComputeUnit*> Shader::cuList |
Definition at line 227 of file shader.hh.
Referenced by AccessMem(), dispatchWorkgroups(), doFunctionalAccess(), functionalTLBAccess(), notifyCuSleep(), prepareFlush(), prepareInvalidate(), registerCU(), and ~Shader().
| int Shader::globalMemSize |
Definition at line 211 of file shader.hh.
Referenced by GlobalMemPipeline::init().
| GPUCommandProcessor& Shader::gpuCmdProc |
| ThreadContext* Shader::gpuTc |
Definition at line 113 of file shader.hh.
Referenced by functionalTLBAccess(), init(), FetchUnit::initiateFetch(), mmap(), ComputeUnit::sendRequest(), ComputeUnit::sendScalarRequest(), and updateContext().
| hsail_mode_e Shader::hsail_mode |
Definition at line 195 of file shader.hh.
Referenced by Wavefront::exec().
| int Shader::impl_kern_end_rel |
Definition at line 200 of file shader.hh.
Referenced by Gcn3ISA::Inst_SOPP__S_ENDPGM::execute(), and ComputeUnit::injectGlobalMemFence().
| int Shader::impl_kern_launch_acq |
Definition at line 198 of file shader.hh.
Referenced by GPUDispatcher::exec().
| int64_t Shader::max_valu_insts |
Definition at line 232 of file shader.hh.
Referenced by ComputeUnit::updateInstStats().
| int Shader::n_cu |
Definition at line 206 of file shader.hh.
Referenced by dispatchWorkgroups(), prepareInvalidate(), and ~Shader().
| int Shader::n_wf |
Definition at line 208 of file shader.hh.
Referenced by ComputeUnit::dispWorkgroup(), FetchUnit::exec(), ScoreboardCheckStage::exec(), ComputeUnit::hasDispResources(), FetchUnit::init(), ComputeUnit::isVectorAluIdle(), ComputeUnit::releaseWFsFromBarrier(), Wavefront::setStatus(), and ComputeUnit::~ComputeUnit().
| int Shader::nextSchedCu |
Definition at line 214 of file shader.hh.
Referenced by dispatchWorkgroups().
| uint32_t Shader::sa_n |
Definition at line 217 of file shader.hh.
Referenced by execScheduledAdds(), and ScheduleAdd().
| std::vector<int*> Shader::sa_val |
Definition at line 220 of file shader.hh.
Referenced by execScheduledAdds(), and ScheduleAdd().
| std::vector<uint64_t> Shader::sa_when |
Definition at line 222 of file shader.hh.
Referenced by execScheduledAdds(), and ScheduleAdd().
| std::vector<int32_t> Shader::sa_x |
Definition at line 224 of file shader.hh.
Referenced by execScheduledAdds(), and ScheduleAdd().
|
private |
Definition at line 93 of file shader.hh.
Referenced by getHiddenPrivateBase(), and initShHiddenPrivateBase().
|
protected |
| EventFunctionWrapper Shader::tickEvent |
Definition at line 191 of file shader.hh.
Referenced by dispatchWorkgroups(), execScheduledAdds(), and ScheduleAdd().
| bool Shader::timingSim |
Definition at line 194 of file shader.hh.
Referenced by FetchUnit::init(), and ComputeUnit::sendRequest().
| int64_t Shader::total_valu_insts |
Definition at line 233 of file shader.hh.
Referenced by ComputeUnit::updateInstStats().