gem5  v20.1.0.0
Public Types | Public Member Functions | Public Attributes | Private Member Functions | Private Attributes | List of all members
Wavefront Class Reference

#include <wavefront.hh>

Inheritance diagram for Wavefront:
SimObject EventManager Serializable Drainable Stats::Group

Public Types

enum  status_e {
  S_STOPPED, S_RETURNING, S_RUNNING, S_STALLED,
  S_WAITCNT, S_BARRIER
}
 
typedef WavefrontParams Params
 
- Public Types inherited from SimObject
typedef SimObjectParams Params
 

Public Member Functions

void freeResources ()
 
GPUDynInstPtr nextInstr ()
 
void setStatus (status_e newStatus)
 
status_e getStatus ()
 
void resizeRegFiles (int num_vregs, int num_sregs)
 
bool isGmInstruction (GPUDynInstPtr ii)
 
bool isLmInstruction (GPUDynInstPtr ii)
 
bool isOldestInstWaitcnt ()
 
bool isOldestInstGMem ()
 
bool isOldestInstLMem ()
 
bool isOldestInstPrivMem ()
 
bool isOldestInstFlatMem ()
 
bool isOldestInstVectorALU ()
 
bool isOldestInstScalarALU ()
 
bool isOldestInstScalarMem ()
 
bool isOldestInstBarrier ()
 
void computeActualWgSz (HSAQueueEntry *task)
 
void initRegState (HSAQueueEntry *task, int wgSizeInWorkItems)
 
 Wavefront (const Params *p)
 
 ~Wavefront ()
 
virtual void init ()
 init() is called after all C++ SimObjects have been created and all ports are connected. More...
 
void setParent (ComputeUnit *cu)
 
void validateRequestCounters ()
 
void start (uint64_t _wfDynId, uint64_t _base_ptr)
 
void exec ()
 
std::vector< int > reserveResources ()
 
bool stopFetch ()
 
void regStats ()
 Callback to set stat parameters. More...
 
Addr pc () const
 
void pc (Addr new_pc)
 
VectorMaskexecMask ()
 
bool execMask (int lane) const
 
void discardFetch ()
 
bool waitCntsSatisfied ()
 
void setWaitCnts (int vm_wait_cnt, int exp_wait_cnt, int lgkm_wait_cnt)
 
void clearWaitCnts ()
 
void incVMemInstsIssued ()
 
void incExpInstsIssued ()
 
void incLGKMInstsIssued ()
 
void decVMemInstsIssued ()
 
void decExpInstsIssued ()
 
void decLGKMInstsIssued ()
 
void freeRegisterFile ()
 Freeing VRF space. More...
 
TheGpuISA::GPUISA & gpuISA ()
 
void barrierId (int bar_id)
 
int barrierId () const
 
bool hasBarrier () const
 
void releaseBarrier ()
 
- Public Member Functions inherited from SimObject
const Paramsparams () const
 
 SimObject (const Params *_params)
 
virtual ~SimObject ()
 
virtual const std::string name () const
 
virtual void loadState (CheckpointIn &cp)
 loadState() is called on each SimObject when restoring from a checkpoint. More...
 
virtual void initState ()
 initState() is called on each SimObject when not restoring from a checkpoint. More...
 
virtual void regProbePoints ()
 Register probe points for this object. More...
 
virtual void regProbeListeners ()
 Register probe listeners for this object. More...
 
ProbeManagergetProbeManager ()
 Get the probe manager for this object. More...
 
virtual PortgetPort (const std::string &if_name, PortID idx=InvalidPortID)
 Get a port with a given name and index. More...
 
virtual void startup ()
 startup() is the final initialization call before simulation. More...
 
DrainState drain () override
 Provide a default implementation of the drain interface for objects that don't need draining. More...
 
virtual void memWriteback ()
 Write back dirty buffers to memory using functional writes. More...
 
virtual void memInvalidate ()
 Invalidate the contents of memory buffers. More...
 
void serialize (CheckpointOut &cp) const override
 Serialize an object. More...
 
void unserialize (CheckpointIn &cp) override
 Unserialize an object. More...
 
- Public Member Functions inherited from EventManager
EventQueueeventQueue () const
 
void schedule (Event &event, Tick when)
 
void deschedule (Event &event)
 
void reschedule (Event &event, Tick when, bool always=false)
 
void schedule (Event *event, Tick when)
 
void deschedule (Event *event)
 
void reschedule (Event *event, Tick when, bool always=false)
 
void wakeupEventQueue (Tick when=(Tick) -1)
 This function is not needed by the usual gem5 event loop but may be necessary in derived EventQueues which host gem5 on other schedulers. More...
 
void setCurTick (Tick newVal)
 
 EventManager (EventManager &em)
 Event manger manages events in the event queue. More...
 
 EventManager (EventManager *em)
 
 EventManager (EventQueue *eq)
 
- Public Member Functions inherited from Serializable
 Serializable ()
 
virtual ~Serializable ()
 
void serializeSection (CheckpointOut &cp, const char *name) const
 Serialize an object into a new section. More...
 
void serializeSection (CheckpointOut &cp, const std::string &name) const
 
void unserializeSection (CheckpointIn &cp, const char *name)
 Unserialize an a child object. More...
 
void unserializeSection (CheckpointIn &cp, const std::string &name)
 
- Public Member Functions inherited from Drainable
DrainState drainState () const
 Return the current drain state of an object. More...
 
virtual void notifyFork ()
 Notify a child process of a fork. More...
 
- Public Member Functions inherited from Stats::Group
 Group (Group *parent, const char *name=nullptr)
 Construct a new statistics group. More...
 
virtual ~Group ()
 
virtual void resetStats ()
 Callback to reset stats. More...
 
virtual void preDumpStats ()
 Callback before stats are dumped. More...
 
void addStat (Stats::Info *info)
 Register a stat with this group. More...
 
const std::map< std::string, Group * > & getStatGroups () const
 Get all child groups associated with this object. More...
 
const std::vector< Info * > & getStats () const
 Get all stats associated with this object. More...
 
void addStatGroup (const char *name, Group *block)
 Add a stat block as a child of this block. More...
 
const InforesolveStat (std::string name) const
 Resolve a stat by its name within this group. More...
 
 Group ()=delete
 
 Group (const Group &)=delete
 
Groupoperator= (const Group &)=delete
 

Public Attributes

const int wfSlotId
 
int kernId
 
const int simdId
 
int execUnitId
 
int flatLmUnitId
 
int flatGmUnitId
 
ComputeUnitcomputeUnit
 
int maxIbSize
 
std::deque< GPUDynInstPtrinstructionBuffer
 
bool pendingFetch
 
bool dropFetch
 
Tick lastNonIdleTick
 
int scalarAlu
 
int scalarAluGlobalIdx
 
int globalMem
 
int localMem
 
int scalarMem
 
uint32_t maxVgprs
 
uint32_t maxSgprs
 
std::vector< AddrlastAddr
 
std::vector< uint32_t > workItemId [3]
 
std::vector< uint32_t > workItemFlatId
 
uint32_t workGroupId [3]
 
uint32_t workGroupSz [3]
 
uint32_t gridSz [3]
 
uint32_t wgId
 
uint32_t wgSz
 
uint32_t actualWgSz [3]
 
uint32_t actualWgSzTotal
 
uint32_t wfId
 
uint32_t maxDynWaveId
 
uint32_t dispatchId
 
int outstandingReqs
 
int outstandingReqsWrGm
 
int outstandingReqsWrLm
 
int outstandingReqsRdGm
 
int outstandingReqsRdLm
 
int scalarOutstandingReqsRdGm
 
int scalarOutstandingReqsWrGm
 
int rdLmReqsInPipe
 
int rdGmReqsInPipe
 
int wrLmReqsInPipe
 
int wrGmReqsInPipe
 
int scalarRdGmReqsInPipe
 
int scalarWrGmReqsInPipe
 
int memTraceBusy
 
uint64_t lastTrace
 
int reservedVectorRegs
 
int reservedScalarRegs
 
uint32_t startVgprIndex
 
uint32_t startSgprIndex
 
std::vector< uint32_t > oldVgpr
 
uint32_t oldVgprId
 
uint64_t oldVgprTcnt
 
std::vector< uint64_t > oldDgpr
 
uint32_t oldDgprId
 
uint64_t oldDgprTcnt
 
VectorMask initMask
 
LdsChunkldsChunk
 
uint64_t wfDynId
 
Stats::Scalar numInstrExecuted
 
Stats::Scalar schCycles
 
Stats::Scalar schStalls
 
Stats::Scalar schRfAccessStalls
 
Stats::Scalar schResourceStalls
 
Stats::Scalar schOpdNrdyStalls
 
Stats::Scalar schLdsArbStalls
 
Stats::Scalar numTimesBlockedDueWAXDependencies
 
Stats::Scalar numTimesBlockedDueRAWDependencies
 
uint64_t lastInstExec
 
Stats::Distribution vecRawDistance
 
std::unordered_map< int, uint64_t > rawDist
 
Stats::Distribution readsPerWrite
 
std::vector< int > vecReads
 
uint8_t * context
 

Private Member Functions

void reserveGmResource (GPUDynInstPtr ii)
 
void reserveLmResource (GPUDynInstPtr ii)
 

Private Attributes

TheGpuISA::GPUISA _gpuISA
 
int vmWaitCnt
 the following are used for waitcnt instructions vmWaitCnt: once set, we wait for the oustanding number of vector mem instructions to be at, or below vmWaitCnt. More...
 
int expWaitCnt
 
int lgkmWaitCnt
 
int vmemInstsIssued
 
int expInstsIssued
 
int lgkmInstsIssued
 
status_e status
 
Addr _pc
 
VectorMask _execMask
 
int barId
 

Additional Inherited Members

- Static Public Member Functions inherited from SimObject
static void serializeAll (CheckpointOut &cp)
 Serialize all SimObjects in the system. More...
 
static SimObjectfind (const char *name)
 Find the SimObject with the given name and return a pointer to it. More...
 
- Static Public Member Functions inherited from Serializable
static const std::string & currentSection ()
 Gets the fully-qualified name of the active section. More...
 
static void serializeAll (const std::string &cpt_dir)
 Serializes all the SimObjects. More...
 
static void unserializeGlobals (CheckpointIn &cp)
 
- Protected Member Functions inherited from Drainable
 Drainable ()
 
virtual ~Drainable ()
 
virtual void drainResume ()
 Resume execution after a successful drain. More...
 
void signalDrainDone () const
 Signal that an object is drained. More...
 
- Protected Attributes inherited from SimObject
const SimObjectParams * _params
 Cached copy of the object parameters. More...
 
- Protected Attributes inherited from EventManager
EventQueueeventq
 A pointer to this object's event queue. More...
 

Detailed Description

Definition at line 57 of file wavefront.hh.

Member Typedef Documentation

◆ Params

typedef WavefrontParams Wavefront::Params

Definition at line 275 of file wavefront.hh.

Member Enumeration Documentation

◆ status_e

Enumerator
S_STOPPED 
S_RETURNING 
S_RUNNING 
S_STALLED 
S_WAITCNT 

wavefront has unsatisfied wait counts

while in this state the WF will only execute if the oldest instruction is the waitcnt. while in S_WAITCNT, the wavefront will not be ready until all of its waitcnts have been satisfied. the scoreboard ready() function will check the status of the waitcnts whenever the WF is in S_WAITCNT, and once they are satisfied, it will resume normal operation.

S_BARRIER 

WF is stalled at a barrier.

Definition at line 60 of file wavefront.hh.

Constructor & Destructor Documentation

◆ Wavefront()

Wavefront::Wavefront ( const Params p)

◆ ~Wavefront()

Wavefront::~Wavefront ( )

Definition at line 586 of file wavefront.cc.

Member Function Documentation

◆ barrierId() [1/2]

int Wavefront::barrierId ( ) const

Definition at line 1445 of file wavefront.cc.

References barId.

◆ barrierId() [2/2]

void Wavefront::barrierId ( int  bar_id)

◆ clearWaitCnts()

void Wavefront::clearWaitCnts ( )

Definition at line 1336 of file wavefront.cc.

References expWaitCnt, lgkmWaitCnt, S_RUNNING, status, and vmWaitCnt.

Referenced by waitCntsSatisfied().

◆ computeActualWgSz()

void Wavefront::computeActualWgSz ( HSAQueueEntry task)

◆ decExpInstsIssued()

void Wavefront::decExpInstsIssued ( )

Definition at line 1374 of file wavefront.cc.

References expInstsIssued.

◆ decLGKMInstsIssued()

void Wavefront::decLGKMInstsIssued ( )

◆ decVMemInstsIssued()

void Wavefront::decVMemInstsIssued ( )

◆ discardFetch()

void Wavefront::discardFetch ( )

clear the fetch buffer for this wave in order to remove any stale inst data

Definition at line 1235 of file wavefront.cc.

References computeUnit, dropFetch, ComputeUnit::fetchStage, FetchStage::fetchUnit(), FetchUnit::flushBuf(), instructionBuffer, pendingFetch, simdId, and wfSlotId.

Referenced by exec().

◆ exec()

void Wavefront::exec ( )

if this wave is in S_WAITCNT state, then it should enter exec() precisely one time before the waitcnts are satisfied, in order to execute the waitcnt instruction itself thus we assert that the waitcnt is the oldest instruction. if we enter exec() with active waitcnts, and we're not executing the waitcnt instruction, something must be wrong

we return here to avoid spurious errors related to flat insts and their address segment resolution.

Definition at line 921 of file wavefront.cc.

References _gpuISA, ComputeUnit::activeLanesPerGMemInstrDist, ComputeUnit::activeLanesPerLMemInstrDist, computeUnit, ComputeUnit::controlFlowDivergenceDist, ComputeUnit::cu_id, Clocked::cyclesToTicks(), ComputeUnit::deleteFromPipeMap(), discardFetch(), DPRINTF, execMask(), ComputeUnit::execRateDist, fatal, ComputeUnit::getTokenManager(), Shader::hsail_mode, ArmISA::i, ComputeUnit::instCyclesLdsPerSimd, ComputeUnit::instCyclesScMemPerSimd, ComputeUnit::instCyclesVMemPerSimd, ComputeUnit::instExecPerSimd, ComputeUnit::instInterleave, instructionBuffer, isGmInstruction(), isLmInstruction(), isOldestInstWaitcnt(), ComputeUnit::issuePeriod, ComputeUnit::lastExecCycle, lastInstExec, ArmISA::n, Stats::none, numInstrExecuted, ComputeUnit::numInstrExecuted, ComputeUnit::numVecOpsExecuted, ComputeUnit::numVecOpsExecutedF16, ComputeUnit::numVecOpsExecutedF32, ComputeUnit::numVecOpsExecutedF64, ComputeUnit::numVecOpsExecutedFMA16, ComputeUnit::numVecOpsExecutedFMA32, ComputeUnit::numVecOpsExecutedFMA64, ComputeUnit::numVecOpsExecutedMAC16, ComputeUnit::numVecOpsExecutedMAC32, ComputeUnit::numVecOpsExecutedMAC64, ComputeUnit::numVecOpsExecutedMAD16, ComputeUnit::numVecOpsExecutedMAD32, ComputeUnit::numVecOpsExecutedMAD64, ComputeUnit::numVecOpsExecutedTwoOpFP, panic, pc(), rawDist, readsPerWrite, TokenManager::recvTokens(), S_RETURNING, S_STALLED, S_STOPPED, S_WAITCNT, Stats::DistBase< Derived, Stor >::sample(), scalarAlu, ComputeUnit::scalarALUs, ComputeUnit::scalarMemUnit, WaitClass::set(), ComputeUnit::shader, simdId, Shader::SIMT, ComputeUnit::srf, ComputeUnit::srf_scm_bus_latency, ComputeUnit::srfToScalarMemPipeBus, status, ComputeUnit::totalCycles, ComputeUnit::updateInstStats(), Stats::ScalarBase< Derived, Stor >::value(), vecRawDistance, vecReads, ComputeUnit::vectorALUs, ComputeUnit::vectorGlobalMemUnit, Shader::vectorInstDstOperand, Shader::vectorInstSrcOperand, ComputeUnit::vectorSharedMemUnit, ComputeUnit::vrf, ComputeUnit::vrf_gm_bus_latency, ComputeUnit::vrf_lm_bus_latency, ComputeUnit::vrfToGlobalMemPipeBus, ComputeUnit::vrfToLocalMemPipeBus, wfDynId, and wfSlotId.

Referenced by ExecStage::exec().

◆ execMask() [1/2]

VectorMask & Wavefront::execMask ( )

Definition at line 1398 of file wavefront.cc.

References _execMask.

Referenced by Gcn3ISA::Inst_DS::calcAddr(), exec(), Gcn3ISA::Inst_SOP1__S_AND_SAVEEXEC_B64::execute(), Gcn3ISA::Inst_SOP1__S_OR_SAVEEXEC_B64::execute(), Gcn3ISA::Inst_SOP1__S_XOR_SAVEEXEC_B64::execute(), Gcn3ISA::Inst_SOP1__S_ANDN2_SAVEEXEC_B64::execute(), Gcn3ISA::Inst_SOP1__S_ORN2_SAVEEXEC_B64::execute(), Gcn3ISA::Inst_SOP1__S_NAND_SAVEEXEC_B64::execute(), Gcn3ISA::Inst_SOP1__S_NOR_SAVEEXEC_B64::execute(), Gcn3ISA::Inst_SOP1__S_XNOR_SAVEEXEC_B64::execute(), Gcn3ISA::Inst_SOPP__S_CBRANCH_EXECZ::execute(), Gcn3ISA::Inst_SOPP__S_CBRANCH_EXECNZ::execute(), Gcn3ISA::Inst_VOP2__V_CNDMASK_B32::execute(), Gcn3ISA::Inst_VOP2__V_ADD_F32::execute(), Gcn3ISA::Inst_VOP2__V_SUB_F32::execute(), Gcn3ISA::Inst_VOP2__V_SUBREV_F32::execute(), Gcn3ISA::Inst_VOP2__V_MUL_LEGACY_F32::execute(), Gcn3ISA::Inst_VOP2__V_MUL_F32::execute(), Gcn3ISA::Inst_VOP2__V_MUL_I32_I24::execute(), Gcn3ISA::Inst_VOP2__V_MUL_HI_I32_I24::execute(), Gcn3ISA::Inst_VOP2__V_MUL_U32_U24::execute(), Gcn3ISA::Inst_VOP2__V_MUL_HI_U32_U24::execute(), Gcn3ISA::Inst_VOP2__V_MIN_F32::execute(), Gcn3ISA::Inst_VOP2__V_MAX_F32::execute(), Gcn3ISA::Inst_VOP2__V_MIN_I32::execute(), Gcn3ISA::Inst_VOP2__V_MAX_I32::execute(), Gcn3ISA::Inst_VOP2__V_MIN_U32::execute(), Gcn3ISA::Inst_VOP2__V_MAX_U32::execute(), Gcn3ISA::Inst_VOP2__V_LSHRREV_B32::execute(), Gcn3ISA::Inst_VOP2__V_ASHRREV_I32::execute(), Gcn3ISA::Inst_VOP2__V_LSHLREV_B32::execute(), Gcn3ISA::Inst_VOP2__V_AND_B32::execute(), Gcn3ISA::Inst_VOP2__V_OR_B32::execute(), Gcn3ISA::Inst_VOP2__V_XOR_B32::execute(), Gcn3ISA::Inst_VOP2__V_MAC_F32::execute(), Gcn3ISA::Inst_VOP2__V_MADMK_F32::execute(), Gcn3ISA::Inst_VOP2__V_MADAK_F32::execute(), Gcn3ISA::Inst_VOP2__V_ADD_U32::execute(), Gcn3ISA::Inst_VOP2__V_SUB_U32::execute(), Gcn3ISA::Inst_VOP2__V_SUBREV_U32::execute(), Gcn3ISA::Inst_VOP2__V_ADDC_U32::execute(), Gcn3ISA::Inst_VOP2__V_SUBB_U32::execute(), Gcn3ISA::Inst_VOP2__V_SUBBREV_U32::execute(), Gcn3ISA::Inst_VOP2__V_ADD_U16::execute(), Gcn3ISA::Inst_VOP2__V_SUB_U16::execute(), Gcn3ISA::Inst_VOP2__V_SUBREV_U16::execute(), Gcn3ISA::Inst_VOP2__V_MUL_LO_U16::execute(), Gcn3ISA::Inst_VOP2__V_LSHLREV_B16::execute(), Gcn3ISA::Inst_VOP2__V_LSHRREV_B16::execute(), Gcn3ISA::Inst_VOP2__V_ASHRREV_I16::execute(), Gcn3ISA::Inst_VOP2__V_MAX_U16::execute(), Gcn3ISA::Inst_VOP2__V_MAX_I16::execute(), Gcn3ISA::Inst_VOP2__V_MIN_U16::execute(), Gcn3ISA::Inst_VOP2__V_MIN_I16::execute(), Gcn3ISA::Inst_VOP1__V_MOV_B32::execute(), Gcn3ISA::Inst_VOP1__V_READFIRSTLANE_B32::execute(), Gcn3ISA::Inst_VOP1__V_CVT_I32_F64::execute(), Gcn3ISA::Inst_VOP1__V_CVT_F64_I32::execute(), Gcn3ISA::Inst_VOP1__V_CVT_F32_I32::execute(), Gcn3ISA::Inst_VOP1__V_CVT_F32_U32::execute(), Gcn3ISA::Inst_VOP1__V_CVT_U32_F32::execute(), Gcn3ISA::Inst_VOP1__V_CVT_I32_F32::execute(), Gcn3ISA::Inst_VOP1__V_CVT_RPI_I32_F32::execute(), Gcn3ISA::Inst_VOP1__V_CVT_FLR_I32_F32::execute(), Gcn3ISA::Inst_VOP1__V_CVT_F32_F64::execute(), Gcn3ISA::Inst_VOP1__V_CVT_F64_F32::execute(), Gcn3ISA::Inst_VOP1__V_CVT_F32_UBYTE0::execute(), Gcn3ISA::Inst_VOP1__V_CVT_F32_UBYTE1::execute(), Gcn3ISA::Inst_VOP1__V_CVT_F32_UBYTE2::execute(), Gcn3ISA::Inst_VOP1__V_CVT_F32_UBYTE3::execute(), Gcn3ISA::Inst_VOP1__V_CVT_U32_F64::execute(), Gcn3ISA::Inst_VOP1__V_CVT_F64_U32::execute(), Gcn3ISA::Inst_VOP1__V_TRUNC_F64::execute(), Gcn3ISA::Inst_VOP1__V_CEIL_F64::execute(), Gcn3ISA::Inst_VOP1__V_RNDNE_F64::execute(), Gcn3ISA::Inst_VOP1__V_FLOOR_F64::execute(), Gcn3ISA::Inst_VOP1__V_FRACT_F32::execute(), Gcn3ISA::Inst_VOP1__V_TRUNC_F32::execute(), Gcn3ISA::Inst_VOP1__V_CEIL_F32::execute(), Gcn3ISA::Inst_VOP1__V_RNDNE_F32::execute(), Gcn3ISA::Inst_VOP1__V_FLOOR_F32::execute(), Gcn3ISA::Inst_VOP1__V_EXP_F32::execute(), Gcn3ISA::Inst_VOP1__V_LOG_F32::execute(), Gcn3ISA::Inst_VOP1__V_RCP_F32::execute(), Gcn3ISA::Inst_VOP1__V_RCP_IFLAG_F32::execute(), Gcn3ISA::Inst_VOP1__V_RSQ_F32::execute(), Gcn3ISA::Inst_VOP1__V_RCP_F64::execute(), Gcn3ISA::Inst_VOP1__V_RSQ_F64::execute(), Gcn3ISA::Inst_VOP1__V_SQRT_F32::execute(), Gcn3ISA::Inst_VOP1__V_SQRT_F64::execute(), Gcn3ISA::Inst_VOP1__V_SIN_F32::execute(), Gcn3ISA::Inst_VOP1__V_COS_F32::execute(), Gcn3ISA::Inst_VOP1__V_NOT_B32::execute(), Gcn3ISA::Inst_VOP1__V_BFREV_B32::execute(), Gcn3ISA::Inst_VOP1__V_FFBH_U32::execute(), Gcn3ISA::Inst_VOP1__V_FFBL_B32::execute(), Gcn3ISA::Inst_VOP1__V_FFBH_I32::execute(), Gcn3ISA::Inst_VOP1__V_FREXP_EXP_I32_F64::execute(), Gcn3ISA::Inst_VOP1__V_FREXP_MANT_F64::execute(), Gcn3ISA::Inst_VOP1__V_FRACT_F64::execute(), Gcn3ISA::Inst_VOP1__V_FREXP_EXP_I32_F32::execute(), Gcn3ISA::Inst_VOP1__V_FREXP_MANT_F32::execute(), Gcn3ISA::Inst_VOP1__V_EXP_LEGACY_F32::execute(), Gcn3ISA::Inst_VOP1__V_LOG_LEGACY_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_CLASS_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_CLASS_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_CLASS_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_CLASS_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_F_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_LT_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_EQ_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_LE_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_GT_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_LG_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_GE_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_O_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_U_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_NGE_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_NLG_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_NGT_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_NLE_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_NEQ_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_NLT_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_TRU_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_F_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_LT_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_EQ_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_LE_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_GT_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_LG_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_GE_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_O_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_U_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_NGE_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_NLG_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_NGT_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_NLE_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_NEQ_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_NLT_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_TRU_F32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_F_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_LT_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_EQ_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_LE_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_GT_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_LG_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_GE_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_O_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_U_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_NGE_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_NLG_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_NGT_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_NLE_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_NEQ_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_NLT_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_TRU_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_F_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_LT_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_EQ_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_LE_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_GT_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_LG_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_GE_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_O_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_U_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_NGE_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_NLG_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_NGT_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_NLE_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_NEQ_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_NLT_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_TRU_F64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_F_I16::execute(), Gcn3ISA::Inst_VOPC__V_CMP_LT_I16::execute(), Gcn3ISA::Inst_VOPC__V_CMP_EQ_I16::execute(), Gcn3ISA::Inst_VOPC__V_CMP_LE_I16::execute(), Gcn3ISA::Inst_VOPC__V_CMP_GT_I16::execute(), Gcn3ISA::Inst_VOPC__V_CMP_NE_I16::execute(), Gcn3ISA::Inst_VOPC__V_CMP_GE_I16::execute(), Gcn3ISA::Inst_VOPC__V_CMP_T_I16::execute(), Gcn3ISA::Inst_VOPC__V_CMP_F_U16::execute(), Gcn3ISA::Inst_VOPC__V_CMP_LT_U16::execute(), Gcn3ISA::Inst_VOPC__V_CMP_EQ_U16::execute(), Gcn3ISA::Inst_VOPC__V_CMP_LE_U16::execute(), Gcn3ISA::Inst_VOPC__V_CMP_GT_U16::execute(), Gcn3ISA::Inst_VOPC__V_CMP_NE_U16::execute(), Gcn3ISA::Inst_VOPC__V_CMP_GE_U16::execute(), Gcn3ISA::Inst_VOPC__V_CMP_T_U16::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_F_I16::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_LT_I16::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_EQ_I16::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_LE_I16::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_GT_I16::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_NE_I16::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_GE_I16::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_T_I16::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_F_U16::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_LT_U16::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_EQ_U16::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_LE_U16::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_GT_U16::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_NE_U16::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_GE_U16::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_T_U16::execute(), Gcn3ISA::Inst_VOPC__V_CMP_F_I32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_LT_I32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_EQ_I32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_LE_I32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_GT_I32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_NE_I32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_GE_I32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_T_I32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_F_U32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_LT_U32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_EQ_U32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_LE_U32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_GT_U32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_NE_U32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_GE_U32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_T_U32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_F_I32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_LT_I32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_EQ_I32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_LE_I32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_GT_I32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_NE_I32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_GE_I32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_T_I32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_F_U32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_LT_U32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_EQ_U32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_LE_U32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_GT_U32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_NE_U32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_GE_U32::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_T_U32::execute(), Gcn3ISA::Inst_VOPC__V_CMP_F_I64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_LT_I64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_EQ_I64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_LE_I64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_GT_I64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_NE_I64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_GE_I64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_T_I64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_F_U64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_LT_U64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_EQ_U64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_LE_U64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_GT_U64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_NE_U64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_GE_U64::execute(), Gcn3ISA::Inst_VOPC__V_CMP_T_U64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_F_I64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_LT_I64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_EQ_I64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_LE_I64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_GT_I64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_NE_I64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_GE_I64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_T_I64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_F_U64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_LT_U64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_EQ_U64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_LE_U64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_GT_U64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_NE_U64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_GE_U64::execute(), Gcn3ISA::Inst_VOPC__V_CMPX_T_U64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_CLASS_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_CLASS_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_CLASS_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_CLASS_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_TRU_F16::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_F_F16::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_TRU_F16::execute(), Gcn3ISA::Inst_VOP3__V_CMP_F_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_LT_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_EQ_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_LE_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_GT_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_LG_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_GE_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_O_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_U_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_NGE_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_NLG_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_NGT_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_NLE_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_NEQ_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_NLT_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_TRU_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_F_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_LT_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_EQ_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_LE_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_GT_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_LG_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_GE_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_O_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_U_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_NGE_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_NLG_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_NGT_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_NLE_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_NEQ_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_NLT_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_TRU_F32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_F_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_LT_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_EQ_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_LE_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_GT_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_LG_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_GE_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_O_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_U_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_NGE_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_NLG_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_NGT_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_NLE_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_NEQ_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_NLT_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_TRU_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_F_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_LT_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_EQ_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_LE_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_GT_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_LG_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_GE_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_O_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_U_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_NGE_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_NLG_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_NGT_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_NLE_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_NEQ_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_NLT_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_TRU_F64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_F_I16::execute(), Gcn3ISA::Inst_VOP3__V_CMP_LT_I16::execute(), Gcn3ISA::Inst_VOP3__V_CMP_EQ_I16::execute(), Gcn3ISA::Inst_VOP3__V_CMP_LE_I16::execute(), Gcn3ISA::Inst_VOP3__V_CMP_GT_I16::execute(), Gcn3ISA::Inst_VOP3__V_CMP_NE_I16::execute(), Gcn3ISA::Inst_VOP3__V_CMP_GE_I16::execute(), Gcn3ISA::Inst_VOP3__V_CMP_T_I16::execute(), Gcn3ISA::Inst_VOP3__V_CMP_F_U16::execute(), Gcn3ISA::Inst_VOP3__V_CMP_LT_U16::execute(), Gcn3ISA::Inst_VOP3__V_CMP_EQ_U16::execute(), Gcn3ISA::Inst_VOP3__V_CMP_LE_U16::execute(), Gcn3ISA::Inst_VOP3__V_CMP_GT_U16::execute(), Gcn3ISA::Inst_VOP3__V_CMP_NE_U16::execute(), Gcn3ISA::Inst_VOP3__V_CMP_GE_U16::execute(), Gcn3ISA::Inst_VOP3__V_CMP_T_U16::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_F_I16::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_LT_I16::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_EQ_I16::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_LE_I16::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_GT_I16::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_NE_I16::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_GE_I16::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_T_I16::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_F_U16::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_LT_U16::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_EQ_U16::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_LE_U16::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_GT_U16::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_NE_U16::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_GE_U16::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_T_U16::execute(), Gcn3ISA::Inst_VOP3__V_CMP_F_I32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_LT_I32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_EQ_I32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_LE_I32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_GT_I32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_NE_I32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_GE_I32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_T_I32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_F_U32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_LT_U32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_EQ_U32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_LE_U32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_GT_U32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_NE_U32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_GE_U32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_T_U32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_F_I32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_LT_I32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_EQ_I32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_LE_I32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_GT_I32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_NE_I32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_GE_I32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_T_I32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_F_U32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_LT_U32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_EQ_U32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_LE_U32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_GT_U32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_NE_U32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_GE_U32::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_T_U32::execute(), Gcn3ISA::Inst_VOP3__V_CMP_F_I64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_LT_I64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_EQ_I64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_LE_I64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_GT_I64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_NE_I64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_GE_I64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_T_I64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_F_U64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_LT_U64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_EQ_U64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_LE_U64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_GT_U64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_NE_U64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_GE_U64::execute(), Gcn3ISA::Inst_VOP3__V_CMP_T_U64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_F_I64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_LT_I64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_EQ_I64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_LE_I64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_GT_I64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_NE_I64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_GE_I64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_T_I64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_F_U64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_LT_U64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_EQ_U64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_LE_U64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_GT_U64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_NE_U64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_GE_U64::execute(), Gcn3ISA::Inst_VOP3__V_CMPX_T_U64::execute(), Gcn3ISA::Inst_VOP3__V_CNDMASK_B32::execute(), Gcn3ISA::Inst_VOP3__V_ADD_F32::execute(), Gcn3ISA::Inst_VOP3__V_SUB_F32::execute(), Gcn3ISA::Inst_VOP3__V_SUBREV_F32::execute(), Gcn3ISA::Inst_VOP3__V_MUL_LEGACY_F32::execute(), Gcn3ISA::Inst_VOP3__V_MUL_F32::execute(), Gcn3ISA::Inst_VOP3__V_MUL_I32_I24::execute(), Gcn3ISA::Inst_VOP3__V_MUL_HI_I32_I24::execute(), Gcn3ISA::Inst_VOP3__V_MUL_U32_U24::execute(), Gcn3ISA::Inst_VOP3__V_MUL_HI_U32_U24::execute(), Gcn3ISA::Inst_VOP3__V_MIN_F32::execute(), Gcn3ISA::Inst_VOP3__V_MAX_F32::execute(), Gcn3ISA::Inst_VOP3__V_MIN_I32::execute(), Gcn3ISA::Inst_VOP3__V_MAX_I32::execute(), Gcn3ISA::Inst_VOP3__V_MIN_U32::execute(), Gcn3ISA::Inst_VOP3__V_MAX_U32::execute(), Gcn3ISA::Inst_VOP3__V_LSHRREV_B32::execute(), Gcn3ISA::Inst_VOP3__V_ASHRREV_I32::execute(), Gcn3ISA::Inst_VOP3__V_LSHLREV_B32::execute(), Gcn3ISA::Inst_VOP3__V_AND_B32::execute(), Gcn3ISA::Inst_VOP3__V_OR_B32::execute(), Gcn3ISA::Inst_VOP3__V_XOR_B32::execute(), Gcn3ISA::Inst_VOP3__V_MAC_F32::execute(), Gcn3ISA::Inst_VOP3__V_ADD_U32::execute(), Gcn3ISA::Inst_VOP3__V_SUB_U32::execute(), Gcn3ISA::Inst_VOP3__V_SUBREV_U32::execute(), Gcn3ISA::Inst_VOP3__V_ADDC_U32::execute(), Gcn3ISA::Inst_VOP3__V_SUBB_U32::execute(), Gcn3ISA::Inst_VOP3__V_SUBBREV_U32::execute(), Gcn3ISA::Inst_VOP3__V_ADD_U16::execute(), Gcn3ISA::Inst_VOP3__V_SUB_U16::execute(), Gcn3ISA::Inst_VOP3__V_SUBREV_U16::execute(), Gcn3ISA::Inst_VOP3__V_MUL_LO_U16::execute(), Gcn3ISA::Inst_VOP3__V_LSHLREV_B16::execute(), Gcn3ISA::Inst_VOP3__V_LSHRREV_B16::execute(), Gcn3ISA::Inst_VOP3__V_ASHRREV_I16::execute(), Gcn3ISA::Inst_VOP3__V_MAX_U16::execute(), Gcn3ISA::Inst_VOP3__V_MAX_I16::execute(), Gcn3ISA::Inst_VOP3__V_MIN_U16::execute(), Gcn3ISA::Inst_VOP3__V_MIN_I16::execute(), Gcn3ISA::Inst_VOP3__V_MOV_B32::execute(), Gcn3ISA::Inst_VOP3__V_CVT_I32_F64::execute(), Gcn3ISA::Inst_VOP3__V_CVT_F64_I32::execute(), Gcn3ISA::Inst_VOP3__V_CVT_F32_I32::execute(), Gcn3ISA::Inst_VOP3__V_CVT_F32_U32::execute(), Gcn3ISA::Inst_VOP3__V_CVT_U32_F32::execute(), Gcn3ISA::Inst_VOP3__V_CVT_I32_F32::execute(), Gcn3ISA::Inst_VOP3__V_CVT_RPI_I32_F32::execute(), Gcn3ISA::Inst_VOP3__V_CVT_FLR_I32_F32::execute(), Gcn3ISA::Inst_VOP3__V_CVT_F32_F64::execute(), Gcn3ISA::Inst_VOP3__V_CVT_F64_F32::execute(), Gcn3ISA::Inst_VOP3__V_CVT_F32_UBYTE0::execute(), Gcn3ISA::Inst_VOP3__V_CVT_F32_UBYTE1::execute(), Gcn3ISA::Inst_VOP3__V_CVT_F32_UBYTE2::execute(), Gcn3ISA::Inst_VOP3__V_CVT_F32_UBYTE3::execute(), Gcn3ISA::Inst_VOP3__V_CVT_U32_F64::execute(), Gcn3ISA::Inst_VOP3__V_CVT_F64_U32::execute(), Gcn3ISA::Inst_VOP3__V_TRUNC_F64::execute(), Gcn3ISA::Inst_VOP3__V_CEIL_F64::execute(), Gcn3ISA::Inst_VOP3__V_RNDNE_F64::execute(), Gcn3ISA::Inst_VOP3__V_FLOOR_F64::execute(), Gcn3ISA::Inst_VOP3__V_FRACT_F32::execute(), Gcn3ISA::Inst_VOP3__V_TRUNC_F32::execute(), Gcn3ISA::Inst_VOP3__V_CEIL_F32::execute(), Gcn3ISA::Inst_VOP3__V_RNDNE_F32::execute(), Gcn3ISA::Inst_VOP3__V_FLOOR_F32::execute(), Gcn3ISA::Inst_VOP3__V_EXP_F32::execute(), Gcn3ISA::Inst_VOP3__V_LOG_F32::execute(), Gcn3ISA::Inst_VOP3__V_RCP_F32::execute(), Gcn3ISA::Inst_VOP3__V_RCP_IFLAG_F32::execute(), Gcn3ISA::Inst_VOP3__V_RSQ_F32::execute(), Gcn3ISA::Inst_VOP3__V_RCP_F64::execute(), Gcn3ISA::Inst_VOP3__V_RSQ_F64::execute(), Gcn3ISA::Inst_VOP3__V_SQRT_F32::execute(), Gcn3ISA::Inst_VOP3__V_SQRT_F64::execute(), Gcn3ISA::Inst_VOP3__V_SIN_F32::execute(), Gcn3ISA::Inst_VOP3__V_COS_F32::execute(), Gcn3ISA::Inst_VOP3__V_NOT_B32::execute(), Gcn3ISA::Inst_VOP3__V_BFREV_B32::execute(), Gcn3ISA::Inst_VOP3__V_FFBH_U32::execute(), Gcn3ISA::Inst_VOP3__V_FFBL_B32::execute(), Gcn3ISA::Inst_VOP3__V_FFBH_I32::execute(), Gcn3ISA::Inst_VOP3__V_FREXP_EXP_I32_F64::execute(), Gcn3ISA::Inst_VOP3__V_FREXP_MANT_F64::execute(), Gcn3ISA::Inst_VOP3__V_FRACT_F64::execute(), Gcn3ISA::Inst_VOP3__V_FREXP_EXP_I32_F32::execute(), Gcn3ISA::Inst_VOP3__V_FREXP_MANT_F32::execute(), Gcn3ISA::Inst_VOP3__V_EXP_LEGACY_F32::execute(), Gcn3ISA::Inst_VOP3__V_LOG_LEGACY_F32::execute(), Gcn3ISA::Inst_VOP3__V_MAD_LEGACY_F32::execute(), Gcn3ISA::Inst_VOP3__V_MAD_F32::execute(), Gcn3ISA::Inst_VOP3__V_MAD_I32_I24::execute(), Gcn3ISA::Inst_VOP3__V_MAD_U32_U24::execute(), Gcn3ISA::Inst_VOP3__V_BFE_U32::execute(), Gcn3ISA::Inst_VOP3__V_BFE_I32::execute(), Gcn3ISA::Inst_VOP3__V_BFI_B32::execute(), Gcn3ISA::Inst_VOP3__V_FMA_F32::execute(), Gcn3ISA::Inst_VOP3__V_FMA_F64::execute(), Gcn3ISA::Inst_VOP3__V_LERP_U8::execute(), Gcn3ISA::Inst_VOP3__V_ALIGNBIT_B32::execute(), Gcn3ISA::Inst_VOP3__V_ALIGNBYTE_B32::execute(), Gcn3ISA::Inst_VOP3__V_MIN3_F32::execute(), Gcn3ISA::Inst_VOP3__V_MIN3_I32::execute(), Gcn3ISA::Inst_VOP3__V_MIN3_U32::execute(), Gcn3ISA::Inst_VOP3__V_MAX3_F32::execute(), Gcn3ISA::Inst_VOP3__V_MAX3_I32::execute(), Gcn3ISA::Inst_VOP3__V_MAX3_U32::execute(), Gcn3ISA::Inst_VOP3__V_MED3_F32::execute(), Gcn3ISA::Inst_VOP3__V_MED3_I32::execute(), Gcn3ISA::Inst_VOP3__V_MED3_U32::execute(), Gcn3ISA::Inst_VOP3__V_SAD_U8::execute(), Gcn3ISA::Inst_VOP3__V_SAD_HI_U8::execute(), Gcn3ISA::Inst_VOP3__V_SAD_U16::execute(), Gcn3ISA::Inst_VOP3__V_SAD_U32::execute(), Gcn3ISA::Inst_VOP3__V_CVT_PK_U8_F32::execute(), Gcn3ISA::Inst_VOP3__V_DIV_FIXUP_F32::execute(), Gcn3ISA::Inst_VOP3__V_DIV_FIXUP_F64::execute(), Gcn3ISA::Inst_VOP3__V_DIV_SCALE_F32::execute(), Gcn3ISA::Inst_VOP3__V_DIV_SCALE_F64::execute(), Gcn3ISA::Inst_VOP3__V_DIV_FMAS_F32::execute(), Gcn3ISA::Inst_VOP3__V_DIV_FMAS_F64::execute(), Gcn3ISA::Inst_VOP3__V_MAD_U64_U32::execute(), Gcn3ISA::Inst_VOP3__V_MAD_I64_I32::execute(), Gcn3ISA::Inst_VOP3__V_MAD_U16::execute(), Gcn3ISA::Inst_VOP3__V_MAD_I16::execute(), Gcn3ISA::Inst_VOP3__V_PERM_B32::execute(), Gcn3ISA::Inst_VOP3__V_ADD_F64::execute(), Gcn3ISA::Inst_VOP3__V_MUL_F64::execute(), Gcn3ISA::Inst_VOP3__V_MIN_F64::execute(), Gcn3ISA::Inst_VOP3__V_MAX_F64::execute(), Gcn3ISA::Inst_VOP3__V_LDEXP_F64::execute(), Gcn3ISA::Inst_VOP3__V_MUL_LO_U32::execute(), Gcn3ISA::Inst_VOP3__V_MUL_HI_U32::execute(), Gcn3ISA::Inst_VOP3__V_MUL_HI_I32::execute(), Gcn3ISA::Inst_VOP3__V_LDEXP_F32::execute(), Gcn3ISA::Inst_VOP3__V_BCNT_U32_B32::execute(), Gcn3ISA::Inst_VOP3__V_MBCNT_LO_U32_B32::execute(), Gcn3ISA::Inst_VOP3__V_MBCNT_HI_U32_B32::execute(), Gcn3ISA::Inst_VOP3__V_LSHLREV_B64::execute(), Gcn3ISA::Inst_VOP3__V_LSHRREV_B64::execute(), Gcn3ISA::Inst_VOP3__V_ASHRREV_I64::execute(), Gcn3ISA::Inst_VOP3__V_BFM_B32::execute(), Gcn3ISA::Inst_DS__DS_WRITE_B32::execute(), Gcn3ISA::Inst_DS__DS_WRITE2_B32::execute(), Gcn3ISA::Inst_DS__DS_WRITE2ST64_B32::execute(), Gcn3ISA::Inst_DS__DS_WRITE_B8::execute(), Gcn3ISA::Inst_DS__DS_WRITE_B16::execute(), Gcn3ISA::Inst_DS__DS_READ_B32::execute(), Gcn3ISA::Inst_DS__DS_READ2_B32::execute(), Gcn3ISA::Inst_DS__DS_READ2ST64_B32::execute(), Gcn3ISA::Inst_DS__DS_READ_U8::execute(), Gcn3ISA::Inst_DS__DS_READ_U16::execute(), Gcn3ISA::Inst_DS__DS_PERMUTE_B32::execute(), Gcn3ISA::Inst_DS__DS_BPERMUTE_B32::execute(), Gcn3ISA::Inst_DS__DS_WRITE_B64::execute(), Gcn3ISA::Inst_DS__DS_WRITE2_B64::execute(), Gcn3ISA::Inst_DS__DS_READ_B64::execute(), Gcn3ISA::Inst_DS__DS_READ2_B64::execute(), Gcn3ISA::Inst_DS__DS_READ2ST64_B64::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_UBYTE::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_USHORT::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_DWORD::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_DWORDX2::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_DWORDX3::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_DWORDX4::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_BYTE::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_SHORT::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_DWORD::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_DWORDX2::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_DWORDX3::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_DWORDX4::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_WBINVL1::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_WBINVL1_VOL::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_UBYTE::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_USHORT::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_DWORD::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_DWORDX2::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_DWORDX3::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_DWORDX4::execute(), Gcn3ISA::Inst_FLAT__FLAT_STORE_BYTE::execute(), Gcn3ISA::Inst_FLAT__FLAT_STORE_SHORT::execute(), Gcn3ISA::Inst_FLAT__FLAT_STORE_DWORD::execute(), Gcn3ISA::Inst_FLAT__FLAT_STORE_DWORDX2::execute(), Gcn3ISA::Inst_FLAT__FLAT_STORE_DWORDX3::execute(), Gcn3ISA::Inst_FLAT__FLAT_STORE_DWORDX4::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_SWAP::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_CMPSWAP::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_ADD::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_SUB::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_INC::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_DEC::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_CMPSWAP_X2::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_ADD_X2::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_SUB_X2::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_INC_X2::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_DEC_X2::execute(), VectorRegisterFile::printReg(), Gcn3ISA::VecOperand< DataType, Const, NumDwords >::write(), and Gcn3ISA::ScalarOperand< DataType, Const, sizeof(DataType)/sizeof(VecElemU32) >::write().

◆ execMask() [2/2]

bool Wavefront::execMask ( int  lane) const

Definition at line 1404 of file wavefront.cc.

References _execMask.

◆ freeRegisterFile()

void Wavefront::freeRegisterFile ( )

◆ freeResources()

void Wavefront::freeResources ( )

Definition at line 800 of file wavefront.cc.

References execUnitId.

Referenced by ExecStage::exec().

◆ getStatus()

status_e Wavefront::getStatus ( )
inline

◆ gpuISA()

TheGpuISA::GPUISA& Wavefront::gpuISA ( )
inline

Definition at line 318 of file wavefront.hh.

References _gpuISA.

◆ hasBarrier()

bool Wavefront::hasBarrier ( ) const

◆ incExpInstsIssued()

void Wavefront::incExpInstsIssued ( )

Definition at line 1356 of file wavefront.cc.

References expInstsIssued.

◆ incLGKMInstsIssued()

void Wavefront::incLGKMInstsIssued ( )

Definition at line 1362 of file wavefront.cc.

References lgkmInstsIssued.

Referenced by ScheduleStage::exec().

◆ incVMemInstsIssued()

void Wavefront::incVMemInstsIssued ( )

Definition at line 1350 of file wavefront.cc.

References vmemInstsIssued.

Referenced by ScheduleStage::exec().

◆ init()

void Wavefront::init ( )
virtual

init() is called after all C++ SimObjects have been created and all ports are connected.

Initializations that are independent of unserialization but rely on a fully instantiated and connected SimObject graph should be done here.

Reimplemented from SimObject.

Definition at line 175 of file wavefront.cc.

References computeUnit, globalMem, localMem, ComputeUnit::mapWaveToGlobalMem(), ComputeUnit::mapWaveToLocalMem(), ComputeUnit::mapWaveToScalarAlu(), ComputeUnit::mapWaveToScalarAluGlobalIdx(), ComputeUnit::mapWaveToScalarMem(), reservedScalarRegs, reservedVectorRegs, scalarAlu, scalarAluGlobalIdx, scalarMem, startSgprIndex, and startVgprIndex.

◆ initRegState()

void Wavefront::initRegState ( HSAQueueEntry task,
int  wgSizeInWorkItems 
)

Since flat scratch init is needed for this kernel, this kernel is going to have flat memory instructions and we need to initialize the hidden private base for this queue. scratch_resource_descriptor[0] has this queue's scratch base address. scratch_backing_memory_location has the offset to this queue's scratch base address from the SH_HIDDEN_PRIVATE_BASE_VMID. Ideally, we only require this queue's scratch base address for address calculation (stored in scratch_resource_descriptor[0]). But that address calculation shoule be done by first finding the queue's scratch base address using the calculation "SH_HIDDEN_PRIVATE_BASE_VMID + offset". So, we initialize SH_HIDDEN_PRIVATE_BASE_VMID.

For more details see: http://rocm-documentation.readthedocs.io/en/latest/ ROCm_Compiler_SDK/ROCm-Native-ISA.html#flat-scratch

https://github.com/ROCm-Developer-Tools/ ROCm-ComputeABI-Doc/blob/master/AMDGPU-ABI.md #flat-addressing

the compute_tmpring_size_wavesize specifies the number of kB allocated per wavefront, hence the multiplication by 1024.

to get the per wavefront offset into the scratch memory, we also multiply this by the wfId. the wfId stored in the Wavefront class, however, is the wave ID within the WG, whereas here we need the global WFID because the scratch space will be divided amongst all waves in the kernel. to get the global ID we multiply the WGID by the WG size, then add the WFID of the wave within its WG.

Definition at line 190 of file wavefront.cc.

References HSAQueueEntry::amdQueue, _amd_queue_s::compute_tmpring_size_wavesize, computeUnit, ComputeUnit::cu_id, DispatchPtr, divCeil(), DPRINTF, fatal, FlatScratchInit, HSAQueueEntry::gridSize(), GridWorkgroupCountX, GridWorkgroupCountY, GridWorkgroupCountZ, HSAQueueEntry::hostAMDQueueAddr, HSAQueueEntry::hostDispPktAddr(), Shader::initShHiddenPrivateBase(), HSAQueueEntry::kernargAddr(), KernargSegPtr, RegisterManager::mapSgpr(), RegisterManager::mapVgpr(), NumScalarInitFields, NumVectorInitFields, PrivateSegBuf, PrivSegWaveByteOffset, QueuePtr, rawDist, ComputeUnit::registerManager, _amd_queue_s::scratch_backing_memory_location, _amd_queue_s::scratch_resource_descriptor, _amd_queue_s::scratch_workitem_byte_size, HSAQueueEntry::sgprBitEnabled(), ComputeUnit::shader, simdId, ComputeUnit::srf, HSAQueueEntry::vgprBitEnabled(), ComputeUnit::vrf, wfDynId, wfId, ComputeUnit::wfSize(), wfSlotId, wgId, HSAQueueEntry::wgSize(), wgSz, workGroupId, WorkgroupIdX, WorkgroupIdY, WorkgroupIdZ, WorkgroupInfo, workItemId, WorkitemIdX, WorkitemIdY, and WorkitemIdZ.

◆ isGmInstruction()

bool Wavefront::isGmInstruction ( GPUDynInstPtr  ii)

Definition at line 639 of file wavefront.cc.

Referenced by exec().

◆ isLmInstruction()

bool Wavefront::isLmInstruction ( GPUDynInstPtr  ii)

Definition at line 650 of file wavefront.cc.

Referenced by exec().

◆ isOldestInstBarrier()

bool Wavefront::isOldestInstBarrier ( )

Definition at line 708 of file wavefront.cc.

References instructionBuffer, S_STOPPED, and status.

◆ isOldestInstFlatMem()

bool Wavefront::isOldestInstFlatMem ( )

Definition at line 773 of file wavefront.cc.

References instructionBuffer, S_STOPPED, and status.

◆ isOldestInstGMem()

bool Wavefront::isOldestInstGMem ( )

Definition at line 721 of file wavefront.cc.

References instructionBuffer, S_STOPPED, and status.

◆ isOldestInstLMem()

bool Wavefront::isOldestInstLMem ( )

Definition at line 747 of file wavefront.cc.

References instructionBuffer, S_STOPPED, and status.

◆ isOldestInstPrivMem()

bool Wavefront::isOldestInstPrivMem ( )

Definition at line 760 of file wavefront.cc.

References instructionBuffer, S_STOPPED, and status.

◆ isOldestInstScalarALU()

bool Wavefront::isOldestInstScalarALU ( )

Definition at line 678 of file wavefront.cc.

References instructionBuffer, S_STOPPED, and status.

◆ isOldestInstScalarMem()

bool Wavefront::isOldestInstScalarMem ( )

Definition at line 734 of file wavefront.cc.

References instructionBuffer, S_STOPPED, and status.

◆ isOldestInstVectorALU()

bool Wavefront::isOldestInstVectorALU ( )

Definition at line 693 of file wavefront.cc.

References instructionBuffer, S_STOPPED, and status.

◆ isOldestInstWaitcnt()

bool Wavefront::isOldestInstWaitcnt ( )

Definition at line 661 of file wavefront.cc.

References instructionBuffer.

Referenced by ScheduleStage::addToSchList(), and exec().

◆ nextInstr()

GPUDynInstPtr Wavefront::nextInstr ( )

Definition at line 1216 of file wavefront.cc.

References computeUnit, instructionBuffer, and ComputeUnit::pipeMap.

Referenced by ScoreboardCheckStage::exec().

◆ pc() [1/2]

Addr Wavefront::pc ( ) const

◆ pc() [2/2]

void Wavefront::pc ( Addr  new_pc)

Definition at line 1392 of file wavefront.cc.

References _pc.

◆ regStats()

void Wavefront::regStats ( )
virtual

Callback to set stat parameters.

This callback is typically used for complex stats (e.g., distributions) that need parameters in addition to a name and a description. Stat names and descriptions should typically be set from the constructor usingo from the constructor using the ADD_STAT macro.

Reimplemented from Stats::Group.

Definition at line 106 of file wavefront.cc.

References Stats::DataWrap< Derived, InfoProxyType >::desc(), Stats::Distribution::init(), SimObject::name(), Stats::DataWrap< Derived, InfoProxyType >::name(), numInstrExecuted, numTimesBlockedDueRAWDependencies, numTimesBlockedDueWAXDependencies, readsPerWrite, Stats::Group::regStats(), schCycles, schLdsArbStalls, schOpdNrdyStalls, schResourceStalls, schRfAccessStalls, schStalls, and vecRawDistance.

◆ releaseBarrier()

void Wavefront::releaseBarrier ( )

Definition at line 1457 of file wavefront.cc.

References barId, and WFBarrier::InvalidID.

Referenced by Gcn3ISA::Inst_SOPP__S_ENDPGM::execute().

◆ reserveGmResource()

void Wavefront::reserveGmResource ( GPUDynInstPtr  ii)
private

◆ reserveLmResource()

void Wavefront::reserveLmResource ( GPUDynInstPtr  ii)
private

Definition at line 849 of file wavefront.cc.

References execUnitId, fatal_if, localMem, panic, rdLmReqsInPipe, and wrLmReqsInPipe.

Referenced by reserveResources().

◆ reserveResources()

std::vector< int > Wavefront::reserveResources ( )

◆ resizeRegFiles()

void Wavefront::resizeRegFiles ( int  num_vregs,
int  num_sregs 
)

Definition at line 580 of file wavefront.cc.

References maxSgprs, and maxVgprs.

◆ setParent()

void Wavefront::setParent ( ComputeUnit cu)
inline

Definition at line 281 of file wavefront.hh.

References computeUnit.

◆ setStatus()

void Wavefront::setStatus ( status_e  newStatus)

◆ setWaitCnts()

void Wavefront::setWaitCnts ( int  vm_wait_cnt,
int  exp_wait_cnt,
int  lgkm_wait_cnt 
)

prior waitcnts should be satisfied, at which time the WF resets them back to -1, indicating they are no longer active

if the instruction encoding indicates a waitcnt of 0xf, that means the waitcnt is not being used

Definition at line 1291 of file wavefront.cc.

References expWaitCnt, lgkmWaitCnt, S_WAITCNT, status, and vmWaitCnt.

◆ start()

void Wavefront::start ( uint64_t  _wfDynId,
uint64_t  _base_ptr 
)

Definition at line 628 of file wavefront.cc.

References _pc, maxVgprs, S_RUNNING, status, vecReads, and wfDynId.

◆ stopFetch()

bool Wavefront::stopFetch ( )

Definition at line 786 of file wavefront.cc.

References instructionBuffer.

Referenced by FetchUnit::exec().

◆ validateRequestCounters()

void Wavefront::validateRequestCounters ( )

Definition at line 805 of file wavefront.cc.

References outstandingReqs, panic_if, rdGmReqsInPipe, rdLmReqsInPipe, simdId, wfDynId, wfSlotId, wrGmReqsInPipe, and wrLmReqsInPipe.

Referenced by Gcn3ISA::Inst_DS__DS_WRITE_B32::execute(), Gcn3ISA::Inst_DS__DS_WRITE2_B32::execute(), Gcn3ISA::Inst_DS__DS_WRITE2ST64_B32::execute(), Gcn3ISA::Inst_DS__DS_WRITE_B8::execute(), Gcn3ISA::Inst_DS__DS_WRITE_B16::execute(), Gcn3ISA::Inst_DS__DS_READ_B32::execute(), Gcn3ISA::Inst_DS__DS_READ2_B32::execute(), Gcn3ISA::Inst_DS__DS_READ2ST64_B32::execute(), Gcn3ISA::Inst_DS__DS_READ_U8::execute(), Gcn3ISA::Inst_DS__DS_READ_U16::execute(), Gcn3ISA::Inst_DS__DS_SWIZZLE_B32::execute(), Gcn3ISA::Inst_DS__DS_PERMUTE_B32::execute(), Gcn3ISA::Inst_DS__DS_BPERMUTE_B32::execute(), Gcn3ISA::Inst_DS__DS_WRITE_B64::execute(), Gcn3ISA::Inst_DS__DS_WRITE2_B64::execute(), Gcn3ISA::Inst_DS__DS_READ_B64::execute(), Gcn3ISA::Inst_DS__DS_READ2_B64::execute(), Gcn3ISA::Inst_DS__DS_READ2ST64_B64::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_UBYTE::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_USHORT::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_DWORD::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_DWORDX2::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_DWORDX3::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_DWORDX4::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_BYTE::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_SHORT::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_DWORD::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_DWORDX2::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_DWORDX3::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_DWORDX4::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_WBINVL1::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_WBINVL1_VOL::execute(), and Gcn3ISA::Inst_FLAT__FLAT_STORE_DWORDX2::execute().

◆ waitCntsSatisfied()

bool Wavefront::waitCntsSatisfied ( )

If we reach here, that means an s_waitcnt instruction was executed and the waitcnts are set by the execute method. Check if waitcnts are satisfied.

Definition at line 1248 of file wavefront.cc.

References clearWaitCnts(), expInstsIssued, expWaitCnt, lgkmInstsIssued, lgkmWaitCnt, vmemInstsIssued, and vmWaitCnt.

Member Data Documentation

◆ _execMask

VectorMask Wavefront::_execMask
private

Definition at line 358 of file wavefront.hh.

Referenced by execMask(), and Wavefront().

◆ _gpuISA

TheGpuISA::GPUISA Wavefront::_gpuISA
private

Definition at line 329 of file wavefront.hh.

Referenced by exec(), and gpuISA().

◆ _pc

Addr Wavefront::_pc
private

Definition at line 357 of file wavefront.hh.

Referenced by pc(), and start().

◆ actualWgSz

uint32_t Wavefront::actualWgSz[3]

Definition at line 155 of file wavefront.hh.

Referenced by computeActualWgSz().

◆ actualWgSzTotal

uint32_t Wavefront::actualWgSzTotal

Definition at line 156 of file wavefront.hh.

Referenced by computeActualWgSz().

◆ barId

int Wavefront::barId
private

Definition at line 359 of file wavefront.hh.

Referenced by barrierId(), hasBarrier(), and releaseBarrier().

◆ computeUnit

ComputeUnit* Wavefront::computeUnit

◆ context

uint8_t* Wavefront::context

Definition at line 273 of file wavefront.hh.

◆ dispatchId

uint32_t Wavefront::dispatchId

Definition at line 161 of file wavefront.hh.

Referenced by Gcn3ISA::Inst_SOPP__S_ENDPGM::execute().

◆ dropFetch

bool Wavefront::dropFetch

◆ execUnitId

int Wavefront::execUnitId

Definition at line 95 of file wavefront.hh.

Referenced by Gcn3ISA::Inst_SMEM__S_LOAD_DWORD::execute(), Gcn3ISA::Inst_SMEM__S_LOAD_DWORDX2::execute(), Gcn3ISA::Inst_SMEM__S_LOAD_DWORDX4::execute(), Gcn3ISA::Inst_SMEM__S_LOAD_DWORDX8::execute(), Gcn3ISA::Inst_SMEM__S_LOAD_DWORDX16::execute(), Gcn3ISA::Inst_SMEM__S_BUFFER_LOAD_DWORD::execute(), Gcn3ISA::Inst_SMEM__S_BUFFER_LOAD_DWORDX2::execute(), Gcn3ISA::Inst_SMEM__S_BUFFER_LOAD_DWORDX4::execute(), Gcn3ISA::Inst_SMEM__S_BUFFER_LOAD_DWORDX8::execute(), Gcn3ISA::Inst_SMEM__S_BUFFER_LOAD_DWORDX16::execute(), Gcn3ISA::Inst_SMEM__S_STORE_DWORD::execute(), Gcn3ISA::Inst_SMEM__S_STORE_DWORDX2::execute(), Gcn3ISA::Inst_SMEM__S_STORE_DWORDX4::execute(), Gcn3ISA::Inst_DS__DS_WRITE_B32::execute(), Gcn3ISA::Inst_DS__DS_WRITE2_B32::execute(), Gcn3ISA::Inst_DS__DS_WRITE2ST64_B32::execute(), Gcn3ISA::Inst_DS__DS_WRITE_B8::execute(), Gcn3ISA::Inst_DS__DS_WRITE_B16::execute(), Gcn3ISA::Inst_DS__DS_READ_B32::execute(), Gcn3ISA::Inst_DS__DS_READ2_B32::execute(), Gcn3ISA::Inst_DS__DS_READ2ST64_B32::execute(), Gcn3ISA::Inst_DS__DS_READ_U8::execute(), Gcn3ISA::Inst_DS__DS_READ_U16::execute(), Gcn3ISA::Inst_DS__DS_SWIZZLE_B32::execute(), Gcn3ISA::Inst_DS__DS_PERMUTE_B32::execute(), Gcn3ISA::Inst_DS__DS_BPERMUTE_B32::execute(), Gcn3ISA::Inst_DS__DS_WRITE_B64::execute(), Gcn3ISA::Inst_DS__DS_WRITE2_B64::execute(), Gcn3ISA::Inst_DS__DS_READ_B64::execute(), Gcn3ISA::Inst_DS__DS_READ2_B64::execute(), Gcn3ISA::Inst_DS__DS_READ2ST64_B64::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_UBYTE::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_USHORT::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_DWORD::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_DWORDX2::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_DWORDX3::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_DWORDX4::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_BYTE::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_SHORT::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_DWORD::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_DWORDX2::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_DWORDX3::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_DWORDX4::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_WBINVL1::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_WBINVL1_VOL::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_UBYTE::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_USHORT::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_DWORD::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_DWORDX2::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_DWORDX3::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_DWORDX4::execute(), Gcn3ISA::Inst_FLAT__FLAT_STORE_BYTE::execute(), Gcn3ISA::Inst_FLAT__FLAT_STORE_SHORT::execute(), Gcn3ISA::Inst_FLAT__FLAT_STORE_DWORD::execute(), Gcn3ISA::Inst_FLAT__FLAT_STORE_DWORDX2::execute(), Gcn3ISA::Inst_FLAT__FLAT_STORE_DWORDX3::execute(), Gcn3ISA::Inst_FLAT__FLAT_STORE_DWORDX4::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_SWAP::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_CMPSWAP::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_ADD::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_SUB::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_INC::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_DEC::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_CMPSWAP_X2::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_ADD_X2::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_SUB_X2::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_INC_X2::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_DEC_X2::execute(), freeResources(), reserveGmResource(), reserveLmResource(), reserveResources(), GPUDynInst::resolveFlatSegment(), and Wavefront().

◆ expInstsIssued

int Wavefront::expInstsIssued
private

Definition at line 354 of file wavefront.hh.

Referenced by decExpInstsIssued(), incExpInstsIssued(), and waitCntsSatisfied().

◆ expWaitCnt

int Wavefront::expWaitCnt
private

Definition at line 351 of file wavefront.hh.

Referenced by clearWaitCnts(), setWaitCnts(), and waitCntsSatisfied().

◆ flatGmUnitId

int Wavefront::flatGmUnitId

Definition at line 97 of file wavefront.hh.

Referenced by reserveResources(), and GPUDynInst::resolveFlatSegment().

◆ flatLmUnitId

int Wavefront::flatLmUnitId

Definition at line 96 of file wavefront.hh.

Referenced by reserveResources(), and GPUDynInst::resolveFlatSegment().

◆ globalMem

int Wavefront::globalMem

Definition at line 119 of file wavefront.hh.

Referenced by init(), reserveGmResource(), and ScheduleStage::reserveResources().

◆ gridSz

uint32_t Wavefront::gridSz[3]

Definition at line 151 of file wavefront.hh.

Referenced by computeActualWgSz().

◆ initMask

VectorMask Wavefront::initMask

Definition at line 211 of file wavefront.hh.

◆ instructionBuffer

std::deque<GPUDynInstPtr> Wavefront::instructionBuffer

◆ kernId

int Wavefront::kernId

◆ lastAddr

std::vector<Addr> Wavefront::lastAddr

Definition at line 145 of file wavefront.hh.

Referenced by Wavefront().

◆ lastInstExec

uint64_t Wavefront::lastInstExec

Definition at line 254 of file wavefront.hh.

Referenced by exec(), Gcn3ISA::Inst_SOPP__S_ENDPGM::execute(), and Wavefront().

◆ lastNonIdleTick

Tick Wavefront::lastNonIdleTick

Definition at line 107 of file wavefront.hh.

Referenced by setStatus(), and Wavefront().

◆ lastTrace

uint64_t Wavefront::lastTrace

Definition at line 184 of file wavefront.hh.

Referenced by Wavefront().

◆ ldsChunk

LdsChunk* Wavefront::ldsChunk

◆ lgkmInstsIssued

int Wavefront::lgkmInstsIssued
private

Definition at line 355 of file wavefront.hh.

Referenced by decLGKMInstsIssued(), incLGKMInstsIssued(), and waitCntsSatisfied().

◆ lgkmWaitCnt

int Wavefront::lgkmWaitCnt
private

Definition at line 352 of file wavefront.hh.

Referenced by clearWaitCnts(), setWaitCnts(), and waitCntsSatisfied().

◆ localMem

int Wavefront::localMem

◆ maxDynWaveId

uint32_t Wavefront::maxDynWaveId

Definition at line 160 of file wavefront.hh.

◆ maxIbSize

int Wavefront::maxIbSize

Definition at line 100 of file wavefront.hh.

Referenced by FetchUnit::FetchBufDesc::allocateBuf().

◆ maxSgprs

uint32_t Wavefront::maxSgprs

Definition at line 126 of file wavefront.hh.

Referenced by resizeRegFiles(), GPUDynInst::resolveFlatSegment(), and Wavefront().

◆ maxVgprs

uint32_t Wavefront::maxVgprs

Definition at line 124 of file wavefront.hh.

Referenced by freeRegisterFile(), resizeRegFiles(), start(), and Wavefront().

◆ memTraceBusy

int Wavefront::memTraceBusy

Definition at line 183 of file wavefront.hh.

Referenced by Wavefront().

◆ numInstrExecuted

Stats::Scalar Wavefront::numInstrExecuted

Definition at line 224 of file wavefront.hh.

Referenced by exec(), and regStats().

◆ numTimesBlockedDueRAWDependencies

Stats::Scalar Wavefront::numTimesBlockedDueRAWDependencies

Definition at line 251 of file wavefront.hh.

Referenced by regStats().

◆ numTimesBlockedDueWAXDependencies

Stats::Scalar Wavefront::numTimesBlockedDueWAXDependencies

Definition at line 248 of file wavefront.hh.

Referenced by regStats().

◆ oldDgpr

std::vector<uint64_t> Wavefront::oldDgpr

Definition at line 204 of file wavefront.hh.

Referenced by Wavefront().

◆ oldDgprId

uint32_t Wavefront::oldDgprId

Definition at line 206 of file wavefront.hh.

◆ oldDgprTcnt

uint64_t Wavefront::oldDgprTcnt

Definition at line 208 of file wavefront.hh.

Referenced by Wavefront().

◆ oldVgpr

std::vector<uint32_t> Wavefront::oldVgpr

Definition at line 197 of file wavefront.hh.

Referenced by Wavefront().

◆ oldVgprId

uint32_t Wavefront::oldVgprId

Definition at line 199 of file wavefront.hh.

◆ oldVgprTcnt

uint64_t Wavefront::oldVgprTcnt

Definition at line 201 of file wavefront.hh.

Referenced by Wavefront().

◆ outstandingReqs

int Wavefront::outstandingReqs

Definition at line 163 of file wavefront.hh.

Referenced by Gcn3ISA::Inst_DS__DS_WRITE_B32::execute(), Gcn3ISA::Inst_DS__DS_WRITE2_B32::execute(), Gcn3ISA::Inst_DS__DS_WRITE2ST64_B32::execute(), Gcn3ISA::Inst_DS__DS_WRITE_B8::execute(), Gcn3ISA::Inst_DS__DS_WRITE_B16::execute(), Gcn3ISA::Inst_DS__DS_READ_B32::execute(), Gcn3ISA::Inst_DS__DS_READ2_B32::execute(), Gcn3ISA::Inst_DS__DS_READ2ST64_B32::execute(), Gcn3ISA::Inst_DS__DS_READ_U8::execute(), Gcn3ISA::Inst_DS__DS_READ_U16::execute(), Gcn3ISA::Inst_DS__DS_WRITE_B64::execute(), Gcn3ISA::Inst_DS__DS_WRITE2_B64::execute(), Gcn3ISA::Inst_DS__DS_READ_B64::execute(), Gcn3ISA::Inst_DS__DS_READ2_B64::execute(), Gcn3ISA::Inst_DS__DS_READ2ST64_B64::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_UBYTE::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_USHORT::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_DWORD::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_DWORDX2::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_DWORDX3::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_DWORDX4::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_BYTE::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_SHORT::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_DWORD::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_DWORDX2::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_DWORDX3::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_DWORDX4::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_WBINVL1::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_WBINVL1_VOL::execute(), Gcn3ISA::Inst_FLAT__FLAT_STORE_DWORDX2::execute(), validateRequestCounters(), and Wavefront().

◆ outstandingReqsRdGm

int Wavefront::outstandingReqsRdGm

◆ outstandingReqsRdLm

int Wavefront::outstandingReqsRdLm

◆ outstandingReqsWrGm

int Wavefront::outstandingReqsWrGm

◆ outstandingReqsWrLm

int Wavefront::outstandingReqsWrLm

◆ pendingFetch

bool Wavefront::pendingFetch

◆ rawDist

std::unordered_map<int,uint64_t> Wavefront::rawDist

Definition at line 261 of file wavefront.hh.

Referenced by exec(), Gcn3ISA::Inst_SOPP__S_ENDPGM::execute(), initRegState(), and Wavefront().

◆ rdGmReqsInPipe

int Wavefront::rdGmReqsInPipe

◆ rdLmReqsInPipe

int Wavefront::rdLmReqsInPipe

Definition at line 176 of file wavefront.hh.

Referenced by ScheduleStage::dispatchReady(), Gcn3ISA::Inst_DS__DS_READ_B32::execute(), Gcn3ISA::Inst_DS__DS_READ2_B32::execute(), Gcn3ISA::Inst_DS__DS_READ2ST64_B32::execute(), Gcn3ISA::Inst_DS__DS_READ_U8::execute(), Gcn3ISA::Inst_DS__DS_READ_U16::execute(), Gcn3ISA::Inst_DS__DS_SWIZZLE_B32::execute(), Gcn3ISA::Inst_DS__DS_PERMUTE_B32::execute(), Gcn3ISA::Inst_DS__DS_BPERMUTE_B32::execute(), Gcn3ISA::Inst_DS__DS_READ_B64::execute(), Gcn3ISA::Inst_DS__DS_READ2_B64::execute(), Gcn3ISA::Inst_DS__DS_READ2ST64_B64::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_UBYTE::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_USHORT::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_DWORD::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_DWORDX2::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_DWORDX3::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_LOAD_DWORDX4::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_UBYTE::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_USHORT::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_DWORD::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_DWORDX2::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_DWORDX3::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_DWORDX4::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_SWAP::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_CMPSWAP::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_ADD::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_SUB::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_INC::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_DEC::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_CMPSWAP_X2::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_ADD_X2::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_SUB_X2::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_INC_X2::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_DEC_X2::execute(), reserveLmResource(), GPUDynInst::resolveFlatSegment(), validateRequestCounters(), and Wavefront().

◆ readsPerWrite

Stats::Distribution Wavefront::readsPerWrite

Definition at line 265 of file wavefront.hh.

Referenced by exec(), Gcn3ISA::Inst_SOPP__S_ENDPGM::execute(), and regStats().

◆ reservedScalarRegs

int Wavefront::reservedScalarRegs

◆ reservedVectorRegs

int Wavefront::reservedVectorRegs

Definition at line 186 of file wavefront.hh.

Referenced by freeRegisterFile(), init(), and Wavefront().

◆ scalarAlu

int Wavefront::scalarAlu

Definition at line 114 of file wavefront.hh.

Referenced by ScheduleStage::dispatchReady(), exec(), and init().

◆ scalarAluGlobalIdx

int Wavefront::scalarAluGlobalIdx

Definition at line 118 of file wavefront.hh.

Referenced by init(), and reserveResources().

◆ scalarMem

int Wavefront::scalarMem

Definition at line 121 of file wavefront.hh.

Referenced by init(), and reserveGmResource().

◆ scalarOutstandingReqsRdGm

int Wavefront::scalarOutstandingReqsRdGm

◆ scalarOutstandingReqsWrGm

int Wavefront::scalarOutstandingReqsWrGm

◆ scalarRdGmReqsInPipe

int Wavefront::scalarRdGmReqsInPipe

◆ scalarWrGmReqsInPipe

int Wavefront::scalarWrGmReqsInPipe

◆ schCycles

Stats::Scalar Wavefront::schCycles

◆ schLdsArbStalls

Stats::Scalar Wavefront::schLdsArbStalls

Definition at line 244 of file wavefront.hh.

Referenced by regStats().

◆ schOpdNrdyStalls

Stats::Scalar Wavefront::schOpdNrdyStalls

Definition at line 240 of file wavefront.hh.

Referenced by ScheduleStage::checkRfOperandReadComplete(), and regStats().

◆ schResourceStalls

Stats::Scalar Wavefront::schResourceStalls

Definition at line 238 of file wavefront.hh.

Referenced by regStats().

◆ schRfAccessStalls

Stats::Scalar Wavefront::schRfAccessStalls

◆ schStalls

Stats::Scalar Wavefront::schStalls

◆ simdId

const int Wavefront::simdId

Definition at line 92 of file wavefront.hh.

Referenced by ScheduleStage::addToSchList(), ScheduleStage::checkRfOperandReadComplete(), discardFetch(), ScheduleStage::dispatchReady(), ExecStage::dumpDispList(), ScoreboardCheckStage::exec(), ExecStage::exec(), exec(), Gcn3ISA::Inst_SOPP__S_ENDPGM::execute(), Gcn3ISA::Inst_SOPP__S_BARRIER::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_UBYTE::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_USHORT::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_DWORD::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_DWORDX2::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_DWORDX3::execute(), Gcn3ISA::Inst_FLAT__FLAT_LOAD_DWORDX4::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_SWAP::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_CMPSWAP::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_ADD::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_SUB::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_INC::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_DEC::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_CMPSWAP_X2::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_ADD_X2::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_SUB_X2::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_INC_X2::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_DEC_X2::execute(), FetchUnit::fetch(), FetchStage::fetch(), freeRegisterFile(), GPUDynInst::GPUDynInst(), FetchUnit::initiateFetch(), initRegState(), VectorRegisterFile::printReg(), ScalarRegisterFile::printReg(), FetchStage::processFetchReturn(), FetchUnit::processFetchReturn(), Gcn3ISA::VecOperand< DataType, Const, NumDwords >::read(), Gcn3ISA::ScalarOperand< DataType, Const, sizeof(DataType)/sizeof(VecElemU32) >::read(), ScheduleStage::reserveResources(), reserveResources(), ScheduleStage::schedRfWrites(), validateRequestCounters(), Gcn3ISA::VecOperand< DataType, Const, NumDwords >::write(), and Gcn3ISA::ScalarOperand< DataType, Const, sizeof(DataType)/sizeof(VecElemU32) >::write().

◆ startSgprIndex

uint32_t Wavefront::startSgprIndex

Definition at line 194 of file wavefront.hh.

Referenced by init(), and Wavefront().

◆ startVgprIndex

uint32_t Wavefront::startVgprIndex

Definition at line 191 of file wavefront.hh.

Referenced by freeRegisterFile(), init(), and Wavefront().

◆ status

status_e Wavefront::status
private

◆ vecRawDistance

Stats::Distribution Wavefront::vecRawDistance

Definition at line 258 of file wavefront.hh.

Referenced by exec(), and regStats().

◆ vecReads

std::vector<int> Wavefront::vecReads

Definition at line 268 of file wavefront.hh.

Referenced by exec(), Gcn3ISA::Inst_SOPP__S_ENDPGM::execute(), start(), and Wavefront().

◆ vmemInstsIssued

int Wavefront::vmemInstsIssued
private

Definition at line 353 of file wavefront.hh.

Referenced by decVMemInstsIssued(), incVMemInstsIssued(), and waitCntsSatisfied().

◆ vmWaitCnt

int Wavefront::vmWaitCnt
private

the following are used for waitcnt instructions vmWaitCnt: once set, we wait for the oustanding number of vector mem instructions to be at, or below vmWaitCnt.

expWaitCnt: once set, we wait for the outstanding number outstanding VM writes or EXP insts to be at, or below expWaitCnt.

lgkmWaitCnt: once set, we wait for the oustanding number of LDS, GDS, scalar memory, and message instructions to be at, or below lgkmCount. we currently do not support GDS/message ops.

Definition at line 350 of file wavefront.hh.

Referenced by clearWaitCnts(), setWaitCnts(), and waitCntsSatisfied().

◆ wfDynId

uint64_t Wavefront::wfDynId

◆ wfId

uint32_t Wavefront::wfId

Definition at line 159 of file wavefront.hh.

Referenced by initRegState().

◆ wfSlotId

const int Wavefront::wfSlotId

◆ wgId

uint32_t Wavefront::wgId

◆ wgSz

uint32_t Wavefront::wgSz

Definition at line 153 of file wavefront.hh.

Referenced by initRegState().

◆ workGroupId

uint32_t Wavefront::workGroupId[3]

Definition at line 149 of file wavefront.hh.

Referenced by initRegState().

◆ workGroupSz

uint32_t Wavefront::workGroupSz[3]

Definition at line 150 of file wavefront.hh.

Referenced by computeActualWgSz().

◆ workItemFlatId

std::vector<uint32_t> Wavefront::workItemFlatId

Definition at line 147 of file wavefront.hh.

Referenced by Wavefront().

◆ workItemId

std::vector<uint32_t> Wavefront::workItemId[3]

Definition at line 146 of file wavefront.hh.

Referenced by initRegState(), and Wavefront().

◆ wrGmReqsInPipe

int Wavefront::wrGmReqsInPipe

◆ wrLmReqsInPipe

int Wavefront::wrLmReqsInPipe

Definition at line 178 of file wavefront.hh.

Referenced by ScheduleStage::dispatchReady(), Gcn3ISA::Inst_DS__DS_WRITE_B32::execute(), Gcn3ISA::Inst_DS__DS_WRITE2_B32::execute(), Gcn3ISA::Inst_DS__DS_WRITE2ST64_B32::execute(), Gcn3ISA::Inst_DS__DS_WRITE_B8::execute(), Gcn3ISA::Inst_DS__DS_WRITE_B16::execute(), Gcn3ISA::Inst_DS__DS_WRITE_B64::execute(), Gcn3ISA::Inst_DS__DS_WRITE2_B64::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_BYTE::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_SHORT::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_DWORD::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_DWORDX2::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_DWORDX3::execute(), Gcn3ISA::Inst_MUBUF__BUFFER_STORE_DWORDX4::execute(), Gcn3ISA::Inst_FLAT__FLAT_STORE_BYTE::execute(), Gcn3ISA::Inst_FLAT__FLAT_STORE_SHORT::execute(), Gcn3ISA::Inst_FLAT__FLAT_STORE_DWORD::execute(), Gcn3ISA::Inst_FLAT__FLAT_STORE_DWORDX2::execute(), Gcn3ISA::Inst_FLAT__FLAT_STORE_DWORDX3::execute(), Gcn3ISA::Inst_FLAT__FLAT_STORE_DWORDX4::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_SWAP::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_CMPSWAP::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_ADD::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_SUB::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_INC::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_DEC::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_CMPSWAP_X2::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_ADD_X2::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_SUB_X2::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_INC_X2::execute(), Gcn3ISA::Inst_FLAT__FLAT_ATOMIC_DEC_X2::execute(), reserveLmResource(), GPUDynInst::resolveFlatSegment(), validateRequestCounters(), and Wavefront().


The documentation for this class was generated from the following files:

Generated on Wed Sep 30 2020 14:02:34 for gem5 by doxygen 1.8.17