Open pixelmager opened 5 years ago
V_SQRT_F32 should not be 4 cycles, but 16 - the same as V_RSQ_F32 https://github.com/GPUOpen-Tools/RGA/blob/f3f9539759c22e1a01bf184397ff089e95638d86/RadeonGPUAnalyzerBackend/Emulator/Parser/VOPInstruction.h#L180
Maybe check that the other instructions marked with "// guess" are accurate?
Thanks for reporting this. We need to update these estimates to reflect some architectural changes that were introduced in RDNA. We would do our best to get to it in one of our upcoming releases.
V_SQRT_F32 should not be 4 cycles, but 16 - the same as V_RSQ_F32 https://github.com/GPUOpen-Tools/RGA/blob/f3f9539759c22e1a01bf184397ff089e95638d86/RadeonGPUAnalyzerBackend/Emulator/Parser/VOPInstruction.h#L180
Maybe check that the other instructions marked with "// guess" are accurate?