Closed xinyi-li7 closed 2 years ago
Do you have a simplified version of your code? I am not able to reproduce either of the issues locally.
I am running on r465 and CUDA 11.3.
Hi Zi Yan, I just found it's my programming error when creating the simple version. Sorry for bothering you.
When I'm operating an FP64 instruction, I need to read the value in the destination register. Since they are 32-bit registers, I need to pass
reg_num
andreg_num+1
into the inject function. However, I noticed something wrong in the second register, so I did a naive test.nvbit_add_call_arg_reg_val
.nvbit_add_call_arg_const_val32
, and read them throughnvbit_read_reg
in inject function, it will print true or false alternatively.Thus, my questions are:
val_low
should be equal toval_hi
since they are from the same register; however, in my test, they are different.Do you know what's the problems here? I would very much appreciate it if you could help me figure out this issue!