Tencent / TPAT

TensorRT Plugin Autogen Tool
Apache License 2.0
365 stars 42 forks source link

precision for one hot plugin is wrong #24

Open wugoukanle opened 1 year ago

wugoukanle commented 1 year ago

for one hot,tensorflow result and trt result is not match,trt reslut is all 0

wugoukanle commented 1 year ago

it waste my many time to check error location,and finaly it's found that -arch=sm_75 is not fit for my device in Makefile, that is to say, -arch=sm_70 is correct for v100.

buptqq commented 1 year ago

this is our oversight. We will add the config of 'sm_xx' in Makefile. thank you for your feedback