[bugfix] upcast input tensor to fp64 for cpu reference

FlagOpen / FlagGems

FlagGems is an operator library for large language models implemented in Triton Language.

Apache License 2.0

347 stars 49 forks source link

Closed StrongSpoon closed 1 month ago

StrongSpoon commented 1 month ago

OP Test

Bug Fix

upcast input tensor to fp64 for cpu reference. so that weight_norm and upsample_bicubic2d_aa could run fp16 tests in cpu mode.

iclementine commented 1 month ago

I wonder why the original upcast value for these tests were False? I thought non-upcasting only applies to some integer related operarions?