FlagOpen / FlagGems

FlagGems is an operator library for large language models implemented in Triton Language.
Apache License 2.0
347 stars 49 forks source link

[bugfix] upcast input tensor to fp64 for cpu reference #238

Closed StrongSpoon closed 1 month ago

StrongSpoon commented 1 month ago

PR Category

OP Test

Type of Change

Bug Fix

Description

upcast input tensor to fp64 for cpu reference. so that weight_norm and upsample_bicubic2d_aa could run fp16 tests in cpu mode.

Issue

Progress

Performance

iclementine commented 1 month ago

I wonder why the original upcast value for these tests were False? I thought non-upcasting only applies to some integer related operarions?