nod-ai / sharktank

SHARK Inference Modeling and Serving
Apache License 2.0
7 stars 9 forks source link

[punet] Evaluate rescale efficiency #75

Open stellaraccident opened 3 weeks ago

stellaraccident commented 3 weeks ago

Currently for the punet model, we are doing scale math on all layers in f32 because some are sensitive to precision. We should do an experiment to see if there is an overall benefit to switching to lower precision or a different way of doing this. If so, we can spend some time on the simulator attempting to make it work.