pytorch / executorch

On-device AI across mobile, embedded and edge for PyTorch
https://pytorch.org/executorch/
Other
2.24k stars 374 forks source link

Android: Tensor type is not very friendly to BFloat16 #6571

Open JamePeng opened 1 month ago

JamePeng commented 1 month ago

🐛 Describe the bug

After https://github.com/pytorch/executorch/issues/6284#issuecomment-2423431020 patch that original UTF-8 invalid character issue had fixed,there is a new issue in tensor type support.

Llama3.2 1B/3B Model BF16 dtype A large number of the following logcat logs will appear during the session, and the output is truncated or not displayed E 00:00:02.486051 executorch:tensor_util.h:482] Check failed (t.scalar_type() == dtype): Expected to find Half type, but tensor has type BFloat16 E 00:00:02.496141 executorch:tensor_util.h:482] Check failed (t.scalar_type() == dtype): Expected to find Float type, but tensor has type BFloat16

Llama3.2 1B/3B SpinQuant Model FP32 dtype Conversations and emoji can be output completely, without truncation or non-display, and logcat does not show the above errors

Versions

master version

JacobSzwejbka commented 1 month ago

cc @kirklandsign