Llama3.2 1B/3B Model BF16 dtype
A large number of the following logcat logs will appear during the session, and the output is truncated or not displayed
E 00:00:02.486051 executorch:tensor_util.h:482] Check failed (t.scalar_type() == dtype): Expected to find Half type, but tensor has type BFloat16E 00:00:02.496141 executorch:tensor_util.h:482] Check failed (t.scalar_type() == dtype): Expected to find Float type, but tensor has type BFloat16
Llama3.2 1B/3B SpinQuant Model FP32 dtype
Conversations and emoji can be output completely, without truncation or non-display, and logcat does not show the above errors
🐛 Describe the bug
After https://github.com/pytorch/executorch/issues/6284#issuecomment-2423431020 patch that original UTF-8 invalid character issue had fixed,there is a new issue in tensor type support.
Llama3.2 1B/3B Model BF16 dtype A large number of the following logcat logs will appear during the session, and the output is truncated or not displayed
E 00:00:02.486051 executorch:tensor_util.h:482] Check failed (t.scalar_type() == dtype): Expected to find Half type, but tensor has type BFloat16
E 00:00:02.496141 executorch:tensor_util.h:482] Check failed (t.scalar_type() == dtype): Expected to find Float type, but tensor has type BFloat16
Llama3.2 1B/3B SpinQuant Model FP32 dtype Conversations and emoji can be output completely, without truncation or non-display, and logcat does not show the above errors
Versions
master version