Open qiyuangong opened 7 months ago
Currently the problem is found in Qwen-7b-chat and Qwen-14b-chat, while it works well on Chatglm2-6b and Llama2-7b. Looks like Qwen depends some other packages. The problem can be solved by running pip install transformers_stream_generator einops tiktoken
.
Transform int4 cannot find
transformers_stream_generator einops tiktoken