Closed YQ-Wang closed 1 year ago
I was able to unblock myself by taking reference from this commit: https://github.com/ray-project/aviary/commit/2569f2d9e50ee27e4828d205ac9d75820c738da9#diff-aad8e2d49d52b24d24c8cdb142d7088c240998b37187401f0ea7fec0791ccd90. Close this issue for now. Thanks!
I tried to serve
TheBloke--Llama-2-70B-chat-GPTQ
model with Aviary 0.2.0 and have the following error:RuntimeError: weight model.layers.0.self_attn.q_proj.weight does not exist
. This seems to be an issue https://github.com/huggingface/text-generation-inference/issues/500 with TGI.I noticed that the issue in Aviary to run TheBloke--Llama-2-70B-chat-GPTQ model is marked as resolved. Can you share any suggestions?
cc @Yard1