Open hummingbird2030 opened 7 months ago
I also meet the problem. Is there any solution?
I also meet the problem. And I fixed the problem.
There is an empty tensor in "model.safetensors". I'm not sure why this weight is being saved, but please check it, delete the file, and try running it again. Below is the code I used to check the safetensor:
from safetensors import safe_open
path = "model.safetensors"
tensors = {}
with safe_open(path, framework="pt", device="cpu") as f:
for key in f.keys():
tensors[key] = f.get_tensor(key)
print(tensors)
I also meet the problem. Is there any solution?
I fixed it by deleting the 'model.safetensors'. Hope to help you.
This issue has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this issue should remain open. Thank you!
Your current environment
How would you like to use vllm
I want to run inference of a llama2 which is trained by myself. I use the following code to save the trained model: '''
'''
And then I get following trained model files: When I tried to load the model with
model = LLM(model={my_model_dir}, download_dir={my_model_dir}, dtype=args.dtype, tensor_parallel_size=args.world_size,)
I got the following AssertionError:This is a problem when I use the code of https://github.com/AkariAsai/self-rag. But when I use the model trained by the author of self-rag, this problem didn't appear. Maybe there's something wrong with my method of saving the model. If you can provide me some help, it will help me a lot.