Closed Davido111200 closed 8 months ago
hi @Davido111200 , I checked the latest version (v0.3.8) and found it works. Could you send me your inference code?
from angle_emb import AnglE
model = AnglE.from_pretrained('NousResearch/Llama-2-7b-hf', pretrained_lora_path'SeanLee97/angle-llama-7b-nli-v2').cuda()
model.encode({'text': 'hello world'})
Hi @SeanLee97 . Thanks for your reply. My inference code looks almost exactly like yours, but I did not transfer the model to CUDA. It seems to me that is the only problem.
Hi Sean,
Thanks for the amazing work! I notice that there might be a small bug in newer versions of the code resulting in a device error when using angle-llama to get embeddings. I downgrade the version to 0.3.0 and the problem disappears.
To reproduce the error, simply execute the code given in Angle-llama instructions
Could you take a quick look at the problem? Thanks !