unslothai / unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
https://unsloth.ai
Apache License 2.0
15.22k stars 1.02k forks source link

Example with VLLM #755

Open vjagannath786 opened 1 month ago

vjagannath786 commented 1 month ago

It would have been nice if there were some inference examples like vLLM. So that we know how exactly model inferencing is working.

AnthonyDipilato commented 1 month ago

Inference is the same in vllm as any other model, they have examples in their documentation https://docs.vllm.ai/en/latest/getting_started/quickstart.html

danielhanchen commented 1 month ago

Yep vLLM has some cool docs - would adding it into an Unsloth notebook be helpful?

vjagannath786 commented 1 month ago

Yep vLLM has some cool docs - would adding it into an Unsloth notebook be helpful?

Yes, it would be helpful. Actually I was trying to solve classification tasks which would churn out logits and was experimenting with this notebook https://github.com/timothelaborie/text_classification_scripts/blob/main/unsloth_classification.ipynb which wasn't working and I had to do a fast inferencing vLLM. So that is where I am stuck.

Would be glad if some examples are added for vLLM and classification tasks.

danielhanchen commented 1 month ago

Ok will try add some vLLM examples!