mozilla-ai / lm-buddy

Your buddy in the (L)LM space.
Apache License 2.0
64 stars 3 forks source link

Adding information about evaluating quantized models with GPUs #65

Closed veekaybee closed 7 months ago

veekaybee commented 7 months ago

What's changing

When you run Ray evaluation job on a quantized model, it requires GPUs, per the bitsandbytes quantization environment validation checker.

We can make this clearer in the documentation for example evaluation runs.

How to test it

Check documenation

Related Jira Ticket

Addressing this issue: https://github.com/mozilla-ai/lm-buddy/issues/50

Additional notes for reviewers

Also added a config file for a quantized HF model eval run.