-
Hi! I would like to use QLora to "pretrain" a model and wanted to ask if that is possible, in the release time of qlora I've heard something about a 'raw' mode not existing right now
For example, l…
-
Hello,
Thanks so much for providing such resource so that we can all leverage the latest development of AI on different platforms.
I was able to use your example to run a job to fine tune Llama-…
-
Thank you for building this - very interested in trying it. In my hands, when I try to export the model to a .bin I get the following error - is this something simple / user error?
(MacOS Ventura …
-
It seems like AutoGPTQ quantization module not being able to access the CUDA extension.
The previous ingest problem is solve by
pip install git+https://github.com/Keith-Hon/bitsandbytes-windows.g…
-
Would that be relevant in the scope of this project? Like adding a couple sorts of task examples could improve its generalized capabilities, for instance:
Longer responses
GPT-4 Generated Response…
-
Iterate on the SFT-8 dataset mixes to create pretraining and final SFT mixes for SFT-9. This requires investigating the quality and usefulness of the datasets. Community input welcome below. See the `…
-
Hello, I have a few questions about OctoCoder.
For this part in the paper:
> For instruction tuning our models, we select 5,000 random samples from COMMITPACKFT across the 6 programming languages…
-
# Expected Behavior
After clicking "AutoGenerate Memory", the full summary should be saved in memory
# Current Behavior
After clicking "AutoGenerate Memory", only "[Summary: Certainly!]" is s…
-
### Question
when i run cmd below, got size mismatch.
CUDA_VISIBLE_DEVICES=1 python -m llava.serve.cli --model-path liuhaotian/llava-v1.5-13b-lora --model-base liuhaotian/**vicuna-13b-v1.5** --image…
-
This issue is now to track the implementation of various evaluation methods and workflows for LLMs.
Evaluations:
- [x] G-Eval
- [ ] PingPong
- [ ] InfiniteBench
- [ ] Ruler
- [ ] MMLU
- [ ] M…