-
- This issue focuses on the technical courses we take about LLM, we'll put the paper part in
https://github.com/xp1632/DFKI_working_log/issues/70
---
1. **ChainForge** https://chainforge.ai/ …
-
Is it possible to finetune VILA through hugging face with a custom image dataset? I don't see any documentation about this.
-
Related issues:
- #6
- #25
- #47
- #80
- #84
- #99
-
### Reminder
- [X] I have read the README and searched the existing issues.
### System Info
[2024-09-17 10:58:53,418] [INFO] [real_accelerator.py:203:get_accelerator] Setting ds_accelerator to cuda…
-
**Describe the bug**
I am getting the following error while attempting to run deepspeed-chat step 3 with the actor model CarperAI/openai_summarize_tldr_sft (gpt-j 6B) and critic model CarperAI/openai…
-
Hi @jackboyla,
Thanks for your work!
I was wondering if you plan to extend the dataset to other languages (French, Spanish, German, for example), to build a multilingual models.
Regards,
…
-
### ❓ The question
Hi, I am wondering if you can provide your config file for finetuning on the Tulu V2 dataset? It would be helpful for reproducing the finetuning results. In addition, have you tr…
-
Great work. I guess the sft dataset can affect the performance of model. Do you make a supervised finetuning using your high-quality data on Falcon-40B? Thanks a lot.
-
I would like to ask how I can train to achieve better results. I used unpaired images for training, but the results were not very good. Could you advise on how to improve the training and which models…
-
There are three steps in the entire pipeline. But the help messages when things go wrong can be hard to interpret.
For all steps, it suggests me to use --gradient_checkpointing to resolve the out …