-
### Reminder
- [X] I have read the README and searched the existing issues.
### System Info
None
### Reproduction
None
### Expected behavior
None
### Others
None
-
### System Info
```transformers >= 4.43.0```
### Who can help?
@zucchini-nlp
### Information
- [X] The official example scripts
- [ ] My own modified scripts
### Tasks
- [ ] An …
-
### Describe the issue
Hi, when I use my own dataset, roughly 50w data, DDP training with 8 A100 80G, the training hangs and gives the following error:
```
[E ProcessGroupNCCL.cpp:828] [Rank 1] Wat…
-
Hi!
I tried to combine the inference instruction you provided and follow the inference code from the hf tutorials in
https://colab.research.google.com/drive/1dTdro-k7NFqRgGq5-TlGHM-6k2sYQhXp#scrollT…
-
I tune llava-next ckpt on my custom dataset, the loss is normal for the first 20 iters, but becomes nan from the 30th iter. I have trained the same data in the official llava code and did not encounte…
-
conda activate swift
CUDA_VISIBLE_DEVICES=0,1,2,3 swift sft --model_type llava1_6-mistral-7b-instruct --dataset dataset/abc.jsonl \
Command that i am using, dont know whats wrong in it
…
-
### System Info
- `transformers` version: 4.39.1
- Platform: Linux-5.19.0-051900rc6-generic-x86_64-with-glibc2.35
- Python version: 3.9.18
- Huggingface_hub version: 0.21.1
- Safetensors version:…
-
I don't see the part of RAG-ICL during reasoning, is this part of the code not yet available.
-
### System Info
text-generation-launcher --env
```
Runtime environment:
Target: x86_64-unknown-linux-gnu
Cargo version: 1.75.0
Commit sha: 2d0a7173d4891e7cd5f9b77f8e0987b82a339e51
Docker label:…
-
Subscribe to this issue and stay notified about new [daily trending repos in Python](https://github.com/trending/python?since=daily)!