-
## Description:
In AutoGluon's multimodal framework, Distributed Data Parallel (DDP) is the primary strategy employed for leveraging multiple GPUs across most problem types. A known limitation of D…
-
Hello, I would like to fine-tune or train TinyChart to improve its summarization skills. I have the impression that it doesn't capture all the data during summarization, whereas it does during data co…
-
Hi,
Im studying CrewAI, i tried to create a crew to make docs about some code, when i try to use the new feature for train i got this error :
Traceback (most recent call last):
File "/home/well…
-
Will it be possible in the future for you to coordinate with the [`torchtune`](https://pytorch.org/blog/torchtune-fine-tune-llms/) project so that we are able to use A for xyz and B for ikj?
We've …
-
使用zero2,在进行梯度更新时候,梯度的参数量级有9B左右,远远大于模型大小7B
```
╭───────────────────── Traceback (most recent call last) ──────────────────────╮
│ /export/App/training_platform/PinoModel/applications/VisualGLM/vis…
-
### System Info
Name: trl
Version: 0.13.0.dev0
Name: transformers
Version: 4.46.2
Python 3.11.10
### Information
- [X] The official example scripts
- [X] My own modified scripts
### Tasks
- […
-
### Describe the feature
The PPO training needs to maintain 4 models in memory at the same time. The original implementation keep the reward/actor critic/initial model in video ram at the same time.
…
-
- [ ] [blog/starcoder2.md at main · huggingface/blog](https://github.com/huggingface/blog/blob/main/starcoder2.md?plain=1)
# blog/starcoder2.md at main · huggingface/blog
---
## StarCoder…
-
### 🐛 Describe the bug
Hello,
I tried to test the example related to the new G-Retriever model in colab: https://github.com/pyg-team/pytorch_geometric/blob/master/examples/llm/g_retriever.py.
…
-
In the demos I’ve seen of Leon AI, it appeared rather slow. I have no idea if this was a limitation of the hardware or there were inefficiencies that might be improved upon. [GPT4All](https://github.c…