-
I have test latest master branch for hf trainer, llama2 example run right, then I run the qwen example, it hang when save to memory.
env:
one node with 8 gpus
train scripts:
https://github.com…
-
### What happened?
I'm getting error when starting the Fine Tuning after executed update.sh.
Code review showed that there is a typo in StableDiffusionBaseDataLoader.py
```
encode_prompt = Encod…
-
### 提交前必须检查以下项目
- [X] 请确保使用的是仓库最新代码(git pull),一些问题已被解决和修复。
- [X] 我已阅读[项目文档](https://github.com/ymcui/Chinese-Mixtral/wiki)和[FAQ章节](https://github.com/ymcui/Chinese-Mixtral/wiki/常见问题)并且已在Issue中对问题进行了搜…
-
Hi,
I'm trying to run the tutorial, but it fails during `vae.train()` with the message `Trainer.__init__() got an unexpected keyword argument 'use_gpu'`
```
TypeError …
-
I've encountered a bug:
```stats = ppo_trainer.step(question_tensors, response_tensors, rewards)
File "/opt/conda/lib/python3.10/contextlib.py", line 79, in inner
return func(*args, **kwds)…
-
## 🚀 Feature
Cross-Validation is a crucial model validation techniques for assessing how the model generalizes on new data.
### Motivation
Research papers usually require cross-validation. Fro…
-
**Describe the bug**
> A clear and concise description of what the bug is.
Traceback (most recent call last):
File "run.py", line 122, in main
trainer.train()
File "C:\ProgramData\anacond…
-
### System Info
------- nvidia-smi output ----------
Tesla T4
NVIDIA-SMI 545.23.08
Driver Version: 545.23.08
CUDA Version: 12.3
-------- hostnamectl output --------
U…
-
I am comparing Integration methods using the `IntegrateLayers()` function and am running into an Error when trying to use the scVI method. The other methods (harmony, rpca and cca) I have tried work f…
-
### 🐛 Describe the bug
I've been trying to implement quantization-aware training on a convnextSmall model. It errors out saying RuntimeError: Failed running call_module self_loss_fn(*(FakeTensor(Fa…