-
训练脚本
#!/bin/bash
export CUDA_DEVICE_MAX_CONNECTIONS=1
DIR=`pwd`
export MODEL="/workspace/model_weight/internlm-xcomposer2-vl-7b"
export DATA="data.txt"
GPUS_PER_NODE=8
NNODES=1
NODE_RANK=0…
-
hello.
for something like learning to rank models, having the ability to score the vectors with a custom function (namely a custom ml model) is crucial.
with this ability, the model can be trained o…
-
Hi, I use grad_cache to train my model, but it seems very slow, I want to konw is this normal?
Does using grad cache generally affect the training speed?
-
### Describe the bug
when I run the script train_dreambooth_lora_flux.py. It raise ValueError: unexpected save model: . something bug in save_model_hook?
![Uploading image.png…]()
### Reproducti…
-
### Root Cause
The root cause is due to recent transformers update [to resolve high CPU usage for large quantized models](https://github.com/huggingface/transformers/pull/33154).
- what the PR…
-
## question
I would like to check the excellence of MiniCPM-V and try fine tuning. However, I have a question because the computer resource usage is strange during fine tuning.
In particular, GPU ut…
-
### This issue is for a: (mark with an `x`)
```
- [x] bug report -> please search issues before submitting
- [ ] feature request
- [ ] documentation issue or request
- [ ] regression (a behavior …
-
# Key Objectives Table
Skill | Mastered (Y/N) | Rank (1-4) | Ratio | Notes
-- | -- | -- | -- | --
Laptop Verification or Cloud Workspace | [Y] | 0 | 0.0 | verification by Linux commands notebook
…
-
### Is your feature request related to a problem? Please describe.
As pointed out recently in other places, some (me) might say our focus system is horribly out of whack, and a way to improve our s…
-
This is an excellent piece of work, but I am unable to reproduce the results presented in the paper. After three days of communication with the author and repeatedly modifying the code, I still cannot…