-
Traceback (most recent call last):
File "train.py", line 277, in
batch_loss_n, pred = solver.optimize(index+1,epoch)
File "/home/jayakumar/MSMDFF-NET-main/utils/frame_work_general.py", lin…
-
Deepspeed 软件版本: 0.15.2
Transformers: 4.45.2
训练命令: deepspeed GOT/train/train_GOT.py --deepspeed zero_config/zero2.json --model_name_or_path /home/GOT-OCR2.0/GOT-OCR-2.0-master/GOT_weights --…
-
### 🎟️ 상위 작업 (Ticket Number)
BO-42
### 🌳 브랜치명 (Branch)
main
### 📝 상세 내용(Description)
여행 시작과 종료를 위한 배치 스케쥴러를 구현합니다
### ✅ 체크리스트(Tasks)
- [ ]
-
**Is your feature request related to a problem? Please describe.**
The current AnimateDiffSDXLPipeline doesn't support neither 1 controlnet nor multi controlnets.
I've been working on this task for …
-
### Reminder
- [x] I have read the README and searched the existing issues.
### System Info
- `llamafactory` version: 0.9.1.dev0
- Platform: Linux-5.4.0-152-generic-x86_64-with-glibc2.35
- Python…
-
MySQL Input 的配置
```toml
[input]
type = "mysql"
mode = "batch"
[input.config]
nr-scanner = 10
table-scan-batch = 10000
batch-per-second-limit = 10
max-full-dump-count…
-
**Problem:**
Polling for changes in state isn't scaling well, especially when running into API rate limits.
**Solution:**
Subscribe to changes for all Azure services used in the implementation of…
-
**Describe the bug**
A clear and concise description of what the bug is.
1. I train mixtral 7Bx8 model , tain 270 step, it will be hang , after 30m , NCCL timeout ,process will be killed
Inva…
-
### System Info
Hi,
I'm having trouble reproducing NVidia claimed numbers in the table here: https://nvidia.github.io/TensorRT-LLM/performance/perf-overview.html#throughput-measurements
System Im…
-
On version `v3.0.75` :
While doing slack indexation with the slack connector, there is the following error being raised :
```
2024-06-10 15:36:50,541: INFO/MainProcess] Task check_for_document…