batch-scheduler Search Results

1000+ results
for batch-scheduler

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

lshqqytiger/stable-diffusion-webui-amdgpu #503

Ultimate SD upscale - Cannot set version_counter for inferen…

### Checklist - [ ] The issue exists after disabling all extensions - [ ] The issue exists on a clean installation of webui - [ ] The issue is caused by an extension, but I believe it is caused by a …

nicodem09 updated 2 months ago
26
wycloveinfall/MSMDFF-NET #3

Till 197 epoches =>Epoches 197, learning rate = 0.00000002, …

Traceback (most recent call last): File "train.py", line 277, in batch_loss_n, pred = solver.optimize(index+1,epoch) File "/home/jayakumar/MSMDFF-NET-main/utils/frame_work_general.py", lin…

Jayku88 updated 4 months ago
3
tarantool/tarantool #5544

Develop kind of an async work scheduler for internal yieldin…

The problem: some systems have async work to do, which may yield. They don't want or simply can't do the work right away. For example, can be called via FFI, or want to collect a batch of such request…

Gerold103 updated 4 weeks ago
3
pytorch/xla #4083

Custom learning rate scheduler affects TPU performance

## ❓ Questions and Help I have trained my transformer model once on a single GPU and once using a multi-core TPU. In both cases a batchsize of 256 is used (times 8 for the TPU). My training results…

DanielRoeder1 updated 2 years ago
4
kohya-ss/sd-scripts #567

returned non-zero exit status 3221225477 problem with Kohya …

Can the problem be that I have GTX 1050 ti 4 GB? (playing with options to lower VRAM usage does not help), When I play with settings I get the same thing but the last thing changes to returned non-zer…

Garano11 updated 1 month ago
10
hiyouga/LLaMA-Factory #5484

请问DPO训练的时候有什么注意事项吗？我训练出来效果很差。

### Reminder - [X] I have read the README and searched the existing issues. ### System Info 训练命令： llamafactory-cli train \ --stage dpo \ --do_train \ --finetuning_type full \ …

zlh-source updated 3 weeks ago
9
JohnRomanelis/SPVD #8

IndexError: max(): Expected reduction dim 0 to have non-zero…

Dear, Thank you for the great work! I am running your code for point cloud completion and get such an error when inferencing. I did some dedug, and realized using the code below, after self.update_…

jianchaoci updated 1 week ago
11
Ucas-HaoranWei/GOT-OCR2.0 #134

deepspeed 和 transformers 精度对不上，

Deepspeed 软件版本： 0.15.2 Transformers: 4.45.2 训练命令： deepspeed GOT/train/train_GOT.py --deepspeed zero_config/zero2.json --model_name_or_path /home/GOT-OCR2.0/GOT-OCR-2.0-master/GOT_weights --…

Gaopeng-Bai updated 1 month ago
1
cj-mills/pytorch-yolox-object-detection-tutorial-code #5

AssertionError: Loss is NaN or infinite at epoch 0, batch 71…

I get it at different batches in the first epoch, not always the same. But around 70-80% progress iof the first batch it seems. ``` ----------------------------------------------------------------…

agoransson updated 1 month ago
2
gradio-app/gradio #9378

Hangs at loading shards then get a OOM error.

### Describe the bug I've gone through all the steps to install Sora and the last step of running gradio/app.py it fails about 2/3 of the way. It hangs on loading shards at 0% and then get the follow…

blacknoon updated 2 months ago
1

上一页 1...19 20 21 22 23 24 25...100 下一页

1000+ results for batch-scheduler

1000+ results
for batch-scheduler