bsz Search Results - Githubissues

1000+ results
for bsz

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ylacombe/musicgen-dreamboothing #5

Incorporating input_values for Audio-Prompted Audio Generati…

I want to train the musicgen model (instead musicgen melody model) for Audio-Prompted audio continuation/generation tasks. According to my interpretation of the code provided below, it appears that `…

LiuZH-19 updated 4 months ago
1
carpedm20/ENAS-pytorch #48

Cifar10 CNN problem

1. utils.py", line 150, nbatch = data.size(0) // bsz AttributeError: 'DataLoader' object has no attribute 'size' -> nbatch = len (data.size) 2. image.py line 9, if args == 'cifar' -> if args …

aaron851113 updated 4 years ago
1
Nealcly/templateNER #14

Custom Label Problem

When I train the model with custom labels, the training code works well. However, Adapting Inference.py code to my custom trained model does not work. I change the Inference.ipynb code to adapt my …

savasy updated 2 years ago
4
jzhang38/TinyLlama #173

Why FSDP not DPP？

Could I kindly inquire as to why, given the relatively small size of the tinyllama model, the Strategy was made to utilize FSDP (Fully Sharded Data Parallel) instead of DDP (Distributed Data Parallel)…

noforit updated 2 months ago
1
pytorch/pytorch #57495

Please remove this assertion - it triggers on valid use case…

https://github.com/pytorch/pytorch/blob/4cb534f92ef6f5b2ec99109b0329f93a859ae831/aten/src/ATen/native/cpu/IndexKernel.cpp#L430 OR https://github.com/pytorch/pytorch/blob/d68ad3cb1e28fc464dd40dd…

ARozental updated 3 years ago
1
HandH1998/QQQ #15

Condition to achieve linear speedup?

I tested latency of QuantLinear forward with various sizes of input and feature sizes. But for token counts from 1 to 1024, I cannot see any speedup compared to AWQ W4A16 kernel and the results were …

jiwonsong-dev updated 1 week ago
18
OpenMOSS/MOSS #80

AttributeError: 'NoneType' object has no attribute 'deepspee…

Traceback (most recent call last): File "finetune_moss.py", line 303, in train(args) File "finetune_moss.py", line 175, in train accelerator.state.deepspeed_plugin.deepspeed_config['t…

HDRBgg updated 1 year ago
7
Link-Li/Balanced-DataParallel #15

用原版的就可以，用这个版本就一直报错 IndexError: tuple index out of range

bsz = inputs[0].size(self.dim) IndexError: tuple index out of range 原版是这样写的： model = DataParallel(model, device_ids=[int(i) for i in args.device.split(',')]) 按这个版本的介绍这样写： model = BalancedDataPara…

sctm002 updated 1 year ago
1
CLUEbenchmark/CLUE #124

Performance issue in baselines/models/xlnet/data_utils.py

Hello! I've found a performance issue in baselines/models/xlnet/data_utils.py: `dataset = dataset.batch(bsz_per_core, drop_remainder=True)`[(here)](https://github.com/CLUEbenchmark/CLUE/blob/5b1d19dc8…

DLPerf updated 3 years ago
2
DRSY/EMO #4

分布式多机训练, loss 训练 300 step 后会变成负数

在我自己的代码中引入 emo, 使用的 bf16 训练, 训练过程中 loss 会变负, 可能是什么原因呢 loss: -2.334918e-02, loss_cur_dp: -2.334918e-02

jiaruipeng1994 updated 11 months ago
3

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for bsz

1000+ results
for bsz