vocabulary-trainer Search Results

1000+ results
for vocabulary-trainer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

yuhuixu1993/qa-lora #10

RuntimeError: self and mat2 must have the same dtype

Whenever I follow the installation instructions, apply autogptq to llama2-hf, then try to run qalora.py on the checkpoints produced by autogptq. I install autogptq using pip install autogptq[triton], …

M-Elfeki updated 12 months ago
6
pytorch/pytorch #127176

RuntimeError: "_amp_foreach_non_finite_check_and_unscale_cud…

### 🐛 Describe the bug i use pytorch==2.3.0 and peft to train llama3 8b , when i run my code, its raise error like: ```text torch._amp_foreach_non_finite_check_and_unscale_( RuntimeError:…

ykallan updated 6 months ago
3
eth-easl/modyn #28

Pushshift Reddit Benchmark Integration

To better plan and have an overview of the tasks still open to implementing the Pushshift Reddit benchmark I ask you to go through the following functions with regard to the Pushshift Reddit benchmark…

vGsteiger updated 1 year ago
8
LeeSureman/Flat-Lattice-Transformer #53

数据处理部分的代码的clip没生效

在32G 的卡上跑，clip 默认设置的200。 batch-size=1都会溢出。也是服了。然后发现preprocess里面的clip 其实没有clip掉。最长的句子竟然会出现1146个char 溢出应该也正常吧。。。修改了代码，贴上来，有遇到同样问题的人可以参考一下。 ``` import argparse import numpy as np import os imp…

mirrorQAQ updated 2 years ago
8
OptimalScale/LMFlow #870

DPO+ZeRO train error

I would like to ask for your advice on the following two questions. 1. DPO train does not seem to support DeepSpeed ZeRO. After manually integrating `DPOAlignerArguments` with the `FinetunerArguments…

tankeui updated 5 months ago
2
kedz/nnsum #4

somethings goes wrong after loading data

Hi, I encountered a problem when using your code. here is the command I used: ``` python script_bin/train_model.py \ --trainer --train-inputs cnn-dailymail/inputs/train \ --train-labels …

tanyuqian updated 5 years ago
3
hiyouga/LLaMA-Factory #4135

【NPU】GLM-4-9B-Chat PPO 出错

### Reminder - [X] I have read the README and searched the existing issues. ### System Info [2024-06-07 10:17:14,980] [INFO] [real_accelerator.py:191:get_accelerator] Setting ds_accelerator t…

hunterhome updated 5 months ago
5
intel-analytics/ipex-llm #11209

CPU finetune can't work

Hi, I have installed ipex-llm follow the docs: https://github.com/intel-analytics/ipex-llm/tree/main/python/llm/example/CPU/QLoRA-FineTuning and i meet the error ``` found intel-openmp in /root/…

sanbuphy updated 6 months ago
4
kedz/nnsum #14

ERROR:ignite.engine.engine.Engine:Current run is terminating…

when I try to use this repo to run the AMI dataset , I got this error ,can you help me? My command is: `python train_model.py --trainer --train-inputs /home/workspace/IdeaProjects/word2sentExtra…

XuemingQiu updated 4 years ago
1
espnet/espnet #5296

decoding error AttributeError: 'NoneType' object has no attr…

**Describe the bug** I\m training a seq2seq transformer model in librispeech_100 egs2 dir. I ahve set ctc_weight =0.0 to disable ctc in model training and I expect that to hold for decoding too. …

vivektyagiibm updated 10 months ago
4

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for vocabulary-trainer

1000+ results
for vocabulary-trainer