peft Search Results - Githubissues

1000+ results
for peft

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

albertan017/LLM4Decompile #34

Prediction becomes empty, therefore the loss become nan.

I've tried to finetune the llm4decompile-6.7b model on my dataset and the result is impressive. My own dataset looks like the following format ```{'instruction': 'MY_CUSTOMIZE_QUESTION, 'input': '',…

zero90169 updated 2 weeks ago
1
jerry1993-tech/Cornucopia-LLaMA-Fin-Chinese #15

运行infer.sh 报错

运行报错代码如下： ###infering### ((), (), (), ()) tensor([0, 0, 0, 0], device='cuda:0') Traceback (most recent call last): File "/home/pxc/Cornucopia-LLaMA-Fin-Chinese/infer.py", line 168, in main…

godcmappalling updated 1 year ago
2
facebookresearch/esm #606

EsmForSequenceClassification does not support gradient check…

NOTE: if this is not a bug report, please use the [GitHub Discussions](https://github.com/facebookresearch/esm/discussions) for support questions (How do I do X?), feature requests, ideas, showcasing …

Amelie-Schreiber updated 7 months ago
1
baichuan-inc/Baichuan2 #39

微调baichuan2时提示no attribute named "future_mask"

我是用transformers的trainer类去做的微调训练，每次一到eval的步骤就会报错，信息如下： AttributeError: Caught AttributeError in replica 1 on device 1. Original Traceback (most recent call last): File "/home/uos/miniconda3/envs/l…

CarolXh updated 6 months ago
12
tloen/alpaca-lora #37

Maximum recursion depth exceeded

I tried to fine tune the 13B model with a 3090 (24GB Ram). The training was started and a progress bar was also shown, however, I got an error saying 'maximum recursion depth exceeded' after 100 steps…

ouwei2013 updated 1 year ago
4
CASIA-IVA-Lab/AnomalyGPT #81

'LoraConfig' is not defined

Hi, great work! Thanks for sharing! When I trained the released codes after feeding the weights and data as provided in readme, I encountered an error as follows: ![1710749903540](https://github…

EddieEduardo updated 8 months ago
3
scott4ai/llama3-8b-fine-tuning-cosmic-fusion-dynamics #1

Facing issue in training the model

Hi Scott, Thanks for the video, it was very helpful. Cna you please help me with this error I am facing. I am using COLAB PRO, with A100 GPU and HIGH RAM as you mentioned. But facing this ERR…

debashishDev updated 4 months ago
1
axolotl-ai-cloud/axolotl #1031

ValueError: Attempting to unscale FP16 gradients.

### Please check that this issue hasn't been reported before. - [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports…

hengjiUSTC updated 1 month ago
12
nebuly-ai/optimate #261

[Chatllama] training 20b model hardware requirements

I saw the hardware requirement for training chat-llama 13B to 20B → 8x Nvidia A100 (80Gb) but check this article from HF where they show how to do it with a single 4090 https://huggingface.co…

ehartford updated 1 year ago
1
artidoro/qlora #28

LORA Merge fails in 4-bit mode

Trained vicuna-13b-1.1 LORA in 4bit Now trying to merge it for running generations but it fails with the following error ``` python3.11/site-packages/peft/tuners/lora.py", line 352, in merge_an…

KKcorps updated 1 year ago
8

上一页 1...73 74 75 76 77 78 79...100 下一页

1000+ results for peft

1000+ results
for peft