peft Search Results - Githubissues

1000+ results
for peft

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mymusise/ChatGLM-Tuning #214

int8量化版本finetuning报错：RuntimeError: self and mat2 must have t…

Traceback (most recent call last): File "/data/ChatGLM-Tuning/finetune.py", line 117, in main() File "/data/ChatGLM-Tuning/finetune.py", line 110, in main trainer.train() File "/ro…

zlht812 updated 1 year ago
6
baaivision/Emu #45

The in-context learning sample for pretrained model does not…

Hello. Thank you for sharing such a great work. I am trying to run samples in inference.py. The instruction-tuned worked perfectly. However, the in-context working example for pretrained model did not…

ShunsukeOnoo updated 9 months ago
3
h2oai/h2o-llmstudio #782

[BUG] Error during LoRA-merge in HF upload for Llama 3.1 70B…

### 🐛 Bug Today when attempting to upload a LoRA-trained Llama 3.1 70B model (first time I've trained Llama 3.1), I hit the following during the eLoRA merge. Note I used the `cpu_shard` method to u…

tmostak updated 3 months ago
22
QwenLM/Qwen-VL #330

TypeError: _BaseAutoPeftModel.from_pretrained() missing 1 re…

### 是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this? - [X] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions ### 该问题是否在FAQ中有解答？ | Is there an existing ans…

Egber1t updated 5 months ago
1
aws/sagemaker-huggingface-inference-toolkit #129

SageMaker fails because Conversation object is not found

We are hosting a model in SageMaker and today we observed the following error in our logs when the model was being relaunched in the instance: ``` ImportError: cannot import name 'Conversation' fr…

adrihercer updated 1 week ago
13
AIoT-MLSys-Lab/SVD-LLM #20

GQA architecture reproduce problem, Mistral-7B-v0.1 and Llam…

For Mistral-7B-v0.1 20% compression ratio, PPL after pruning: {'wikitext2': 245.2660781818917}, the pruned ppl for Llama-3-8B is also bad like Mistral-7B. Could you tell me how to reproduce the resul…

Ppaddington updated 3 days ago
3
huggingface/diffusers #8826

RuntimeError: torch.cat(): expected a non-empty list of Tens…

### Describe the bug **Problem:** When the "officially provided" example (see https://github.com/huggingface/diffusers/blob/a785992c1d6fcb1ff66f8a0d68d09a0a81b909e8/src/diffusers/pipelines/ledits_pp…

byrkbrk updated 6 days ago
3
AGI-Edgerunners/LLM-Adapters #34

Eval without Tuning/Using OPT-1.3B

Dear Author, Thanks for your great projects. I was trying to evaluate the model without Tuning and with Tuning. I wondered if we can evaluate the model with the original model. Also, if I want to…

ChaoGaoUCR updated 1 year ago
2
unslothai/unsloth #973

Unexpected train_batch_size in saved checkpoint file, causin…

In my training script, I set the **per_device_train_batch_size = 4** in the TrainingArguments. But the **train_batch_size** in the **trainer_state.json** of each checkpoint is **2**. When I tried …

Decentblast updated 2 months ago
3
AkihikoWatanabe/paper_notes #881

QLoRA: Efficient Finetuning of Quantized LLMs, Tim Dettmers+…

# URL - https://arxiv.org/abs/2305.14314 # Affiliations - Tim Dettmers, N/A - Artidoro Pagnoni, N/A - Ari Holtzman, N/A - Luke Zettlemoyer, N/A # Abstract - We present QLoRA, an efficient fi…

AkihikoWatanabe updated 7 months ago
2

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for peft

1000+ results
for peft