peft Search Results - Githubissues

1000+ results
for peft

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

UKPLab/sentence-transformers #3050

Gradual slowdown of training in bigger batch sizes

I am facing a very weird issue here. ### Issue - The training speed slows down with time for batch sizes 64 and 128. For batch size 32 it seems to be staying fairly constant. - The tensorboard g…

sidharthg-couture updated 1 week ago
7
huggingface/peft #2194

[BUG] Issue with using `rank_pattern` and `alpha_pattern` to…

### System Info peft==0.13.2 ### Who can help? @BenjaminBossan ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks - [X] An officially supported task in t…

sirluk updated 1 week ago
2
huggingface/trl #2250

OOM when unwrap_model_for_generation

### System Info torch==2.4.0 transformers==4.43.4 trl==0.9.6 tokenizers==0.19.1 accelerate==0.32.0 peft==0.12.0 datasets==2.20.0 deepspeed==0.15.0 bitsandbytes==0.43.3 sentencepiece==0.2.0 …

hlnchen updated 1 week ago
4
unslothai/unsloth #1240

why is unsloth thinking I'm doing multi gpu optimization whe…

code ```python ''' conda activate beyond_scale_2_unsloth ''' import torch from datasets import load_dataset from trl import SFTConfig, SFTTrainer from unsloth import FastLanguageModel from tr…

brando90 updated 2 weeks ago
3
pytorch/pytorch #130486

RuntimeError: NVML_SUCCESS == DriverAPI::get()->nvmlInit_v2_…

### 🐛 Describe the bug ''' checkpoint_path = './llama_relevance_results' training_args = transformers.TrainingArguments( #remove_unused_columns=False, # Whether or not to automatically r…

Zzv213 updated 3 weeks ago
4
furiosa-ai/ssm-peft #1

question: hyperparameters

Hi, Thanks for providing a public implementation for the experimental results of your paper. I am trying to reproduce the results, regarding hyperparameters in the paper it is stated (quote): `…

puigde updated 2 days ago
4
OpenGVLab/LLaMA-Adapter #97

Support for llama-2 70B

Would it work natively or we need to train new adapters?

qizzzh updated 1 year ago
7
keras-team/keras-hub #1831

📢 KerasNLP is now KerasHub 📢

## tl;dr - We have consolidated KerasNLP and KerasCV into a new **KerasHub** package. - We have renamed the `keras-nlp` GitHub repository to `keras-hub`. - **All existing usages will continue to …

mattdangerw updated 2 weeks ago
1
X-PLUG/mPLUG-DocOwl #82

problem when finetuning DocOwl1.5-Omni

I have the following error when finetune the DocOwl1.5-Omni. It always raises error when index is 10. Please help!!! ``` File "/opt/conda/envs/mplug_owl2/lib/python3.10/site-packages/deepspeed/run…

lmydian1014 updated 2 months ago
5
LLaVA-VL/LLaVA-NeXT #129

Missing transformer_engine, cannot run video demo

I'm always missing the `transformer_engine` package after running `pip install -e ".[train]"` and attempt to run the demo ``` bash scripts/video/demo/video_demo.sh lmms-lab/LLaVA-NeXT-Video-32B-Qwen…

tsaiJN updated 3 months ago
1

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for peft

1000+ results
for peft