peft Search Results - Githubissues

1000+ results
for peft

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mlflow/mlflow #13325

[FR] Add support for logging of FullyShardedDataParallel mod…

### Willingness to contribute No. I cannot contribute this feature at this time. ### Proposal Summary This feature request proposes to add support for logging FullyShardedDataParallel models …

kimminw00 updated 1 month ago
4
QwenLM/Qwen2-VL #350

The vocab_size=152064, why len(tokenizer) = 151657 ?

Dear author, Thanks for your great work! I found that vocab_size=152064 in config.json of checkpoint, so as the lm_head module. However, when I print the len(tokenizer) it is 151657. This ca…

zhang9302002 updated 1 month ago
4
Lightning-AI/pytorch-lightning #20299

Incosistant memory usage comparing to huggingface trainer wh…

### Bug description I was able to fine-tune a 8B LLM using Huggingface training framework with PEFT+DeepSpeed stage 2 under fp16 precision(mixed precision training). Recently I would like to change…

mickeysun0104 updated 2 months ago
5
mlflow/mlflow #11907

[BUG] Azure Databricks disk_offload error

### Issues Policy acknowledgement - [X] I have read and agree to submit bug reports in accordance with the [issues policy](https://www.github.com/mlflow/mlflow/blob/master/ISSUE_POLICY.md) ### W…

vitaliy-sharandin updated 6 months ago
10
huggingface/doc-builder #502

NotFound [Error]: Not found: /

Hi doc-builder team! Thanks for your great library! I am trying to use your library to build docs for my own project, but I am facing some difficulties. I have created a file structure for docs s…

kovalexal updated 5 months ago
4
tloen/alpaca-lora #357

This does not work, training adapter and using it does not c…

I trained the PEFT model on my dataset, I used file finetune.py. There is no difference between using and not using PEFT in the interface, so training with finetune.py does not make work. I am v…

Oxi84 updated 9 months ago
19
samlhuillier/code-llama-fine-tune-notebook #6

InvalidHeaderDeserialization

Hello, the fine-tuning process was done successfully, however when I try to run separate the inference by loading the code" ``` import torch from transformers import AutoModelForCausalLM, Bits…

ticlazau updated 1 year ago
2
FunAudioLLM/CosyVoice #380

试用zero_shot的时候卡住不动，也没报错信息

**Describe the bug** from cosyvoice.cli.cosyvoice import CosyVoice from cosyvoice.utils.file_utils import load_wav import torchaudio cosyvoice = CosyVoice('pretrained_models/CosyVoice-300M') …

redpintings updated 1 month ago
3
TUDB-Labs/mLoRA #258

TODO list

- [ ] auto hyperparameters search - [ ] more rlhf methods support - [ ] more model support - [ ] the multimodal support - [ ] auto-parallelizing - [ ] better dispatcher and monitor

yezhengmao1 updated 1 month ago
2
foundation-model-stack/fms-acceleration #84

Slowdown and Higher Memory Consumption for GPTQ-LoRA with Bf…

## Description ### Regression Test for Loss, Memory, Throughput Comparisons on loss, memory and throughput for Full-FT, PEFT - QLoRA: status quo on the switch of `torch_dtype=float16` (Referenc…

achew010 updated 4 weeks ago
1

上一页 1...31 32 33 34 35 36 37...100 下一页

1000+ results for peft

1000+ results
for peft