parameter-efficient-fine-tuning Search Results

1000+ results
for parameter-efficient-fine-tuning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ATP-1010/FederatedLLM #3

Some questions about the code

Thanks for your excellent work and code. I have some questions about the code. 1. when ``stacking == True`` and ``zero_padding == True``? https://github.com/ATP-1010/FederatedLLM/blob/8efe72d1a7c7…

Baijiong-Lin updated 1 week ago
19
irthomasthomas/undecidability #831

DeepSeek-V2: A Strong, Economical, and Efficient MoE LLM of …

- [ ] [DeepSeek-V2: A Strong, Economical, and Efficient MoE LLM of 236B total parameters](https://github.com/deepseek-ai/DeepSeek-V2) # DeepSeek-V2: A Strong, Economical, and Efficient MoE LLM of 2…

ShellLM updated 6 months ago
1
manisnesan/fastchai #70

Fine tuning LLM

https://lightning.ai/pages/community/finetuning-falcon-efficiently/

manisnesan updated 6 months ago
14
AkihikoWatanabe/paper_notes #769

Full Parameter Fine-tuning for Large Language Models with Li…

# URL - https://arxiv.org/abs/2306.09782 # Affiliations - Kai Lv, N/A - Yuqing Yang, N/A - Tengxiao Liu, N/A - Qinghui Gao, N/A - Qipeng Guo, N/A - Xipeng Qiu, N/A # Abstract - Large Lan…

AkihikoWatanabe updated 1 year ago
1
pytorch/torchtune #1694

Bug when I run on single GPU

**Command: tune run lora_finetune_single_device --config llama3_1/8B_lora_single_device** **Output**: ``` INFO:torchtune.utils._logging:Running LoRAFinetuneRecipeSingleDevice with resolved config:…

kailashg26 updated 1 month ago
24
NielsRogge/Transformers-Tutorials #430

Confidence score for paligemma

Hi @NielsRogge I have finetuned my paligemma for custom data for image to JSON use case, but when I inference it some key values I got wrong like 3000 is extracted as 9000 so to get the data is corr…

himasai9712 updated 5 months ago
7
lozadaa-chalmers/transformer-bio #10

Read about fine-tuning strategies

Questions we want to answer: - Should we use pre-trained embeddings or the whole model? - Why or why not? - What are some major fine-tuning strategies and what are their benefits and drawbacks? Rela…

lozadaa-chalmers updated 1 year ago
3
mistralai/mistral-inference #141

Fine Tuning Mistral 7b

Can we fine-tune with Mistral for a custom dataset in the field of digital marketing/marketing communication?

nourolive updated 8 months ago
3
huggingface/peft #2205

KeyError: Parameter containing

I want to run [sft](https://github.com/huggingface/peft/tree/main/examples/sft) example and I get some erros, Can you help me to find the problem? I run [run_peft_fsdp.sh](https://github.com/huggin…

Amerehei updated 1 week ago
16
hiyouga/LLaMA-Factory #6186

[HELP] How to Fine-tune the added special tokens only?

### Reminder - [X] I have read the README and searched the existing issues. ### System Info - `llamafactory` version: 0.9.2.dev0 - Platform: Linux-5.15.0-1070-aws-x86_64-with-glibc2.31 - Python v…

ys-zong updated 1 day ago
1

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for parameter-efficient-fine-tuning

1000+ results
for parameter-efficient-fine-tuning