instruct-finetune Search Results

1000+ results
for instruct-finetune

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

haotian-liu/LLaVA #1417

[Question] LLaVA Pretraining with Mixtral 8×7B

### Question Does anyone have carried out the pretraining with Mixtral 8×7B? When I run the petraining script, one problem occured like the figure shown below. I just add a llava_mixtral.py to the ll…

ShawnAn-WHU updated 5 months ago
11
princeton-nlp/MeZO #22

LoRA & p-tuning with multi-GPU

Hi, in table 20, it shows prefix FT with 2 and 4 GPUs. How are those obtained? I tried using `MODEL=facebook/opt-13b TASK=SST2 MODE=prefix LR=1e-5 NUM_GPU=8 bash finetune_fsdp.sh`, but got some errors…

haozhouamzn updated 11 months ago
3
modelscope/ms-swift #1653

Best practice for Qwen2-Audio

## 环境准备（Environmental Preparation） ```bash # 安装ms-swift （Install ms-swift） pip install git+https://github.com/modelscope/swift.git#egg=ms-swift[llm] # 安装最新的transformers（Install the latest trans…

Jintao-Huang updated 1 week ago
11
unslothai/unsloth #882

The fine-tuning of the 2024.8 version has become very poor c…

Today I updated the unsloth version for the first time, using 2024.8, and found a strange phenomenon. The fine-tuning results using the 2024.4 version were very good, but the fine-tuning results using…

githubzuoyi updated 1 month ago
12
pytorch/torchtune #832

How to save a trained model so it can be loaded with HF `fro…

I'm finding this repo to be a user friendly, extensible, memory efficient solution for training/fine-tuning models. However, when it comes to inference, there is a usability gap that could be solved b…

calmitchell617 updated 1 month ago
29
opening-up-chatgpt/opening-up-chatgpt.github.io #47

Add Orca if it ever releases

Whitepaper: https://arxiv.org/pdf/2306.02707.pdf Will be released here: https://aka.ms/orca-lm Summary: https://www.youtube.com/watch?v=Dt_UNg7Mchg

timjzee updated 10 months ago
4
mbzuai-oryx/GeoChat #50

Could you describe the procedure of reproduce the GeoChat?

Dear @salman-h-khan , Thanks for your fantastic work GeoChat, I am really interested in it. And the ckpt provided by you works for me. However, when I tried to reproduce it as a beginner of the …

Amazingren updated 2 weeks ago
1
lmstudio-ai/lmstudio-bug-tracker #102

v0.3.2 High CPU (no GPU offload?)

I'm noticing with v0.3.2 my CPU is getting slaughtered. The UI revamp is worse than the previous iteration with GPU offload now hidden on "My Models" page but even with all the layers assigned to GPU …

nPHYN1T3 updated 4 days ago
4
ggerganov/llama.cpp #9153

Bug: igpu

### What happened? C:\Users\ArabTech\Desktop\5\LlamaCppExe>C:/Users/ArabTech/Desktop/5\LlamaCppExe/llama-cli -m C:/Users/ArabTech/Desktop/5/phi-3.5-mini-instruct-q4_k_m.gguf -p "Who is Napoleon Bonap…

ayttop updated 3 weeks ago
6
Lightning-AI/litgpt #1237

LongLora fine-tuning support

[LongLora](https://arxiv.org/abs/2309.12307) is "an efficient fine-tuning approach that extends the context sizes of pre-trained large language models". They propose to fine-tune a model with a sparse…

belerico updated 5 months ago
5

上一页 1...21 22 23 24 25 26 27...100 下一页

1000+ results for instruct-finetune

1000+ results
for instruct-finetune