instruct-finetune Search Results

1000+ results
for instruct-finetune

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

opening-up-chatgpt/opening-up-chatgpt.github.io #47

Add Orca if it ever releases

Whitepaper: https://arxiv.org/pdf/2306.02707.pdf Will be released here: https://aka.ms/orca-lm Summary: https://www.youtube.com/watch?v=Dt_UNg7Mchg

timjzee updated 11 months ago
4
Lightning-AI/litgpt #1226

LoRA model tokenizer configuration fails to load

``` Traceback (most recent call last): File "", line 1, in File "/home/mmior/.local/share/virtualenvs/json-descriptions-cLP9Kwl8/lib/python3.10/site-packages/litgpt/tokenizer.py", line 39, in …

michaelmior updated 7 months ago
8
unslothai/unsloth #1075

Confusion about the ModelFile

Hi, I want to upload finetune Llama 3-instruct to ollama, and following the [https://docs.unsloth.ai/tutorials/how-to-finetune-llama-3-and-export-to-ollama](url) to do it, but it didn't generate the M…

Ambitious4 updated 1 month ago
2
microsoft/VPTQ #78

New Quantized Model Request

Thank you for your valuable contributions to the community; your work looks great! I came across this 70B model: Llama-3.1-Nemotron-70B-Instruct, and its benchmark results are impressive. Could you pl…

JoesSattes updated 2 weeks ago
3
FlagOpen/FlagEmbedding #855

long-llm run for more than 1 epoch

If we following the script setting of long-llm, the parameter num_train_epoch is set to 1, it will give out really significant improvment over the original model. However, if we change the paramter to…

disperaller updated 3 months ago
5
johnsmith0031/alpaca_lora_4bit #7

Unbelievably good perf..

Training LLaMA-13B-4bit on a single RTX 4090 with `finetune.py` (using PyTorch 2 beta, to support the requisite CUDA 11.8 for compute rev 8.9) finishes 3 epochs in only a minute: ``` =============…

sterlind updated 1 year ago
13
DAMO-NLP-SG/VideoLLaMA2 #58

how to finetune Videollama2 chat models using QLoRA and LoRA…

how to finetune Videollama2 chat models using QLoRA and LoRA. ... --data_path datasets/custom_sft/custom.json --data_folder datasets/custom_sft/ --pretrain_mm_mlp_adapter CONNECTOR_DOWNLOAD_PAT…

thisurawz1 updated 5 days ago
4
fdschmidt93/trident-nllb-llm2vec #2

Release of Pre-trained models

### Request for Release of Pretrained NLLB-LLM2Vec Model Hello Team, Could you please release the pretrained NLLB-LLM2Vec models mentioned in your paper on "Self-Distillation for Model Stacking…

ArkadeepAcharya updated 2 weeks ago
9
zhangfaen/finetune-Qwen2-VL #16

No module named 'flash_attn_2_cuda'

I installed everything from `requirements.txt`, but still get this error when I run `finetune.py`: ``` Traceback (most recent call last): File "/work/pi_hzamani_umass_edu/zarifalam_umass_edu/fi…

zarif98sjs updated 1 week ago
3
ollama/ollama #7267

Running out of memory when allocating to second GPU

### What is the issue? No issues with any model that fits into a single 3090 but seems to run out of memory when trying to distribute to the second 3090. ``` INFO [wmain] starting c++ runner | ti…

joshuakoh1 updated 2 weeks ago
5

上一页 1...26 27 28 29 30 31 32...100 下一页

1000+ results for instruct-finetune

1000+ results
for instruct-finetune