instruct-finetune Search Results

1000+ results
for instruct-finetune

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

allenai/open-instruct #330

Problems about resuming from checkpoint for finetune_with_lo…

Hi, thanks for your great job. I found some errors when I wanted to continue my LoRA finetuning. ``` 09/04/2024 15:54:23 - INFO - accelerate.accelerator - Loading states from output/tulu_v2_dolly_…

ypwang61 updated 2 weeks ago
1
unslothai/unsloth #954

Able to finetune `homebrewltd/llama3.1-s-instruct-v0.2` (Inp…

Hi Unsloth! I came across this interesting model on reddit: https://www.reddit.com/r/LocalLLaMA/comments/1ez8rmu/llama31_just_got_ears_early_experiments/ It allows Text and Audio as input, and o…

asmith26 updated 4 weeks ago
3
unslothai/unsloth #942

Save the original model name in `config.json` instead of map…

Right now, when we finetune a LoRA on top of e.g. Llama 3.1 8B instruct, even if model_name is `meta-llama/Meta-Llama-3.1-8B-Instruct`, it gets resolved to `unsloth/meta-llama-3.1-8b-instruct-bnb-4bit…

ivsanro1 updated 3 weeks ago
4
vllm-project/vllm #7286

[Usage]: add mulitple lora in docker

### Your current environment ```text The output of `python collect_env.py` ``` ### How would you like to use vllm Hi I want to attach lora using docker command docker run --runtime nv…

chintanshrinath updated 1 month ago
5
InternLM/xtuner #814

finetuning Qwen2-7B-INSTRUCT got RuntimeError: CUDA error:…

config file: `# Model pretrained_model_name_or_path = '/data/llm/cache/Qwen2-7B-Instruct/' use_varlen_attn = True # Data data_files = ['/workspace/xtuner/sft_openai.json'] prompt_template = …

dingy007 updated 2 months ago
11
cloudflare/workers-sdk #6576

🐛 BUG: ✘ [ERROR] 🚨 Couldn't upload file: ReferenceError: Blo…

### Which Cloudflare product(s) does this pertain to? Wrangler ### What version(s) of the tool(s) are you using? Wrangler 3.72.2 ### What version of Node are you using? 16.15.1 ### W…

PKD667 updated 3 weeks ago
1
unslothai/unsloth #786

Out of memory error while finetuning unsloth/Mistral-Nemo-In…

Trying to finetune unsloth/Mistral-Nemo-Instruct-2407-bnb-4bit. It gives me error with batch size 1 and max seq length of 2048. I can see the example notebook on colab t4. Unsloth: Fast Mistral pa…

abpani updated 2 months ago
3
deepseek-ai/DeepSeek-LLM #46

Humaneval, use base model or instruct finetuned model?

Which one would you recommend to use to get the best possible performance?

jasonzliang updated 5 months ago
1
pytorch/torchtune #1369

Improve documentation on custom datasets finetune

I am trying to do a finetune using a custom dataset, in particular: https://huggingface.co/datasets/truthfulqa/truthful_qa I haven't found any clear documentation, only partial docs explaining bits…

dpalmasan updated 1 month ago
3
pytorch/torchtune #1120

Support for Phi-3-mini-128k-instruct and larger context leng…

All of the models supported by torchtune currently have rather low context lengths (

dcsuka updated 2 months ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for instruct-finetune

1000+ results
for instruct-finetune