instruct-finetune Search Results

1000+ results
for instruct-finetune

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

haotian-liu/LLaVA #1363

🐛 [BUG] llava-v1.6-mistral-7b fail to generate right respons…

# Description I write a inference script like this: ```python import torch from PIL import Image import sys sys.path.append('./') from llava.constants import IMAGE_TOKEN_INDEX, DEFAULT_IMAG…

clownrat6 updated 5 months ago
4
bitsandbytes-foundation/bitsandbytes #1210

AttributeError: 'NoneType' object has no attribute 'cquantiz…

### System Info I am using a Tesla T4 16 gb ### Reproduction import torch from transformers import AutoTokenizer, AutoModelForCausalLM, BitsAndBytesConfig base_model_id = "mistralai/Mistral-7B-…

wissamee updated 2 months ago
9
allenai/OLMo #609

Finetuning config file

### ❓ The question Hi, I am wondering if you can provide your config file for finetuning on the Tulu V2 dataset? It would be helpful for reproducing the finetuning results. In addition, have you tr…

joellliu updated 3 months ago
3
unslothai/unsloth #1024

target_modules "embed_tokens", "lm_head" really needed? try…

Hi @danielhanchen I try out to do FT with a different model like Nous Research LLAMA31 8B Instruct I get an Error. Is it really needed to make those other target modules trainable? I want to tra…

carstendraschner updated 3 weeks ago
1
Alpha-VLLM/LLaMA2-Accessory #74

The stage_2 checkpoint include QFormer part?

Does this tage_2 checkpoint weights your provide include Qformer part weights which PEFT from data {alpaca_gpt4_data.json and lava_instruct_150k.json } ? https://huggingface.co/Alpha-VLLM/LLaMA2-A…

ghost updated 1 year ago
3
allenai/open-instruct #188

About the training seed

Hi, thanks for your great repo. I am trying to use this code to fune-tune llama2-7b on tulu-v2. And I find we always get the same loss curve when I use different seed. I guess this is because the d…

lucasliunju updated 4 months ago
2
unslothai/unsloth #915

Llama 3 template issue

When training either Llama 3 or 3.1 8B base model using the Llama 3 template for conversation prompt format, it seems to not train with the correct tokens. It ends up producing text containing tokens…

minipasila updated 2 months ago
4
princeton-nlp/LESS #4

step 2 when run "/get_train_lora_grads.sh", load the optimi…

when load the optimizer.pt display the key is different KeyError: 'base_model.model.model.layers.0.self_attn.q_proj.lora_A.default.weight' the items in optimizer.pt state is 0~255.

victorjiax updated 1 month ago
19
alisawuffles/proxy-tuning #7

how can i reproduce the results on truthfulqa?

I notice that operating truthfulqa.sh requires "gpt_true_model_name" and "gpt_info_model_name". But it seems the original model is unavailable now.

SuperChanS updated 3 months ago
1
DAMO-NLP-SG/VideoLLaMA2 #101

Request for Inference Code on Custom Datasets

Dear VideoLLaMA2 Maintainers, I have been using your library and successfully fine-tuned models with LoRA and QLoRA on my own dataset. However, I noticed that the repository does not include code f…

dongqi-me updated 2 weeks ago
4

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for instruct-finetune

1000+ results
for instruct-finetune