instruction-tuning Search Results

1000+ results
for instruction-tuning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Dahoas/QDSyntheticData #159

From Quantity to Quality: Boosting LLM Performance with Self…

- Paper name: From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning - ArXiv Link: https://arxiv.org/abs/2308.12032 To close this issue open a …

Dahoas updated 6 months ago
1
unslothai/unsloth #854

Loss is acting weird

I tried fine-tuning **Llama 2**, **Llama 3** & even **LLama 3.1** but my loss is decreasing/increasing. I can't figure out. I have my dataset in alpaca format like this: ``` [ { …

MuhammadBilal848 updated 3 months ago
1
showlab/Show-o #27

Does show-o support multimodal-in multimodal-out?

Like what I said, does it support the title? does it multimodal-in, multimodal-out(with multi images)?

URRealHero updated 2 months ago
6
haotian-liu/LLaVA #594

[Question] training time of instruction tuning, reproducing …

### Question Hi~ 100 hours was needed for instruction tuning in my experiment.....is this nomal? 2machines×4A100-40G（total 8A100）was used in our fintuning，same dataset with paper. Due to the 40G…

dongzhiwu updated 1 year ago
5
google-research/FLAN #67

[Question] What license is used for this FLAN dataset(not th…

Hi, Thanks a lot for open source the code to fetch the FLAN data set. I noticed in the paper: The Flan Collection: Designing Data and Methods for Effective Instruction Tuning. (https://arxiv.or…

quq99 updated 7 months ago
10
tloen/alpaca-lora #555

Does the instruction fine-tuning code work with LLAMA-2?

The fine-tuning code runs when I replace the base model with LLAMA-2. I am aware that LLAMA and LLAMA-2 share the same configuration files and other associated components. However, I would sti…

zl-liu updated 1 year ago
6
arcee-ai/mergekit #446

KeyError model[0] did not exist in tensor?

I am performing a Mega Merge using LLaMA 3.2 3B, both the base model and fine-tuning/instruction tuning, with the DARE linear method. Following the successful completion of the initial merge, I encoun…

FrozzDay updated 5 days ago
1
jerry1993-tech/Cornucopia-LLaMA-Fin-Chinese #11

请教下数据集规模

非常感谢您很有意义的工作，想请教一下所使用到的instruction-tuning的数据量。另外，想再请教一下是否有探究多大的instruction-tuning数据量就够用了呢？非常感谢

TZWwww updated 1 year ago
2
iree-org/iree #19063

Release tracker - 3.0.0

## Overview Our previous stable release was cut on 2024-11-04: https://github.com/iree-org/iree/releases/tag/candidate-20241104.1068. We aim for roughly one stable release every 6 weeks, though in th…

ScottTodd updated 2 days ago
6
TobyYang7/Llava_Qwen2 #2

[Question] About the performance.

### Question I wonder about the performance when using Qwen2 as the LLM. Does it outperform the original LLaVA-v1.5? By the way, are there any scripts for instruction tuning? I only found the scri…

Haochen-Wang409 updated 4 months ago
5

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for instruction-tuning

1000+ results
for instruction-tuning