instruction-tuning Search Results

1000+ results
for instruction-tuning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mikeizbicki/cmc-csci181-languages #48

Fun Chess/LLM/Grammar link

[This website](https://dynomight.net/chess/) shows how recent LLMs have lost some chess playing ability. There's two tie-ins to class: 1. He uses llama.cpp and grammars to enforce that models make…

mikeizbicki updated 3 days ago
2
OpenGVLab/Ask-Anything #245

WebVid link not available

Hi, I'm trying to download the video instruction tuning datasets used in VideoChat2, but the [link for WebVid](https://maxbain.com/webvid-dataset/) is not working. According to the [WebVid repo](https…

byminji updated 3 weeks ago
2
CGCL-codes/VulLLM #4

Question about Multi-Task

请问 Multi-Task Instruction Fine-tuning 中的Multi-Task指的是通过Vulnerability Localization\Vulnerability Detection\Vulnerability Interpretation来训练VulLLM的意思吗，但在CodeLlama/StarCoder finetune.py的prompt中貌似没有体现出Inte…

xcx1r3 updated 2 weeks ago
2
unslothai/unsloth #109

[Feature Request] `train_completions` and `packing=True`

Hi unsloth team, I am wondering how to enable `packing = True` when I need to only train on `output` tokens for a `' text pair eg: ''. This is a general use-case for instruction fine-tuning proble…

akjindal53244 updated 1 month ago
5
Devasy23/FaceRec #63

Automated Model Training and Fine-Tuning Pipeline

**Is your feature request related to a problem? Please describe.** Training and fine-tuning models often involve significant manual work, especially when experimenting with different hyperparameters …

devansh-shah-11 updated 1 month ago
3
apache/tvm #17456

[Bug] How to use auto_scheduler to generate SVE code

Hello, I am currently using auto_scheduler to automatically tune a naive gemm operator. However, after the tuning is completed, I checked the corresponding assembly code and found that the registers r…

yohuna77777 updated 1 month ago
1
mozilla/translations #325

Investigate if we can use datasets from instruction tuning

Either as monolingual or multilingual. Relevant links: - https://txt.cohere.com/aya-multilingual/ - https://huggingface.co/datasets/OpenAssistant/oasst1

marco-c updated 2 months ago
2
TIGER-AI-Lab/MAmmoTH2 #6

Question about the continued instruction-tuning phase

In section 2.5, models are continued fine-tuned on several opensource instruction tuning datasets, which includes the training set of GSM8K and MATH. I'm wondering after continued fine-tuning, are …

Jiaxin-Wen updated 5 months ago
5
NavyZeng/Diff-Unmix #8

Request for training code (train.py)

Dear Author, Thank you for sharing your work on this project. I noticed that the repository currently doesn’t include the training code (train.py). I would greatly appreciate it if you could share …

ljp1997 updated 2 weeks ago
1
jupyter-naas/awesome-notebooks #2365

Huggingface: Instruction Tuning Seq2Seq Models

Problem Description This notebook demonstrates how to instruction tune SeqSeq models using huggingface transformers. Instruction tuning is a machine learning paradigm where a model is trained to foll…

kevinscaria updated 1 year ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for instruction-tuning

1000+ results
for instruction-tuning