instruction-datasets Search Results

haotian-liu/LLaVA #1569

Why use plain text sharegpt datasets for instruction tuning

### Question ShareGPT is used for instruction fine-tuning, with the aim of inserting data from image independent pure text conversations into multiple rounds of image conversations, so that the model…

AshOneN updated 1 week ago

argilla-io/distilabel #751

[BUG] GPU utilization depends on targeted dataset size

**Describe the bug** Generating larger datasets with `LoadDataFromDicts` leads to underutilization of the GPU during the `TextGeneration` step. **To Reproduce** Setting `N_SAMPLES` to a smal…

fpreiss updated 1 week ago

huggingface/datasets #6982

cannot split dataset when using load_dataset

### Describe the bug when I use load_dataset methods to load mozilla-foundation/common_voice_7_0, it can successfully download and extracted the dataset but It cannot generating the arrow document,…

cybest0608 updated 9 hours ago

OpenGVLab/InternVL #306

Pretrain OCR datasets structure

> Traditional OCR datasets can be transformed into instruction-following datasets. For example, in the traditional OCR dataset, a data sample is an image with OCR ground truths. > > W…

toshiks updated 1 day ago

OpenGVLab/Ask-Anything #139

Instruction tuning with my own datasets

I am planning to fine-tune the VideoChat2 model with custom instruction data to enhance its performance on downstream tasks. I have a couple of questions regarding the pre-training data and the proces…

sonderzhang updated 3 months ago

Adamliu1/SNLP_GCW #90

Collect a list of all datasets used

Updated 2024-07-01. Datasets: - Used for evaluation: - MMLU: https://huggingface.co/datasets/hails/mmlu_no_train - ARC-Challenge: https://huggingface.co/datasets/allenai/ai2_arc - HellaSwag: h…

Willmish updated 3 hours ago

hamadichihaoui/BIRD #9

How to train my own datasets?

Hello, I wanted to express my gratitude for your work; it's been instrumental in my current project. However, I've encountered some confusion that I'm hoping you can shed light on. Now，I want to trai…

superY688 updated 2 weeks ago

datatime27/videos #8

How to parse the raw data (word-tracker)

Add instructions/tools to parse the data from the datasets into graphs and visual representations of the data shown in the video, e.g the sentiment analysis

Headedbranch225 updated 1 week ago

AChen-qaq/ProML #1

Instructions for getting datasets?

Great work - do you have any details on the exact datasets used and where to get them?

NtaylorOX updated 7 months ago

OpenGVLab/Ask-Anything #155

Any instructions for fine-tuning on custom datasets?

Thanks for your excellent work. I am curious if there are any instructions for fine-tuning video-llava on my own dataset?

2000ZRL updated 3 months ago

1000+ results for instruction-datasets

1000+ results
for instruction-datasets