hkust-nlp / deita

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
Apache License 2.0
458 stars 28 forks source link

[Question] Is the 6k dataset is a subset of 10k dataset. #7

Closed ChenMnZ closed 7 months ago

ChenMnZ commented 7 months ago

Hi,

Thanks for your interesting and great work. I want to know if the 6k dataset is a subset of 10k dataset.

VPeterV commented 7 months ago

Hi. Thanks for your interest. Yes, according to our iterative method, the 6K dataset is a subset of 10K dataset.