Closed Smile-L closed 11 months ago
Hi @Smile-L ,
Thanks for your attention.
Now the HuatuoGPT-sft-data-v1 dataset is shuffled, making it difficult to extract the distilled data. To address this issue, we will label the source of each data in the future.
Best, Junying
How can we tell apart the distilled data from the real-world data in the HuatuoGPT-sft-data-v1 dataset?