Hello, I wish to reproduce the StarChat training for educational purposes, but I see the dataset (HuggingFaceH4/oasst1_en) has been removed. Is there any way to download it?
If not, any suggestions for similar datasets? I want to use the current code (chat/train.py) with the least amount of friction.
Hello, I wish to reproduce the StarChat training for educational purposes, but I see the dataset (HuggingFaceH4/oasst1_en) has been removed. Is there any way to download it?
If not, any suggestions for similar datasets? I want to use the current code (chat/train.py) with the least amount of friction.