VikParuchuri / textbook_quality

Generate textbook-quality synthetic LLM pretraining data
MIT License
488 stars 50 forks source link