VikParuchuri / textbook_quality

Generate textbook-quality synthetic LLM pretraining data
MIT License
461 stars 46 forks source link

Fix lesson length issue #6

Closed VikParuchuri closed 9 months ago

VikParuchuri commented 9 months ago

Sometimes, lessons would generate that were very short. This patches the issue to generate to the length of the outline.