Yu-Fangxu / FoR

Flow of Reasoning: Efficient Training of LLM Policy with Diverse Thinking
MIT License
21 stars 2 forks source link