lamm-mit / PRefLexOR

Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning
Apache License 2.0
20 stars 3 forks source link