ZHZisZZ / weak-to-strong-search

Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models
https://arxiv.org/abs/2405.19262
9 stars 1 forks source link