hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
Apache License 2.0
3.28k stars 173 forks source link

Relevant paper #10

Open jadeCurl opened 3 days ago

jadeCurl commented 3 days ago

Hi,

Thank you for organizing and sharing such valuable resources! I would like to recommend a paper titled Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs for inclusion. In this paper, we perform reasoning using the Tree of Thought (ToT) approach and distill these reasoning processes into the LLM's Chain of Thought (CoT). This method serves as a form of self-reflection & self-improvement , effectively improving the model's ability to handle complex reasoning tasks.

I hope you might consider adding this paper to your list of recommendations. Thank you!