namin / llm-verified-with-monte-carlo-tree-search

LLM verified with Monte Carlo Tree Search
https://arxiv.org/abs/2402.08147
MIT License
210 stars 25 forks source link

Added instructions to README about running the DPO script on only one GPU. #36

Closed ChloeL19 closed 5 months ago