arXiv / html_feedback

Supports a student project developing a UI for feedback on arXiv articles rendered as html.
MIT License
17 stars 3 forks source link

/methodname #975

Open Emnolope opened 5 months ago

Emnolope commented 5 months ago

Description

We present \methodname, a general reasoning and search method for tasks with outputs that can be decomposed into components. \methodname explores a search tree of potential solutions using Monte Carlo Tree Search (MCTS), building solutions one action at a time and evaluating according to any domain-specific heuristic, which in practice is often simply an LLM evaluator. Critically, our action space includes revision actions: \methodname may choose to revise part of its previous output rather than

(Optional:) Please add any files, screenshots, or other information here.

No response

(Required) What is this issue most closely related to? Select one.

Choose One

Internal issue ID

11c3a1e1-50c8-4191-b105-4accfb1204a8

Paper URL

https://arxiv.org/html/2404.05966v1

Browser

Chrome/122.0.0.0

Device Type

Android

html-feedback-bot[bot] commented 5 months ago

Location in document: undefined

Selected HTML:

evaluating according to any domain-specific heuristic, which in practice is often simply an LLM evaluator. Critically, our action space includes revision actions: \methodname may choose to revise part of its previous output rather than continuing to build the rest of its output. Empirically, \methodname outperforms state-of-the-art reasoning methods across three challenging tasks: Story Outline Improvement (up to +30% interestingness), Mini-Crosswords Solving (up to +16% word success rate), and Constrained Generation (up to +10% concept coverage).

1 Introduction

github-actions[bot] commented 5 months ago

Hello @Emnolope, thanks for the issue report! We are reviewing your report and will address it as soon as possible.