Aidenzich / road-to-master

A repo to store our research footprint on AI
MIT License
19 stars 4 forks source link

SELF-REFINE: Iterative Refinement with Self-Feedback #47

Open Aidenzich opened 7 months ago

Aidenzich commented 7 months ago

https://arxiv.org/pdf/2303.17651.pdf

IMG_0968

Aidenzich commented 7 months ago

Performance

Screenshot 2024-04-04 at 11 58 15 AM

Generic feedback: like improving the efficiency of the code, lacks this precision and direction.

Aidenzich commented 7 months ago

Influence of the Iteration

Screenshot 2024-04-04 at 12 02 47 PM
Aidenzich commented 7 months ago

Appendix. GPT-4 Evaluation

Screenshot 2024-04-04 at 12 05 07 PM Screenshot 2024-04-04 at 12 05 35 PM Screenshot 2024-04-04 at 12 05 51 PM

These are essentially what you'll see in the gpt-prompt-engineer repo and the automatic evaluation metrics at my company. It appears that pair-to-pair comparison is more useful than simply using prompts for evaluation or a fine-tuned classifier.