nagataka / Read-a-Paper

Survey
6 stars 1 forks source link

Deep reinforcement learning from human preferences #49

Open nagataka opened 1 year ago

nagataka commented 1 year ago

Summary

Link

Deep reinforcement learning from human preferences

Author/Institution

OpenAI/DeepMind

What is this

Comparison with previous researches. What are the novelties/good points?

Key points

How the author proved effectiveness of the proposal?

Any discussions?

What should I read next?