issues
search
yamatokataoka
/
learning-from-human-preferences
Replication of Deep Reinforcement Learning from Human Preferences (Christiano et al, 2017).
MIT License
2
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
High-level Design
#12
yamatokataoka
closed
11 months ago
0
Enable numpy type checking
#11
yamatokataoka
closed
2 years ago
0
Automate release
#10
yamatokataoka
closed
2 years ago
2
set up rl-human-prefs
#9
yamatokataoka
closed
2 years ago
0
Release flow consideration
#8
yamatokataoka
closed
2 years ago
7
initial research
#7
yamatokataoka
opened
2 years ago
4
set up rl-human-prefs-ui
#6
yamatokataoka
closed
2 years ago
1
Set up rl human prefs api
#5
yamatokataoka
closed
2 years ago
0
set up rl-human-prefs
#4
yamatokataoka
closed
2 years ago
5
set up rl-human-prefs-api
#3
yamatokataoka
closed
2 years ago
5
add comparison page layout
#2
yamatokataoka
closed
3 years ago
0
Add comparison page
#1
yamatokataoka
closed
3 years ago
0