issues
search
openai
/
lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
https://openai.com/blog/fine-tuning-gpt-2/
MIT License
1.24k
stars
164
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Permission Denied on Google Cloud Storage
#29
AlisonWen
opened
1 year ago
0
Bump certifi from 2019.9.11 to 2023.7.22
#28
dependabot[bot]
opened
1 year ago
0
Bump requests from 2.18.0 to 2.31.0
#27
dependabot[bot]
opened
1 year ago
0
Where to find the experiment comparation: Using the data of training reward model for fine-tuning without reinforcement learning.
#26
guotong1988
opened
1 year ago
0
remove reference to google storage
#25
karthik-rangarajan
closed
1 year ago
0
replace gs:// prefixes w/ MS URIs
#23
leondz
closed
1 year ago
6
What is the full link for gs://lm-human-preferences/
#22
guotong1988
opened
1 year ago
3
The link in readme is broke.
#21
guotong1988
opened
1 year ago
4
The updated link is possibly broken
#20
TristanThrush
closed
1 year ago
1
Bump certifi from 2019.9.11 to 2022.12.7
#19
dependabot[bot]
closed
1 year ago
1
Bump protobuf from 3.9.1 to 3.18.3
#18
dependabot[bot]
opened
2 years ago
0
Unable to access book and cnndm datasets
#17
aypan17
opened
2 years ago
6
Bump protobuf from 3.9.1 to 3.15.0
#16
dependabot[bot]
closed
2 years ago
1
Azure data path gives 404
#15
8enmann
closed
3 years ago
2
How to liberate the gpt2 from reference model?
#14
yananchen1989
opened
3 years ago
0
About the calculated returns for value loss
#13
yanghoonkim
opened
3 years ago
0
question related to the code
#12
yanghoonkim
closed
3 years ago
1
Bump rsa from 4.0 to 4.7
#11
dependabot[bot]
opened
3 years ago
0
Bump py from 1.8.0 to 1.10.0
#10
dependabot[bot]
opened
3 years ago
0
Bump rsa from 4.0 to 4.1
#9
dependabot[bot]
closed
3 years ago
1
Bump httplib2 from 0.13.1 to 0.19.0
#8
dependabot[bot]
opened
3 years ago
0
Trouble with accessing bucket / Google credentials
#7
alberg94
closed
4 years ago
1
Bump httplib2 from 0.13.1 to 0.18.0
#6
dependabot[bot]
closed
3 years ago
1
error when training a reward model
#5
Yuminzhou
opened
4 years ago
0
The installation steps doesn't work for me
#4
phaniram-sayapaneni
opened
5 years ago
0
Got an error that I can't trace
#3
mysterefrank
opened
5 years ago
1
Bump requests from 2.18.0 to 2.20.0
#2
dependabot[bot]
closed
1 year ago
1
PPO training
#1
mehdimashayekhi
closed
5 years ago
5