openai lm-human-preferences issues

openai / lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

https://openai.com/blog/fine-tuning-gpt-2/

MIT License

1.24k stars 164 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Permission Denied on Google Cloud Storage

#29 AlisonWen opened 1 year ago
0
Bump certifi from 2019.9.11 to 2023.7.22

#28 dependabot[bot] opened 1 year ago
0
Bump requests from 2.18.0 to 2.31.0

#27 dependabot[bot] opened 1 year ago
0
Where to find the experiment comparation: Using the data of training reward model for fine-tuning without reinforcement learning.

#26 guotong1988 opened 1 year ago
0
remove reference to google storage

#25 karthik-rangarajan closed 1 year ago
0
replace gs:// prefixes w/ MS URIs

#23 leondz closed 1 year ago
6
What is the full link for gs://lm-human-preferences/

#22 guotong1988 opened 1 year ago
3
The link in readme is broke.

#21 guotong1988 opened 1 year ago
4
The updated link is possibly broken

#20 TristanThrush closed 1 year ago
1
Bump certifi from 2019.9.11 to 2022.12.7

#19 dependabot[bot] closed 1 year ago
1
Bump protobuf from 3.9.1 to 3.18.3

#18 dependabot[bot] opened 2 years ago
0
Unable to access book and cnndm datasets

#17 aypan17 opened 2 years ago
6
Bump protobuf from 3.9.1 to 3.15.0

#16 dependabot[bot] closed 2 years ago
1
Azure data path gives 404

#15 8enmann closed 3 years ago
2
How to liberate the gpt2 from reference model?

#14 yananchen1989 opened 3 years ago
0
About the calculated returns for value loss

#13 yanghoonkim opened 3 years ago
0
question related to the code

#12 yanghoonkim closed 3 years ago
1
Bump rsa from 4.0 to 4.7

#11 dependabot[bot] opened 3 years ago
0
Bump py from 1.8.0 to 1.10.0

#10 dependabot[bot] opened 3 years ago
0
Bump rsa from 4.0 to 4.1

#9 dependabot[bot] closed 3 years ago
1
Bump httplib2 from 0.13.1 to 0.19.0

#8 dependabot[bot] opened 3 years ago
0
Trouble with accessing bucket / Google credentials

#7 alberg94 closed 4 years ago
1
Bump httplib2 from 0.13.1 to 0.18.0

#6 dependabot[bot] closed 3 years ago
1
error when training a reward model

#5 Yuminzhou opened 4 years ago
0
The installation steps doesn't work for me

#4 phaniram-sayapaneni opened 5 years ago
0
Got an error that I can't trace

#3 mysterefrank opened 5 years ago
1
Bump requests from 2.18.0 to 2.20.0

#2 dependabot[bot] closed 1 year ago
1
PPO training

#1 mehdimashayekhi closed 5 years ago
5