issues
search
voidful
/
TextRL
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
MIT License
545
stars
60
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Fix Reward Calculation in example/2022-12-10-textrl-elon-musk.ipynb
#28
Alanhsiu
closed
6 months ago
0
Fix Reward Calculation in `example/2022-12-10-textrl-elon-musk.ipynb`
#27
Alanhsiu
opened
6 months ago
1
Problems in the inference process
#26
ignorejjj
opened
11 months ago
0
unfreeze_layer_from_past parameter
#25
JhonDan1999
opened
1 year ago
4
Does the package support automatic multi-gpu?
#24
margarita-aicyd
closed
1 year ago
2
Reward policy agent environment is not training with Finetuned model
#23
harshs21
closed
1 year ago
1
Update dump.py
#22
Kongfha
closed
1 year ago
0
ValueError: Expected parameter logits
#21
josutk
closed
1 year ago
5
Text generation after period/full-stop (".")
#20
ansharora7
closed
1 year ago
0
Support for other PFRL Algorithms
#19
ansharora7
closed
1 year ago
2
Documentation on Methodology
#18
flyingabove
closed
1 year ago
1
Update interval
#17
debjitpaul
closed
1 year ago
3
Support for AutoModelForSeq2SeqLM
#16
janpf
closed
1 year ago
2
Are there any examples for T5 or Bart? Why T5 and bart give the same output before/after training?
#15
YuXiangLin1234
closed
1 year ago
2
token classification test
#14
hemangjoshi37a
opened
1 year ago
3
Text generation models generating repeated/duplicate text/sentences.
#13
tontan1998
closed
1 year ago
3
i get error when i use elon example
#12
wac81
closed
1 year ago
6
About the compare_sample
#11
jkwang93
closed
1 year ago
1
Backward compatibility
#10
Keith-Hon
closed
1 year ago
2
AssertionError
#9
Ulov888
closed
1 year ago
3
It needs a license
#8
cooljoseph1
closed
1 year ago
1
AttributeError: module 'numpy' has no attribute '_no_nep50_warning'
#7
GrahamboJangles
closed
1 year ago
1
AttributeError: 'MyRLEnv' object has no attribute 'num_envs'
#6
lucascassiano
closed
1 year ago
2
Could you give some examples to run the code?
#5
SusannaWull
closed
1 year ago
3
Errors may occur after changing the batchsize and update interval of the agent
#4
rongaoli
closed
1 year ago
5
'Model' object has no attribute 'lm_head'
#3
Mousumi44
closed
2 years ago
2
Restyle Add license scan report and status
#2
restyled-io[bot]
closed
2 years ago
0
Add license scan report and status
#1
fossabot
closed
2 years ago
0