voidful TextRL issues - Githubissues

voidful / TextRL

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

MIT License

545 stars 60 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Fix Reward Calculation in example/2022-12-10-textrl-elon-musk.ipynb

#28 Alanhsiu closed 6 months ago
0
Fix Reward Calculation in `example/2022-12-10-textrl-elon-musk.ipynb`

#27 Alanhsiu opened 6 months ago
1
Problems in the inference process

#26 ignorejjj opened 11 months ago
0
unfreeze_layer_from_past parameter

#25 JhonDan1999 opened 1 year ago
4
Does the package support automatic multi-gpu?

#24 margarita-aicyd closed 1 year ago
2
Reward policy agent environment is not training with Finetuned model

#23 harshs21 closed 1 year ago
1
Update dump.py

#22 Kongfha closed 1 year ago
0
ValueError: Expected parameter logits

#21 josutk closed 1 year ago
5
Text generation after period/full-stop (".")

#20 ansharora7 closed 1 year ago
0
Support for other PFRL Algorithms

#19 ansharora7 closed 1 year ago
2
Documentation on Methodology

#18 flyingabove closed 1 year ago
1
Update interval

#17 debjitpaul closed 1 year ago
3
Support for AutoModelForSeq2SeqLM

#16 janpf closed 1 year ago
2
Are there any examples for T5 or Bart? Why T5 and bart give the same output before/after training?

#15 YuXiangLin1234 closed 1 year ago
2
token classification test

#14 hemangjoshi37a opened 1 year ago
3
Text generation models generating repeated/duplicate text/sentences.

#13 tontan1998 closed 1 year ago
3
i get error when i use elon example

#12 wac81 closed 1 year ago
6
About the compare_sample

#11 jkwang93 closed 1 year ago
1
Backward compatibility

#10 Keith-Hon closed 1 year ago
2
AssertionError

#9 Ulov888 closed 1 year ago
3
It needs a license

#8 cooljoseph1 closed 1 year ago
1
AttributeError: module 'numpy' has no attribute '_no_nep50_warning'

#7 GrahamboJangles closed 1 year ago
1
AttributeError: 'MyRLEnv' object has no attribute 'num_envs'

#6 lucascassiano closed 1 year ago
2
Could you give some examples to run the code?

#5 SusannaWull closed 1 year ago
3
Errors may occur after changing the batchsize and update interval of the agent

#4 rongaoli closed 1 year ago
5
'Model' object has no attribute 'lm_head'

#3 Mousumi44 closed 2 years ago
2
Restyle Add license scan report and status

#2 restyled-io[bot] closed 2 years ago
0
Add license scan report and status

#1 fossabot closed 2 years ago
0