-
**Describe the bug**
Running an error
**Log output**
***** Evaluating perplexity, Epoch 0/1 *****
Traceback (most recent call last):
File "main.py", line 345, in
main()
File "main.py…
-
### 🐛 Describe the bug
When running the example code in this repo following https://wandb.ai/carperai/summarize_RLHF/reports/Implementing-RLHF-Learning-to-Summarize-with-trlX--VmlldzozMzAwODM2 the tr…
-
### 🐛 Describe the bug
Hi,
There is something that is slightly unclear to me in the **summarize_rlhf** code -
I see that the tokenizer used everywhere is the pretrained tokenizer of `EleutherAI/gpt…
-
Thank you very much for such a great work. When I run gpt2-sentiment.py (https://github.com/lvwerra/trl/blob/main/examples/sentiment/scripts/gpt2-sentiment.py#L151), I have a question I would like to …
-
*This is a high-priority request from a partner.*
Rosetta is a public API spec defined to be a common denominator for blockchain projects.
As per the discussion (find some pieced below), we are …
-
Hi,
I recently am working on a psychological project on estimating the model parameters using numpyro MCMC inference. However, I've found no tutorials within the numpyro documentation to guide me.…
-
Create an example showing reward modeling. This could use a synthetic reward source artificially limited, or the HHH Anthropic data (already on the Stability cluster).
More ideas for tasks: https://…
-
### 🐛 Describe the bug
0%| | 0/10000 [00:00
-
This istn something i think thats in dire need, i just think it would be dope.
I imagine it that u can select ur party composition and a Boss Template and the AI gives u a mitigation plane. It doe…
-
Hello, Antoxnxpod! I saw you liked my repositories. Do you want to create a new project together?