-
### Pitch
I would suggest to rename content warning to content notice in the mastodon frontend. This would make it more clearly what it is actually used for by many people without changing the mean…
-
Dear team,
Hope you are well.
We would like to suggest a new event for the BBW website; not sure how this works here; pleases advice; below is the content:
BAI Online Course DIGITAL ART, AI, AN…
-
See here: https://deepmind.com/blog/alphazero-shedding-new-light-grand-games-chess-shogi-and-go/
Not read yet...
-
Hi @zaiyan-x ,
I'm running and trying to reproduce your code for my project. But it takes quite a long for training.
I checked the log file and found out that most of the time is for eta optimiza…
-
**System information**
- Have I written custom code (as opposed to using a stock example script provided in TensorFlow): Yes
- OS Platform and Distribution (e.g., Linux Ubuntu 16.04): 16.04
- Mobil…
-
#108 is going to be an important place to start.
Twitter is the only context where I do not currently see a need for increased emphasis on self-curation at this time considering how that all appearan…
-
### Required prerequisites
- [X] I have read the documentation .
- [X] I have searched the [Issue Tracker](https://github.com/PKU-Alignment/safe-rlhf/issues) and [Discussions](https://github.com/PKU-…
-
## Describe the bug
KL divergence calculation in KLPENPPOLoss is always zero, causing the contribution to the loss to be 0.
A clear and concise description of what the bug is.
It seems that the w…
-
Hi, thanks for the great work. I encountered the problem when initializing vllm-engine engine in PPO training. It seems that the program cannot find available GPUS in the initialization.
```
F…
-
i have be try to record a reward on track
![Screenshot 2024-04-26 185119](https://github.com/trackmania-rl/tmrl/assets/86875051/c1e790c5-f80b-4c4c-a343-b228418a98a4)
![Screenshot 2024-04-26 185200]…