-
Here's a strange error I keep getting and I haven't seen that anyone has posted it yet. I'm running the Causal Language Model finetuning [script found on the optimum-neuron examples page ](https://git…
-
In the reinforcement learning process, there will be some fixed joints whose positions are uncertain, in order to achieve the migration from simulation to real environment, I would like to be able to …
-
Dear MuJoCo Support Team,
I am currently encountering a problem with model visualization in a reinforcement learning setup using MuJoCo. To facilitate training in randomized environments, I dynamic…
-
Comment below with questions or thoughts about the reading for this week's [workshop](https://github.com/uchicago-computation-workshop/Spring2020/tree/master/04-09_Bainbridge).
Please make your com…
-
Suppose there is a game, a grid 10 by 10 ,each position was placed a piece of gold with a randomly positive value , and an agent do the mining job on this grid. The rule is when the agent digs a posit…
-
### Required prerequisites
- [X] I have read the documentation .
- [X] I have searched the [Issue Tracker](https://github.com/OmniSafeAI/omnisafe/issues) and [Discussions](https://github.com/OmniSafe…
-
**Note: italicized text below is include as an example and should be updated before submission. If you feel any section is not applicable to your request, please replace with 'N/A' rather than delete …
-
hi @stefanbschneider
Recently, I have finally completed the task of migrating `d-drl-coordination SB3` to the `rllib` version. After adding the curiosity module, I found that a similar success rate …
-
What is the size unit of the map in the current scene design?
What is the distance that the agent moves each time, and what is its unit?
Is the current map continuous or grid-designed?
Yu-zx updated
3 months ago
-
https://research.fb.com/facebook-open-sources-elf-opengo/
ELF OpenGo
ELF OpenGo is a reimplementation of AlphaGoZero / AlphaZero. It was trained on 2,000 GPUs over a two week period, and has achie…