muzero Search Results - Githubissues

397 results
for muzero

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

werner-duvaud/muzero-general #193

Policy target after MCTS should be in form of probabilities

This issue appears only in the implementation of continuous actions version of MuZero. When computing child visits, we need to divide by sum_visits in order to be in probabilities form. But, it …

2M-kotb updated 2 years ago
1
pytorch/pytorch #43250

Support for Multi-Categorical in torch.distributions

## 🚀 Feature Support for Multi-Categorical in torch.distributions ## Motivation As openai gym supports ``MultiDiscrete`` space, it would be nice if pytorch can support the corresponding…

youkaichao updated 1 year ago
6
suragnair/alpha-zero-general #247

Adoption for single player game

Any suggestions for changing the code such that we can adopt it for single player game in which the rules are available and the goal is to get the highest score? For example, `snake eating egg` game a…

vsahil updated 4 months ago
4
opendilab/LightZero #229

Hyperparameter of Muzero and reproducibility of the results

Hello, I am trying to reproduce the result of Muzero on Atari (I am using the MsPacmanNoFrameskip-v4 env as it's the one with the most published result on the original paper of [Muzero](https://arx…

marintoro updated 1 month ago
2
Farama-Foundation/Gymnasium #1125

[Question] Automated planning over simulators support

### Question Hi, I would like to know if Gymnasium supports the functionality of simulating actions in a given state. For example, the agent is in a state and wants to perform a simulation five steps…

MFaisalZaki updated 5 days ago
4
seungeunrho/minimalRL #11

Add new algorithms

It would be nice to add the following algorithms: - [ ] RAINBOW - [x] A2C (multiprocessing) I will submit a PR if I finish any of them.

rahulptel updated 3 years ago
7
sotetsuk/pgx #1115

Royal Game of Ur Environment

I would like to request Royal Game of Ur environment for Pgx. Royal Game of Ur is a simple race game with chance and perfect information but it has some distinct features. First, it might be an old…

Alian3785 updated 8 months ago
4
google/jax #21946

pmap race condition (?)

### Description I've got a self-play (not important) function that I would like to execute on the CPU, but I'm running into an issue where the execution mysteriously freezes on the 32nd iteration. No…

LeonEricsson updated 1 month ago
2
Tribler/tribler #6942

Web3 recommendations: balancing trust and relevance

- Cum Laude candidate [USA House of Representatives - INVESTIGATION OF COMPETITION IN DIGITAL MARKETS ](https://www.govinfo.gov/content/pkg/CPRT-117HPRT47832/pdf/CPRT-117HPRT47832.pdf) : ``` As p…

synctext updated 6 months ago
48
opendilab/LightZero #225

Minigrid environment

I was looking at the implementation of the minigrid environment as inspiration to create my own environment. I noticed that the correspoin efficient muzero config doesnt actually use the environment. …

Depresivna-ryza updated 1 month ago
1

上一页 1...4 5 6 7 8 9 10...40 下一页

397 results for muzero

397 results
for muzero