rlax Search Results - Githubissues

70 results
for rlax

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

patrick-kidger/equinox #307

Jitting leads to much slower execution times and then to a c…

Hi Patrick, for my next YT video I want to showcase Eqx for RL and so I use it to train an agent (using policy gradient algorithm) on some gym environments. The agent solves the environment eas…

Artur-Galstyan updated 1 year ago
2
google-deepmind/rlax #110

Relax numpy version requirement

1e2e4bcb4761ba6107fb565982f6cc2b951cbeb5 introduced version qualifier `numpy < 1.23` due to failing tests with jax, but I believe this has been resolved with recent jax versions and we no longer need …

wookayin updated 1 year ago
2
google-deepmind/meltingpot #100

Questions about applying PopArt on OPRE

Hi, I noticed that the PopArt layer is applied on the value head of the models in Meltingpot v2.0. I was able to implement it on IMPALA successfully, but when applying it to OPRE, there seems to be…

kinalmehta updated 1 year ago
2
google-deepmind/open_spiel #962

A question of BatchA2CLoss

[loss of a2c rl_losses.py](https://github.com/deepmind/open_spiel/blob/c3f8b538afd6223d450c0f74269937e76850cf33/open_spiel/python/algorithms/losses/rl_losses.py#L196) I think the total loss should be…

yata0 updated 1 year ago
4
google-deepmind/acme #254

pip install dm-acme[jax] has problem

I followed the instruction of the installation. However, some bad things happened. This is a amd computer, window system and use Python 3.9.13. pip install dm-acme[tensorflow] works well. But pip in…

ruyikang updated 2 years ago
2
google-deepmind/rlax #100

tree_multimap deprecated in favor of tree_map

I get this error when trying to import rlax, I think because tree_multimap has been deprecated? ```ImportError: cannot import name 'tree_multimap' from 'jax.tree_util' (/home/rohanmehta/anaconda3/…

rohan-mehta-1024 updated 2 years ago
2
google-deepmind/dm-haiku #520

How to duplicate a module's parameters similar to semantics …

In reinforcement learning, target network is a common technique to assist off-policy value learning. In PyTorch-based implementations, `target_q_network = deepcopy(q_network)` could create a target ne…

jjyyxx updated 2 years ago
4
fly51fly/aicoco #3

爱可可老师24小时热门分享

微博内容精选

fly51fly updated 3 weeks ago
1906
jax-ml/jax #7175

Common Loss Functions written natively in JAX?

Could we create a new subpackage (or find somewhere to put this) for native loss functions written in JAX? The issue is that when I try to use loss functions from say `sk_metrics`, the function will n…

derrickxli updated 2 years ago
4
google-deepmind/rlax #89

Title mistake in docs

Hello, I can't PR this for you because the docs are not in this repo so I'll just open this issue. At https://rlax.readthedocs.io/en/latest/api.html#id1, the section MPO Compute Weights and Temper…

HenriDeh updated 2 years ago
1

上一页 1...1 2 3 4 5 6 7...7 下一页

70 results for rlax

70 results
for rlax