a3c-lstm Search Results

160 results
for a3c-lstm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

PyLadiesBerlin/community-organisation #9

[EVENT] deeplearning study group

I'm proposing holding space for a study group on the dl course by hugging face (https://huggingface.co/deep-rl-course/unit1/introduction?fw=pt) I need to have it discussed in a c-base circle the circl…

marimeireles updated 2 months ago
7
clear-datacenter/plan #1

ML Spark

wanghaisheng updated 5 years ago
61
Kismuz/btgym #140

Support Tensorflow 2

##### Expected behaviour: Environment fully supports TF2 ##### Actual behaviour: Only 1.5 is supported for certain scripts I ran the [automatic upgrade](https://www.tensorflow.org/guide/upgra…

mcrowson updated 4 months ago
14
microsoft/CNTK #960

.NET Support

Issue #817 was closed with this response: > We fell a little behind. The Python bindings are done in SWIG. I think that can be quickly repurposed into .Net bindings, once done. So it shouldn't be too…

StevenGann updated 4 years ago
125
Stable-Baselines-Team/stable-baselines3-contrib #222

[Question] how to use "lstm_states" from rollout_buffer to r…

### ❓ Question Hi all! I hope to integrate RNN(LSTM/GRU) to off-policy algorithm(SAC and TD3) without multiprocessing like A3C.So I checked SB3-contrib code about recurrentPPO and [the recurrent…

DeepRowLie updated 9 months ago
2
fly51fly/aicoco #4

爱可可老师一周热门分享

fly51fly updated 4 years ago
99
fly51fly/aicoco #3

爱可可老师24小时热门分享

微博内容精选

fly51fly updated 5 months ago
1907
vwxyzjn/cleanrl #350

Reproduction of Muesli

## Problem Description [Muesli](https://arxiv.org/abs/2104.06159) is a next-generation policy gradient algorithm from DeepMind that performs exceptionally well. Notably, it can match MuZero’s SOTA …

vwxyzjn updated 10 months ago
23
ray-project/ray #9071

[rllib] State shapes incorrect using custom model (TorchMode…

### What is the problem? It seems that the states being passed to TorchModelV2 and TFModelV2 are incorrect, as the shapes don't seem to match up. Please see the stack traces below. Note that I am u…

msloma144 updated 1 year ago
6
deeplearning4j/deeplearning4j #5273

RL4J: Question: Multiple inputs in comp graph

_From @Ailanz on March 6, 2018 15:50_ Hi, I have a question. I want to train DDQN / A3C with comp graph that has multiple inputs. Half of the input will feed into LSTM while the other half feed into …

AlexDBlack updated 1 year ago
7

上一页 1...3 4 5 6 7 8 9...16 下一页

160 results for a3c-lstm

160 results
for a3c-lstm