actor-critic-algorithm Search Results

749 results
for actor-critic-algorithm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

flrngel/understanding-ai #7

Diversity Is All You Need: Learning Skills without a Reward …

https://arxiv.org/abs/1802.06070 # Abstract - Learn **skills** by maximizing information using maximum entropy policy - Train typical reinforcement learning with best **skill** after unsupervised…

flrngel updated 6 years ago
2
hagerrady13/Reinforce-PyTorch #1

Is this a reinforce implementation or actually AC2 given the…

Hi there, Thanks for sharing your repo, it's helping me greatly explore the field. I have a question I'm not sure the answer of. In this implementation I believe you have implemented the V(s) funct…

lrfreeman updated 2 years ago
1
zhouyou-gu/drl-5g-scheduler #2

small command issue

Hi there! I'm trying to reproduce your code and find a small issue in the off-line training setup. Hope it's helpful. PYTHONPATH=./ python3 ./_sim_script_example_/ka.py instead of PYTHONPATH=./ pytho…

hanghoo updated 6 months ago
10
Denys88/rl_games #270

How to Correctly Integrate LSTM or GRU into the SAC Algorith…

I have referred to some people's work on adding RNNs to reinforcement learning algorithms, but strangely, almost everyone's code implementation is different. So I would like to ask how you integrate L…

namjiwon1023 updated 7 months ago
1
alex-petrenko/sample-factory #274

State-action value function

Hello, I had a quick question about the form of the value function. Right now by default it is an action value function with a linear layer that receives the output of the decoder. I was wondering …

paLeziart updated 1 year ago
5
hans/rlcomp #1

DDPG implementation correctness

Hi! I'm trying to implement DDPG as well based on paper [Continuous control with deep reinforcement learning](http://arxiv.org/pdf/1509.02971.pdf). Though without much success yet... So I was looking …

stas-sl updated 8 years ago
2
VinF/deer #65

DDPG implementation

Hey VinF, thanks for your work! I have questions about the DDPG implementation in deer. Patrick Emami recommends in http://pemami4911.github.io/blog/2016/08/21/ddpg-rl.html to use for the act…

dynamik1703 updated 6 years ago
4
microsoft/DeepSpeedExamples #552

【Need Help】What is [state, action, reward ] in NLP Scenari…

In PPO algorithm mentioned here [https://arxiv.org/pdf/1707.06347.pdf] it has (state , action, reward) tuple and [s1,a1,r1,s2,a2,r2....sn,an,rn] as experience to train actor model and critic mode…

valkryhx updated 1 year ago
1
next-theme/hexo-theme-next #818

Why I have additional `$$` characters before and after rende…

### Issue Checklist - [X] I am using NexT version 8.0 or later. - [X] I have already read the [Troubleshooting page of Hexo](https://hexo.io/docs/troubleshooting) and [Troubleshooting page of NexT…

LYK-love updated 3 weeks ago
2
pytorch/rl #338

[Feature Request] Purely functional loss objectives

## Motivation ### 1. Consistent style for `torch.nn.modules.loss.*Loss` In `torch.nn.modules.loss`, there are many `*Loss` subclassing `nn.Module`. The `Loss.__init__()` does not takes other `nn…

XuehaiPan updated 2 years ago
4

上一页 1...2 3 4 5 6 7 8...75 下一页

749 results for actor-critic-algorithm

749 results
for actor-critic-algorithm