rl-algorithms Search Results

allenai/RL4LMs #23

Off-policy RL algorithms support

Hi, first of all, great work. This is a very useful library for research on RL and NLP. It will be very helpful if it's possible to add off-policy RL methods like Q-learning, SAC, etc. along with benc…

Div99 updated 1 month ago

praveen-palanisamy/macad-gym #49

Unable to use RL algorithms with continuous action space

Hi @praveen-palanisamy I have been working on macad-gym successfully over the past few months using PPO and many other algorithms. Now I am trying to use DDPG using RLlib which requires continuous…

AizazSharif updated 4 weeks ago

AI4Finance-Foundation/FinRL #1220

Reward shaping in RL algorithms using benchmark returns

I want to build an RL algo that will understand the concept of beating a benchmark (say S&P500), at a tic level. So if a tic is constantly beating the benchmark, the algo should prefer to pick that ti…

arunbharadwaj2009 updated 3 months ago

Unity-Technologies/ml-agents #6126

Reinforcement learning with search algorithms

There are several state of the art algorithms that use search to improve the policy trained with RL(e.g. AlphaZero, Student Of Games). The current implementation of ML-Agents does not seem to support …

plamentotev updated 3 days ago

Geonhee-LEE/rl-collision-avoidance #5

Implement RL algorithms

- Value based RL - [ ] DQN - [ ] Rainbow DQN - [ ] [CQL](https://sites.google.com/view/cql-offline-rl) - Value based + Policy based RL - [x] DDPG - [ ] [TD3](https://spinni…

Geonhee-LEE updated 3 years ago

opendilab/DI-engine #813

How to introduce other optimizers into DI-engine?

- [X] I have marked all applicable categories: + [ ] exception-raising bug + [ ] RL algorithm bug + [ ] system worker bug + [ ] system utils bug + [X] code design/refactor …

weidaolee updated 4 weeks ago

Akshat111111/Hedging-of-Financial-Derivatives #505

💡[FEATURE]: Currency Arbitrage with Reinforcement Learning

**Is your feature request related to a problem? Please describe.** Develop an RL agent to exploit arbitrage opportunities in the foreign exchange market by trading currency pairs. **Describe the s…

ashis2004 updated 3 weeks ago

catalyst-team/catalyst-rl #2

Implementing RNNs into RL algorithms

Hey! First of all thank you for this library! I would like to take your actors and critics and implement RNN-enhanced TD3 algorithm as described here: https://arxiv.org/pdf/1710.06537.pdf. I …

dtransposed updated 4 years ago

leggedrobotics/rsl_rl #33

[Bug Report] actor's std becomes "nan" during PPO training

I am conducting reinforcement learning for a robot using rsl_rl and isaac lab. While it works fine with simple settings, when I switch to more complex settings (such as Domain Randomization), the foll…

mitsu3291 updated 1 week ago

ray-project/ray #45846

[RLlib] Deprecation warnings won't turn off

### What happened + What you expected to happen ### The bug Default instantiation of `RLlib` algorithms causes deprecation warnings (even when the new API stack is selected). Furthermore, I cannot d…

mantasu updated 1 month ago

1000+ results for rl-algorithms

1000+ results
for rl-algorithms