rl-algorithm Search Results

geocompx/geocompr #1115

Proofreading changes

- [x] Foreword (2nd Edition): RL - [x] Preface: RL - [x] 1 Introduction: RL, JM, JN - [x] 2 Geographic data in R: RL - [x] 3 Attribute data operations: RL - [x] 4 Spatial data operations: RL - […

Robinlovelace updated 6 hours ago

Alescontrela/AMP_for_hardware #27

Not normalizing the expert states when computing gradient pe…

https://github.com/Alescontrela/AMP_for_hardware/blob/bfb0dbdcf32bdf83a916790bddf193fffc7e79b8/rsl_rl/rsl_rl/algorithms/amp_ppo.py#L235 When using state normalization, the `sample_amp_expert` tuple…

EGalahad updated 2 days ago

Cattharine/product_owner_rl #53

Add new RL algorithm to comare with baseline (PPO)

Current algorithm being used is DQN, or more specifically: DDQN. This off-policy algorithm is capable of adapting to the environment during episodes. However, this feature is not necessary for this pr…

krutovsky-danya updated 2 weeks ago

swarm-workflows/aco-scheduling #4

Baselines for JSSP

* ACO (heuristic-based swarm algorithms) * ACO_LS (our approach) * OR-Tools (serve as ground truth, but should be considered as base line) * RL (L2D, Jsp-env etc) reinforcement learning based algor…

cshjin updated 6 days ago

carla-simulator/carla #7896

Multi Agent in Intersection for RL algorithm

Hi all, I want to apply reinforcement learning using multi agent, specifically algorithms are PPO, TRPO, DDPG and A2C. I don't understand how to write Carla environment for these algorithm. Is any …

SExpert12 updated 2 months ago

allenai/RL4LMs #23

Off-policy RL algorithms support

Hi, first of all, great work. This is a very useful library for research on RL and NLP. It will be very helpful if it's possible to add off-policy RL methods like Q-learning, SAC, etc. along with benc…

Div99 updated 3 months ago

araffin/sbx #53

[Question] framestack and train_freq for sbx

Hi there! Thank you for developing SBX! I'm currently working with SB3 for real-time robot control and was wondering if SBX supports the `framestack` using `DummyVecEnv` wrapper? Additionally, can …

Jackflyingzzz updated 1 week ago

assume-framework/assume #398

Early Stopping, Learning rate and noise decay

Implement the best practices from multi-agent Rl community and stablebaselines3 into our algorithm. Further analyse similarities between petting zoo multi-agent implementation to current RL implementa…

kim-mskw updated 1 week ago

Emerge-Lab/gpudrive #249

Is there a big speed difference between training with jax or…

Our algorithm has a long gradient chain, and the speedup is very obvious with jax in RL, and I would like to ask if using jax in GPUDrive would be much faster than torch? (after using jax.jit())

THUwangyinuo updated 1 week ago

ray-project/ray #47127

[RLlib] Flatten observations example doesn't work

### What happened + What you expected to happen Running the example script `rllib/examples/connectors/flatten_observations_dict_space.py` raises an error because the order of Connectors in the `env_t…

rubenjacob updated 3 weeks ago

1000+ results for rl-algorithm

1000+ results
for rl-algorithm