EdanToledo / Stoix

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL
Apache License 2.0
175 stars 14 forks source link

[FEATURE] Add support for efficient recurrent models #54

Open smorad opened 3 months ago

smorad commented 3 months ago

Feature

Revisiting Recurrent Reinforcement Learning with Memory Monoids provides a method to combine recurrent models with standard, nonrecurrent RL losses. This should provide support for S5, LRU, FFM, Linear Transformer, etc recurrent models in stoix. Note that models like LSTM or GRU would be intractable under this paradigm, and would require significantly more effort to integrate into stoix.

Proposal

Benchmarking (Optional)

Definition of done

We have an example that can solve a few POPGym tasks

Mandatory checklist before making a PR

Links / references / screenshots

EdanToledo commented 3 months ago

On the roadmap and seems very exciting to have implemented. Any help would be greatly appreciated.