[Feature Request] Implement Efficient Ops to Compute Returns

pytorch / rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

MIT License

2.19k stars 289 forks source link

Great suggestion So far we have implemented a vectorized version of lambda return that works several orders of magnitude faster than regular for-loops without requiring c++ code. I'd love to see how much we can gain by designing a c++ kernel for that if that's what you have in mind.

Here are the things I would keep in mind in this endeavour:

We'd like the resulting functions to be compatible with functorch. That means that operations should be -- at least -- twice differentiable.
I think it's ok that we use C++ to optimize things but knowing how researchers work, we should keep not-optimized functions as well: people may want to copy paste them and tweak them at will.

pytorch / rl

[Feature Request] Implement Efficient Ops to Compute Returns #140