Closed Kermalis closed 2 weeks ago
Hi. Thanks for trying RLMatrix
Yes cuda is used by default if available.
IContinuousEnvironment will not work and has been completely removed. I have not included a replacement yet. It will be something like this: https://github.com/asieradzk/RL_Matrix/blob/master/src/RLMatrix.Common/IEnvironmentAsync.cs I've decided to no longer reset env for users based on step counters, since some use-cases may not require that.
Looks like you might be using an older version, you can clone the one from the repo for best experience. There have been a lot of changes between now and the nuget...
I'll update you when ContinousEnv becomes available (maybe today)!
I got you homie. Try now. I've added IContinuousEnvironmentAsync you can use it with
LocalContinuousRolloutAgent
Clone the repo, its not on the nuget.
There are 2 caveats: Only works with training: true Only works with 1 environment.
Going to take weekend now but thanks for motivating me to put this in.
Its a bit hard to get right because I insist on using shared parameters for multiple discrete and continuous heads so I've made a mistake somewhere when slicing output tensors.
Example:
https://github.com/asieradzk/CartPoleForTesting/blob/master/TrivialContinuousEnvironmentAsync%20.cs
Awesome, it looks great. I won't be able to try it out for a while though. I'll close this issue for now
Hello, I'm currently in the process of porting a (failed) TorchSharp agent over to RL Matrix. Honestly thanks so much for this library. I'm just wondering if you're planning on using the TorchSharp-cuda-windows or TorchSharp-cuda-linux packages eventually? I was using cuda in my previous environment and was hoping it was in the plans
EDIT: I stepped through the debugger and saw that the tensors were already using cuda. I just assumed it didn't since I didn't see it in the README.
I guess while I'm here writing this, I was also wondering if there was an example for a
IContinuousEnvironment<>
? This is the environment I need since I don't need discrete actions, but the examples were all regularIEnvironment<>
from what I saw. Maybe I missed it as well lolI'm asking since I'm running into this error:![image](https://github.com/asieradzk/RL_Matrix/assets/29823718/10216bf0-9f89-4402-bbdc-bf8c092afc6e)
My env is extremely basic, I am trying to set up 3 inputs and 3 (continuous) outputs with 0 discrete outputs.
I also noticed
stepCounter
never increases from 0