ppo-pytorch Search Results

1000+ results
for ppo-pytorch

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

DLR-RM/rl-baselines3-zoo #133

[question] make_atari_env() is not used for Atari environmen…

## Question Why does the zoo call standard `make_vec_env()` for all environments, including Atari, when sb3 has a special function for it `make_atari_env()`? ## Train of thought - train.py calls …

rienath updated 2 years ago
12
DLR-RM/stable-baselines3 #160

[feature request] LSTM policies with custom feature extracto…

Hi! It would be awesome to be able to implement LSTM policies in this library, like in the former version. Is there an straightforward way to accomplish this with the current version?

cisprague updated 2 years ago
14
DLR-RM/stable-baselines3 #518

[Bug] FPS calculation is incorrect

**Important Note: We do not do technical support, nor consulting** and don't answer personal questions per email. Please post your question on the [RL Discord](https://discord.com/invite/xhfNqQv), [R…

ghost updated 2 years ago
3
thu-ml/tianshou #395

optimizer double update ? deprecation soon

- [x] I have marked all applicable categories: + [x] exception-raising bug + [x] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the documentation.") + [ ] ne…

externalsupplierstaff updated 3 years ago
1
Unity-Technologies/ml-agents #5518

RollerBall cant be trained

Hi, I followed the instructions in Design a New Learning Environment to build a Rollerball project. I did 3-D ball demo before. Training model and run pre-trained model can totally work in that dem…

Albert950207 updated 2 years ago
5
alex-petrenko/sample-factory #109

Cannot reproduce scores on dmlab-30

Hi @alex-petrenko, I ran the codes on dmlab-30 with the exactly same arguments/configurations in README. However, as shown in the below figure, the obtained scores (mean capped) are lower than the…

sungwoong updated 2 years ago
28
DLR-RM/stable-baselines3 #172

[Enhancement] Support for fallbacks when loading cloudpickle…

Related issue #171 As seen in #171, cloudpickle/pickle can easily fail when transferring models between Python versions (and in case some other shenanigans). There is currently no way to address t…

Miffyli updated 2 years ago
5
DLR-RM/stable-baselines3 #563

Is data normalization required by stable baselines?

For custom environments do I need to normalize the observation array? Or it is done by stable-baselines internaly ? This is the learning code ``` env = MyEnv(config) policy_kwargs = di…

evo11x updated 2 years ago
4
ray-project/ray #18245

[rllib] Reproduce issue: with the same seed, multiple runs s…

### What is the problem? I found multiple runs for PPO still have different performance even we set the same seed. How can we obtain the exact same result with the same seed? Current strate…

yangysc updated 2 years ago
18
hill-a/stable-baselines #1138

[question] Loading the PPO model after training does seem to…

I've read similar questions (e.g. #[30](https://github.com/hill-a/stable-baselines/issues/30)) that were asked here about loading the model after the training but still, I could not figure out what th…

Milad-Rakhsha updated 2 years ago
5

上一页 1...91 92 93 94 95 96 97...100 下一页

1000+ results for ppo-pytorch

1000+ results
for ppo-pytorch