stable-baselines Search Results

1000+ results
for stable-baselines

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

hill-a/stable-baselines #666

[question] HER does not sample very last state of episode as…

It seems to me that when HER samples an achieved goal from the replay buffer it never samples the very last state of the episode. Is this intended? As a consequence, the sampling strategy "final" …

nicoguertler updated 4 years ago
3
hill-a/stable-baselines #745

[potential bug] HER Replay buffer observations

Hi, it seems like the state features that are added to the replay buffer in the HindsightExperienceReplayWrapper are features for the full observation-dictionary (observation, achieved-goal, desire…

johannes-dornheim updated 4 years ago
2
rafaelpossas/bayes_sim #1

Pickle file loading issue

Dear @rafaelpossas Thanks for the code. I was trying your code for running an experiment but it seems pickle file is not loading properly. I am getting following error. ``` Traceback (most re…

eshafeeqe updated 3 years ago
1
hill-a/stable-baselines #566

Parallel rollout implementation in HER+DDPG?

In ddpg.py, the parameter `nb_rollout_steps` is an integer containing the number of rollout steps. I believe that this is the same as the parameter `T` in [OpenAI baselines](https://github.com/openai/…

RyanRizzo96 updated 4 years ago
2
evilsocket/pwnagotchi #166

find a way to optimize AI loading times

TensorFlow takes minutes to import on a Raspberry Pi Zero W and that's probably because of the huge .so file with native primitives it has to load, among other things. Given the nature of the project,…

evilsocket updated 4 years ago
6
hill-a/stable-baselines #1044

[question] Issue with multiple instances for DDPG-MPI from s…

Hello, I am pretty new to MPI. I am using stable-baselines DDPG for a custom environment. Everything is working fine and I am getting good results as well. Question: When I use MPI and run the co…

UtkarshMishra04 updated 3 years ago
5
flow-project/flow #787

Exceptions in Traffic Light Grid RL Examples

## Bug Description I am running into an issue with the traffic_light_grid examples from both the stable_baselines and rllib set of examples. For the stable baselines example, the script runs…

acjoseph updated 3 years ago
11
lbertge/baselines #2

About ppo2

Hello, thank you for your code sharing. https://github.com/openai/baselines/pull/1027.At here, You said you don't get much success for intergration ppo2 into gail.Can you tell me what kind of "not su…

Kailiangdong updated 4 years ago
6
thu-ml/tianshou #1157

Unable to replicate original PPO performance

- [x] I have marked all applicable categories: + [x] exception-raising bug + [x] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the documentation.") + [ ] ne…

rajfly updated 3 weeks ago
7
Stable-Baselines-Team/stable-baselines3-contrib #223

[Feature Request] STAC algorithm

### 🚀 Feature Build the STAC algorithm as a callable algorithm: https://arxiv.org/pdf/2002.12928.pdf ### Motivation Hyperparametrization is one of the most time/cost expensive thing when training R…

EloyAnguiano updated 5 months ago
4

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for stable-baselines

1000+ results
for stable-baselines