baselines-zoo Search Results

408 results
for baselines-zoo

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

DLR-RM/rl-baselines3-zoo #151

[Feature] Upstream hyperparameter eval script

Building off the jsons of the best hyperparameters saved in https://github.com/DLR-RM/rl-baselines3-zoo/pull/140, I have a script that takes each and runs trains it for 10 times (by default) to determ…

jkterry1 updated 2 years ago
4
NVlabs/ODISE #32

install‘s q：about detectron2

run：pip install -e . return：ERROR: Could not build wheels for detectron2, which is required to install pyproject.toml-based projects detail： 39\detectron2\model_zoo\configs\new_baselines …

fuzhao123232 updated 10 months ago
2
huawei-noah/SMARTS #1841

Modularize SMARTS

**Is your feature request related to a problem? Please describe.** The current version of smarts contains a lot of different components but these components can conflict and cause problems with other…

Gamenot updated 1 year ago
1
hill-a/stable-baselines #876

[feature request] Add maximum time steps parameter to evalua…

Hi there, When doing hyper-parameter training with rl-zoo I often accidentally test a parameter that produces invalid or explosive policies for a particular algorithm/environment. Occasionally t…

philwinder updated 4 years ago
1
rail-berkeley/softlearning #76

SAC Hyperparameters MountainCarContinuous-v0 - Env with dece…

Hello, I've tried in vain to find suitable hyperparameters for SAC in order to solve MountainCarContinuous-v0. Even with hyperparameter tuning (see "add-trpo" branch of [rl baselines zoo](https:…

araffin updated 4 years ago
10
araffin/rl-baselines-zoo #26

[question] Hyperparameters for Roboschool HumanoidFlagrunHar…

Hi, Currently, I have used 4 algorithms from stable-baselines for the task of Roboschool HumanoidFlagrunHarder. My evaluation metric is the mean reward of 100 episodes. Basically: PPO2 is perfect, A2…

doviettung96 updated 5 years ago
11
hill-a/stable-baselines #781

[PPO2] problems resuming training

I'm trying to resume the model training and I'm getting some strange results. Using SubProcVecEnv and VecNormalize on a custom environment: ``` from stable_baselines.common.policies import MlpPoli…

k0rean updated 5 months ago
5
DLR-RM/stable-baselines3 #1670

[Question] Can't solve Gymnasium Frozenlake-v1 8x8 with A2C

### ❓ Question Hello, I'm trying to solve the Frozenlake-v1 environment with is_slippery = True (non-deterministic) with the stable baselines 3 A2C algorithm. I can solve the 4x4 version but I can't …

MetallicaSPA updated 10 months ago
9
hill-a/stable-baselines #1018

[question] unstable actions in PPO

Hi I'm using PPO1 in my experiments. I scale the action between [-1, 1] and do the rescale in the environment. The result is relatively good but for one issue, the actions are very unstable and have …

HJ-TANG updated 3 years ago
2
google/dopamine #37

Reproducing the scores reported by the IQN paper

Thank you for opensourcing such a great code! I have questions about your IQN implementation, especially on how it can reproduce the scores reported by th paper. First, your config file https://…

muupan updated 1 year ago
29

上一页 1...1 2 3 4 5 6 7...41 下一页

408 results for baselines-zoo

408 results
for baselines-zoo