-
I am trying to use debugging to figure out the process of A2C method in tf2 branch
so I add the code in a2c.py as follows:
`if __name__ == '__main__':
import gym
learn('mlp',gym.vector.ma…
-
### Describe the bug
While starting a sweep through `.yml` file, the agent did not correctly started the sweep with the right combinations of parameters. Instead some hyperparameter combo is repeated…
-
Importing A2C agent from rlberry.agents.torch automatically tries to import optuna (which is not part of rlberry's install dependencies, but of "default" dependencies which are part of extra dependenc…
-
Hi author, I ran the code [https://github.com/koryakinp/A2C](url), I modified the environment and just want to test it on the gym environment such as pong. While I got the **_NotImplementedError_** er…
-
-
hello,
i get this error, when i run code "python main.py --env-name "PongNoFrameskip-v4"
i don't know what happed, my env is:
python3.6.3
Package Version
----------------- -------
a…
-
您好,我直接使用demo_A2C_PPO.py训练pendulum环境下的A2C算法无法收敛,可能算法实现上有问题。AgentDiscreteA2C算法仅继承了AgentDiscretePPO,并未实现自己的update_net函数
-
1. Where is 'processed_full' defined?
```
----> 1 data_risk_indicator = processed_full[(processed_full.date=TRAIN_START_DATE)]
2 insample_risk_indicator = data_risk_indicator.drop_duplicates…
-
I am training an A2C agent and I want to frequently save the model.
The issue I am having is that too many tensorboard files are being opened and never closed. This causes the program to crash as i…
-
**Describe the bug**
I am working on this [notebook](https://github.com/AI4Finance-Foundation/FinRL/blob/master/examples/FinRL_Ensemble_StockTrading_ICAIF_2020.ipynb) and, when I run this code
`df_s…