-
Reminder todo after internship.
Mostly for meta-bandit and gridworld tasks
-
Add the A2C algorithm which is the synchronous version of the algorithm described in this paper https://arxiv.org/pdf/1602.01783.pdf
and described here: https://medium.com/emergent-future/simple-rei…
-
**Which entry point should I specify(Steady-CLI 3.2.5)?**
I'm just tried with specifying main class in pom, but it didn't work (vulas.reach.sourceDir = app).
Also i tried just point to src of proj…
-
https://github.com/AI4Finance-Foundation/FinRL-Tutorials/blob/master/4-Optimization/FinRL_HyperparameterTuning_using_Optuna_basic.ipynb
I am getting the following error while recreating the above n…
-
hi, not found vmix and vdn_a2c in modules
-
-
Versions:
uniforms: ^3.10.1
uniforms-mui: ^3.10.1
```
type A1 = {
id_type: 'a1';
id: string;
};
type A2 = {
id_type: 'a2';
id: string;
};
type C = {
name: string;
addres…
-
When trying to run this part:
```
train_df = data_split(df, start=config.TRAIN_START_DATE,
end=config.TRAIN_END_DATE)
stock_dimension = len(train_df.tic.unique())
state_…
-
For example when I run a2c.py -r "runs/a2c/a2c_cartpole.ini" tons of errors pop up.
Regardless I like that you've implemented a lot of algorithms and put them here. It's very useful for someone new…
ghost updated
5 years ago
-
I noticed while browsing the RL examples that the PPO implementation is actually A2C (which there's already an example for). On line 141, this line:
```Rust
let action_loss = (-advantages.detach()…