Closed internetcoffeephone closed 4 years ago
Additionally, I imagine some train_* files can be merged when parametrized, there's a lot of duplicate code in there. Separating the experiment categories (baseline, visible actions, influence, moa) from their algorithms (A3C, A2C, DQN) so they can vary independently would be ideal, although I'm not sure whether it's easy to do.
These changes are all very outdated, and I'm getting rid of config_parser in my fork. Thus, closing.
Implemented the config file as mentioned in https://github.com/eugenevinitsky/sequential_social_dilemma_games/issues/151.
4 questions:
I haven't reproduced the results from the paper yet. Currently it takes me 6 days to take the 3e8 steps required per experiment - I'm in the process of requesting more powerful hardware.