[x] Finish code template for single-agent DDPG algorithm
[x] Finish code template for multi-agent QMIX algorithm
[x] Solve CSS and div problems for the hyperparameter boxes
[x] Finish code generation for current DQN and IQL code templates
For the NEAR future
[ ] Create a flexible reward framework: the user can design the reward function using blocks or an input in the website to design mathematical functions
Important Design Question (to discuss)
The deep learning system is an incipient type of system (to teach and implement deep learning algorithms). The main objectives of this system are to:
model reward function easily
choose RL algorithm and hyperparameters easily
How to expand the algorithm 'block' to facilitate the understanding by the user? For example, how Chris Olah's explain the LSTM network...
To Do (Urgent)
For the NEAR future
Important Design Question (to discuss)
The deep learning system is an incipient type of system (to teach and implement deep learning algorithms). The main objectives of this system are to:
How to expand the algorithm 'block' to facilitate the understanding by the user? For example, how Chris Olah's explain the LSTM network...
For the future