assume-framework / assume

ASSUME - Agent-based Simulation for Studying and Understanding Market Evolution
https://assume.readthedocs.io
20 stars 5 forks source link

Run_learning loop Cleaning #352

Closed kim-mskw closed 1 month ago

kim-mskw commented 1 month ago

The run_learning function in loader csv currently kills the entire simulation and world and restarts it for the DRL part. This leads to the necessity of storing a bunch of parameters which are necessary for the continuous learning process across multiple runs of the same simulation horizon.

We discussed the option of resetting the time in mango agents. This is technically possible but would require a rescheduling of a bunch of tasks such as the ones from the markets. This is done in the world reset function which is ultimately called at the end of each simulation and triggers a new set-up. Every solution that does not use the reset function (which is used now as well) would lead to some sort of "Frankenstein" solution with old and new agents. This is destined to come with new unforeseeable problems, which cannot be justified by the learning loop being messy.

Instead we agreed on the following: