Closed aezarebski closed 1 month ago
Need to pass times through here.
The database should have the following structure (eventually)
/Simulations/
/Simulation_001/
TemporalMeasurements (dataset with columns MeasurementTime, Prevalence, ReproductionNumber, Etc)
PickledTree (dataset)
SimulationXMLConfig (attribute storing the XML as a string)
PresentTime (attribute storing the time of the last sample)
TreeHeight (attribute storing the tree height)
Initial work on this issue should just focus on getting the temporal measurements included into the current database. We can reorganise it into the structure above later.
https://github.com/aezarebski/derp-simulation/blob/a4ac52db4f24c5657b10c00f1293b22430e0b686/main.py#L436
The simulated dataset only includes measurements of the prevalence and cumulative infections at the present. This limits our ability to make predictive models that estimate these values through time using this dataset.
Suggested solutions
Option 2 seems the best for now but would involve a slight reorganisation of the database being produced. It would definitely simplify downstream usage though.