Issue 184: Add outputs to `generate_simulated_data()` fxn and package data

kaitejohnson commented 1 month ago

This PR closes #184. It adds the following outputs to the generate_simulated_data() function.

Wastewater data beyond the forecast period (called ww_data_eval)
Subpopulation level count data

Goal of including these outputs is to compare performance of forecasts at the subpopulation level (of both sets of observations).

Some open questions:

should we include all of this as package data? Or is it fine just to have as outputs of generate_simulated_data()?
the eval data as separate is probably redundant but it feels useful to have them as two different datasets otherwise we lose the fact that the data fit to the model in the vignette represents the bare minimum data requirements you need. I'm in favor of keeping them as separate datasets (since we want to encourage the use of vintaged data anyway), but curios others thoughts on this?

github-actions[bot] commented 1 month ago

Thank you for your contribution, @dylanhmorris :rocket:! Your page is ready to preview here

github-actions[bot] commented 4 weeks ago

Thank you for your contribution @kaitejohnson :rocket:! Your github-pages is ready for download :point_right: here :point_left:! _{(The artifact expires on 2024-11-07T01:26:31Z. You can re-generate it by re-running the workflow here.)}

kaitejohnson commented 3 weeks ago

Pending CI passes this is ready for re-review

CDCgov / ww-inference-model

Issue 184: Add outputs to `generate_simulated_data()` fxn and package data #220