An in-development R package and a Bayesian hierarchical model jointly fitting multiple "local" wastewater data streams and "global" case count data to produce nowcasts and forecasts of both observations
This PR closes #184. It adds the following outputs to the generate_simulated_data() function.
Wastewater data beyond the forecast period (called ww_data_eval)
Subpopulation level count data
Goal of including these outputs is to compare performance of forecasts at the subpopulation level (of both sets of observations).
Some open questions:
should we include all of this as package data? Or is it fine just to have as outputs of generate_simulated_data()?
the eval data as separate is probably redundant but it feels useful to have them as two different datasets otherwise we lose the fact that the data fit to the model in the vignette represents the bare minimum data requirements you need. I'm in favor of keeping them as separate datasets (since we want to encourage the use of vintaged data anyway), but curios others thoughts on this?
Thank you for your contribution @kaitejohnson :rocket:! Your github-pages is ready for download :point_right: here :point_left:!
(The artifact expires on 2024-11-07T01:26:31Z. You can re-generate it by re-running the workflow here.)
This PR closes #184. It adds the following outputs to the
generate_simulated_data()
function.ww_data_eval
)Goal of including these outputs is to compare performance of forecasts at the subpopulation level (of both sets of observations).
Some open questions:
generate_simulated_data()
?