Closed AFg6K7h4fhy2 closed 13 hours ago
Items that require further consideration (this PR is not done yet):
csvs
in ./output/
.csvs
.For gen_tru_csv.R
, I remembered that forecasttools
has recode_locations
. The file should be updated.
UPDATE: This has been done.
The args
for pull_nhsn
in gen_map_csv.R
can be abstracted further (e.g. the reference date argument).
From SB:
csv
files by reference-date
(called forecast-date
in some csvs
).For SB:
csv
s aligned with column in ./weekly-summaries/README.md
../weekly-summaries/utils/
.Move ./weekly-summaries/utils/
to ./src/code/
(from SB).
The folder structure ought (as presently decided) to look like either of:
weekly-summaries
(name subject to change)
all-forecasts
(name subject to change)YYYY-MM-DD
(i.e. the reference date)
file.csv
(name subject to change)or
weekly-summaries
(name subject to change)
all-forecasts
(name subject to change)YYYY-MM-DD.csv
(i.e. the reference date; name subject to change)The latter is preferred.
The only conceivable reason (at present) to use the former approach is if there would be a .txt
or .md
file with comments about the generated .csv
.
Generate files for first week of submission, account for the data anomalies. Does Inform want us to take into account exclusions? SB: exclude some locations for the first week of submissions.
EDIT: This has been fixed.
For / By Monday, December 02, 2024:
output
content.utils
.UPDATE: 2024-12-02
model_id
to model
forecast_teams
, forecast_fullnames
forecasts and truth data for US
is missing (in all three csvs)
all_forecasts.csv
model_id
-> model
forecast_teams
, forecast_fullnames
truth_data.csv
EDIT: This has been fixed.
What should the csv
files be named? Currently:
map.csv
truth_data.csv
all_forecasts.csv
I suggest the following structure:
weekly-summaries\reference-date\
reference-date-map.csv
reference-date-truth_data.csv
reference-date-all_forecasts.csv
where reference-date is of format YYYY-MM-DD
.
We add a new folder named by corresponding reference-date every week, so reference-date should not be hardcoded in the codes.
Re: https://github.com/CDCgov/covid19-forecast-hub/pull/118#issuecomment-2511902915
This works, thank you SB.
EDIT: This has been fixed.
GitHub Actions (GHA) will likely break relative paths:
# store base metadata path for use later
model_metadata_path <- "../../model-metadata/"
# get `covid19-forecast-hub` content
base_hub_path <- "../../"
hub_content <- hubData::connect_hub(base_hub_path)
These should change to arguments to the argparse. Multiple arguments: one for output folder and one for base hub path.
This note originated in a call between DHM, SB, TM.
EDIT: This has been fixed.
Issue is not expected here but renaming for consistency seems like a good idea (yaml
→ yml
):
EDIT: This has been fixed.
The author would appreciate guidance on handling the below
There is this warning as well that is generated from the author's code below:
Warning
Warning message:
In dplyr::left_join(dplyr::mutate(dplyr::mutate(forecasttools::pivot_hubverse_quantiles_wider(hubverse_table = current_forecasts, :
Detected an unexpected many-to-many relationship between `x` and `y`.
ℹ Row 213 of `x` matches multiple rows in `y`.
ℹ Row 1 of `y` matches multiple rows in `x`.
ℹ If a many-to-many relationship is expected, set `relationship = "many-to-many"` to silence this warning.
Code
dplyr::left_join(
model_metadata, by = "model_id") |>
Good catch @AFg6K7h4fhy2. @sbidari could you edit the metadata to list yourself and/or (as you prefer) the COVIDHub team (with the overall contact email) as model contributors on the Hub models? Thanks!
Re: https://github.com/CDCgov/covid19-forecast-hub/pull/118#issuecomment-2512740542
The author will pull once issue #120 is completed.
I think we should exclude the locations indicated here or at-least a subset of it for the first week (reference-date = 2024-11-23
). @dylanhmorris thoughts?
I forgot to mention this in the earlier meeting but had talked to @AFg6K7h4fhy2 previously about this
EDIT: These have been addressed.
The author would appreciate guidance on handling the below
This is likely due to the duplication of
model_id
s inmodel_metadata
. You can try to extract only the unique model_id entries before theleft_join
. Will look into why there are duplicates in themodel_metadata
in #123
This is ready to merge. Thanks a lot @AFg6K7h4fhy2
For the scope of this PR, please reference issue #93 .