Open liunelson opened 1 week ago
@liunelson We will also need:
@mwdchang got us an example of a dataset card that Terarium expects: https://github.com/DARPA-ASKEM/experiments/blob/nliu-funman/python_sandbox/notebooks/data/monthly_demo_july/dataset-card-example.json
Here are six datasets in decreasing order of priority.
Documentation comes in two parts:
Documentation also in two parts:
CSV file: export or API from here https://healthdata.gov/Hospital/COVID-19-Reported-Patient-Impact-and-Hospital-Capa/anag-cw7u/about_data
Documentation: the text in the same page
CSV file: COVID-19_Estimates.csv.xz
https://github.com/hsbadr/COVID-19_Estimates?tab=readme-ov-file
Documentation: README.md
in the repo
CSV files: us-counties-202*.csv
https://github.com/nytimes/covid-19-data
Documentation: README.md
in the repo
Note: This is a really big and probably the most challenging dataset to process since it attempts to track many COVID-related time-series down to the local level.
CSV file: COVID-19.csv.xz
https://github.com/CSSEGISandData/COVID-19_Unified-Dataset?tab=readme-ov-file
Documentation: "Case Types" of README.md
in the repo
I found the JSON schema + example for the AMR model metadata section: https://github.com/gyorilab/mira/blob/e468059089681c7cd457acc51821b5bd1074df04/docs/model_metadata_annotation.md?plain=1#L21
@jryu01
We need to also parse out these fields during document -> config extraction
description: this is a long form text description of what the parameter represents
units: units
distribution: a dictionary which contains the following keys, type
and parameters
type: distribution type,, cauchy, gaussian, exponential, ... etc
parameters: the parameters that dictate the shape of the distribution. A normal distribution will have parameters "mu" and "sigma" for mean and standard deviation. This field should be a dictionary
We will probably need to update this adapter so that it correctly reformats the new keys and values into a format that the HMI expects
For this task, Julian will need to know what each field in a model AMR means so that he can properly instruct an agent to extract and assign what to where.
The JSON schema for a PetriNet model AMR is here:
In addition to the schema, here's a description of most fields: