Closed AmandaDoyle closed 4 years ago
Axway
Manual
Never
(this is a one time upload only dataset)Google Sheets
None
Every Other Day
S3
Schedule
The script will check every week for new file uploads, if uploaded, then new table will be created, else ignore.
Since street_easy_rental_sales_index
is not time stamped in source data, so we will always update whenever streeteasy_weekly_nta
is updated.
Weekly
(every Wednesday)S3
Schedule
Checked every other day, and create a new table versioned by the day of update regardless of if new files are uploaded. (source file updates have irregular patterns, so we will default to check every other day)
every other day
S3
Schedule
Checked daily
every day
S3
Schedule
Checked daily, same mechanism as cuebiq_daily
every day
updated on the same update cycle as cuebiq_daily
https://github.com/MODA-NYC/db-recovery-data-partnership/blob/a3b308d05bfe81544878f611e57b66cc4ea57792/.github/workflows/cuebiq_weekly.yml#L6
S3
Schedule
Checked daily.
every day
Note that VERSION
comes from web-scraping of the following link https://visitdata.org/data-noncommercial
https://github.com/MODA-NYC/db-recovery-data-partnership/blob/a3b308d05bfe81544878f611e57b66cc4ea57792/recipes/foursquare/runner_county.sh#L24-L28
Update Trigger: Schedule Checked daily.
Google Drive
https://github.com/MODA-NYC/db-recovery-data-partnership/blob/master/recipes/foursquare/datacube.py
Schedule
Checked daily.
Daily
Axway
Scheduled
Weekly
(We check if there's a new file available and update every week)Note that the source data update cycle is irregular, even though the data itself is monthly, we will still update every week, just to make sure what we have is up-to-date https://github.com/MODA-NYC/db-recovery-data-partnership/blob/a3b308d05bfe81544878f611e57b66cc4ea57792/.github/workflows/linkedin.yml#L6
Schedule
Every other day
Axway
Manual
Unknown
@mgraber do we know the update cycle for ioby?
Github
Schedule
Every 3 days
Source data has irregular/infrequent update cycles by application design, hence we will default to check every other 3 days to ensure our files are the most up-to-date https://github.com/MODA-NYC/db-recovery-data-partnership/blob/a3b308d05bfe81544878f611e57b66cc4ea57792/.github/workflows/betanyc.yml#L6
Github
Schedule
Weekly
and Daily
https://github.com/MODA-NYC/db-recovery-data-partnership/blob/a3b308d05bfe81544878f611e57b66cc4ea57792/.github/workflows/opp_insights_weekly.yml#L6 https://github.com/MODA-NYC/db-recovery-data-partnership/blob/f0ca210eebcfc22fc6edad731b307931aba1c9b6/.github/workflows/opp_insights_daily.yml#L6
Google Drive
https://github.com/MODA-NYC/db-recovery-data-partnership/blob/master/recipes/oats/get_data.py
Schedule
Unknown
last updated: Aug 26, 2020
@mgraber do we have an update cycle for OATS?
closing, migrated to excel spread sheet in teams
Organized by data provider > output dataset briefly write how the output is updated. Information should include where the source data comes from, and how an update is triggered (i.e. Axway, and new file is uploaded). This can be a Wiki Page