MODA-NYC / db-recovery-data-partnership

Data pipelines for datasets that are part of the Recovery Data Partnership project
https://www1.nyc.gov/site/analytics/initiatives/recovery-data-partnership.page
12 stars 8 forks source link

Source: Cuebiq #17

Closed AmandaDoyle closed 3 years ago

AmandaDoyle commented 4 years ago

Status

Description

Output schema

cuebiq/cuebiq_mobility/latest/cuebiq_daily_mobility.csv

Daily cuebiq mobility index data for NYC

Input field name Data Type Output field name Output example
reference_date YYYY-MM-DD date 2020-06-20
week_name IYYY-IW year_week 2020-31
county_name text borough BX
county_name int borocode 1
census_block_group_id int cbg2010 360050004002
mobility_index numeric mobility_index 3.78327867
weight numeric weight 4.8801

Visit Index Data

cuebiq/cuebiq_daily/latest/cuebiq_daily_visits.zip

Daily visits to each business type within the NYC market area

Input field name Data Type Output field name Output example
reference_date YYYY-MM-DD date 2020-06-20
- IYYY-IW year_week 2020-31
sector text sector Automotive
vertical text vertical Car Brands
brand text brand Lexus
naics6_code numeric naics6_code 444130
roll_avg_7days_cvi numeric roll_avg_7days_cvi 1.5344
ly_avg_7days_cvi numeric ly_avg_7days_cvi 1.4097
roll_avg_7days_cvi_per_store numeric roll_avg_7days_cvi_per_store 0.1096
ly_avg_7days_cvi_per_store numeric ly_avg_7days_cvi_per_store 0.0939
cuebiq_weekly is discontinued **Visit Index Data** Formally cuebiq_weekly_visits | Input field name | Data Type | Output field name | Output example | |------------------|-------------|-------------------|----------------| | week_label | `YYYY-MM-DD` | date | 2020-07-27 | | visit_week_cd | `IYYY-IW` | year_week | 2020-31 | | market_area_code | `int` | market_area_code | 501 | | market_area | `text` | market_area | New York (NY) | | brand | `text` | brand | Lexus | | vertical | `text` | vertical | Car Brands | | sector | `text` | sector | Automotive | | cvi | `numeric` | cvi | 0.1223 | | cvi_per_store | `numeric` | cvi_per_store | 0.0083 |

cuebiq/cuebiq_travelers/latest/cuebiq_daily_travelers.csv

Daily report of number of travelers to counties within the NYC Region.

Input field name Data Type Output field name Output example
reference_date sdate date 2020-06-20
county_code text fips_county 36001
county_name text county Albany
origin_state text origin_state Alabama
last_14_days_travelers int last_14_days_travelers 141
last_14_days_travelers_sip int last_14_days_travelers_sip 41
last_14_days_travelers_not_sip int last_14_days_travelers_not_sip 100
travelers_not_sip_daily_miles int travelers_not_sip_daily_miles 16

Use cases

1) Times used Cuebiq data https://www.nytimes.com/interactive/2020/08/18/business/economy/coronavirus-economic-recovery-states.html?smid=em-share

https://www.cuebiq.com/visitation-insights-switchers/

Resources

https://help.cuebiq.com/hc/en-us/articles/360041350092-Cuebiq-Mobility-Visit-Index-Feed-Specs#h_f7587488-0205-49a8-b418-bc24ecc0b9b9 https://www.cuebiq.com/adventures-of-clara/

mgraber commented 4 years ago

cuebiq_daily currently contains values in the field naics6_code that are not valid 6-digit NAICS codes. What are they? Should they get cleaned/NULLed?

mgraber commented 4 years ago

Suggestion for output table names:

@AmandaDoyle thoughts?