jasonasher / dc_doh_hackathon

Repository for the DC DOH Hackathon on September 23rd, 2017
5 stars 28 forks source link

Extract Features from Basic Business License Data #11

Open jasonasher opened 7 years ago

jasonasher commented 7 years ago

Start with the Basic Business License data in the /Data Sets/Basic Business Licenses/ folder in Dropbox.

Write a script that uses this data to produce a feature data table for

  1. The number of new business licenses issued in the last 4 weeks; and
  2. The number of business licenses in effect in each week.

You can find the data format and examples on the Feature Dataset Format tab in this document

Basic business licenses can have one ore more categories, found in the LICENSECATEGORY column of the source data.

Input: CSV files with data for each given year

Output: A CSV file with

-1 row for each feature id, construction permit type and subtype, and each week, year, and census block -The dataset should include the following columns:

feature_id: The ID for the feature, in this case, "business_licenses_issued_last_4_weeks" or "business_licenses_in_effect" feature_type: Business license category, found in the LICENSECATEGORY column of the source data. feature_subtype: Left blank year: The ISO-8601 year of the feature value week: The ISO-8601 week number of the feature value census_block_2010: The 2010 Census Block of the feature value value: The value of the feature, i.e. the number of business licenses of the specified type either new in the previous 4 weeks or active during the week and year in question in the given census block.

When you are finished Submit a pull request on GitHub (or upload your scripts) Upload any files to Dropbox

Need more information? Flag Jason or Elizabeth, or ask your question in the comments below and we'll respond as soon as we can!

histed commented 7 years ago

code to do this in this pull request: https://github.com/jasonasher/dc_doh_hackathon/pull/51