CityOfNewYork / CROL-Overview

City Record Online parsing libraries and supporting files
26 stars 14 forks source link

Parsing AdditionalDescriptions for the Date and Time #22

Closed ghost closed 9 years ago

ghost commented 9 years ago

Current next step(s):

Objective: Check if readable date and time in the AdditionalDescriptions field. Sub-tasks: 1.Use dateutil.parser to find all valid dates and time in the AdditionalDescriptions field.

  1. Create Summary spreadsheet that # of dates and time found for each record id.
  2. Create Detail spreadsheet that list each date and time for each record id. Delivery: Estimated Monday the 23rd
ghost commented 9 years ago

Update for extracting date and time from the AdditionalDescriptions field.

Code: https://github.com/CityOfNewYork/CROL-PDF/blob/master/Planned%20Implementation/code/DateFind.py

Estimated Run Time: 90 minutes

Problems: (see Request ID: 20141117106 as example)

  1. Unable to retrieve all dates and time from the field. Get at most the first three dates.
  2. Numbers extracted as dates.
cds-amal commented 9 years ago

This task is closed, and moved over to #37