Closed patrickm02L closed 1 year ago
In Product Sync 7/20/22.
Here’s the OCR/image extraction: https://github.com/cityofaustin/atd-airflow/blob/master/dags/python_scripts/cr3_extract_diagram_ocr_narrative.py. It’s in the airflow repo, which is submoduled into the prefect repo, and called from here. https://github.com/cityofaustin/atd-prefect/blob/main/flows/vision-zero/cr3_ocr_narrative_extract_diagram/cr3_ocr_narrative_extract_diagram.py.
A new CR3 form will be implement on 1/1/23 as mandated by TxDOT. The improvement adds new field as outlined in the Functional Requirements Specification document which include:
Updated Code sheet v0.5
In order to manage this change, we will need to update the OCR to capture the data from the new fields. @frankhereford has outlined the following minor changes to bring operations back to normal operation:
Additionally, to support detecting if the CR3 is an old-style or a new one we'll need to extend the constellation test to tell us if it's digital end-to-end and also which CR3 form style we're looking at.