cladteam / CCDA_OMOP_by_Python

2 stars 3 forks source link

convert to upper case before hashing for an ID #143

Closed chrisroederucdenver closed 1 week ago

chrisroederucdenver commented 2 weeks ago

The file bennis_shauna_ccda.xml has three locations, itself a bit interesting, all basically identical, two are in Menlo Park, and one in MENLO PARK. The different capitalization of course gives it a different ID.

chrisroederucdenver commented 2 weeks ago

similar for eHX_Terry.xml ANYTOWN and Anytown, MAIN ST. and Main St.

It shows an additional issue, that it repeats identical lines. Easily solved with a "select distinct" equivalent on output.

chrisroederucdenver commented 1 week ago

added upper() in create_hash() in data_driven_parse.py