OHDSI / ETL-CMS

Workproducts to ETL CMS datasets into OMOP Common Data Model
Apache License 2.0
94 stars 52 forks source link

Feature/fix merge script #75

Open byeungchun opened 2 years ago

byeungchun commented 2 years ago

Hi,

Thank you for this repo. It is really helpful to parse SynPuf. I found range function on merge.py doesn't include 20th file. So, I fix it.

Also, I add Python3 merge script using (DASK)[https://docs.dask.org/en/latest/generated/dask.dataframe.multi.concat.html] library which is faster than file open function.

Thank you. Kind regards, Byeungchun