usc-isi-i2 / dig-etl-engine

Download DIG to run on your laptop or server.
http://usc-isi-i2.github.io/dig/
MIT License
101 stars 39 forks source link

Duplication of desired_docs field in status.json #270

Open elmaestro08 opened 5 years ago

elmaestro08 commented 5 years ago

Datasets with identifiers containing upper case letters are being duplicated in the status.json file contained in the working_dir of the project. This is causing the desired flag in the DIG UI to be reset to zero. Hence, the data is not ingested into the system.

Example status.json:

{ "desired_docs": { "imfCPI": 0, "imfcpi": 1 }, "added_docs": { "imfCPI": 0 }, "total_docs": { "imfCPI": 1 } }