globaldothealth / list

Repository for Global.health: a data science initiative to enable rapid sharing of trusted and open public health data to advance the response to infectious diseases.
MIT License
39 stars 7 forks source link

Add ability to ingest data sources stored in zip files #1155

Closed calremmel closed 3 years ago

calremmel commented 3 years ago

The dataset for Mexico is stored as a zip file, as follows: http://datosabiertos.salud.gob.mx/gobmx/salud/datos_abiertos/datos_abiertos_covid19.zip

It would be great if it were possible for data ingestion to automatically handle data sources that are stored as zip files or other compressed file formats.

Alternatively, we could rehost data sources stored this way somehow, and link to an extracted version on our own servers, but that seems less than ideal.

attwad commented 3 years ago

The retrieval function could uncompress the data when it stores it, let me look at how we could do that.