covidatlas / li

Next-generation serverless crawler for COVID-19 data
Apache License 2.0
57 stars 33 forks source link

Add scraper for MEX #408

Closed jzohrab closed 4 years ago

jzohrab commented 4 years ago

Original issue https://github.com/covidatlas/coronadatascraper/issues/672, transferred here on Sunday Apr 05, 2020 at 18:23 GMT


Location name

Mexico

Source URL

https://coronavirus.gob.mx/, from the federal government's ministry of health.

Notes/comments

This is more of a bookmark than anything else - just caching this will be difficult as it seems to be nested more deeply than Argentina.

At the bottom of the site there are some videos / links in some sort of auto-scrolling frame. Each day it appears they have a press conference, and it seems each one gets a page: e.g. April 4. URLs for those appear easy to generate:

https://coronavirus.gob.mx/YYYY/MM/DD/conferencia-D-de-mmm/

where mmm is the full month name in Spanish, all lower case.

For example, I spot checked March 4th and it exists: https://coronavirus.gob.mx/2020/03/04/conferencia-4-de-marzo/

Each of the press conference pages links to a PDF with a link whose text is "Comunicado técnico". URLs for those PDFs seem pretty consistent except for one number I can't decipher. e.g.

https://www.gob.mx/cms/uploads/attachment/file/538947/Comunicado_Tecnico_Diario_COVID-19_2020.03.04.pdf https://www.gob.mx/cms/uploads/attachment/file/545219/Comunicado_Tecnico_Diario_COVID-19_2020.04.03.pdf https://www.gob.mx/cms/uploads/attachment/file/545266/Comunicado_Tecnico_Diario_COVID-19_2020.04.04.pdf

Content of the PDF apparently can change. I can't imagine doing anything but manual data entry on this one. Currently our source for Mexico is https://github.com/CSSEGISandData/COVID-19 but at least the more recent PDFs here have death counts per state (but not case counts).

jzohrab commented 4 years ago

(Transferred comment)

Hey @shaperilio I came across this resource as well that might be worth looking at. Unfortunately I don't know exactly how to verify whether this is the same or different data:: https://datos.gob.mx/

jzohrab commented 4 years ago

(Transferred comment)

@NateBaldwinDesign I saw that too - It looks to be the equivalent of our own data.gov. I tried a few covid related search terms and didn't find anything.

jzohrab commented 4 years ago

(Transferred comment)

@shaperilio and @NateBaldwinDesign detailed data for Mexico are now available as .csv files at the state and municipality level. Please see: https://github.com/covidatlas/coronadatascraper/issues/1024