dataesr / affiliation-matcher

Matcher for affiliations - link raw affiliation to ROR ids, country and RNSR
MIT License
22 stars 1 forks source link

Insee data download fails #60

Closed TesteFixe closed 2 years ago

TesteFixe commented 2 years ago

Hi,

I have currently a problem with Insee data that can't be downloaded. "make load" rise:

requests.exceptions.SSLError: HTTPSConnectionPool(host='www.insee.fr', port=443): Max retries exceeded with url: /fr/statistiques/fichier/4652957/ZE2020_au_01-01-2021.zip (Caused by SSLError(SSLError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:852)'),))

And then ror indexes don't exist in elastic (just Rnsr and Grid)... Any help would be welcome :)

Thanks!

ericjeangirard commented 2 years ago

Yes indeed, the certificate does not seem valid, the commit should fix this (it forces requests not to verify the certificate)

TesteFixe commented 2 years ago

Thanks a lot! But there is still a problem on the "make load" step: the ror file from zenodo isn't recognize as a zip...and the load fails. (maybe an env. problem --Windows 10 Pro 19044, docker desktop 4.12, wsl 2, ubuntu 20.4-- or a library one?).

ericjeangirard commented 2 years ago

I've opened a new issue for this one : https://github.com/dataesr/affiliation-matcher/issues/61 will look into it next week!