nmontalva / ccaa-surnames

Code developed to analyse surnames data as part of Project Fondecyt N°11160402
3 stars 2 forks source link

download_pdfs(): HTTP error 403 #4

Open nmontalva opened 5 years ago

nmontalva commented 5 years ago

Access to OTCA directory is now forbidden. Files can be accessed via URL, but not directories, so the script does not work (at least we already have the files).

> download_pdfs()
Error in open.connection(x, "rb") : HTTP error 403.
rhz commented 5 years ago

Oh no, we will have to parse some html files to get the links to the files I guess. I remember we talked about this at some point and there were reasons why parsing the htmls were not a good idea. Maybe some of the files were not accesible from the html? We will see. I can't work on this at the moment but maybe next week? How urgent is it for you?

nmontalva commented 5 years ago

Do not worry, I figured out a workaround for the urgency. I would better suggest to give it a full week or two around December, so we can also see other things.

Best,

Nicolás Montalva

On Wed, 31 Oct 2018 at 01:06, Ricardo Honorato-Zimmer < notifications@github.com> wrote:

Oh no, we will have to parse some html files to get the links to the files I guess. I remember we talked about this at some point and there were reasons why parsing the htmls were not a good idea. Maybe some of the files were not accesible from the html? We will see. I can't work on this at the moment but maybe next week? How urgent is it for you?

— You are receiving this because you were assigned. Reply to this email directly, view it on GitHub https://github.com/nmontalva/ccaa-surnames/issues/4#issuecomment-434554260, or mute the thread https://github.com/notifications/unsubscribe-auth/ACYWHX9QAxRHPGMI_Of2rSpfspoP2DVLks5uqSHhgaJpZM4YDBvw .