metachris / pdfx

Extract text, metadata and references (pdf, url, doi, arxiv) from PDF. Optionally download all referenced PDFs.
http://www.metachris.com/pdfx
Apache License 2.0
1.03k stars 113 forks source link

Adding Timeout CLI parameter #51

Open dustywhite7 opened 2 years ago

dustywhite7 commented 2 years ago

I ran into some problems where a file was hanging because the urlopen command never timed out, so I added an option for users to specify that they want requests to time out after 10 seconds. This parameter is then passed down to the urlopen function. I think it goes with #43.

This is my first pull request, so let me know if I messed anything up.