rafguns / doidownloader

You give it DOIs, it gives you the article PDFs
MIT License
0 stars 0 forks source link

Handle relative fulltext links #5

Closed rafguns closed 1 year ago

rafguns commented 1 year ago

Example: https://iorj.hse.ru/en/2021-16-2/484825594.html contains the following snippet:

<meta name="citation_pdf_url" content="/data/2021/09/28/1418532779/VESTNIK Т16(3)2021-52-69.pdf" />

We store these links as is, whuch leads to a MissingSchema error. We should resolve them to the full URL.

rafguns commented 1 year ago

Seems like we handled this already in https://github.com/rafguns/doidownloader/commit/54c0d74632026bb916fcc1e2e29765e0366b1ae4 ! I'll focus on merging branch async first. It has this and other niceties, such as a (partially) class-based design.