internetarchive / iare

An interactive IARI JSON viewer
GNU Affero General Public License v3.0
5 stars 4 forks source link

URLs from PDF are inaccurate and do not reflect link in original document #48

Closed mojomonger closed 1 year ago

mojomonger commented 1 year ago

From:

https://www.foundationforfreedomonline.com/wp-content/uploads/2023/03/FFO-FLASH-REPORT-REV.pdf

I see this URL:

https://www.cisa.gov/topics/election-security/foreign-influence-operations-and-disinformation

But the parser for iare represented that is:

https://www.cisa.gov/topics/election-security/foreign-influence-operations-and-

Resulting in a false 404

dpriskorn commented 1 year ago

this is a IARI library issue, see https://github.com/internetarchive/iari/issues/753