internetarchive / iari

Import workflows for the Wikipedia Citations Database
GNU General Public License v3.0
11 stars 9 forks source link

Bug: Url from wikitext gets parsed twice, one being erroneous #891

Closed mojomonger closed 1 year ago

mojomonger commented 1 year ago

using the endpoint for reference info:

https://archive.org/services/context/iari/v2/statistics/reference/dd8603ac

{
id: "dd8603ac",
template_names: [
"cite journal"
],
wikitext: "* {{cite journal|last= Fischer|first= Steven Roger|year= 1995|title= Preliminary Evidence for Cosmogonic Texts in Rapanui's Rongorongo Inscriptions|journal= Journal of the Polynesian Society |issue=104|pages=303–21|url=http://www.jps.auckland.ac.nz/document/Volume_104_1995/Volume_104%2C_No._3/Preliminary_evidence_for_cosmogonic_texts_in_Rapanui%26apos%3Bs_Rongorongo_inscriptions%2C_by_Steven_Roger_Fischer%2C_p_303-322/p1}}",
type: "general",
footnote_subtype: "",
flds: [
"auckland.ac.nz"
],
urls: [
"http://www.jps.auckland.ac.nz/document/Volume_104_1995/Volume_104%2C_No._3/Preliminary_evidence_for_cosmogonic_texts_in_Rapanui%26apos%3Bs_Rongorongo_inscriptions%2C_by_Steven_Roger_Fischer%2C_p_303-322/p1",
"http://www.jps.auckland.ac.nz/document/Volume_104_1995/Volume_104%2C_No._3/Preliminary_evidence_for_cosmogonic_texts_in_Rapanui%26apos%3Bs_Rongorongo_inscriptions%2C_by_Steven_Roger_Fischer%2C_p_303-322/p1}}">"
],
. . .

there two url entries. as can be seen, the second one is simply the first one with extra curly brace pair at the terminus.

This is a wikitext parsing error.

dpriskorn commented 1 year ago

This should already be fixed but I'd not deployed yet. See https://github.com/internetarchive/iari/pull/887. What article does this reference appear in?

dpriskorn commented 1 year ago

closing, please reopen with link to the article endpoint that touches this reference.