addshore / wikicrowd

Tool for crowd sourced micro edits for Wikimedia
https://wikicrowd.toolforge.org/
MIT License
7 stars 4 forks source link

Script is grabbing bolded quotation marks in aliases #49

Open PiotrGackowski opened 2 years ago

PiotrGackowski commented 2 years ago

Sometimes on wiki editor also bolded quotation marks so instead "AAA" he/she used "AAA" Is any way to strip/remove such suggestions?

image
PiotrGackowski commented 2 years ago

Also - on Polish wikipedia we are using Polish quotation marks. Instead of standard English "AAAA" we are writing „AAAA”

waldyrious commented 2 years ago

Coincidentally, I recently edited an article on English Wikipedia to move the quotation marks out of the bold formatting, according to Wikipedia:Manual of Style/Titles#Additonal markup and Wikipedia:Manual of Style#Quotation marks in article openings, precisely because I came across the same issue.

I assume Wikicrowd is using the bold formatting to extract the aliases, and indeed that seems to be the case in the article you came across. Ideally the fix should be applied directly to the article, but I agree that Wikicrowd could be smart and bypass those cases entirely, as it isn't its responsibility to notify editors of such formatting errors.

addshore commented 2 years ago

So would trimming quotes from all detected things be acceptable? Are there cases where quotes would actually be wanted? Should it only trim if quotes are on both ends perhaps?

PiotrGackowski commented 2 years ago

Probably trimming from both ends will be fine. Inside (as <USS "Iowa" (BB-61)>) it can be problematic, so leave it.

Please remember about Polish quotation marks. People are creazy about them on pl.wiki, we have bots that are changing English quotation marks to Polish one.