siznax / wptools

Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis
MIT License
574 stars 78 forks source link

Need solid unit tests for language variants (esp. Chinese) #52

Closed siznax closed 6 years ago

siznax commented 8 years ago

As shown in #50, getting language variants right is complicated by:

  1. incorrect API data
  2. incomplete support by wikisites
  3. knowing what to expect under ideal conditions

We need some solid test cases that operate on some title/page/item with variants that are correct.

siznax commented 8 years ago

See also https://meta.wikimedia.org/wiki/Automatic_conversion_between_simplified_and_traditional_Chinese

siznax commented 8 years ago

The English Wikipedia API docs mention a converttitles parameter:

converttitles: Convert titles to other variants if necessary. Only works if the wiki's content language supports variant conversion. Languages that support variant conversion include gan, iu, kk, ku, shi, sr, tg, uz, zh. Type: boolean (details)

but it doesn't seem to do anything useful (query)

@seansay, do you see anything helpful here? https://www.mediawiki.org/wiki/API:Query/zh

siznax commented 6 years ago

I'm going to close this until/unless we have more activity around variants.