alkamid / wiktionary

AlkamidBot's scripts
http://pl.wiktionary.org
MIT License
3 stars 2 forks source link

rewrite porzucone.py to use pagelinks dump #11

Closed alkamid closed 9 years ago

alkamid commented 9 years ago

Right now Page.getReferences() is used to count references to a page, and this is a method that operates on-line. This is an overkill and it takes forever to find orphaned pages.

Options:

  1. Use (adapt) pywikibot/lonelypages.py
  2. Use pagelinks dump, where are links are listed
  3. Use pywikibot's xmlreader?
alkamid commented 9 years ago

Actually I'm quite happy with the current script, which executes in 3h+. (https://github.com/alkamid/wiktionary/commit/bc95045d6c72a4f27e9d2fdd6839cd4a948974be). Closing, I might revisit it at some point.