The new ArticleFilter can now filter the pages that are included in the
revision dump according to their namespaces. The prefixes are read from the
siteinfo section in the xml dump.
Currently, the namespaces are hard-coded in the class DiffToolThread
ArticleFilter nameFilter = new ArticleFilter(Arrays.asList(new Integer[]{0,1}));
The filter is set to include articles (namespace 0) and talk pages(namespace 1)
and reject everything else. It already works language independently.
The namespaces that are to be included in the revision db should be passed to
the filter via the configuration file (and thus be made configurable via the
ConfigGUI)
Original issue reported on code.google.com by oliver.ferschke on 22 Jul 2011 at 3:41
Original issue reported on code.google.com by
oliver.ferschke
on 22 Jul 2011 at 3:41