Open renepickhardt opened 9 years ago
Ok I found https://github.com/marcusklang/wikiforia/blob/bd1e9d4f3fd4bcaad0776bd58399a233b00a9d20/src/main/java/se/lth/cs/nlp/wikipedia/lang/BarConfig.java basically telling me that bavarian should be supported.
yeah I have tried the german edition of wikipedia. it also doesn't work.
I just tried english version of simple wikipedia and got the same result
@thesoulshell you need to use the absolute path... /home/user/enwiki.xml.bz2
hey Marcus I tried
so far so good (ok I had a little trouble figuring out that the easiest way to respect external dependencies is switching to the target directory and running from there)
then
when I now run `java -jar wikiforia-1.2.1.jar -pages barwiki-20151002-pages-articles-multistream.xml.bz2 -output res.xml``
I receive the following output:
looking at https://github.com/marcusklang/wikiforia/blob/5672123ec7eb24801a40276c3e7083e977279838/src/main/java/se/lth/cs/nlp/mediawiki/parser/MultistreamBzip2XmlDumpParser.java#L480
I see that there must be some class fields not initialized but I didn't go into further debugging.
ls shows me that the file res.xml was created so I assume that passing arguments works and something else in the class field is not correctly set.
Did I do something wrong? Is the tool just not working with bavarian wikipedia? comparing git has I found this in git log
which made me wonder that my dump could be the reason. Thanks for help! I am not particularly interested in the bavarian wikipedia but I wanted to test the tool with small data (:
best Rene