PiRSquared17 / jwpl

Automatically exported from code.google.com/p/jwpl
0 stars 1 forks source link

Missing stand-alone parser library #19

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
I only need the mediawiki parser alone. There was one jar before, but now 
there's none. Could you please put it back up?

Original issue reported on code.google.com by w4nderl...@gmail.com on 1 Jun 2011 at 10:47

GoogleCodeExporter commented 9 years ago
The old jars haven't been removed. They are still available. You just have to 
switch from "Current downloads" to "All downloads" on the download page.

However, the parser is still contained in the new jars. You'll find it in 
JWPL_API_0.6.0.jar 

Original comment by oliver.ferschke on 1 Jun 2011 at 5:57

GoogleCodeExporter commented 9 years ago
Yes but that jar carries within a lot of things hat the parser alone doesn't 
need like lucene (wich conflicts in my project as i use another version of 
lucene as library). And also yes, there are the older parsers, but, if 
possible, i would like to have also the latest parser alone. I don't know 
precisely what improvement have been done, but, on italian wikipedia article 
dataset the older parser returned some some errors within the text of a 
wikipedia page regarding references, while the 0.6.0 parser do not, so it would 
be extremely helpful for me to have the 0.6.0 parser alone. If it is too much 
work, no problem i'll download the source and manage to do it by my own.

Original comment by w4nderl...@gmail.com on 1 Jun 2011 at 11:30

GoogleCodeExporter commented 9 years ago
I will create new jars including source files in the course of this week. 
As far as I know, there has never been a parser-only jar on Google Code.
Which one do you mean - so I can build a similar package.

Original comment by oliver.ferschke on 1 Jun 2011 at 11:39

GoogleCodeExporter commented 9 years ago
I have to say i haven't seen them, but my colleague who first told me to use 
jwpl to parse a wikipedia page told me there was.... by the way, i think it 
would be useful.
Another useful thing would be a bit more documentation about the parser, lie 
how to use a modular parser and how to build parsers for languages other than 
german (images, categories and other itags are different).

Original comment by maria.mo...@gmail.com on 7 Jun 2011 at 3:41

GoogleCodeExporter commented 9 years ago

Original comment by oliver.ferschke on 11 Jun 2011 at 9:57

GoogleCodeExporter commented 9 years ago

Original comment by oliver.ferschke on 11 Jun 2011 at 9:57

GoogleCodeExporter commented 9 years ago
The parser never was available as a stand alone package and will not be 
published as a stand alone package. However, it CAN be used independently from 
JWPL. If anybody wants to do this, they are invited to use the JWPL API package.

The docu issue is a valid point. The existing docu in the wiki here on Google 
Code should be extended.

Original comment by oliver.ferschke on 13 Aug 2011 at 6:44