rehamaltamimi / gwtwiki

Automatically exported from code.google.com/p/gwtwiki
0 stars 0 forks source link

PlainTextConverter output is not optimal #6

Closed GoogleCodeExporter closed 8 years ago

GoogleCodeExporter commented 8 years ago
What steps will reproduce the problem?
1. Wikipedia article "Anarchism"
2. Parsing the text of this page and convert it to plain text using
   String txt = wikiModel.render(new MyPlainTextConverter(false), str);

What is the expected output? What do you see instead?
I would expect that the output is plain, readable text. Instead, commands
(e.g., {{pp-move-indef}}, {{cite journal}}, {{main}}, etc.) are still
included. Also, footnote/reference texts are inlined, which destroys the
readability. As I noticed, only the first of multiple footnotes is
included. Some of the language-related links (e.g., sco:Anarchism) are not
filtered.

What version of the product are you using? On what operating system?
Using bliki.3.0.11.zip on a Windows XP machine, Java 1.5, Eclipse

Please provide any additional information below.
First of all, great work! However, some additional extensions would boost
the quality of bliki. These include optimal plain text output that can be
fed into further analysis tools (e.g., computational linguistics,
TextMining, etc.). Thus, the getBodyString() method in the TagNode class
(htmlcleaner package) has to be adapted.

Original issue reported on code.google.com by Hassler....@gmail.com on 8 Apr 2009 at 7:17

GoogleCodeExporter commented 8 years ago

Original comment by axelclk@gmail.com on 17 Apr 2009 at 4:08

GoogleCodeExporter commented 8 years ago
Although it's not perfect. Could you please try this example:
http://code.google.com/p/gwtwiki/source/detail?r=502

Original comment by axelclk@gmail.com on 26 Aug 2009 at 4:51

GoogleCodeExporter commented 8 years ago
I agree. The output should not have the {{}} or  [].  citations seem to be the 
big one 
here...

Original comment by sportman...@gmail.com on 26 Feb 2010 at 8:16

GoogleCodeExporter commented 8 years ago

Original comment by axelclk@gmail.com on 27 Jan 2012 at 3:49

GoogleCodeExporter commented 8 years ago

Original comment by axelclk@gmail.com on 3 Jul 2012 at 5:41