TU-Berlin / project-mlp

a machine learning approach for processing mathematical language in scientific documents
0 stars 1 forks source link

find a libabrary that converts HTML to plain text #3

Open physikerwelt opened 9 years ago

physikerwelt commented 9 years ago

... this should be availible... it would be optimal if highlightings could be preserved in the same way as we support them for wikilinks and other wiki markup blocks #2

alexeygrigorev commented 9 years ago

jsoup should be good for this purpose