eonum / medtextcollector

Scripts for the collection of online medical texts and definitions
MIT License
1 stars 0 forks source link

Preprocess Wiki Output, parsing update #10

Open fabmue opened 7 years ago

fabmue commented 7 years ago

Replace newline chars '\n' with whitespace ' '. Maybe remove picture annotations, titles, etc.