eonum / medtextcollector

Scripts for the collection of online medical texts and definitions
MIT License
1 stars 0 forks source link

Consider BeatifulSoup for HTML to text extraction #13

Open tschimbr opened 6 years ago

tschimbr commented 5 years ago

https://github.com/buriy/python-readability

See chapter 3 in "Applied Text Analysis with Python"