TeamHG-Memex / html-text

Extract text from HTML
MIT License
130 stars 24 forks source link

.extract_text returning incorrect format. #29

Closed hg0428 closed 3 years ago

hg0428 commented 3 years ago

When I use .extract_text on any html, it always returns: b'Hello\n\nWorld!' You can see the b'', but it is in string data type. And, I do not want the \n\n there. I have guess layout set to False, if set to True, it just puts an actual new line there instead of the \n

I need this to be fixed soon