First, thanks for writing and maintaining this speedy library. It makes a huge difference when parsing thousands of documents compared to other parsers.
Anyway, while attempting to strip the data tag from some content, I noticed that the library seems to choke on this specific tag. Here's some code to reproduce it:
from selectolax.parser import HTMLParser
html = """test"""
tree = HTMLParser(html)
tree.unwrap_tags(["data"])
First, thanks for writing and maintaining this speedy library. It makes a huge difference when parsing thousands of documents compared to other parsers.
Anyway, while attempting to strip the
data
tag from some content, I noticed that the library seems to choke on this specific tag. Here's some code to reproduce it:Output: Segmentation fault (core dumped)
I'm running
selectolax==0.2.6
onPython 3.6.9