Open xujiang1 opened 4 years ago
Interesting. I’m marking it as a bug, although I am not 100% it is one, and even if it is, it is probably an upstream issue from lxml.
I don’t think Parsel intends to require that input HTML is standard-compliant. Ideally, anything that a browser accepts we should accept as well, because HTML documents in the wild care about browser support more than they care about standard compliance.
Browsers seem to accept this syntax.
当我使用css选择器时 无法获取h3下的p标签,结果如下:
当我将p标签换成其他标签时可以正常获取:
结果: