taoqf / node-html-parser

A very fast HTML parser, generating a simplified DOM, with basic element query support.
MIT License
1.11k stars 107 forks source link

Node html parser ignore tbody #232

Closed Celtian closed 1 year ago

Celtian commented 1 year ago

For example on this page

https://www.transfermarkt.com/slug/kader/verein/62/plus/1

node-html-parser ignores tbody and does not work ...

td.posrela > table > tbody > tr:nth-child(1) > td.hauptlink > a

Now I have to use simplified selector instead ... which is wrong ...

td.posrela > table tr:nth-child(1) > td.hauptlink > a

dbm03 commented 1 year ago

Same happening to me, it is 100% sure the tbody tag is being ignored

Maxdola commented 1 year ago

Are there any Updates or a workaround, since I've now also run into this issue and I need the content within the tbody...

taoqf commented 1 year ago

I'm afraid this would not be fixed, cause there are no thead or tbody in rawhtml, We could query tbody only in browser because browser will add tbody where aren't any tbody in table.