jhy / jsoup

jsoup: the Java HTML parser, built for HTML editing, cleaning, scraping, and XSS safety.
https://jsoup.org
MIT License
10.9k stars 2.17k forks source link

it doesn't work in asynchronous page #492

Closed liefdiy closed 9 years ago

liefdiy commented 9 years ago

you can try to parse this url: http://detail.tmall.com/item.htm?id=4044334956&spm=181.1109657.0.0.lEfDyA then find "span.tm-price" element, it's null because that part is asynchronous loaded.

SubiRUN commented 9 years ago

Jsoup is a light-weight lib that doesn't have a javascript engine. It cannot be called an issue. Maybe you can use a browser automation tool like htmlunit.