I am trying to parse parts of a weirdly formatted website, where .at_xpath() and .at_css() don't help much. Is it somehow possible to retrieve the raw HTML that a Node or Session instance represent?
if I am not mistaken, you should be able to use session.body() to get the HTML and session.document() to get a parsed version of the document (using the lxml library).
Dear Niklas,
I am trying to parse parts of a weirdly formatted website, where .at_xpath() and .at_css() don't help much. Is it somehow possible to retrieve the raw HTML that a Node or Session instance represent?
Kind regards, Arne