utkarshkukreti / select.rs

A Rust library to extract useful data from HTML documents, suitable for web scraping.
MIT License
971 stars 69 forks source link

Support quirks mode #28

Open grahamc opened 7 years ago

grahamc commented 7 years ago

I have a somewhat malformed document that I would like to parse. However, without quirks mode (http://doc.servo.org/html5ever/tree_builder/struct.TreeBuilderOpts.html) I'm unable to meaningfully parse the document.

utkarshkukreti commented 7 years ago

Could you give a quick example of a string that parses differently with/without quirks mode?

Also I'm currently using RcDom to do the tree building and as far as I can see, it doesn't actually use the value of quirks_mode anywhere, only stores it: https://github.com/servo/html5ever/blob/master/src/rcdom.rs although I may have missed something!