HTML Extract References is not very robust

o0111 / ruralcafe

Automatically exported from code.google.com/p/ruralcafe

0 stars 0 forks source link

HTML Extract References is not very robust #24

Closed GoogleCodeExporter closed 8 years ago

GoogleCodeExporter commented 8 years ago

Since we rolled our own, this method is not very robust. We probably want to 
borrow an open source library for this. Somewhat ties into the caching of 
dynamic pages (Issue 3).

Original issue reported on code.google.com by shouldab...@gmail.com on 10 Oct 2010 at 8:43

GoogleCodeExporter commented 8 years ago

I used HTML Agility Pack as a library. It yields the same results and can 
replace our HTML parser. Nevertheless I am still using 
HtmlParser.LinkTagAttributes and
HtmlParser.EmbeddedObjectTagAttributes, as I did not find a better way to 
detect references.

Is there anything else to be done here?

Original comment by satiaher...@gmx.de on 2 May 2013 at 3:51

Changed state: Started

GoogleCodeExporter commented 8 years ago

Nothing more to be done.

Original comment by satiaher...@gmx.de on 9 May 2013 at 4:52

Changed state: Fixed