There are various directives in use in the web to tell crawlers to ignore
sections of documents. The HTML parser should understand these directives
(it should probably also be possible to configure it to ignore them).
A good list of such directives is at:
http://wunderwood.org/most_casual_observer/2007/05/selective_page_indexing_direc
t.html
Original issue reported on code.google.com by boulton.rj@gmail.com on 30 Nov 2007 at 1:15
Original issue reported on code.google.com by
boulton.rj@gmail.com
on 30 Nov 2007 at 1:15