Hi, I'm working on indexer/crawler which allows to index content of weld website for the search (see
https://issues.jboss.org/browse/ORG-2763). Problem is that there is not any unique css selector which allows to get real content of the page, without all the common stuff (header, main menu, footer). Getting real content only can improve search relevance.
Should you add some unique css class, eg. content-wrapper to the div which contains real content of the page please?
Hi, I'm working on indexer/crawler which allows to index content of weld website for the search (see https://issues.jboss.org/browse/ORG-2763). Problem is that there is not any unique css selector which allows to get real content of the page, without all the common stuff (header, main menu, footer). Getting real content only can improve search relevance. Should you add some unique css class, eg.
content-wrapper
to thediv
which contains real content of the page please?