Some search result snippets have big whitespace (NBSP) between sentences.
This occurs due to the coded whitespace in the webpage when its content was
extracted.
Whitespace should be trimmed and if the trimming is big (e.g., more than 4
spaces), it should insert an ellipsis to visually separate the sentences.
Example of snippets with big whitespace (not the best examples though):
http://arquivo.pt/nutchwax/search.jsp?query=http://dacp.pt/trofeus.htm%20ta%C3%A
7ahitsPerDup=0&dedupField=site
"Troféu Taça DACP DOGUE ALEMÃO CLUBE DE PORTUGAL Largo do Araújo, 59
4465-680 LEÇA DO BALIO ..."
http://arquivo.pt/nutchwax/search.jsp?l=pt&query=jogos+ol%C3%ADmpicos+atenas+htt
p%3A%2F%2Fwww.aaop.pt%2F
"Jogos Olímpicos / Atenas 2004 De 13 Ago 2004 a 29 Ago 2004 ver toda a agenda
Natação ... para os ..."
Original issue reported on code.google.com by devel.da...@vcruz.net on 1 Aug 2012 at 2:05
Original issue reported on code.google.com by
devel.da...@vcruz.net
on 1 Aug 2012 at 2:05