Open ThomasSchellenbergNextCentury opened 7 years ago
@jasonslepicka
After talking with @saggu it sounds like the issue is that the <br/>
tags are only in knowledge_graph
but the highlights only use content_extraction
or indexed
. In order to have both highlights and line breaks, we need to either:
<br/>
tags within content_extraction
knowledge_graph
We don't want to just use the raw text because part of the cleaning process includes removing excess newlines.
Breaks (
<br/>
) from titles/descriptions are being converted into carriage returns (\r
) in the highlight results from sandpaper. We need the highlighted title/description text to have breaks in order to show line breaks in the DIG UI.Here is the
knowledge_graph->description->value
:Here is the
highlight->content_extraction.content_strict.text
Here is my sandpaper query on
http://10.3.2.82:9876/search/coarse
Here is the link to the ES document: http://10.1.94.103:9201/dig-etk-search/ads/CDFDF087781B7FCEFD7CEA46A739DAB72F26434CF6B7BE5D34865CAE48243B76