localgovdrupal / localgov_demo

Example content for demonstrating the LocalGovDrupal distribution and to help with development.
GNU General Public License v2.0
0 stars 1 forks source link

[warning] An overlong word (more than 50 characters) was encountered while indexing: #57

Closed finnlewis closed 2 years ago

finnlewis commented 3 years ago

[warning] An overlong word (more than 50 characters) was encountered while indexing: committedkeepingvulnerableadultssafenewsafeguardinghead.<br />Since database search servers currently cannot index words of more than 50 characters, the word was truncated for indexing. If this should not be a single word, please make sure the "Tokenizer" processor is enabled and configured correctly for index Sitewide search.

When enabling the localgov_demo module we see lots of these warnings.

This might be mainly down to indexing rendered html that includes long links.

This thread might be useful:

https://www.drupal.org/project/search_api/issues/3069515

Specifically

https://www.drupal.org/project/search_api/issues/3069515#comment-13796223

finnlewis commented 2 years ago

@msayoung and @finnlewis just started looking at this again. We're exploring moving the tokeniser processor and /or defining the whitespace characters to include hyphens.