[warning] An overlong word (more than 50 characters) was encountered while indexing: committedkeepingvulnerableadultssafenewsafeguardinghead.<br />Since database search servers currently cannot index words of more than 50 characters, the word was truncated for indexing. If this should not be a single word, please make sure the "Tokenizer" processor is enabled and configured correctly for index Sitewide search.
When enabling the localgov_demo module we see lots of these warnings.
This might be mainly down to indexing rendered html that includes long links.
@msayoung and @finnlewis just started looking at this again. We're exploring moving the tokeniser processor and /or defining the whitespace characters to include hyphens.
[warning] An overlong word (more than 50 characters) was encountered while indexing: committedkeepingvulnerableadultssafenewsafeguardinghead.<br />Since database search servers currently cannot index words of more than 50 characters, the word was truncated for indexing. If this should not be a single word, please make sure the "Tokenizer" processor is enabled and configured correctly for index Sitewide search.
When enabling the localgov_demo module we see lots of these warnings.
This might be mainly down to indexing rendered html that includes long links.
This thread might be useful:
https://www.drupal.org/project/search_api/issues/3069515
Specifically
https://www.drupal.org/project/search_api/issues/3069515#comment-13796223