esmero / archipelago-deployment

Archipelago Commons Docker Deployment Repository
33 stars 15 forks source link

Upgrade OCR Solr field to tokenize on "," and others #252

Closed DiegoPino closed 7 months ago

DiegoPino commented 9 months ago

What?

This has come up before and better to fix here and on deployment-live so I don't forget next time. The Solr field definition for the OCR highlight field is not properly tokenizing on commas and other characters, forcing things like "Diego," to be exact search queries instead of allowing "Diego" to also match "Diego,".

I have fixed this before in Live instances so will copy the new field definitions (YAML file and Solr setup) into 1.1.0 and 1.1.0-iiif @alliomeria ping

DiegoPino commented 7 months ago

Solved in 1.3.0. See https://github.com/esmero/archipelago-deployment/commit/f141c1c8d28895638f71c928e8c5b71cf5e4a821