geonetwork / core-geonetwork

GeoNetwork is a catalog application to manage spatially referenced resources. It provides powerful metadata editing and search functions as well as an interactive web map viewer. It is currently used in numerous Spatial Data Infrastructure initiatives across the world.
http://geonetwork-opensource.org/
GNU General Public License v2.0
406 stars 482 forks source link

index appropriate translation of keywords provided by thesaurus #4458

Open jodygarnett opened 4 years ago

jodygarnett commented 4 years ago

The facets provide the ability to filter documents using the viewers language. However typing this same text does not find the same documents.

Describe the solution you'd like

When indexing the document the "any" field should pull out the appropriate keyword translation as provided by the thesaurus.

Describe alternatives you've considered

Adding every translation into the metadata document does not seem like a good alternative to work around this limitation.

Additional context

This report is based on the use of thesaurus by the luecene index (rather than specific to any schema).

jodygarnett commented 4 years ago

This is just an idea for discussion as I learn how indexing is done; was surprised keyword translations were not respected by text search

fxprunayre commented 4 years ago

Some options we have been investigated to help on this topic:

"any" analyzer is language specific, so we can't really mix keywords in different language in the same field.

jodygarnett commented 4 years ago

Speaking with @josegar74 today I may have a misunderstanding here, sorry I am full of questions while learning :P