wetneb / openrefine-wikibase

This repository has migrated to:
https://gitlab.com/nfdi4culture/ta1-data-enrichment/openrefine-wikibase
Other
100 stars 24 forks source link

Fuzzy-matching via ElasticSearch #80

Open wetneb opened 4 years ago

wetneb commented 4 years ago

The Wikidata search service supports fuzzy search by adding ~ to some words:

So, how should the reconciliation service use this? Add ~ to all the words in the query to enable fuzzy search everywhere? Only do it if the exact query did not return any candidate? Just let users add ~ to their own queries when they want to enable the feature? Any other strategy?

Cc @darecoder1999 who brought up the issue

thadguidry commented 3 years ago

There's quite a bit more exposed even beyond ~ from the ElasticSearch-backed CirrusSearch as documented here: https://www.mediawiki.org/wiki/Help:CirrusSearch

Extra Specific Wikibase features are mentioned:

Wikibase search The Wikibase extension defines some search keywords in order to make it easier to search for certain Wikibase items. This is useful on Wikidata and other Wikibase sites, including to search for images with Structured data on Wikimedia Commons. See Help:WikibaseCirrusSearch for details.

And documented here: https://www.mediawiki.org/wiki/Help:Extension:WikibaseCirrusSearch