NuGet / NuGetGallery

NuGet Gallery is a package repository that powers https://www.nuget.org. Use this repo for reporting NuGet.org issues.
https://www.nuget.org/
Apache License 2.0
1.54k stars 644 forks source link

[Azure Search] Consider supporting misspelled terms #7386

Open loic-sharma opened 5 years ago

loic-sharma commented 5 years ago

Is your feature request related to a problem? Please describe.

Our data shows that ~7.4% of "legacy" and ~1.5% "preview" search queries have no results. Many of these queries are due to misspelled terms.

See https://github.com/NuGet/NuGetGallery/issues/7128#issuecomment-515607493 for data analysis.

Describe the solution you'd like

Consider adding fuzzy search to our queries: https://docs.microsoft.com/en-us/azure/search/search-get-started-portal#fuzzy-search

Additional context

joelverhagen commented 5 years ago

Bounce should decrease and % of search queries that have no results must go down (as a ship gate, otherwise how do we measure success of this change).

loic-sharma commented 5 years ago

Verify that this doesn't make typosquatting worse. Find all package ids, determine how many are within 1/2/3 edit distances