NASA-PDS / planetary-data-engine

Free-text search capability for planetary data, services, tools, and information
Apache License 2.0
0 stars 0 forks source link

Design and develop a new tool to evaluate the quality of a set of search results #15

Open jjacob7734 opened 8 months ago

jjacob7734 commented 8 months ago

💡 Description

Design and develop a software tool that can assign a score to a set of search results to quantify the quality of the result set. The inputs to the tool are a set of search results and an a labeling of the top 10 documents in the search result set on a scale of 0 to 5 where 0 means completely irrelevant and 5 means extremely highly relevant. The expectation is that the documents of high relevance will appear at or near the top of the result set for a successful search. Only the top 10 documents need to be considered for evaluation in this iteration.

tloubrieu-jpl commented 7 months ago

@jjacob7734 is making progress on this ticket with a single test test suite (Cassini search).

jjacob7734 commented 7 months ago

The best metric I've found to evaluate the quality of a search technology is the Normalized Discounted Cumulative Gain (NDCG). That metric and other common ones are succinctly described here: https://ml-compiled.readthedocs.io/en/latest/metrics.html.

jjacob7734 commented 7 months ago

I am using this implementation in the scikit-learn module: https://scikit-learn.org/stable/modules/generated/sklearn.metrics.ndcg_score.html

jordanpadams commented 4 months ago

Not completed prior to project pause. Moving to icebox