cernopendata / cernopendata-client

CERN Open Data command-line client
http://cernopendata-client.readthedocs.io/
GNU General Public License v3.0
10 stars 9 forks source link

select record by title should use exact string matching #56

Open tiborsimko opened 4 years ago

tiborsimko commented 4 years ago

Looking for records by title sometimes does not work due to returning more than one record:

$ cernopendata-client get-record --title '/BTau/Run2010B-Apr21ReReco-v1/AOD'
More than one record fit this title. This should not happen.

See two records: http://opendata.cern.ch/search?page=1&size=20&q=title:%22%2FBTau%2FRun2010B-Apr21ReReco-v1%2FAOD%22

However, there is precisely one dataset record matching the given title exactly, record ID 1, which should be returned...

We could look for "exact" string match instead of the "substring" match, as it were, to fix this problem. Note that the issue may need to amend Elasticsearch settings on the server side.

Originally posted by @tiborsimko in https://github.com/cernopendata/cernopendata-client/issues/19#issuecomment-683651334

utkarsh147-del commented 3 years ago

I want to work on this issue.Please assign this to me.

tiborsimko commented 3 years ago

Hi @utkarsh147-del, thanks for your kind offer, however this issue necessitates server-side upgrades on the digital repository side, e.g. see https://github.com/cernopendata/opendata.cern.ch/issues/2930 and many issues in the forthcoming Infrastructure-Upgrade sprint. Hence we plan to address search upgrades there more holistically...