LD4P / qa_server

A rails engine with questioning authority gem installed to serve as an authority search server with normalized results.
Apache License 2.0
6 stars 3 forks source link

New Indexing: CERL - changes in accuracy tests #384

Open elrayle opened 3 years ago

elrayle commented 3 years ago

The new indexing scheme has the following impact on CERL accuracy tests:

image

sfolsom commented 3 years ago

Not sure what's happening with Johannes.

To satisfy the "Jacob Winter" test we may need to get closer to exact match with the rdaGr2:nameOfThePerson.

sfolsom commented 3 years ago

I checked again, and I'm still not sure why these passed before, but aren't now. They are spelled exactly the same in the search terms as they are in the label. See: https://data.cerl.org/thesaurus/_search?query=+Johannes+Philippus+de+Lignamine&from=0

elrayle commented 3 years ago

ACTION: @sfolsom will loosen Jacob Winter test ACTION: @eichmann will look into why the Johannes queries aren't working

sfolsom commented 3 years ago

Pull request for loosening Jacob Winter test: https://github.com/LD4P/qa_server/pull/417

elrayle commented 3 years ago

We noticed that the actual label is "Lignamine, Johannes Philippus ¬de¬". The results are picking up anything that has de as a work with or without the ¬ character. And that matches a lot of records.

elrayle commented 3 years ago

ACTION: Need to contact CERL to see if this data is an error on their side.

{
"uri": "http://thesaurus.cerl.org/record/cnp00948202",
"id": "http://thesaurus.cerl.org/record/cnp00948202",
"label": "***, ... ¬de¬"
},
{
"uri": "http://thesaurus.cerl.org/record/cnp00948207",
"id": "http://thesaurus.cerl.org/record/cnp00948207",
"label": "***, ... ¬de¬"
},
{
"uri": "http://thesaurus.cerl.org/record/cnp00972176",
"id": "http://thesaurus.cerl.org/record/cnp00972176",
"label": "***, ... ¬de¬"
},
{
"uri": "http://thesaurus.cerl.org/record/cnp01967027",
"id": "http://thesaurus.cerl.org/record/cnp01967027",
"label": "De***"
},