sfu-dhil / mvm

GNU General Public License v2.0
0 stars 3 forks source link

Manuscript search issues #99

Closed angelawachowich closed 3 years ago

angelawachowich commented 3 years ago

Hi Michael,

As Betty and I understand it, the manuscript search field should turn up results for call numbers and titles. We think there is an issue with the call number search though. Here are two examples of the problem:

  1. If I am looking for "Folger MS M.a. 15" I can't get results by searching "MS M.a. 15," "M.a. 15," or "15," and if I search "Folger MS M.a. 15," I am shown all the Folger manuscripts.
  2. If I'm looking for Beinecke Osborn c130, I can only find it by searching "c130" (an incorrect result when I search "c 130," "130," or "c.130"). If I include the word "Osborn" or "Beinecke Osborn," I am shown all the Beinecke manuscripts.

Can the manuscript search field be made less space/ punctuation sensitive, and can it be refined so that when you include the archive name in the call number, you aren't shown all manuscripts from that archive?

ubermichael commented 3 years ago

The problem is caused by the very short "words" in the manuscript names. MS and M.a. and 15 are too short for the text indexing algorithms and they get confused by it.

I can adjust the search code for manuscripts so that it does exact string matching instead. So if you search for 15 you will see results the results you expect (and also matches for 150 and 215 etc). Does that seem like a good solution?

ubermichael commented 3 years ago

I think I have a solution that should work.

If you quote the search term ("m.a." or "c130") it will do an exact search and find manuscripts with call numbers that include the quoted term. If you leave out the quotes the search works as before, searching the call number and the description fields.

Can you give it a try and see if it meets your needs?

angelawachowich commented 3 years ago

I'm a little confused. Why can I search short "words" and get results from some archives (eg Leeds Brotherton) and not others (eg Folger, Beinecke)?

Could we adjust the search codes? Searching in quotations is great if you know the exact call number (meaning the proper punctuation and spacing as we have recorded them), but it would be nice to have a backup for users who don't.

ubermichael commented 3 years ago

I think it's time to talk about searching in a bit more detail. What would a user who isn't sure of a call number search for? What would they expect to find?

Adjusting the search isn't a quick or simple or easy thing.

ubermichael commented 3 years ago

Also, can you provide more specific, repeatable examples of "Why can I search short "words" and get results from some archives (eg Leeds Brotherton) and not others (eg Folger, Beinecke)?" Without something I can try on my own there isn't much I can do to try to figure out what is going on.

schellenb commented 3 years ago

Hi all. Just to respond to some of the issues here:

schellenb commented 3 years ago

with regard to what a user not knowing a call number might search for, my hunch is: