upenndigitalscholarship / deep

MIT License
3 stars 1 forks source link

Problem with ignoring punctuation in filtering #228

Closed ZacharyLesser closed 2 months ago

ZacharyLesser commented 2 months ago

If you search

“Comedies, histories and tragedies”

the results are correct. But if you search

"“Comedies, histories, and tragedies”

it returns no hits.

If you search with no commas at all, you get correct results as well.

AND: if you search “comedies histories, and tragedies” — with just one comma but in a different place — you STILL get correct results. Same thing if you put the one comma after “and” (even though that is nonsensical in English)

The upshot: the search seems to break if there is more than one comma present. This must have something to do with the way the filter ignores punctuation, right? Can you figure out why it can ignore one comma but not two?