internetarchive / openlibrary

One webpage for every book ever published!
https://openlibrary.org
GNU Affero General Public License v3.0
5.16k stars 1.35k forks source link

Author search results pages show many more works per author than there actually are #9256

Closed stopregionblocking closed 3 weeks ago

stopregionblocking commented 5 months ago

Problem

Search results for authors don't always show accurate numbers of works, but recently they frequently seem to be several times higher than the actual number of works.

Evidence / Screenshot

Relevant URL(s)

Sample searches: https://openlibrary.org/search/authors?q=jeeves https://openlibrary.org/search/authors?q=jarid https://openlibrary.org/search/authors?q=kom%27boa https://openlibrary.org/search/authors?q=bishara

Reproducing the bug

  1. Search for an author
  2. Observe the number of works attributed on the author search result page
  3. Observe the number of works attributed on the author page

Context

Notes from this Issue's Lead

Proposal & constraints

Related files

Stakeholders

jchefdeville commented 5 months ago

Hi I wrote a message an "Gitter". I wanted to reproduce this issue, but I can't do it on localhost. I don't have any results :( Can anyone help me to contribute ?

scottbarnes commented 5 months ago

@jchefdeville, I can't speak to reproducibility, but in terms of finding results, the following may show some authors: http://localhost:8080/search/authors?q=*&mode=everything

cdrini commented 3 weeks ago

This is a duplicate of #9558 . The fix for this has been merged in, but will require a full solr reindex before it takes effect. That will likely happen in ~1 month or so.