TheStanfordDaily / archives-web

Helper functions and web app for METS/ALTO archive viewing.
https://archives.stanforddaily.com
6 stars 2 forks source link

use cloud search cursor #68

Open ufxela opened 4 years ago

ufxela commented 4 years ago

right now, we're limited to 10,000 search results for any search. to unlock maximum potential, use cloudsearch cursor

https://docs.aws.amazon.com/cloudsearch/latest/developerguide/paginating-results.html

epicfaace commented 4 years ago

Don't we already do this? If you navigate to a new page, you'll note that we do a network call like (note the start and size parameters):

https://ehabp6fuc5.execute-api.us-east-1.amazonaws.com/prod?q=test&size=20&start=20&highlight.article_text=%7Bformat:%27html%27,max_phrases:5%7D&fq=publish_date:[%271892-01-01T12:00:00Z%27,%272015-01-02T12:00:00Z%27]

ufxela commented 4 years ago

we are paginating, but we're not doing deep paginating (right now, the results for a query are limited to 10,000). under "Deep Paging Beyond 10,000 Hits in Amazon CloudSearch" it says that you can use a cursor to get more than 10,000 hits