internetarchive / fatcat-scholar

search interface for scholarly works
https://scholar.archive.org
Other
80 stars 14 forks source link

Continuous updates from fatcat catalog #4

Closed bnewbold closed 4 years ago

bnewbold commented 4 years ago

The current experimental index is a one-shot, based on a 2020-08 export of fatcat release entities. Of course we want updates to flow from fatcat to the scholar index in the same way that entity updates currently flow to the fatcat metadata search index.

The rough plan for this feature is:

Because scholar/fulltext index updates are relatively expensive compared to regular fatcat entity index updates, we might want to consider some optimizations:

bnewbold commented 4 years ago

This work is complete and has been running for a couple weeks now.