internetarchive / dweb-mirror

Offline Internet Archive project
https://www-dweb-mirror.dev.archive.org/
GNU Affero General Public License v3.0
273 stars 31 forks source link

Palm Leaf crawl #304

Closed mitra42 closed 4 years ago

mitra42 commented 4 years ago

Need to be able to crawl the palmleaf

mitra42 commented 4 years ago

Test query as currently in yaml

      - query: 'collection:Bali AND external-identifier:*palmleaf.org*'
        level: details
        search:
          sort: '-downloads'
          rows: 5
          level: details
mitra42 commented 4 years ago

Getting right size

mitra42 commented 4 years ago

Works as saved search and now as collection bali-lontar-transcribed