ukwa / ukwa-manage

Shepherding our web archives from crawl to access.
Apache License 2.0
10 stars 5 forks source link

Create and test Hadoop 020/3 Solr indexer #81

Open anjackson opened 2 years ago

anjackson commented 2 years ago

We want to start automatically indexing FC WARCs for full-text search.

Some of this will be done in ukwa-manage rather than here, but we'll need an Airflow runner.

Create a Solr indexer that: