apache / incubator-stormcrawler

A scalable, mature and versatile web crawler based on Apache Storm
https://stormcrawler.apache.org/
Apache License 2.0
887 stars 262 forks source link

Add an archetype for crawling with the OpenSearch module #1017

Closed jnioche closed 1 year ago

jnioche commented 1 year ago

Similar to the one using Elasticsearch but should save users the trouble of having to rename parameters and components if copying the ES one.

jnioche commented 1 year ago

With the latest version of the main branch

mvn clean install
cd /tmp
mvn archetype:generate -DarchetypeGroupId=com.digitalpebble.stormcrawler -DarchetypeArtifactId=storm-crawler-opensearch-archetype -DarchetypeVersion=2.7-SNAPSHOT

and follow the instructions in the README in the folder created by the archetype.