USCDataScience / sparkler

Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
http://irds.usc.edu/sparkler/
Apache License 2.0
411 stars 143 forks source link

Standalone Docker image #166

Closed buggtb closed 3 years ago

buggtb commented 5 years ago

For a scalable deployment you don't need solr etc in the docker image, a standalone one will be useful.

ghost commented 4 years ago

Is there any updates on the dockerization of sparkler ? would be awesome to try it quickly.

thammegowda commented 3 years ago

@x0rzkov Please see https://hub.docker.com/r/uscdatascience/sparkler/tags?page=1&ordering=last_updated