USCDataScience / sparkler

Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
http://irds.usc.edu/sparkler/
Apache License 2.0
411 stars 143 forks source link

Support for infinite crawl or until the end of all new URLs #152

Closed thammegowda closed 6 years ago