USCDataScience / sparkler

Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
http://irds.usc.edu/sparkler/
Apache License 2.0
410 stars 143 forks source link

[NUTCH][MEMEX] Create Generator Plugin Interface #47

Open thammegowda opened 7 years ago

thammegowda commented 7 years ago

This plugin shall add customize the URL selection for fetching