USCDataScience / sparkler

Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
http://irds.usc.edu/sparkler/
Apache License 2.0
410 stars 143 forks source link

new additions : plugin interfaces, plugin service, urlfilter, regex url filter, config files.... #21

Closed thammegowda closed 8 years ago

thammegowda commented 8 years ago

Browse the commit log for the full details.

@karanjeets please review and merge. Let me know if you have questions!

chrismattmann commented 8 years ago

please merge this it's been sitting for 12 days.

karanjeets commented 8 years ago

@thammegowda - Looks good to me. Merging it to the Master.

chrismattmann commented 8 years ago

thanks @karanjeets @thammegowda