issues
search
USCDataScience
/
sparkler
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
http://irds.usc.edu/sparkler/
Apache License 2.0
411
stars
143
forks
source link
Add Regex URLFilter Plugin
#18
Closed
thammegowda
closed
8 years ago
thammegowda
commented
8 years ago
Use the regex url filter plugin from Nutch
Use the regex url filter plugin from Nutch