USCDataScience / sparkler

Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
http://irds.usc.edu/sparkler/
Apache License 2.0
411 stars 143 forks source link

Performed code refactoring #138

Closed giuseppetotaro closed 6 years ago

giuseppetotaro commented 6 years ago

Performed code refactoring based on the following suggestions:

chrismattmann commented 6 years ago

@thammegowda please review

thammegowda commented 6 years ago

Sorry, I missed it last weekend. I will review it before the end of this week.

thammegowda commented 6 years ago

@giuseppetotaro Merged. Thanks.

I made a few edits (mostly optimizations, see https://github.com/USCDataScience/sparkler/commit/ee332d6795fa65f8f53c4af9c8dbf96e64c4e1c5 if you are curious how I merged score and status updates into a single update)

We need a plugin, and we need atleast one testcase to accompany that plugin, we dont have any!