USCDataScience / sparkler

Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
http://irds.usc.edu/sparkler/
Apache License 2.0
411 stars 143 forks source link

[Nutch][Memex] Make SOLR query for generator configurable through yaml #128

Closed sujen1412 closed 6 years ago

sujen1412 commented 6 years ago

Issue Description

This issue is linked to (and maybe considered as the first step to) https://github.com/USCDataScience/sparkler/issues/47

The user should be able to select the SOLR fields to group by, sort by and should be able to choose his/her own sort order.