Changes so Sparkler can be optionally configured to run in a Databricks spark environment
Is this related to an already existing issue on sparkler?
204
Will it close an existing issue?
204
How was this patch tested?
The resulting fat jar zipped up with the conf and plugin directories and copied up to the databricks file system (dbfs). Then scripted to be pulled onto Master node of a cluster, unzipped and executed. Sample crawls and scraps where performed that persisted results in a standalone EC2 Solr server. Then pulled from Solr via rest api.
What changes were proposed in this pull request?
Changes so Sparkler can be optionally configured to run in a Databricks spark environment
Is this related to an already existing issue on sparkler?
204
Will it close an existing issue?
204
How was this patch tested?
The resulting fat jar zipped up with the conf and plugin directories and copied up to the databricks file system (dbfs). Then scripted to be pulled onto Master node of a cluster, unzipped and executed. Sample crawls and scraps where performed that persisted results in a standalone EC2 Solr server. Then pulled from Solr via rest api.
Please review https://github.com/USCDataScience/sparkler/blob/master/.github/CONTRIBUTING.md before opening a pull request.