USCDataScience / sparkler

Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
http://irds.usc.edu/sparkler/
Apache License 2.0
411 stars 143 forks source link

RemoteSolrException: Error from server at unknown field 'segment' #158

Closed micheladennis closed 6 years ago

micheladennis commented 6 years ago

I get this error after running a freshly build project: org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error from server at http://localhost:8983/solr/crawldb: ERROR: [doc=268D12B57DBCCBE3743A1811F3BA6FB8CA234EB3CA9D777A211AE77DFA8EC407] unknown field 'segment'

Reproduce:

Environment and Version Information

Please indicate relevant versions, including, if relevant:

I would like run a debub on Sparkler, but unable to move past the just doing an initial test.

micheladennis commented 6 years ago

Further Details on the Problem. I ran Dockler.sh in order to get Solr up and running, then run sparkler.sh outside of Docker. This error consistently reproduced if this happens.

Solution/Workaround