USCDataScience / sparkler

Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
http://irds.usc.edu/sparkler/
Apache License 2.0
412 stars 143 forks source link

warning: usage of JAVA_HOME is deprecated, use ES_JAVA_HOME #260

Open francesco1119 opened 1 year ago

francesco1119 commented 1 year ago

Issue Description

Please describe our issue, along with:

After I execute the command:

docker run -v elastic:/elasticsearch-7.17.0/data ghcr.io/uscdatascience/sparkler/sparkler:main inject -id myid -su 'https://www.nature.com/articles/s41586-022-04983-9'

I see the warning:

warning: usage of JAVA_HOME is deprecated, use ES_JAVA_HOME
warning: usage of JAVA_HOME is deprecated, use ES_JAVA_HOME
OpenJDK 64-Bit Server VM warning: Option UseConcMarkSweepGC was deprecated in version 9.0 and will likely be removed in a future release.

When I try to reach the url http://localhost:8983/solr/crawldb nothing happens

How to reproduce it

The error shows at the third steps of your guide

Environment and Version Information

Please indicate relevant versions, including, if relevant:

francesco1119 commented 1 year ago

Should the URL be available at http://localhost:8083/solr or http://localhost:8083/banana ?

francesco1119 commented 1 year ago

Hello, is this project still maintained or is it deprecated?

thammegowda commented 1 year ago

Hi, I am one of the authors and unfortunately this project is no longer in my top-10 priorities (I can't speak of others though.) A lot has happened in life and career -- I am sorry to inform -- this is no longer actively maintained by me.

If anyone is following this project and have interest in maintaining/advancing further, please reach out to me (https://twitter.com/thammegowda) or the director of USCDataScience (https://twitter.com/chrismattmann). We will be happy to work with the transition and grant permissions.