clemEssien / spark-nlp-phi-annotator

Apache License 2.0
1 stars 0 forks source link

Dockerize Spark NLP #10

Closed tschaffter closed 3 years ago

tschaffter commented 3 years ago

Tasks

tschaffter commented 3 years ago

@clemEssien I needed to install man (mkdir -p /usr/share/man/man1) before installing Java. I now have Java installed:

$ docker exec -it phi-annotator bash
root@3003013fd504:/# java -version
openjdk version "11.0.11" 2021-04-20
OpenJDK Runtime Environment (build 11.0.11+9-post-Debian-1deb10u1)
OpenJDK 64-Bit Server VM (build 11.0.11+9-post-Debian-1deb10u1, mixed mode, sharing)
tschaffter commented 3 years ago

Apache Spark is running by default on port 8080. I now run uWSGI on port 8081 to prevent conflict.

After running apt install iproute2 -y

root@8254dd7f5469:/opt/spark# ss -tunelp | grep 8081
tcp     LISTEN   0        100              0.0.0.0:8081           0.0.0.0:*      uid:33 ino:135958 sk:3 <->
root@8254dd7f5469:/opt/spark# ss -tunelp | grep 8080
tcp     LISTEN   0        1                0.0.0.0:8080           0.0.0.0:*      users:(("java",pid=395,fd=336)) ino:140877 sk:5 <->
tschaffter commented 3 years ago

Notes

EDIT: I could not access port 8080 because I had another service running on my computer on that port. After stopping it, I can now access the Apache Spark landing page.

image

tschaffter commented 3 years ago

I can successfully load Spark NLP from the terminal using the instruction listed here.

tschaffter commented 3 years ago

Replaced by https://github.com/nlpsandbox/phi-annotator-spark-nlp/pull/3