Closed Marcos-A closed 1 year ago
what does the /tmp/tika*log file return? I need to see what that is showing to debug this thanks @Marcos-A
Hi, @chrismattmann, thank you for your response. This is the content of the tika.log
file:
2023-02-23 23:55:03,003 [MainThread ] [INFO ] Retrieving file:////opt/pdf2xlsx/tika_server_files/tika-server-standard-2.6.0.jar to /tmp/tika-server.jar.
2023-02-23 23:55:03,016 [MainThread ] [INFO ] Retrieving file:////opt/pdf2xlsx/tika_server_files/tika-server-standard-2.6.0.jar.md5 to /tmp/tika-server.jar.md5.
2023-02-23 23:55:03,017 [MainThread ] [INFO ] Retrieving file:////opt/pdf2xlsx/tika_server_files/tika-server-standard-2.6.0.jar to /tmp/tika-server.jar.
2023-02-23 23:55:03,030 [MainThread ] [WARNI] Failed to see startup log message; retrying...
2023-02-23 23:55:08,032 [MainThread ] [WARNI] Failed to see startup log message; retrying...
2023-02-23 23:55:13,033 [MainThread ] [WARNI] Failed to see startup log message; retrying...
2023-02-23 23:55:18,034 [MainThread ] [ERROR] Tika startup log message not received after 3 tries.
2023-02-23 23:55:18,037 [MainThread ] [ERROR] Failed to receive startup confirmation from startServer.
On the other hand, this is the content of tika-server.log
:
Caused by: java.lang.ClassNotFoundException: org.apache.tika.server.core.TikaServerCli
Please, let me know if you need more information.
thanks @Marcos-A in your /tmp
directory are you seeing the corresponding tika-server-standard-2.6.0.jar
file? the error in tika-server.log
seems to indicate that the jar file is somehow corrupt. Can you check the contents of the jar file?
Thank you so much, @chrismattmann, your suspicions were correct. The JAR file was corrupt. The strangest thing is that I tried downloading it several times before, even with different versions of Tika Server without much success. Anyway, I tested it downloading the file again and it worked flawlessly. Thank you very much for your support and hard work.
anytime thank you @Marcos-A !
also thank you so much for the tip @Marcos-A !
Setting the
TIKA_SERVER_JAR
environment variable to a local file successfully tellspython-tika
to "download" this file and move it to/tmp/tika-server.jar
, but returns the following errors:Installed on a Docker container:
Local JAR files downloaded:
Java version:
Setup works as expected after unsetting the
TIKA_SERVER_JAR
environment variable, forcingpython-tika
to download the same version of the JAR file from Apache.I tried setting the
JAVA_HOME
environment variable to the Java installation directory, as well as thePATH
variable, with no further success.