-
Dear devs,
Dear @bsilverstein as the one who wrote the code for #3921,
while working on #4172, I am crafting a new Maven profile to generate smallest possible Docker images based on the [payara 5 …
-
**Is your feature request related to a problem? Please describe.**
This is more of a question rather than a feature request. I want to be able to retrieve the images from the documents i ingested i…
-
Tika was working for me until yesterday but today I get 504 error when tika server is being downloaded. I get the error when the PDF is being read
`parsed = parser.from_file('mypdf.pdf')`
Here is…
-
@chrismattmann
i do some performance test for multiple times call parser.frombuffer() method, some gridfs file in mongodb have greater than 50MB, log will print the below information:
New var:.…
-
After processing all the PDFs I needed, I noticed that the Tika server was still up and running in the background, eating a lot of RAM (usually between 1~1.8gb after having processed ~4000 PDFs).
`…
-
Hi,
When I try to crawl 10MB file below exception is getting thrown. Can you please check. I have attached the file that I was trying to crawl.
java.lang.OutOfMemoryError: GC overhead limit exceed…
-
The issue is that when testing the application in the terminal everything works correctly, but when trying to use those same lines of code in a Sikuli environment, the server is not able to start.
At…
-
I'm new to Tika and trying and trying to setup Tika on Databricks. Do I need to install both tika-python and tika server jar files on databricks cluster to make it work?
If so, how to change the Ti…
-
When crawled about 21 000 docs I have 1.19GB in store. The diskspace says there are only 3,72GB free of 14.68GB total. Fess and ES is the only thing running on this server. Is this normal?
![fess](…
-
Hey, not sure if there is a planned change already but I've encountered an issue with the LifecycleHooks class in the installTransformer method.
If I try and run my JUnit tests passing the junit-foun…