-
@chrismattmann
i prepare to parse the mongo gridfs file and take it as a flask api service and expose it to outer caller, whether have file size limitation for parse the file content using tika-…
-
Hello there,
Using tika jar on the command line interface, we can specify a path where tika will save the extracted images.
Can we have the same behaviour with the python wrapper ? I see that it …
-
I have an issue with the use of Tika for language detection. I first remarked that when I parse PDF files, the language was not included in the "metadata part" in most cases.
Thus, I tried to expli…
-
Like is there a way to build this from source? It seems that only debian files are provided which aren't fedora compatible (to the best of my knowledge).
-
Hi,
I am using tika-python I get an error when I execute this code
(pdb) parser.from_file('/home/ubuntu/workarea/dev-harvestor/harvestor-2/harvest-territory-stories/sample.pdf')
2019-07-31 14:1…
-
@ryantam626 It seems the issue is not solved for Jupyterlab 2.0.2. I added the `pip freeze`, sorry ..
I updated all the extensions and jupyterlab_code_formatter isn't showing on the left panel. j…
-
I get the following error:
```
2019-07-31 00:54:21,492 [MainThread ] [INFO ] Retrieving http://search.maven.org/remotecontent?filepath=org/apache/tika/tika-server/1.19/tika-server-1.19.jar to /v…
-
Hi
I have the desktop version installed on my system with the intention to be able to tag a large database of word documents and find reference across the whole database. So far so good but as I'm …
-
When cloning the repo, it downloads over 30MB of data, something that I consider kinda weird.
```
Cloning into 'tika-python'...
Warning: Permanently added the RSA host key for IP address '140.82.…
-
@dsmiley do you want to give it a shot and test the deploy stuff on mac?