-
Hi,
I am running tika-server-1.18.jar within a docker container. I download and run this using my own docker file. I connect to it using the tika-python library from another container. This is no…
-
Since yesterday I can't authenticate anymore.
**Code**
from pydrive2.auth import GoogleAuth
from pydrive2.drive import GoogleDrive
gauth = GoogleAuth()
gauth.LocalWebs…
-
I've followed the installation instructions, using pip in a virtual environment:
- `python3 -m venv venv`
- `source venv/bin/activate`
- `pip install tika`
- `wget -O /tmp/tika-server.jar http://s…
-
Ao extrair vários documentos ou documentos grandes o Tika extrapola o uso de memória na coleta de diários oficiais. Essa issue visa:
- [ ] Investigar o quanto de memória é aceitável
- [ ] Como mon…
-
Two broken scripts.
This one:
```scala
import io.archivesunleashed._
import io.archivesunleashed.udfs._
val languages = Array("th","de","ht")
RecordLoader.loadArchives("/path/to/warcs",s…
-
Upgrade to new release of Tika-Python
-
I'm using tika 1.23 successfully on Python 3.7.4 on one Windows 10 machine. However I installed tika 1.23.1 (the latest version) on another Windows 10 machine running Python 3.8.1, and I get an except…
-
### `brew config` output
```shell
HOMEBREW_VERSION: 3.1.0
ORIGIN: https://github.com/Homebrew/brew
HEAD: ccb05e5e0b7b925c3d6b531fa43b016fa4d0b72a
Last commit: 3 hours ago
Core tap ORIGIN: https:/…
-
hi @chrismattmann
i have a question that How many languages tika-python supports to parse attachments? because i haven't tested all languages for parsing file that contain many languages' content …
-
Can you add a little documentation on how to run this repo.