tika-python Search Results

572 results
for tika-python

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

chrismattmann/tika-python #315

Mono-account and not multi-accounts :-(

What a shame, this "wrapper" has been developed for mono-account (e.g: one laptop, one user), not for multi-accounts .... ! When used with multi-accounts, there's a permission issue (concurrency acce…

enahwe updated 1 year ago
3
emory-libraries/OpenEmory #145

Review SOLR configs for Hyrax vs. SOLR 9

Related to a possible resolution to #143, we want to investigate if the SOLR 9 configuration in our sandbox environment may need additional optimization to work with a Hyrax application.

eporter23 updated 1 year ago
3
elastic/connectors #167

Include Tika in aws connector so the SUPPORTED_FILETYPE can …

### Problem Description The new AWS connector connects to S3 - people place standard data file types here,i.e., log.json, table.csv, and old.xml files. Our current support types are for programing l…

matt-isett updated 1 year ago
1
deepset-ai/haystack #482

Extract passage headers during processing of PDF documents

### What to do When converting PDF documents to txt with either apache tika or pdf2text we have some functionality to split the documents by passages afterwards. It would be beneficial to have per pa…

Timoeller updated 1 year ago
5
opensearch-project/documentation-website #419

[PROPOSAL] Overall Documentation Structure/Restructure

### Background Since the launch of OpenSearch, there has been some great progress in documenting the project. So far, the Open Distro documentation has been updated and moved to OpenSearch, and there…

ahopp updated 1 year ago
37
paperless-ngx/paperless-ngx #1364

[BUG] Classifier training much slower in 1.8.0 on aarch64

### Description Version 1.7.1 is working fine. When I update to 1.8.0 everything is working fine for 5 minutes and then htop shows me that "python3 manage.py qcluster" is using more than 100% of cpu,…

pedrom34 updated 1 year ago
44
elastic/connectors #200

Plug the ingest attachment

### Problem Description Right now we only extract a limited list of files. Let's use the ingest pipeline. Until Tika is available on edge (https://github.com/elastic/connectors-python/issues/167) …

tarekziade updated 1 year ago
1
deepset-ai/haystack #1345

PDFToTextConverter: [WinError 2] The system can't find the s…

Hi guys, I think what you are doing is very interesting. I am currently struggling with data Preprocessing(Tutorial 8). When I open my own pdf file in function PDFToTextConverter, I get the following…

ehsanVIP updated 1 year ago
8
deepset-ai/haystack #3139

Pinned outdated dependencies

Perhaps this isn't a bug, per se, but there seem to be various dependencies that are pinned to outdated versions. For example, throughout the codebase you seem to be using Tika `1.24.1`, which was…

nickchomey updated 2 years ago
2
deepset-ai/haystack #3625

ZMQError: Address already in use when using the multiprocess…

**Describe the bug** There seems to be some issue with multiprocessing in Python and haystack. If I import the multiprocessing library and **don't** import any haystack modules, I can run the fo…

burtonrj updated 1 year ago
8

上一页 1...21 22 23 24 25 26 27...58 下一页

572 results for tika-python

572 results
for tika-python