-
Extracting text using Tika fails (error 400) unless line 142 is commented out in enhance_extract_text_tika_server.py:
```
parsed = parser.from_file(
filename=filename,
…
-
Primary Problem: whenever I try to instantiate the parser some error involving warning logs appears and the code just dies:
```018-12-30 01:37:29,263 [MainThread ] [WARNI] Failed to see startup …
-
-
My work involves integrating Open Semantic Search and Atlassian products in what I call the [Team Investigative Environment](https://nealr.substack.com/p/iib-8-team-investigative-environment). The sys…
-
**Describe the bug**
Simple import of pandas_profiler fails producing the above error.
**To Reproduce**
```
import pandas as pd
from pandas_profiling import ProfileReport
df = pd.DataFra…
-
**Submitting author:** @wrightaprilm (April Wright)
**Repository:** https://github.com/wrightaprilm/treesiftr
**Version:** v1.0.0
**Editor:** @juanklopper
**Reviewer:** @ethanwhite, @rachelss
**Archiv…
-
I have many folders of PDFs downloaded from a legacy database, some of which are corrupted and cannot be opened with Adobe Acrobat. I cannot attach an example here, sorry. mupdf struggles with these f…
-
**Describe the bug**
borb extracts chinese characters only from a document that doesn't contain any chinese characters at all
**To Reproduce**
Get this PDF: https://arxiv.org/pdf/1601.03642.pdf a…
-
* [x] Generate initial MER-A annotations (w/Contains from jSRE) using the MTE pipeline
* [x] Add HasProperty annotations (train and apply jSRE model) #19
* [x] Perform expert review of Target, Cont…
wkiri updated
2 years ago
-
### `brew gist-logs ` link OR `brew config` AND `brew doctor` output
```shell
HOMEBREW_VERSION: 3.5.10-49-gb2ddb34
ORIGIN: https://github.com/Homebrew/brew
HEAD: b2ddb341a0489834dbbfcb57544d87c4…