-
The `unstructured` library, which is what is used to parse PDF's that we download with `https`, is not used by the `arxiv` downloader. However, reading through it, it looks like it is much more capabl…
-
### Description
As discovered through https://github.com/espanso/espanso/issues/1612#issuecomment-2380593095, it is evident that commands run via `script` execute considerably faster than `shell` o…
-
**Describe the bug**
The CoreNLP server is not stopped automatically after the `with` statement in Python is finished. This happens during interactive Python sessions and running Python scripts as a …
-
I am using pdfplumber which is built on top of pdfminer.six
But the issue is the coordinates coming from pdfminer.six.
Here is the pdf
[v2.pdf](https://github.com/pdfminer/pdfminer.six/files/6…
-
Many thanks for your sophisticated tool!
I'm hoping that it will improve my academic reading workflow by turning scanned PDFs into annotatable documents to be processed afterwards with [Zotero](htt…
-
-
Tracking plans for cherry-cola.
## Stability & Flexibility
Cherry-cola has to become stable with all the current features.
- [ ] Dynamic code synchronisation (DOM)
- [ ] Text content
- […
-
Hi,
is there a very simple example using spago to train my own models for Q&A ? It is only enough for a single sentence just to understand the process.
Thank you
-
Current readme\make instructions specify downloads of external prerequisites, but do not appear to cover .net prereqs. Attempting to make via make-release.bat indicates numerous files related to .net …
-
The sidebar menus would be easier to read if they were ordered alphabetically.
Also it would help to spot duplicates.