-
@gsautter here is a PDF to causes problems when opening it, that is I can't even load it properly
[jElectronicPublishing.21.1.pdf](https://github.com/gsautter/goldengate-imagine/files/1950866/jElec…
-
There is already some table detection mechanism in tesseract but unfortunately, there is seems to be no possibility to access the table structure at the API.
This could be done only minimal changes…
-
# 🚀 Feature Request: PDF Data Extraction
### Description
The goal is to implement a PDF data extraction logic within the `implementation/pdfextractor` folder. The user can implement either `extrac…
-
This has the danger of just being another standard. However the benefit of NDEF standard is how compact it is (since it is a binary format).
However, there may be merit to adapting the NDEF standar…
-
Some Metanorma documents are getting so large that the output approach into a single file XML (assuming that `data-uri-only` is turned on) becomes challenged. We have reached a critical point where th…
-
**Is your feature request related to a problem? Please describe.**
I would finally like a fixed folder structure, independent of API changes (like #91), which everyone can set according to their own …
-
Here at the Stanford HIMC, we have a few multi-year customer studies. We would get the 2009 samples, process them, then repeat each year as the 2010, 2011, 2012, etc samples come in.
Is there curren…
-
## Describe the bug
It seems that `Page.crop()`:
- on one hand by default expects cropping coordinates compatible with the page's bbox (`strict=True`),
- on the other objects' coordinates are rel…
-
## Problem
When I try to extract font files out of a pdf no fonts are extracted, I've tried multiple different files with different fonts but none of them have worked, I have provided a sample pdf wh…
-
**Describe the issue**
Hello,
I am having an issue trying to add continuous clinical features to my oncoplot.
If I add more than one clinicalFeature that has continuous values the colors applie…