pdf-extraction Search Results

1000+ results
for pdf-extraction

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Future-House/paper-qa #580

Reading markdown files?

Hello, I wondered what is recommended way to use local markdown files with paperqa. Looking at [readers.py](https://github.com/Future-House/paper-qa/blob/HEAD/paperqa/readers.py#L287) it seems markdow…

aginiewicz updated 1 week ago
1
WorldModelers/DART #8

PDF extraction introducing stray double carriage returns of …

@reynoldsm88 Any double carriage return is going to introduce a sentence break during information extraction. So any time a double carriage return in is in the middle of a sentence, that's quite de…

azamanian updated 5 years ago
2
termux/termux-packages #18393

[Package]: Pymupdf

### Why is it worth to add this package? PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents. ### Home page URL …

masteoo updated 2 weeks ago
1
automeris-io/WebPlotDigitizer #176

Feature Request: Supervised Extraction from Vector .pdf Imag…

I often work with vector .pdf images. They contain essentially perfect representations of the data, but can be difficult to work with. Given the integration with pdfjs, it would be interesting as …

billdenney updated 5 years ago
4
julianhille/MuhammaraJS #389

Unable to modify PDF file, make sure that output file target…

Hi, I use `pdfWriter = muhammara.createWriterToModify(localPdfPath,{modifiedFilePath:destPdfPath});` to create my pdfWriter so I can read and add an annotations. It worked perfectly until now, when…

LudvikWiejowski updated 4 weeks ago
1
ml4ai/skema #433

`503 Service Temporarily Unavailable` after sending PDF for …

Whenever we send a PDF for extraction it seems to take the whole system down for a while. This is using the basic scenario PDF [found here](https://github.com/DARPA-ASKEM/knowledge-middleware/tree/mai…

brandomr updated 1 year ago
2
sciencehistory/scihist_digicoll #229

media-specific metadata extraction for our shrine-based atta…

chf_sufia displayed a "page count" for PDF original downloads. But our current app is not extracting "page count" from PDFs. While it's relatively easy to do that with shrine, the way we are d…

jrochkind updated 2 months ago
1
akaalias/obsidian-extract-pdf-highlights #6

Highllighter colour is not working after extraction of pdf.

Is this due to the recent obsidian update? I'm not sure. But I hope you will fix this soon enough. Thank you.

titan901 updated 3 years ago
3
nextcloud/files_fulltextsearch_tesseract #12

PDF Image Extraction does not auto-rotate landscape pages

This may be a problem with tesseract, or a setting that can be applied when creating the instance to ocr as an option -- not sure if that is even the best place to address the issue to be honest. I fo…

andrewborell updated 1 year ago
1
unjs/unpdf #17

Strange behavior of `getDocumentProxy`'s buffer when extract…

### Environment node v20.11.1 unpdf v0.11.0 ### Reproduction I got the original error in a server route of a Nuxt 3 project. Also, in the original app I performed other operations besides text/met…

ndrbrt updated 3 weeks ago
4

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for pdf-extraction

1000+ results
for pdf-extraction