document-scan-to-text Search Results

1000+ results
for document-scan-to-text

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

aloisdg/Doccou #9

Add DjVu support

> DjVu is a computer file format designed primarily to store scanned documents, especially those containing a combination of text, line drawings, indexed color images, and photographs. [...] DjVu has …

aloisdg updated 8 years ago
1
blindpandas/bookworm #128

Remove page IDs when saving image to text or scanning to tex…

## Is your feature request related to a problem? Please describe. When saving an image to a text file or selecting the Scan to Text File option and selecting a scanned book for text extraction using …

DraganRatkovich updated 2 years ago
5
Unstructured-IO/unstructured #2939

Text Extraction Issue: Greek Language PDFs Rendered with Inc…

**Describe the bug** I am evaluating the UnstructuredClient for processing PDF documents and am encountering an issue with the Greek language text extraction. When I attempt to extract text from PDF …

DarioBernardo updated 4 months ago
3
FreeUKGen/FreeBMD2 #397

Report a Data Problem - page layout

Suggested changes to layout ![Data Problem Report page-page-001.jpg](https://images.zenhubusercontent.com/5cab6cc818447254e461e741/4d8032d2-25f2-4d07-bd68-bea53f7a0384)

DeniseColbert updated 2 weeks ago
14
Islandora/documentation #1583

OCR image-only PDFs

Stemming from on [an email discussion](https://groups.google.com/g/islandora-dev/c/oPr1ZsJx-HA): Hypercube currently uses pdftotext to extract text embedded in a PDF OR tesseract to perform OCR on …

seth-shaw-unlv updated 1 year ago
9
googleapis/python-documentai-toolbox #159

Convert Document AI Object to Preserve Layout Text?

**Is your feature request related to a problem? Please describe.** I've been using Google Document AI for text extraction from scanned documents, and it's been working well in terms of extractin…

raad-altaie updated 4 months ago
6
dotnet/vscode-csharp #7505

C# Extension throws error on hover over diagnostic from Semg…

Type: Bug ## Issue Description ## The C# extension cannot handle code actions when there are diagnostics from the Semgrep Extension included in the request. Hovering over a Semgrep diagnosti…

jkinsfather updated 1 day ago
2
googleapis/python-firestore #939

grpc._channel._MultiThreadedRendezvous fails; StatusCode.UNA…

#### Environment details - OS type and version: I use Windows locally, and Linux in prd - Python version: I use the latest release of Python, which is currently Python 3.12.4 - pip version:…

brent-ridian updated 1 week ago
4
ohmg-dev/OldInsuranceMaps #1

Add not georeferenceable option to preparation stage

https://oldinsurancemaps.net/resource/328 is an example of a sheet that isn't a map. Currently, this sheet would sit in the unprepared area, but that's a difference between a page not yet reviewed but…

BNAndras updated 1 week ago
7
pgvector/pgvector #259

Understanding HNSW + filtering

Hi, I would like to understand how the current implementation handles HNSW + filtering. Imagine you have a table: ```sql CREATE TABLE document ( id UUID PRIMARY KEY, text TEXT NOT NULL, …

Palmik updated 2 weeks ago
84

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for document-scan-to-text

1000+ results
for document-scan-to-text