-
Create a Python script that utilizes OCR libraries (e.g., Tesseract-OCR, Pytesseract) to extract text from images. The script should accept image files as input and output the extracted text. Ideal fo…
-
### Description
![obraz](https://github.com/cryptpad/cryptpad/assets/168080232/64d7a1dc-0f09-44b6-a475-cbac4bd93b32)
With longer translation terms on the + New Document type tiles, name of documen…
-
I understand you can switch the direction of the entire corpus interface in the `corpusConfig` setting in the ..format.blf file, but is it possible to customize things for this to work on the document…
-
When using the language server within VSCode, use properly virtualized text documents, rather than `virtual-uris`
-
### Bug Description
When `num_workers` > 1, the llama_cache file is empty. When `num_workers=1`, the IngestionPipeline can cache normally
### Version
0.11.17
### Steps to Reproduce
```
if __name…
-
I'm using `System.Text.Json.Serialization` attributes/convertors for configuring serialization of my classes, but I think Seq logger is using `Newtonsoft.Json` so when I use the @ character to log an …
-
## Streaming text-to-speech synthesis needs to be documented properly
1. The text-to-speech example at https://cloud.google.com/text-to-speech/docs/samples/tts-quickstart contains working Go code, …
-
### Describe the bug
Hi, to start off I thank anyone who trie to help me in any way.
So, I was trying to open Oobabooga for the first time: I let it download and install everything, I put the righ…
-
### What is the bug?
I am using text_chunking and text_embedding processor to ingest documents into an index. The [text_chunking search example](https://opensearch.org/docs/latest/search-plugins/text…
-
When posting OCR request, we can choose two type of response.
A TEXT_DETECTION response includes the detected phrase, its bounding box, and individual words and their bounding boxes:
A DOCUMENT_…