-
Why not adding native pdfium text extract /search support for using with AndroidPdfViewer Beta 3.2.0? Please do it. Thank you.
-
Right now, the styles definition is represented under the `` element, which by spec requires to be an immediate children of the `` element. Hyperview fragments by spec only return elements that can be…
-
Hello all,
I am trying to use python docx for extracting particularly formatted data from a file which contains both text and inline images. Can someone please suggest an approach for do…
-
The basic function works alright, but its really messy. Maybe it can be refactored into the same two step process of `split_text_by_speakers`:
1. get text indices with metadata
2. split text with th…
-
The point is to transform the push-lexer into a pull-lexer, that is dumb it down a notch, and push more responsibility to the parsers. This may lead to duplication into each parser, at which point we'…
-
This crate registers 11 functions with the `rhai` runtime. These functions create a sort of "devai Agent" DSL. I propose these functions be documented, first in their "src/" definitions, then in a st…
-
Take doii-rsb-0001-100-01.txt we see that the text extracted using PyTesseract is as follows:
```
ォ ョ ン が 行 は 刀 英 貸 が 全 岡 的 に ボイラ コ ッ ト さ れ 、 輝 生 も 反 政府 的 ス ョ ー ガ ン を 掃 げ 大 示威 軍 動
を 展開 し 、 ボ ー ス …
-
# Feature Request
Currently it's impossible to set the prop `textLength` on `` or `` as it's not being extracted by `./lib/extract/extractText.tsx`.
Could the maintainers add the prop `textLengt…
-
**Describe the bug**
A tweet causes the sync to stop working anymore
```
🦣 client √ connected
☁️ client √ connected
profile-sync √ task finished
content-mapper √ tweets: total:…
-
### Translation:
When selecting the entire text box and pressing **Ctrl+C** to copy, if you paste while inputting text, it will copy and produce code such as:
```json
{
"type": "excalidraw/clipboa…