extract-text Search Results

1000+ results
for extract-text

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Yuras/pdf-toolbox #62

workflow to extract text

hi - i just used #master to do the common thing of extracting all text from a pdf. it worked, thanks for the nice library! it took a while to figure out how to do it and required more contortions t…

eflister updated 4 years ago
1
pmrowla/pylivemaker #51

Missing text to extract

* pylivemaker version: 09-05-2020 * Python version: 3.7 * Operating System: Win 10 @pmrowla did you notice that there text exists to translate that's not handled by extract, extractcsv or extrac…

LioMajor updated 4 years ago
6
strapi/blocks-react-renderer #55

[bug]: TextInlineNode and other types aren't exported

### What version of `@strapi/blocks-react-renderer` are you using? - ### What's Wrong? The following types aren't exported: `TextInlineNode`, `ParagraphBlockNode`, `QuoteBlockNode`, `CodeBloc…

monolithed updated 1 week ago
1
py-pdf/pypdf #2998

TypeError: unhashable type: 'ArrayObject' when reading inlin…

Error extracting text from document ## Environment Which environment were you using when you encountered the problem? ```bash $ python -m platform Windows-11-10.0.22631-SP0 $ python -c "…

neeraj9 updated 1 day ago
1
openfoodfacts/openfoodfacts-ai #309

Use LLMs to extract ingredient lists from raw text

Successful test using ChatGPT (GPT-3.5): ``` Extract ingredient lists from the following texts. The ingredient list should start with the first ingredient and include allergy, label or origin info…

raphael0202 updated 1 month ago
3
adbar/trafilatura #751

Support for sidemap parsing from text instead of urls

While working with your library, I noticed that content can be extracted from a site by passing the response text to the `extract()` function. However, I found that the `sitemap_search()` function onl…

NiClassic updated 1 week ago
1
langgenius/dify #10004

Document extractor 's output sometime is string and sometime…

### Self Checks - [X] This is only for bug report, if you would like to ask a question, please head to [Discussions](https://github.com/langgenius/dify/discussions/categories/general). - [X] I have s…

zhuqingchao updated 1 week ago
1
All-Hands-AI/OpenHands #4486

Improve browser agent' scraping/processing web content

**Summary** Currently the generated axtree content for retrieved websites incurs a huge amount of tokens and cost. Maybe below combination of Playwright with BeautifulSoup can save tokens, cost an…

tobitege updated 3 weeks ago
3
jrmuizel/pdf-extract #23

Extract text from string

Currently `extract_text` only supports `AsRef` but what if the user wants to input from `String`? Why not take in anything that implements `Read` instead?

pickfire updated 1 year ago
4
Alpha4615/gibberish-detection #11

False positives

FYI, I tried implementing this to detect gibberish emails. Using `john.doe@gmail.com` and extracting the text before the @ symbol... `john.doe` is detected as gibberish by this library.

nabilfreeman updated 2 weeks ago
1

上一页 1...13 14 15 16 17 18 19...100 下一页

1000+ results for extract-text

1000+ results
for extract-text