-
hi - i just used #master to do the common thing of extracting all text from a pdf. it worked, thanks for the nice library! it took a while to figure out how to do it and required more contortions t…
-
* pylivemaker version: 09-05-2020
* Python version: 3.7
* Operating System: Win 10
@pmrowla did you notice that there text exists to translate that's not handled
by extract, extractcsv or extrac…
-
### What version of `@strapi/blocks-react-renderer` are you using?
-
### What's Wrong?
The following types aren't exported: `TextInlineNode`, `ParagraphBlockNode`, `QuoteBlockNode`, `CodeBloc…
-
Error extracting text from document
## Environment
Which environment were you using when you encountered the problem?
```bash
$ python -m platform
Windows-11-10.0.22631-SP0
$ python -c "…
-
Successful test using ChatGPT (GPT-3.5):
```
Extract ingredient lists from the following texts. The ingredient list should start with the first ingredient and include allergy, label or origin info…
-
While working with your library, I noticed that content can be extracted from a site by passing the response text to the `extract()` function. However, I found that the `sitemap_search()` function onl…
-
### Self Checks
- [X] This is only for bug report, if you would like to ask a question, please head to [Discussions](https://github.com/langgenius/dify/discussions/categories/general).
- [X] I have s…
-
**Summary**
Currently the generated axtree content for retrieved websites incurs a huge amount of tokens and cost.
Maybe below combination of Playwright with BeautifulSoup can save tokens, cost an…
-
Currently `extract_text` only supports `AsRef` but what if the user wants to input from `String`? Why not take in anything that implements `Read` instead?
-
FYI, I tried implementing this to detect gibberish emails.
Using `john.doe@gmail.com` and extracting the text before the @ symbol...
`john.doe` is detected as gibberish by this library.