-
**例行检查**
[//]: # '方框内填 x 表示打钩'
- [ ] 我已确认目前没有类似 issue
- [ ] 我已完整查看过项目 README,以及[项目文档](https://doc.fastgpt.in/docs/intro/)
- [ ] 我使用了自己的 key,并确认我的 key 是可正常使用的
- [ ] 我理解并愿意跟进此 issue,协助测试和提供反馈
…
-
I've been looking into text extraction algorithm and it seems the single most time consuming is following part:
`prune_unwanted_nodes(one_of_the_trees, OVERALL_DISCARD_XPATH)`, since `OVERALL_DISCARD…
-
- [Describo](https://describo.github.io)
Describo is an AI-powered metadata editor and research tool that transforms your data into linked, discoverable insights. Describo creates metadata conformi…
-
#### **Data we can collect using public resources**
1. **Engagement Metrics**:
- Data: Likes, comments, and shares for posts/accounts.
- Source: Instagram Graph API (limited to authenticate…
-
@yoelcortes, @yalinli2, @sarangbhagwat Hello, could I ask you some questions about the liquid-liquid extraction unit? Thanks for your help.
a) Does the ```partition coefficients``` in the tutorial…
-
[`Propagator`](https://hackage.haskell.org/package/hs-opentelemetry-api-0.1.0.0/docs/OpenTelemetry-Propagator.html#t:Propagator)s have `inboundCarrier`s for extraction and `outboundCarrier`s for injec…
-
I was wondering whether there is a functionality to not wipe all the html in the extraction process, for example, for the 10-ks it would be nice to know what is for example tables, lists, headings etc…
-
## 🌟 Feature Overview
This feature allows users to upload PDF files and ask questions about their content, utilizing Google Generative AI for accurate and quick answers.
## 🤔 Why this feature?
Th…
-
Trying to understand how to pdfiototext.c works. The code seem to output too many extra unnecessary spaces for this PDF.
![image](https://github.com/michaelrsweet/pdfio/assets/2600624/7d0bf550-c5fe…
-
### Do you need to file an issue?
- [x] I have searched the existing issues and this bug is not already filed.
- [ ] My model is hosted on OpenAI or Azure. If not, please look at the "model providers…