-
Try to send the PDF (or URL to download) to the LLM.
Also flux.ai API might be worth to explore.
Note that some datasheets are images without text:
* [BSZ070N08LS5ATMA1](https://github.com/open-p…
-
### Question Validation
- [X] I have searched both the documentation and discord for an answer.
### Question
Hello team, I use Ollama service to handle LLM server, and I use **Llama 3**
I used the…
-
To compare different pipelines (LLMs, pdf2img, pdf2txt) we need a benchmark.
## 1. Choose a sub-set of datasheets of each manufacturers
* consider special PDFs that need OCR
* scrambled text
#…
-
[x] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug.
**Describe the bug**
Hi! I'm currently working with `ragas` to test different RAG …
-
PyTorch is dead. Long live JAX.
https://neel04.github.io/my-website/blog/pytorch_rant/
LLM Compressor
https://github.com/vllm-project/llm-compressor
https://neuralmagic.com/blog/llm-compressor-i…
-
https://arxiv.org/pdf/2408.02442
> Structured generation, the process of producing content in standardized formats like JSON and XML, is widely utilized in real-world applications to extract key ou…
-
### Common description
I have encountered an issue where Baml parses the response from the LLM incorrectly. Specifically, it seems that one block type is being incorrectly converted into another bloc…
-
```
datasheets/mcc/MCB70N10YA-TP.pdf
datasheets/mcc/MCAC100N10YHE3-TP.pdf
datasheets/mcc/MCACL120N10YA-TP.pdf
datasheets/mcc/MCACL120N10Y-TP.pdf
datasheets/diodes/DMT10H9M9LCT.pdf
datasheets/inf…
-
I can't find a way to set a custom endpoint for PDFSearchTool.
I tried various key names in 'config' without success:
```python
pdf_tool = PDFSearchTool(
config=dict(
…
-
### Describe your problem
Why not convert tables parsed from PDF and Word files into Markdown format? Is it because HTML format is better recognized by LLM?
Table Markdown format, I mean like th…