-
**Summary**
Currently the generated axtree content for retrieved websites incurs a huge amount of tokens and cost.
Maybe below combination of Playwright with BeautifulSoup can save tokens, cost an…
-
Hi Andrei, great meeting you last night. Here is the structured output functionality I mentioned in LiteLLM - https://docs.litellm.ai/docs/completion/json_mode#pass-in-json_schema. Getting back Pydan…
-
### Checked other resources
- [X] I added a very descriptive title to this issue.
- [X] I searched the LangChain documentation with the integrated search.
- [X] I used the GitHub search to find a…
-
Hello,
Thanks for the great work. I wanted to use the initial example on the `README.md`:
```python
from datasets import load_dataset
from setfit import SetFitModel, Trainer, TrainingArgument…
-
Is there a way to provide a password so that pdf2txt can extract text from pdf (together with the read-only password -P)?
-
I propose these changes to the run-information-batch-001.csv, run-information-batch-002.csv, run-information-batch-001_column-descriptions.csv and run-information-batch-002_column-descriptions.csv:
1…
-
There are a few pre-existing python packages for this...
- pypdf
- slate
- pdfminer
-
Incomplete extraction of data for `quest_request_items_conditional` regardless of whether it is a numeric value or text itself ( see printscreen )
![Ashampoo_Snap_sobota 31 augusta 2024_13h6m41s]…
-
Hi everyone,
I am currently using the ibd2sql tool to recover data from MySQL InnoDB .ibd files. While most of the data is extracted correctly, I've noticed that certain fields, especially text fie…
-
1. Installed Marker from the dev branch, under Win11. For some reason it always skips complete Chapter 5 -> "V. Instructions, Procedures, and Drawings"
Document attached:
[10CFR50AppB_LibOff.pdf](ht…