-
Subtitle extraction fails for several files, but the plugin still makes an attempt on them every run.
-
If I call `uv python install 3.12` on W11 with uv 0.3.0, I get:
```
Searching for Python versions matching: Python 3.12
cpython-3.12.5-windows-x86_64-none ------------------------------ 4.47 MB/…
-
in order to use sponsorblock you need to use the native video link extraction for mpv. But this means the link is extracted twice and this can take quite a while. So it would be nice to be able to jus…
-
I have created pdf from its docx version in which sections and subsections were created by built in heading styles instead of numbering .It is not able to recognise few subsections inside sections
-
Hello Team,
Thank you for the comprehensive survey. Can you please include the work [PII-Compass: Guiding LLM training data extraction prompts towards the target PII via grounding](https://aclanth…
-
### What happens?
`SELECT columns.v4_c6 FROM read_ndjson_auto(...)` is no longer working in DuckDB v1.1.* for the following JSON structure:
```
{"location":"67/820/data/1777410541745082368/854288f7…
-
# Problem
We all love type-safe interaction between system components.
It's also a great idea to be able to use the same TypeScript types both in our server and client codebases, which is often re…
-
this project looks very interesting to me, is it possible to use Kadot for keyword extraction?
-
### What happens?
While extracting field from JSON I get the error "Conversion Error: Failed to cast value to numerical" due to WHERE clause, but when adding some parenthesis to the clauses the err…
-
This example shows that data is duplicated and words are squished together even though they are distinct in the html.
python:
```
from trafilatura import extract
html_string = """
…