-
The current version **does** support extraction of full text and metadata from rechtspraak but does not return data to the user when called explicitly. This needs to be fixed.
-
### Microsoft PowerToys version
0.81.1
### Installation method
PowerToys auto-update
### Running as admin
No
### Area(s) with issue?
TextExtractor
### Steps to reproduce
Have an image with ty…
-
Text similarities or mentions of one another can be used to construct these networks.
-
**What would you like to be added**:
SBOM formats such as CycloneDX and SPDX support including the full text of a license with a component. It would be great if syft could extract this information wh…
-
### Microsoft PowerToys version
v0.76.2
### Installation method
WinGet
### Running as admin
Yes
### Area(s) with issue?
TextExtractor
### Steps to reproduce
When selecting Text Extractor from…
-
I was trying out the tutorial. However, when partitioning the PDF provided in tutorial, I did not observe that the font-style of the text being stored in the Metadata for the element.
Is the font-s…
-
in chartSpeak.py ,I got `ModuleNotFoundError: No module named 'theme_extract.similar_text'`. What's theme_extract? I didn't find any thing about this. Is it a module, a NLPL model or something else.
-
What are similar job done in extracting insights data from papers and books?
Google, watch videos, ask chatgpt about the similar way
-
for this url = "https://www.aia.com/en/health-wellness/healthy-living/healthy-mind/Managing-financial-stress",
I use
downloaded = trafilatura.fetch_url(url) trafilatura.bare_extraction(downloaded, u…
-
### Description of the bug
Can not read the `.docx` file. It worked perfectly on v1.24.6.
Logs:
```
Traceback (most recent call last):
File "~/d.py", line 15, in
print(extract_text(…