-
I performed the demos of both the regular text extraction and the HTML extraction found on the README. The text extraction worked as expected. However, the HTML extraction simply returned the original…
-
**Issue by [will3216](https://github.com/will3216)**
_Wed Nov 4 18:48:57 2015_
_Originally opened as https://github.com/codelucas/newspaper/issues/168_
----
In extractors.py:173 it says that the p…
-
**Describe the issue**:
Stepping through the example for Text Vectorization on dask.org fails at line 4 of cell 5.
**Minimal Complete Verifiable Example**:
https://examples.dask.org/machine-learn…
-
Is it possible to extract sequence of html (or AXTree) and actions from the trace.zip file?
-
# What?
When we produce (from the HOCR/PDFALTO) extraction the pure OCR text we keep the HTML entity encoding. This hurts Views display since internally, twig can not decode the entities and will d…
-
In the section https://docs.vulkan.org/guide/latest/vertex_input_data_processing.html#_filling_in_components , the text quotes the specification as follows:
> For the opposite case, the spec says…
-
i have an issue that is about the portal html file. I change it from .../webpages/compressed/captive.html. The captive.html is from the extraction of the captive.html.gz. I mod the html file (page), i…
-
# Task management with org-roam Vol. 2: Categories
Automatic category extraction from org-roam
[https://d12frosted.io/posts/2020-06-24-task-management-with-roam-vol2.html](https://d12frosted.io/post…
-
### Self-Hosted Version
24.8.0
### CPU Architecture
x64_64
### Docker Version
27.1.2, build d01f264
### Docker Compose Version
2.29.1
### Steps to Reproduce
Follow up of
https://github.com/g…
amenk updated
8 hours ago
-
**Describe the bug**
NotImplementedError: File format not supported
I am facing this error where the expected result is the list of tables
However I am getting a Implementation exception instead
…