-
- Version 3.5.2 Windows/64 bit
- Printing is atrociously slow ! Two pages per minute at best ! Anyway to improve this?
Great piece of software, but might switch to an alternative if we are not…
-
**Describe the bug**
I am getting the following error when extracting text and images from pdf:
`
PIL.UnidentifiedImageError: cannot identify image file '/tmp/tmpjy0tjjjd/2c2e244f-8f8e-46de-a7bc-2e…
-
```
What steps will reproduce the problem?
1. Use pdfium_test to convert the attached pdf to a ppm
2. Open the ppm and extreme jags are present in the frame around the image
What is the expected outp…
-
Currently, pdfocr converts b/w and grayscale pdf to ppm format in color and runs tesseracts on them. Therefore the output file size of pdfocr is about 10 to 100 times bigger than the the input file in…
-
```
What steps will reproduce the problem?
1. Use pdfium_test to convert the attached pdf to a ppm
2. Open the ppm and extreme jags are present in the frame around the image
What is the expected outp…
-
```
What steps will reproduce the problem?
1. Use pdfium_test to convert the attached pdf to a ppm
2. Open the ppm and extreme jags are present in the frame around the image
What is the expected outp…
-
```
What steps will reproduce the problem?
1. Use pdfium_test to convert the attached pdf to a ppm
2. Open the ppm and extreme jags are present in the frame around the image
What is the expected outp…
-
Hello there!
I am using your code to recreate some plots for my data. However, I got an error when I tried to create the ggseqlogo plots with chromvar assay. In particular, when I run this code
…
-
I actually run your code: 01_semi_structured_data.ipynb in collab
```
from typing import Any
from pydantic import BaseModel
from unstructured.partition.pdf import partition_pdf
raw_pdf_elem…
-
As an initial use case for Observations, we are investigating how to model soil test data in ADAPT. This diagram illustrates (at left) how ADAPT models field operations and (at right) my own initial…