-
**Describe the bug**
When using the coordinates of elements for bounding boxes, the coordinates are different using default strategy and 'hi_res' strategy.
**To Reproduce**
```
sudo apt-get inst…
-
### Description of the bug
I was trying to remove all text from PDF files. My python script looks like the following:
```python
for page in document:
info = json.loads(page.get_text('json'…
-
```python
from gmft import AutoTableFormatter, TATRFormatConfig, TATRTableFormatter
config = TATRFormatConfig()
config.total_overlap_reject_threshold = 0.5
formatter = TATRTableFormatter(config = …
-
Hii
First of all, thanks for `mupdf.js` (and that too WASM), it is super cool!
I come from `PyMuPDF` background, there we could load epub, xps, cbz, mobi, fb2, svg file formats and work with the…
-
### Description of the bug
with some docs with already disabled optional content layers the rendered pages still contain them;
example link: https://dropmefiles.com/zTbp4
### How to reproduce th…
-
`.captions()` is pretty slow: I estimate about 415 ms, which is much longer than `df()`.
-
#### Сохранение структуры и форматирования документа
Чтобы сохранить структуру и форматирование документа при переводе, нужно учитывать, что библиотека `python-docx` позволяет работать не только с …
-
**Is your feature request related to a problem? Please describe.**
It is not currently supported to add an image with arbitrary transformation, which is needed when trying to rebuild a PDF from the…
-
求助,报错: Is poppler installed and in PATH ?
在网上搜索了一番没有找到解决办法,已经按照readme安装了需要的pip包。
```
Traceback (most recent call last):
File "/Users/kasusa/Documents/GitHub/Python-Remove-Watermark/watermark.py…
-
Your code is licensed under Apache 2.0 which offers the most liberal license letting people use your software for even commercial purposes. But you have PyMuPDF as one of your requirements. This softw…