-
I was testing the conversion of various pdf files to images and found that sometimes text is missing (randomly) from the final produced image. These are more complex pdf files so that may be the reaso…
-
Hi, I'm trying to merge the following 4 files. They each contain 26,400 pages of the same image, with some small amount of unique text overlayed on top, on every page.
Despite the 4 input PDF's tot…
-
Of course this is not an "issue", merely a bunch of questions hoping that someone here may be able to help.
While learning Khmer I have also got into Khmer script. I have found that while most word…
mbert updated
2 months ago
-
```
What steps will reproduce the problem?
1. Create a docx file with Arabic language, this is written in the right to
left direction.
2. Convert the docx to pdf using itext via XWPF converter
3. The…
-
```
What steps will reproduce the problem?
1. Create a docx file with Arabic language, this is written in the right to
left direction.
2. Convert the docx to pdf using itext via XWPF converter
3. The…
-
### Self Checks
- [X] This is only for bug report, if you would like to ask a question, please head to [Discussions](https://github.com/langgenius/dify/discussions/categories/general).
- [X] I have s…
-
Export in different file formats:
- [x] Markdown
- [ ] Pdf
- [ ] Plain text
- [ ] HTML
- [ ] Word (docx?/doc?/rtf?)
Feel free to suggest other file formats.
-
### Environment
node v20.11.1
unpdf v0.11.0
### Reproduction
I got the original error in a server route of a Nuxt 3 project. Also, in the original app I performed other operations besides text/met…
-
### Your use case
#### What would you like to do?
#### Why would you like to do it?
When you want to find the photo or pdf that was shared in a room to see a list of all files that was shared. …
-
The Needs OCR function needs to be improved. Currently we do this to determine if something that is OCR eligible should be OCRd.
### The Situation
```
if content.strip() == "" or pdf_has_ima…