jonaswinkler / paperless-ng

A supercharged version of paperless: scan, index and archive all your physical documents
https://paperless-ng.readthedocs.io/en/latest/
GNU General Public License v3.0
5.38k stars 352 forks source link

[BUG] Downloaded filenames don't support Unicode #1737

Open mpldr opened 2 years ago

mpldr commented 2 years ago

Describe the bug When downloading a document where the Correspondent/Title contains special characters, these characters are mangled:

- 2022-10-18 Stadt Gro�enhain Meldebest�tigung.pdf
+ 2022-10-18 Stadt Großenhain Meldebestätigung.pdf

To Reproduce Steps to reproduce the behavior:

  1. Find document in list
  2. Click on 'Download'
  3. See Filename

Expected behavior The filename does not contain invalid symbols

Screenshots Not applicable

Relevant information

slankes commented 2 years ago

paperless-ng is pretty much abandoned. Have a look at https://github.com/paperless-ngx/paperless-ngx for a maintained fork.

mpldr commented 2 years ago

Thanks, will do.

bangboomben commented 1 year ago

didn't solve it to me though...

[2023-01-14 00:44:37,557] [INFO] [paperless.management.consumer] Adding /usr/src/paperless/consume/34242536 - Einkommensteuererklrung.pdf to the task queue.

[2023-01-14 00:44:37,647] [ERROR] [paperless.handlers] Creating PaperlessTask failed: 'utf-8' codec can't encode character '\udce4' in position 30: surrogates not allowed

EDIT: worked for me as I installed the ubuntu base in german