veraPDF / veraPDF-library

Industry supported, open source PDF/A validation library
http://verapdf.org/software
GNU General Public License v3.0
270 stars 48 forks source link

CLI error with German Umlaut #1467

Open jh0563 opened 1 month ago

jh0563 commented 1 month ago

Running veraPDF CLI with a sourcefilename including "German Umlaut" results in an error. (Windows11 , VerPDF-Version: 1.26.2)

What is the encoding for filenames ? We try ANSI, UTF8, UTF16 and ASCII.

Running from command, this is the result:

verapdf -f 0 "C:\temp\ÄÖÜ.pdf" Jul 23, 2024 1:37:01 PM org.verapdf.apps.utils.ApplicationUtils filterPdfFiles SCHWERWIEGEND: File C:\temp\├âÔÇ×├âÔÇô├â┼ô.pdf doesn't exist. Jul 23, 2024 1:37:01 PM org.verapdf.cli.VeraPdfCliProcessor processFilePaths SCHWERWIEGEND: There are no files to process.

bdoubrov commented 1 month ago

This looks like a duplicate of #741 . It is a known issue with Java executable not supporting Unicode command line arguments.

The easiest workaround is to place your file(s) into the folder with the name that does not include any international characters and pass this folder name as an argument to veraPDF. It will then run a batch job picking up all file names inside this folder correctly.

We'll do more research if there are better solutions for this issue.