kermitt2 / pdfalto

PDF to XML ALTO file converter
GNU General Public License v2.0
213 stars 68 forks source link

File name encoding Windows #129

Open FriedrichF opened 3 years ago

FriedrichF commented 3 years ago

When i run pdfAlto in Windows with a file including a space and Umlaut (ä,ü,ö), i am getting a i/O Error: Couldn't open file

This happens only in combination of space and umlaut. Without space or umlaut everything is fine.

It seems that the encoding is not correct. Here is the resulting error with corrupted filename: I/O Error: Couldn't open file '"C:\BeispielGerte informationprodukt.pdf"'

Original filename: C:\BeispielGeräte informationprodukt.pdf