dharple / detox

Tames problematic filenames
BSD 3-Clause "New" or "Revised" License
318 stars 19 forks source link

Please provide a way to retain German Umlaute ÄÜÖäüö #99

Closed schlichtanders closed 5 months ago

schlichtanders commented 1 year ago

I tinkered with everything I could think of. The .tbl definitions are interpreted completely magically, depending on whether it is a safe, iso or utf8 table.

I was able to rewrite unicode.tbl to retain the ÄÜÖöüä, however as soon as I run a subsequent safe table, everything is destroyed.

Unfortuantely, overwriting something like


#
# Chars to translate to _
#

0x0022      _   # "
0x002a      _   # *
0x002f      _   # /
0x003a      _   # :
0x003f      _   # ?
0x005c      _   # \
0x007c      _   # |

#
# Chars to translate to -
#

0x003c      -   # <
0x003e      -   # >

in the unicode.tbl is just completely ignored. I don't know what is going on...

I guess the easiest would be if we could have an arbitrary mapping within unicode.tbl so that no safe.tbl is needed (it is really surprising to me that unicode.tbl does not support this already. It seems that also the default value in the unicode.tbl is completely ignored...)

dharple commented 1 year ago

Hi, which version of detox are you using? Can you please post your detoxrc and translation table? Thanks!

dharple commented 5 months ago

Version 3 of detox removes all of the transliteration, so this should no longer be a problem.