pbs / pycaption

Python module to read/write popular video caption formats
Apache License 2.0
256 stars 136 forks source link

Added UTF-8 decode #158

Closed AlexEKoren closed 3 years ago

AlexEKoren commented 7 years ago

Added the decoding of "new_content" with an "ignore" flag so that if there are invalid characters, or characters outside the ASCII 128 range that are UTF-8 encoded, they can still be decoded and written. An example is the "¿" character.

ana-nichifor commented 3 years ago

Hi, I made a few tests with the ¿ character and it seems to be read and written properly. Maybe you can provide an example that fails?