Open creffett opened 5 years ago
This is not a matter of non-ASCII, it's a matter of non-UTF8:
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe0 in position 9189: invalid continuation byte
Wikipedia says:
SubRip's default output encoding is configured as Windows-1252. However, output options are also given for many Windows code pages as well Unicode encodings, such as UTF-8 and UTF-16, with or without Byte Order Mark (BOM). Therefore, there's no de facto character encoding standard for .srt files, which means that any SubRip file parser must attempt to use Charset detection. Unicode Byte Order Mark (BOM) are typically used to aid detection.
Which means that as much as I want to close this as "won't fix, git gud and use UTF-8", I shouldn't. I'll look into fixing this at some point, maybe. In the meantime, I suggest just converting your file to UTF-8.
An SRT file containing non-ASCII characters will often trigger an exception when it's chosen for a meme, as seen in the following backtrace: