Closed darkranger-red closed 9 years ago
This is definitely something I'd like to support, but I'm not sure how (if at all!) it's covered by the RTF specs. Can you give me a couple of example RTF files to test against?
OK, I will collect some files when I back to the office on Monday.
Btw maybe using incrementaldecoder would be the right way?
Multibyte codepages are fixed in 381a3067add074fb5cf48fbc5e56f5b7ba28d795 and your test file now works.
Hello guys,
CJK means Chinese, Japanese, and Korean. Many ancient RTF writer doesn't store these characters in Unicode, and use pyth to read CJK characters from these ancient RTF documents would cause "UnicodeDecodeError" due to CJK codecs actually use 4 hex digits not 2.
I did modified plugins/rtf15/reader.py to resolve my own needs. But I still hope someone can write a better code to deal with this issue.
1)Add this first:
2)Add number 936:
3)Change to 'ignore' :
4):
5)