joshy / striprtf

Stripping rtf to plain old text
http://striprtf.dev
BSD 3-Clause "New" or "Revised" License
94 stars 27 forks source link

Fixes #37 - Handle RTF created with TX_RTF32 #40

Closed plessbd closed 1 year ago

plessbd commented 1 year ago

Fixes #37

TX_RTF32 allows for UTF8 encoded characters within an cpg1252 encoded RTF

joshy commented 1 year ago

Thx a lot!

joshy commented 1 year ago

Oh wow, I will review the changes. Thanks @stevengj

plessbd commented 1 year ago

So I was able to do most of these in https://github.com/joshy/striprtf/pull/42 @stevengj and @joshy if you have more updates let me know I am willing to iterate on this

The only one I couldnt "easily" do was https://github.com/plessbd/striprtf/blob/code-review-cleanup/striprtf/striprtf.py#L193-L198 as doing that broke a bunch of tests, I might just be doing it wrong, please comment on https://github.com/joshy/striprtf/pull/42 and I will update