fleetingbytes / rtfparse

RTF Parser
MIT License
12 stars 8 forks source link

Issue with ordered and unordered lists #38

Open xfoldvar opened 1 day ago

xfoldvar commented 1 day ago

I just tried your lib and found out that when extracting HTML from the MSG file that contains RTF with ordered/unordered lists, these lists are corrupt / displayed incorrectly.

Unordered lists: They have rendered bullets and right after the bullet there is a * character - this should not be present.

Ordered lists: Their items are duplicated. See the attached SS. Eg when there is an ordered list like this:

  1. Item

It does look like this:

  1. 1.Item

image

Original MSG file.

Formátování z Outlook__2024_10_16.zip

fleetingbytes commented 21 hours ago

Wow, this looks like an interesting bug. Thanks for the example file. I no longer have an easy access to a computer with MS Outlook, or other MS Office products, so this will be a bit hard for me to explore and verify a fix.

Also, I am leaving for vacation for the next 11 days. But I will look into this once I am back.