xoofx / markdig

A fast, powerful, CommonMark compliant, extensible Markdown processor for .NET
BSD 2-Clause "Simplified" License
4.21k stars 444 forks source link

Extra space after list item is placing contents in code block #800

Closed Matt-Scheetz closed 1 month ago

Matt-Scheetz commented 1 month ago

Rendering LLM output (gpt-4o) and I've observed the contents after the header for a numbered list item is getting placed in a code block (See content after 10): image

Looking at the raw text and I see that there is an extra space between the Numbered header and the content image

As the images states there are 4 spaces after 10 and 3 spaces after all other numbered headers.

When I pasted the raw text to have it rendered and remove the extra space, the text renders without the code block: image

Using Markdig v 0.36.2

xoofx commented 1 month ago

There is not much we can do. Their Markdown output is invalid CommonMark, as 4 spaces will always result in a code block as per the spec here.

You can try it here on GitHub and it will be the same:

    - *Description*: too bad
    - Hello world

Will give you this, similar to markdig:

- *Description*: too bad
- Hello world