JetBrains / markdown

Markdown parser written in kotlin
Apache License 2.0
706 stars 78 forks source link

non-english text is rendered "span per word" in HTML #45

Closed katepol closed 4 years ago

katepol commented 4 years ago

This breaks further copying and pasting text. image

katepol commented 4 years ago

I see a test https://github.com/valich/intellij-markdown/blob/master/test/data/parser/ruby18936.txt which asserts this behavior. Could you please tell me, why?

valich commented 4 years ago

Files were moved, correct link: https://github.com/valich/intellij-markdown/blob/master/src/commonTest/resources/data/parser/ruby18936.txt

It was an acceptable behaviour back then. I need to either make lexers more unicode-friendly or merge tokens post-lexing.

valich commented 4 years ago

Fixed in 0.2.0.pre-58