Awkee / pandoc

Automatically exported from code.google.com/p/pandoc
GNU General Public License v2.0
0 stars 0 forks source link

unicode combining character messes table parsing #260

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
I'm using version 1.6 from cabal. When I give pandoc any cell, in Emacs table 
mode, with a combining character, like the combining acute (U+0301), all cells 
to the right get a | as part of the string inside it. It behaves normally if I 
delete that character. I tried latex and html output, both get the |. Attached 
goes an example.

Thanks!

Original issue reported on code.google.com by bike...@gmail.com on 5 Oct 2010 at 4:03

Attachments:

GoogleCodeExporter commented 9 years ago
Thanks. Pandoc counts characters for table alignment, and it doesn't (yet) know 
the difference between combining characters and other characters.  I'll look 
into fixing this when I get a chance.

Data.Char can distinguish these nonspacing characters:
Prelude Data.Char> generalCategory '\x0301'
NonSpacingMark

Original comment by fiddloso...@gmail.com on 5 Oct 2010 at 7:14

GoogleCodeExporter commented 9 years ago

Original comment by fiddloso...@gmail.com on 27 Oct 2010 at 4:11

GoogleCodeExporter commented 9 years ago
Fixed by ff93a8e7891d8537c713d6d1b0fd4409c5e43ebe

Original comment by fiddloso...@gmail.com on 27 Jan 2012 at 8:45