jgm / pandoc

Universal markup converter
https://pandoc.org
Other
34.29k stars 3.36k forks source link

citeproc fails to convert some cedilla accents in TeX format #989

Closed mitchelltd closed 11 years ago

mitchelltd commented 11 years ago

for example, BibDesk saves accented characters in a TeX format along the lines of {\c c} (this is a latin lower case c with cedilla, ç)

Usually pandoc-citeproc converts the TeX format into roman, e.g. {\^o} into ô. But this fails with certain characters (I know of ç and Ç) and pandoc-citeproc outputs \c c instead of ç.

From inspecting the code, I suspect this can be fixed in bibutils/lib/latex.c by adding \\c c etc

jgm commented 11 years ago

+++ mitchelltd [Sep 18 13 06:07 ]:

for example, [1]BibDesk saves accented characters in a TeX format along the lines of {\c c} (this is a latin lower case c with cedilla, ç)

Usually pandoc-citeproc converts the TeX format into roman, e.g. {^o} into ô. But this fails with certain characters (I know of ç and Ç) and pandoc-citeproc outputs \c c instead of ç.

From inspecting the code, I suspect this can be fixed in bibutils/lib/latex.c by adding \c c etc

This looks like a bibutils bug, then. Why don't you report it to the bibutils maintainer?

Note: I'm working on a direct bibtex -> pandoc-citeproc YAML converter that will properly handle all such cases.

mitchelltd commented 11 years ago

Fine. I've done that here.

I wasn't sure whether you maintained your own version of bibutils or not.

The direct convertor sounds good !

Thanks for your work on Pandoc, btw - I'm using it with nvAlt and Marked for my daily academic notes, and it is invaluable. I don't know of another way of handling bibliographic info well using markdown, without going via LaTeX.

jgm commented 11 years ago

Thanks - closing the bug here, then!