Closed PeterSommerlad closed 4 years ago
It may be caused by some incompatibilities between pandoc and panflute. I'm not sure whether it is introduced by the newest update. Does pandoc-include v0.8.3 work for this file?
If only v0.8.4 causes this problem, you may try using the development version to debug by cloning this repo and log the intermediate JSON value into some file.
Feel free to update this thread if you still find it difficult to debug it.
Bisecting the offending documents I found spurious unicode characters <U+2028> (line separator) <U+2029> (paragraph separator) in the markdown generated from word. After deleting those, I no longer get these errors. Those might have been a problem in the original word file alreadz, which I cannot check right now (lacking MS Word), but it might be worth investigating, why the json reader is unhappy with them.
I conclude it is a bug in the json reader of python 3.9. However, I am not so much into Python development and use to feel comfortable creating a bug report there.
Hi, I am using pandoc 2.11.1.1 and pandoc-include
I split a large markdown file that was generated from a docx and unfortunately pandoc-include fails with a json error. I checked the individual files, they all conform with utf8. So what could be the problem here?
error message attached. Could it be the case that the pandoc and corresponding panflute updates created some incompatibilities with pandoc-include? I had to debug other filters using panflute for that, but I am completely have no idea on how to treat the underlying json error.
Thanks for help
Regards Peter.