Closed junxu-ai closed 1 year ago
Markdown support is deprecated, and converting first to HTML and then using something else to convert that HTML to Markdown (as you're already doing) is what's recommended.
Thanks @mwilliamson.
i'm just wondering if the additional step would introduce more format errors.
Markdown support is deprecated, and converting first to HTML and then using something else to convert that HTML to Markdown (as you're already doing) is what's recommended.
yes html2text would work
It seems that the coversion to markdown is not fully impletmented. the code shows that it calls the html function.
Currently, i use mammoth to convert docx into html first, and then markdownify to convert the html to markdown.
If you're reporting a bug or requesting a feature, please include:
a minimal example document temp.docx
If you're reporting a bug, it's also useful to know what platform you're running on, including: