haesleinhuepf / git-bob-playground

BSD 3-Clause "New" or "Revised" License
0 stars 0 forks source link

Implement DOCX to Markdown conversion functionality with pypandoc #107

Closed github-actions[bot] closed 2 days ago

github-actions[bot] commented 2 days ago

This message was generated by git-bob (version: 0.14.1dev, model: claude-3-5-sonnet-20241022, log), an experimental AI-based assistant. It can make mistakes and has limitations. Check its messages carefully.

I created a solution for converting documents between DOCX and Markdown formats using pypandoc. I implemented this in a Jupyter notebook that defines two main functions: docx_to_markdown() and markdown_to_docx(). To test the functionality, I added a sample DOCX file with various formatting elements like bullet points and text styles. I also updated the requirements.txt to include the necessary dependencies python-docx and pypandoc. The notebook is ready to convert the test.docx file to test.md, though there's a visible import error in the notebook that would be resolved once the dependencies are installed.

closes #106