This PR allows for nicely formatted, automagically generated zine versions of posted articles.
The underlying tools are based on Pandoc, a tool that converts between common formats (most notably HTML, markdown, and PDF) and XeTeX, a modern LaTeX distribution. Some conditioning of the intermediary markdown text is also done using regular expressions in Python.
The basic order of operations is:
Use Pandoc to scrape the posted HTML into markdown, capturing the article's images and references
Use Python to strip nonsense text from the HTML header and footer, and other minor reformatting
Use Pandoc to render a PDF via LaTeX
All the heavy lifting is done in a LaTeX template used to achieve this last step.
This PR allows for nicely formatted, automagically generated zine versions of posted articles.
The underlying tools are based on Pandoc, a tool that converts between common formats (most notably HTML, markdown, and PDF) and XeTeX, a modern LaTeX distribution. Some conditioning of the intermediary markdown text is also done using regular expressions in Python.
The basic order of operations is:
All the heavy lifting is done in a LaTeX template used to achieve this last step.
This is related to #5.