Mozilla's PDF.js generates a canvas view which makes it easier to retain styles and layout. This is not really what a markdown converter should do.
That said, I've been wondering if there's a decent way to either extract embedded images as inline encoded images from markdown, or perhaps have the option to extract the content and use a headless version of the canvas render to perhaps embed images of the original pages from the PDF. Both could get included inline as base64 images.
Would this be useful here or is this outside the scope of what this project wants to do?
Is there a better way to achieve what I'm describing?
If there's interest in what I've described, I'm happy to do the bulk of the work to make it happen, but I'd appreciate some guidance so we end up with a PR that meets the project's expectations.
Mozilla's PDF.js generates a canvas view which makes it easier to retain styles and layout. This is not really what a markdown converter should do.
That said, I've been wondering if there's a decent way to either extract embedded images as inline encoded images from markdown, or perhaps have the option to extract the content and use a headless version of the canvas render to perhaps embed images of the original pages from the PDF. Both could get included inline as base64 images.