jekyll / jekyll-import

:inbox_tray: The "jekyll import" command for importing from various blogs to Jekyll format.
https://import.jekyllrb.com
MIT License
512 stars 315 forks source link

Convert WordPress to markdown #506

Closed collimarco closed 1 year ago

collimarco commented 1 year ago

I am trying to convert a blog from WordPress export file to Jekyll.

I see that Jekyll just imports the Wordpress HTML (with useless classes, html comments, etc.)...

  1. Is there any option to convert the posts to a clean markdown format?
  2. Also, is there any way to choose the attributes of the frontmatter that we want to keep (because most of them are useless)?
parkr commented 1 year ago

32, #49, #138 are old issues I found relating to this.

I haven't (yet) found an HTML to markdown converter that I find good enough to recommend to everyone. Maybe we could add some docs for the WP importer to point users to these kinds of tools.

parkr commented 1 year ago

Looks like in #138, a user posted a pretty good script which uses pandoc.

But the short answer is that no, Jekyll import doesn't currently translate HTML to markdown for the WordPress importer.

jekyllbot commented 1 year ago

This issue has been automatically marked as stale because it has not been commented on for at least two months.

The resources of the Jekyll team are limited, and so we are asking for your help.

If this is a bug and you can still reproduce this error on the master/main branch, please reply with all of the information you have about it in order to keep the issue open.

If this is a feature request, please consider whether it can be accomplished in another way. If it cannot, please elaborate on why it is core to this project and why you feel more than 80% of users would find this beneficial.

This issue will automatically be closed in two months if no further activity occurs. Thank you for all your contributions.