fourier / rss-bot-diasp

RSS bot for diaspora supporting several users and feeds
5 stars 0 forks source link

Convert HTML to Markdown and some ideas. #2

Open kaffeeringe opened 8 years ago

kaffeeringe commented 8 years ago

Thank you for your great script!

It would be cool if the script converted the HTML from the RSS-feed into markdown so that the articles keep their form. Diaspora escapes HTML. So my current results don't look too nice...

Maybe it would also be cool if the output was configurable - A headline a link and some tags would suffice for my use.

AFAIK Wordpress feeds also contain tags - you could also post them on diaspora…

fourier commented 8 years ago

Thanks for ideas. I have tags support in mind, just need to implement them. But about the conversion from HTML to Markdown I doubt it could be reliable, since HTML itself is not really well-defined. However I could take a look at it.

kaffeeringe commented 8 years ago

Any result of a converter will be better than having all the escaped HTML in posts. ;-)

Maybe you could test this one here: https://github.com/aaronsw/html2text (By the late great Aaron Swartz)

kaffeeringe commented 8 years ago

And there is an active fork of it: https://github.com/Alir3z4/html2text

xuv commented 8 years ago

:+1: I like the idea of having html2text. Or maybe another solution would be to just be able to decativate posting the content of the post. Just posting a link + title could be sufficient.

xuv commented 8 years ago

Sorry for the spam here, but pypandoc seems to do conversion from html to markdown.

fourier commented 8 years ago

hi, I'm not a Diaspora user anymore. I could revisit it later however. Does anyone uses this bot ? I haven't heard anything since diasporaforum.org died...

xuv commented 8 years ago

@fourier well, I don't know if anyone else is running the bot. But I have an instance running it now. Planning to add more Rss feed to it :)