alphapapa / org-web-tools

View, capture, and archive Web pages in Org-mode
GNU General Public License v3.0
638 stars 33 forks source link

"Pandoc failed" on certain websites #42

Open ghost opened 3 years ago

ghost commented 3 years ago

On some web pages I get an error "Pandoc failed". On others capture succeeds. For example on these two pages randomly chosen:

https://sachachua.com/blog/2007/12/planner-basic-configuration/ http://muto.ca/b/19-Rmail.html

The first fails, but the second succeeds.

herop commented 3 years ago

I do agree, even though I'm on the most recent versions of both pandoc and emacs: NOT working: https://aeon.co/essays/democracy-is-common-and-robust-historically-and-across-the-globe Working (but real mess regarding images): https://psyche.co/guides/how-to-approach-the-lifelong-project-of-language-learning Working - fairly - well: https://theconversation.com/carbon-offsets-offer-a-fantasy-of-capitalism-without-crises-155730

Since @revrari posted in January, it would be nice to see some love here. :-)

alphapapa commented 3 years ago

I can't offer support for how third-party tools interact with individual web sites. You're welcome to use this tracker to help debug the problem, and if you find a bug in this package or a sensible workaround, maybe we can apply it here.

alphapapa commented 3 years ago

Also, please see the note about Pandoc in the readme.

herop commented 3 years ago

Understood. Your comments in the Readme suggest, that issues would occur due to older versions of it, though.But you may close the issue if they are caused by pandoc. Kind regards and kudos for your excellent work!

alphapapa commented 3 years ago

I'd prefer to leave this issue open until it can be fixed or worked around, or at least improved (see the relevant TODO in the source code). If no one has the time or interest to work on it now, maybe someone will later.