nkanaev / yarr

yet another rss reader
MIT License
3.03k stars 224 forks source link

RSS feed parsing issue? #88

Closed stepelu closed 2 years ago

stepelu commented 2 years ago

The RSS feed https://francisbach.com/feed/ results in the parse error XML syntax error on line 1815: illegal character code U+000C. I might be mistaken but I cannot find this unicode character in the feed itself, so I am unsure about what might be happening here. Tested on Linux, git HEAD as time of this Issue creation.

nkanaev commented 2 years ago

\x0c is there, on line 1786 (attached feed content: francisbach.xml.txt)

Go's xml decoder is strict in that regard. yarr should be able to handle such nuisances.

nkanaev commented 2 years ago

the fix has landed on master branch. needs thorough testing and will be available in the upcoming release.