Athou / commafeed

Google Reader inspired self-hosted RSS reader.
https://www.commafeed.com
Apache License 2.0
2.59k stars 361 forks source link

Error while adding gazzetta.it rss feeds #1260

Open Paolo7297 opened 4 months ago

Paolo7297 commented 4 months ago

Whenever I try to add a gazzetta.it feed (Like https://www.gazzetta.it/dynamic-feed/rss/section/Calcio/Serie-A.xml), it throws this error: org.xml.sax.SAXParseException: DOCTYPE is disallowed when the feature "http://apache.org/xml/features/disallow-doctype-decl" set to true.. There's a way to bypass the error? It works great in other rss reader. Thanks!

Screenshot 2024-02-21 alle 14 21 15

Athou commented 4 months ago

There is a DOCTYPE declaration at the top of the feed, which is unusual.

image

The parser CommaFeed is using actively blocks feeds with a DOCTYPE declaration for security reasons (see https://github.com/rometools/rome/issues/203 and https://en.wikipedia.org/wiki/Billion_laughs_attack).

Maybe I can remove the DOCTYPE from the XML before the parsing occurs, I'll see what I can do.

In the mean time, you could contact the website to ask them to remove the DOCTYPE declaration.

Paolo7297 commented 4 months ago

Thanks! Actually their contact form isn't working, I hope it will in the next days

travisbeard commented 2 months ago

I also have several that could not be imported when switching from feedly. It would be nice to have an option ignore.

Athou commented 2 months ago

I also have several that could not be imported when switching from feedly. It would be nice to have an option ignore.

Do you get the same error as above? What are the feed urls that are not working?

travisbeard commented 2 months ago

This is no longer bothering me. I was able to find the sites all had 2 feeds, one with and one without. The website parser in commafeed finds the wrong one by default, but i was able to find the 2nd feed for all these sites worked.