RSS-Bridge / rss-bridge

The RSS feed for websites missing it
https://rss-bridge.org/bridge01/
The Unlicense
7.36k stars 1.04k forks source link

feeds created with rss-bridge incompatible with feed informer #1372

Closed destgub closed 1 year ago

destgub commented 4 years ago

I do intensive use of rss feeds and usually manage them using the 'feed informer' site (union, conditional filters, sorting). Until now I have never had any incompatibility problems with the various rss found on the internet and entered on the platform, but it seems that those generated by rss-bridge are not recognized correctly. In most cases they are untitled and with empty contents making them unmanageable. Is it a feed informer or rss-bridge problem? Thanks

logmanoriginal commented 4 years ago

Hard to say. Maybe feed informer requires a different format than the one you choose. What feed format do you use? Do you receive any parsing errors?

Please also try https://validator.w3.org/feed/#validate_by_input to check if your feed is valid.

destgub commented 4 years ago

I tried both atom and mrss, but in both cases on feed informer they are interpreted with each empty element and with the title 'untitled'. On Firefox with 'want my rss' extension I correctly visualize all the the feeds created with rss-bridge (both atom and mrss). But no feed created with rss-bridge is valid by verifying with https://validator.w3.org/feed/#validate_by_input I recive "XML parsing error: :1:0: syntax error"

em92 commented 4 years ago

I recive "XML parsing error: :1:0: syntax error"

Looks like w3's validator is broken. Checked this: https://validator.w3.org/feed/check.cgi?url=https%3A%2F%2Ffeed.eugenemolotov.ru%2F%3Faction%3Ddisplay%26bridge%3DVk%26u%3Did7%26format%3DAtom

It shows 400 bad request. Looked into server logs, and I get this:

128.30.52.139 - - [02/Dec/2019:11:41:55 +0500] "GET /?action=display&bridge=Vk&u=id7&format=Atom HTTP/1.1" 400 37 "-" "FeedValidator/1.3"

It just requests wrong url.

Checked alternative, it looks fine: http://www.feedvalidator.org/check.cgi?url=https%3A%2F%2Ffeed.eugenemolotov.ru%2F%3Faction%3Ddisplay%26bridge%3DVk%26u%3Did7%26format%3DAtom

logmanoriginal commented 4 years ago

@destgub Do you get better results with the alternative mentioned by @em92

destgub commented 4 years ago

With http://www.feedvalidator.org the rsse created with rssbridge result valid, but they still do not work properly on feed informer. So I guess the problem is with this other site "feed informer" but I'm not sure

logmanoriginal commented 4 years ago

Do you receive any error messages from Feed Informer? If not, perhaps it's a good idea to get in touch with them to check why it doesn't work.

destgub commented 4 years ago

Feed informer result "Valid, but interoperability with the widest range of feed readers could be improved by implementing the following recommendations." I have already tried to write several times for support some time ago and they never answered me, we try again. But what I don't understand why on 69 feeds that I manage on feed informer, this problem occurs with rss generated only by rss-bridge (in particular it seems to me that the problem has it when there are symbols in the title like + - * )

logmanoriginal commented 4 years ago

But what I don't understand why on 69 feeds that I manage on feed informer, this problem occurs with rss generated only by rss-bridge (in particular it seems to me that the problem has it when there are symbols in the title like + - * )

Does that mean you have some feeds generated by RSS-Bridge that have no issues and some that do? Which bridges fail for you?

If possible please also include a few queries so we can test them.

destgub commented 4 years ago

No I have numerous feeds directly generated by the sites I follow that all work correctly through FeedInformers, while ALL the feeds created with rss-bridge have problems on FeedInformer, but sometimes some items of the feeds generated is correctly seen. I am not very familiar with codes, this is an example of a feed generated with rss-bridge from facebook page and the same inserted in feed informer. Generated with rss-bridge: https://bridge.suumitsu.eu/?action=display&bridge=Facebook&context=User&u=trasportopubbliconapoli&media_type=all&skip_reviews=on&limit=-1&format=Mrss

Same inserted in FeedInformer with no filters https://feed.informer.com/digests/LBHUKGQBF7/feeder.rss

logmanoriginal commented 4 years ago

Thanks for the examples, this is extremely helpful. It turns out feed informer doesn't handle unicode symbols. For example (first is original, second is feed informer):

1) Okay

<title>Il servizio di informazione sul trasporto pubblico di...</title>
<title>Il servizio di informazione sul trasporto pubblico di...</title>

2) Truncated

<title>DEVIAZIONE #LINEABUS 502 ๐Ÿ”€๐Ÿš Causa lavori in via Terracina,...</title>
<title>DEVIAZIONE #LINEABUS 502 </title>

3) Empty

<title>๐Ÿ“ข #Anminforma che martedรฌ 10 Dicembre le OO.SS.  USB, ORSA e...</title>
<title></title>

In fact, they truncate any text that contains symbols:

<description>&lt;a href="https://www.facebook.com/trasportopubbliconapoli/"&gt;&lt;img src="https://scontent-cdt1-1.xx.fbcdn.net/v/t1.0-1/p50x50/57750937_2367415529947970_974723255661232128_n.jpg?_nc_cat=102&amp;_nc_ohc=QfYCABJMApYAQlmPrVYN2RN55ytKZ32xh0Rn5sa7PihLdB7CSdTJkZC8Q&amp;_nc_ht=scontent-cdt1-1.xx&amp;oh=6ac5d060c50097135cfb3f2e3cd7be10&amp;oe=5E7B6494" alt="" /&gt;&lt;/a&gt;&lt;a href="https://www.facebook.com/trasportopubbliconapoli/"&gt;Anm Napoli&lt;/a&gt;&lt;a href="https://www.facebook.com/trasportopubbliconapoli/posts/2794505113905674"&gt;&lt;/a&gt;&lt;a href="https://www.facebook.com"&gt;&lt;i&gt;&lt;/i&gt;&lt;/a&gt;&lt;p&gt;Il servizio di informazione sul trasporto pubblico di ๐Ÿš๐Ÿš‡๐Ÿšก๐Ÿ…ฟ๏ธAnm torna domattina alle ore 06:30.&lt;/p&gt;</description>
<description>&lt;a href="https://www.facebook.com/trasportopubbliconapoli/"&gt;&lt;img src="https://scontent-cdt1-1.xx.fbcdn.net/v/t1.0-1/p50x50/57750937_2367415529947970_974723255661232128_n.jpg?_nc_cat=102&amp;_nc_ohc=QfYCABJMApYAQmUaxGqJl7whswKuoJVtBsygxt7aKoMqx9NG3BS38-DHw&amp;_nc_ht=scontent-cdt1-1.xx&amp;oh=5997a221421e2d2c071149f7152db27b&amp;oe=5E7B6494" alt="" /&gt;&lt;/a&gt;&lt;a href="https://www.facebook.com/trasportopubbliconapoli/"&gt;Anm Napoli&lt;/a&gt;&lt;a href="https://www.facebook.com/trasportopubbliconapoli/posts/2794505113905674"&gt;&lt;/a&gt;&lt;a href="https://www.facebook.com"&gt;&lt;i&gt;&lt;/i&gt;&lt;/a&gt;&lt;p&gt;Il servizio di informazione sul trasporto pubblico di </description>

RSS and Atom neither permit nor forbid the use of symbols, so it's a valid feed. There also doesn't seem to be any resource from feed informer about their parsing algorithm and limitations.

Do you have the same issues with Atom instead of MRSS?

destgub commented 4 years ago

Do you have the same issues with Atom instead of MRSS?

Thank you very much. Yes, I see the same issues. Atom created with rss-bridge: https://bridge.suumitsu.eu/?action=display&bridge=Facebook&context=User&u=trasportopubbliconapoli&media_type=all&skip_reviews=on&limit=-1&format=Atom

Atom with same feed inserted in FeedInformer with no filters: https://feed.informer.com/digests/NYYXKIXBIU/feeder.atom

I'm doing several tests by setting another encoding on feed informer (I can choose between utf-8, iso-8859-1, iso-8859-6, iso-8859-8, CP1251, GB2312) but I always get the same problem. I will try to send another email to the platform, perhaps talking about the precise problem they can resolve, otherwise would there be a way to eliminate these symbols in some way? By extracting rss from the social networks they are unfortunately very often present.

dvikan commented 2 years ago

@destgub Is this still an issue?

dvikan commented 1 year ago

@destgub ?