miniflux / v2

Minimalist and opinionated feed reader
https://miniflux.app
Apache License 2.0
6.97k stars 728 forks source link

Unable to find any feed in https://www.marcusfolkesson.se/categories/blog/ #2844

Closed Necior closed 2 months ago

Necior commented 2 months ago

Steps to reproduce

  1. Go to the "New feed" page
  2. Paste https://www.marcusfolkesson.se/categories/blog/ as the URL
  3. Click "Find a feed"

Observed behavior

Got Unable to find any feed. message.

Expected behavior

miniflux finds https://www.marcusfolkesson.se/categories/blog/index.xml and uses it.

Extra comment

I found the feed manually by viewing the source of the page. As of writing this, /html/body/footer/div/div/div/ul/li[6]/a points to:

<a href="/categories/blog/index.xml" rel="alternate" type="application/rss+xml" title="Marcus Folkesson">
                           <span class="fa-stack fa-lg">
                               <i class="fas fa-circle fa-stack-2x"></i>
                               <i class="fas fa-rss fa-stack-1x fa-inverse"></i>
                           </span>
                       </a>

from which I was able to manually extract the feed URL.

fguillot commented 2 months ago

Auto-discovery will work only if the link (<link> tag) is located in <head> section of the HTML page. The HTML code that you are referring to is just a normal HTML link on the page.

For reference: https://www.rssboard.org/rss-autodiscovery#element-link

To support autodiscovery, a link element must be added to the header, as shown in this HTML markup from The RSS Blog:

<html>
  <head>
    <title>The RSS Blog</title>
    <link rel="alternate" type="application/rss+xml" title="RSS"
      href="http://feeds.feedburner.com/TheRssBlog">
  </head>
  <body>
    <!-- the web page's contents -->
  </body>
</html>

Same thing for other feed formats like Atom: https://en.wikipedia.org/wiki/Atom_(web_standard)#Including_in_HTML

However, Miniflux should be able to discover the feeds by trying well-known URLs. Add a trailing slash to the URL and it should work: https://www.marcusfolkesson.se/categories/blog/

image

For reference: https://github.com/miniflux/v2/blob/95201fc5cff83938efe32f0bc17f4f333195cf9c/internal/reader/subscription/finder.go#L191-L201

Necior commented 2 months ago

Auto-discovery will work only if the link (<link> tag) is located in <head> section of the HTML page. The HTML code that you are referring to is just a normal HTML link on the page.

Thanks for the context. I emailed the author of the blog to let him know.

However, Miniflux should be able to discover the feeds by trying well-known URLs. Add a trailing slash to the URL and it should work: https://www.marcusfolkesson.se/categories/blog/

I tried with a trailing slash but it didn't help. What helped was upgrading my instance to 2.2.0 :tada: Thanks!