samuelclay / NewsBlur

NewsBlur is a personal news reader that brings people together to talk about the world. A new sound of an old instrument.
http://www.newsblur.com
MIT License
6.92k stars 1k forks source link

Images not showing for certain feed #1784

Closed NylonDiamond closed 1 year ago

NylonDiamond commented 1 year ago

I'm not seeing any issues in this RSS feed but it's not showing any images. It shows fine on feedly but now in NewsBlur. Any ideas? Is there a way to customize the feed before NewsBlur renders it?

Here is a sample of the feed:

<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
    xmlns:content="http://purl.org/rss/1.0/modules/content/"
    xmlns:wfw="http://wellformedweb.org/CommentAPI/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:atom="http://www.w3.org/2005/Atom"
    xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
    xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
    >

<channel>
    <title>CG Persia</title>
    <atom:link href="https://cgpersia.com/feed" rel="self" type="application/rss+xml" />
    <link>https://cgpersia.com</link>
    <description></description>
    <lastBuildDate>Mon, 06 Feb 2023 22:01:50 +0000</lastBuildDate>
    <language>en-US</language>
    <sy:updatePeriod>
    hourly  </sy:updatePeriod>
    <sy:updateFrequency>
    1   </sy:updateFrequency>

    <item>
        <title>Udemy &#8211; 3ds Max &#038; Corona Renderer: An In-Depth Guide for Beginners.</title>
        <link>https://cgpersia.com/2023/02/udemy-3ds-max-corona-renderer-an-in-depth-guide-for-beginners-188225.html</link>
                    <comments>https://cgpersia.com/2023/02/udemy-3ds-max-corona-renderer-an-in-depth-guide-for-beginners-188225.html#respond</comments>

        <dc:creator><![CDATA[Diptra]]></dc:creator>
        <pubDate>Mon, 06 Feb 2023 22:01:50 +0000</pubDate>
                <category><![CDATA[2D]]></category>
        <category><![CDATA[CG Releases]]></category>
        <category><![CDATA[Downloads]]></category>
        <category><![CDATA[Learning]]></category>
        <category><![CDATA[Tutorials]]></category>
        <category><![CDATA[Videos]]></category>
        <category><![CDATA[3ds Max]]></category>
        <category><![CDATA[corona render]]></category>
        <category><![CDATA[Udemy]]></category>
        <guid isPermaLink="false">https://cgpersia.com/?p=188225</guid>

                    <description><![CDATA[Duration 2h 24m Project Files Included MP4 Title: Udemy &#8211; 3ds Max &#38; Corona Renderer: An In-Depth Guide for Beginners. Info: Unleash Your 3D Design Potential with 3ds Max &#38; Corona Renderer: A Comprehensive Course for Beginners What you’ll learn The basics of 3ds Max and Corona Renderer, including interface navigation and essential tools. Techniques [&#8230;]]]></description>
                                        <content:encoded><![CDATA[<p><img decoding="async" loading="lazy" class="aligncenter" src="https://imgur.com/kVfvzUo.jpg" alt="Post Cover" width="575" height="350" /></p>
<p style="text-align: center"><strong>Duration 2h 24m Project Files Included MP4</strong></p>
<p></p>]]></content:encoded>

                    <wfw:commentRss>https://cgpersia.com/2023/02/udemy-3ds-max-corona-renderer-an-in-depth-guide-for-beginners-188225.html/feed</wfw:commentRss>
            <slash:comments>0</slash:comments>

            </item>
    </channel>
</rss>
samuelclay commented 1 year ago

That feed looks valid and feedparser is able to parse it:

>>> import feedparser
>>> fp = feedparser.parse("""<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
...     xmlns:content="http://purl.org/rss/1.0/modules/content/"
...     xmlns:wfw="http://wellformedweb.org/CommentAPI/"
...     xmlns:dc="http://purl.org/dc/elements/1.1/"
...     xmlns:atom="http://www.w3.org/2005/Atom"
...     xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
...     xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
...     >
... 
... <channel>
...     <title>CG Persia</title>
...     <atom:link href="https://cgpersia.com/feed" rel="self" type="application/rss+xml" />
...     <link>https://cgpersia.com</link>
...     <description></description>
...     <lastBuildDate>Mon, 06 Feb 2023 22:01:50 +0000</lastBuildDate>
...     <language>en-US</language>
...     <sy:updatePeriod>
...     hourly</sy:updatePeriod>
...     <sy:updateFrequency>
...     1</sy:updateFrequency>
...     
...     <item>
...             <title>Udemy &#8211; 3ds Max &#038; Corona Renderer: An In-Depth Guide for Beginners.</title>
...             <link>https://cgpersia.com/2023/02/udemy-3ds-max-corona-renderer-an-in-depth-guide-for-beginners-188225.html</link>
...                                     <comments>https://cgpersia.com/2023/02/udemy-3ds-max-corona-renderer-an-in-depth-guide-for-beginners-188225.html#respond</comments>
...             
...             <dc:creator><![CDATA[Diptra]]></dc:creator>
...             <pubDate>Mon, 06 Feb 2023 22:01:50 +0000</pubDate>
...                             <category><![CDATA[2D]]></category>
...             <category><![CDATA[CG Releases]]></category>
...             <category><![CDATA[Downloads]]></category>
...             <category><![CDATA[Learning]]></category>
...             <category><![CDATA[Tutorials]]></category>
...             <category><![CDATA[Videos]]></category>
...             <category><![CDATA[3ds Max]]></category>
...             <category><![CDATA[corona render]]></category>
...             <category><![CDATA[Udemy]]></category>
...             <guid isPermaLink="false">https://cgpersia.com/?p=188225</guid>
... 
...                                     <description><![CDATA[Duration 2h 24m Project Files Included MP4 Title: Udemy &#8211; 3ds Max &#38; Corona Renderer: An In-Depth Guide for Beginners. Info: Unleash Your 3D Design Potential with 3ds Max &#38; Corona Renderer: A Comprehensive Course for Beginners What you’ll learn The basics of 3ds Max and Corona Renderer, including interface navigation and essential tools. Techniques [&#8230;]]]></description>
...                                                                             <content:encoded><![CDATA[<p><img decoding="async" loading="lazy" class="aligncenter" src="https://imgur.com/kVfvzUo.jpg" alt="Post Cover" width="575" height="350" /></p>
... <p style="text-align: center"><strong>Duration 2h 24m Project Files Included MP4</strong></p>
... <p></p>]]></content:encoded>
...                                     
...                                     <wfw:commentRss>https://cgpersia.com/2023/02/udemy-3ds-max-corona-renderer-an-in-depth-guide-for-beginners-188225.html/feed</wfw:commentRss>
...                     <slash:comments>0</slash:comments>
... 
...                     </item>
...     </channel>
... </rss>""")
>>> fp.bozo
False
>>> fp
{'bozo': False, 'entries': [{'title': 'Udemy – 3ds Max & Corona Renderer: An In-Depth Guide for Beginners.', 'title_detail': {'type': 'text/plain', 'language': None, 'base': '', 'value': 'Udemy – 3ds Max & Corona Renderer: An In-Depth Guide for Beginners.'}, 'links': [{'rel': 'alternate', 'type': 'text/html', 'href': 'https://cgpersia.com/2023/02/udemy-3ds-max-corona-renderer-an-in-depth-guide-for-beginners-188225.html'}], 'link': 'https://cgpersia.com/2023/02/udemy-3ds-max-corona-renderer-an-in-depth-guide-for-beginners-188225.html', 'comments': 'https://cgpersia.com/2023/02/udemy-3ds-max-corona-renderer-an-in-depth-guide-for-beginners-188225.html#respond', 'authors': [{'name': 'Diptra'}], 'author': 'Diptra', 'author_detail': {'name': 'Diptra'}, 'published': 'Mon, 06 Feb 2023 22:01:50 +0000', 'published_parsed': time.struct_time(tm_year=2023, tm_mon=2, tm_mday=6, tm_hour=22, tm_min=1, tm_sec=50, tm_wday=0, tm_yday=37, tm_isdst=0), 'tags': [{'term': '2D', 'scheme': None, 'label': None}, {'term': 'CG Releases', 'scheme': None, 'label': None}, {'term': 'Downloads', 'scheme': None, 'label': None}, {'term': 'Learning', 'scheme': None, 'label': None}, {'term': 'Tutorials', 'scheme': None, 'label': None}, {'term': 'Videos', 'scheme': None, 'label': None}, {'term': '3ds Max', 'scheme': None, 'label': None}, {'term': 'corona render', 'scheme': None, 'label': None}, {'term': 'Udemy', 'scheme': None, 'label': None}], 'id': 'https://cgpersia.com/?p=188225', 'guidislink': False, 'summary': 'Duration 2h 24m Project Files Included MP4 Title: Udemy &#8211; 3ds Max &#38; Corona Renderer: An In-Depth Guide for Beginners. Info: Unleash Your 3D Design Potential with 3ds Max &#38; Corona Renderer: A Comprehensive Course for Beginners What you’ll learn The basics of 3ds Max and Corona Renderer, including interface navigation and essential tools. Techniques [&#8230;]', 'summary_detail': {'type': 'text/html', 'language': None, 'base': '', 'value': 'Duration 2h 24m Project Files Included MP4 Title: Udemy &#8211; 3ds Max &#38; Corona Renderer: An In-Depth Guide for Beginners. Info: Unleash Your 3D Design Potential with 3ds Max &#38; Corona Renderer: A Comprehensive Course for Beginners What you’ll learn The basics of 3ds Max and Corona Renderer, including interface navigation and essential tools. Techniques [&#8230;]'}, 'content': [{'type': 'text/html', 'language': None, 'base': '', 'value': '<p><img alt="Post Cover" class="aligncenter" height="350" src="https://imgur.com/kVfvzUo.jpg" width="575" /></p>\n<p style="text-align: center;"><strong>Duration 2h 24m Project Files Included MP4</strong></p>\n<p></p>'}], 'wfw_commentrss': 'https://cgpersia.com/2023/02/udemy-3ds-max-corona-renderer-an-in-depth-guide-for-beginners-188225.html/feed', 'slash_comments': '0'}], 'feed': {'title': 'CG Persia', 'title_detail': {'type': 'text/plain', 'language': None, 'base': '', 'value': 'CG Persia'}, 'links': [{'href': 'https://cgpersia.com/feed', 'rel': 'self', 'type': 'application/rss+xml'}, {'rel': 'alternate', 'type': 'text/html', 'href': 'https://cgpersia.com'}], 'link': 'https://cgpersia.com', 'subtitle': '', 'subtitle_detail': {'type': 'text/html', 'language': None, 'base': '', 'value': ''}, 'updated': 'Mon, 06 Feb 2023 22:01:50 +0000', 'updated_parsed': time.struct_time(tm_year=2023, tm_mon=2, tm_mday=6, tm_hour=22, tm_min=1, tm_sec=50, tm_wday=0, tm_yday=37, tm_isdst=0), 'language': 'en-US', 'sy_updateperiod': 'hourly', 'sy_updatefrequency': '1'}, 'headers': {}, 'encoding': 'utf-8', 'version': 'rss20', 'namespaces': {'content': 'http://purl.org/rss/1.0/modules/content/', 'wfw': 'http://wellformedweb.org/CommentAPI/', 'dc': 'http://purl.org/dc/elements/1.1/', '': 'http://www.w3.org/2005/Atom', 'sy': 'http://purl.org/rss/1.0/modules/syndication/', 'slash': 'http://purl.org/rss/1.0/modules/slash/'}}

Feel free to re-open this ticket if you can point to the changes you'd like to see in the code.