dplocki / podcast-downloader

The Python script for downloading new mp3 from RSS given channels
GNU General Public License v3.0
118 stars 14 forks source link

<pp:url> ??? #76

Open Blackspirits opened 1 month ago

Blackspirits commented 1 month ago

https://rss.podplaystudio.com/3233.xml

this part shows 3 episodes, but it only downloads the first one “<enclosure url=”https://”

but the next two won't download because it shows “https:/”

how do I get it to download?

thanks

part of RSS:

<item>
<description>
<![CDATA[ ]]>
</description>
<enclosure url="https://bauernordic-pods.sharp-stream.com/pt/3233/o_natal_das_maes_e_as_questoes_q_4dc0e99a-9e02-4ce2-be11-138613036255_normal.mp3?aw_0_1st.episodeid=139172&aw_0_1st.collectionid=3233" length="24693072" type="audio/mpeg"/>
<guid isPermaLink="false">http://podcastmcr.iol.pt/rcomercial/KZ2DJMP0-FAV2-GIUP-17JH-OMKROIFQDUJH.mp3</guid>
<pubDate>Fri, 23 Oct 2020 19:16:10 +0000</pubDate>
<title>O Natal das mães e as questões que realmente importam </title>
<itunes:duration>00:25:43</itunes:duration>
<itunes:episodeType>full</itunes:episodeType>
<itunes:explicit>false</itunes:explicit>
<itunes:image href="https://studio-podplay.imgix.net/img/5183105aca1d266621467f945734d123.jpg?auto=compress&h=1500&w=1500&s=e7e59bb9cc45d6ace72c3bea680b5cb8"/>
<pp:original-pubdate>1603480570</pp:original-pubdate>
<pp:exclusive>false</pp:exclusive>
</item>
<pp:episode>
<pp:description>
<![CDATA[ ]]>
</pp:description>
<pp:url>https://bauernordic-pods.sharp-stream.com/pt/3233/a_emissao_a_cantar_com_um_anunci_f30f9cba-7679-41fb-a321-bce961564ff9_normal.mp3?aw_0_1st.episodeid=139171&aw_0_1st.collectionid=3233</pp:url>
<pp:mimetype>audio/mpeg</pp:mimetype>
<pp:size>33249115</pp:size>
<pp:guid>http://podcastmcr.iol.pt/rcomercial/IYQNQUMI-D9JD-EGIZ-Z67R-MK91VQD8W2G4.mp3</pp:guid>
<pp:pubdate>1603396907</pp:pubdate>
<pp:title>A emissão a cantar com um anúncio muito especial </pp:title>
<pp:duration>2078</pp:duration>
<pp:type>full</pp:type>
<pp:explicit>false</pp:explicit>
<pp:id>139171</pp:id>
<pp:exclusive>false</pp:exclusive>
<pp:image>https://studio-podplay.imgix.net/img/5183105aca1d266621467f945734d123.jpg?auto=compress&h=1500&w=1500&s=e7e59bb9cc45d6ace72c3bea680b5cb8</pp:image>
</pp:episode>
<pp:episode>
<pp:description>
<![CDATA[ ]]>
</pp:description>
<pp:url>https://bauernordic-pods.sharp-stream.com/pt/3233/hoje_os_ouvintes_foram_reportere_eed3edbd-ba9a-482e-a4f1-e605158e9f7c_normal.mp3?aw_0_1st.episodeid=139170&aw_0_1st.collectionid=3233</pp:url>
<pp:mimetype>audio/mpeg</pp:mimetype>
<pp:size>14445967</pp:size>
<pp:guid>http://podcastmcr.iol.pt/rcomercial/OH58EM2S-JSZY-J0YL-4PMC-R4ONIITI9FU9.mp3</pp:guid>
<pp:pubdate>1603309056</pp:pubdate>
<pp:title>Hoje os ouvintes foram repórteres por um dia </pp:title>
<pp:duration>902</pp:duration>
<pp:type>full</pp:type>
<pp:explicit>false</pp:explicit>
<pp:id>139170</pp:id>
<pp:exclusive>false</pp:exclusive>
<pp:image>https://studio-podplay.imgix.net/img/5183105aca1d266621467f945734d123.jpg?auto=compress&h=1500&w=1500&s=e7e59bb9cc45d6ace72c3bea680b5cb8</pp:image>
</pp:episode>
<pp:episode>
<pp:description>
<![CDATA[ ]]>
dplocki commented 1 month ago

Hi

For parsing the XML file the script is using the feedparser library. So, it has to be a valid RSS file. Those tags pp: seems to be custom made (by the RSS provided server) - they not recognized by the library.

How to do this? The easiest way would be use an XML parser, and find all pp:url values. Having those links you can downloaded them.