EngineeringKiosk / webpage

Landing page hosted on netlify
0 stars 1 forks source link

Retrieve Podcast episode links from Spotify, Apple Music, Amazon Music and Google automatically #26

Closed andygrunwald closed 2 years ago

andygrunwald commented 2 years ago

The python script in https://github.com/EngineeringKiosk/webpage/blob/main/scripts/podcast-feed-to-content.pycreates the initial podcast episode entry as Markdown based on the Podcast RSS feed.

The links for the particular platforms (Spotify, etc.) still need to be added manually. See https://github.com/EngineeringKiosk/webpage/blob/bb46c27db352a0d5772a7b72563f4e25b4e63071/scripts/podcast-feed-to-content.py#L270-L273

Goal of this task

See if we can automate the retrieval of the links from Spotify, Apple Music, Amazon Music and Google for each podcast episode.

andygrunwald commented 2 years ago

Spotify done in

Apple Podcast done in

andygrunwald commented 2 years ago

Google Podcast done in

andygrunwald commented 2 years ago

Amazon Music is a bit more tricky, because it loads the content async and the API is not very accessible from our script (without building a proper session, etc).

I am not sure if it is really worth the effort and putting hours into it to "reverse engineer" it. For now, I did a small automation/workaround:

  1. I skip crawling Amazon Music website (due to JS load scripts)
  2. I added a small python script to check for all missing player URLs in all podcast episodes in https://github.com/EngineeringKiosk/webpage/commit/aa54487c4f57d716a530f54f7cd53cbbd5807c78
  3. I added a new GitHub action that checks once a month which player URLs are missing - If we have missing ones, it fails and we get an email - See https://github.com/EngineeringKiosk/webpage/runs/6954562212?check_suite_focus=true and https://github.com/EngineeringKiosk/webpage/commit/abdf135e1c53ecfc6b84f9a11680e9687170068c
andygrunwald commented 2 years ago

All done here.