akhilrex / podgrab

A self-hosted podcast manager/downloader/archiver tool to download podcast episodes as soon as they become live with an integrated player.
GNU General Public License v3.0
1.62k stars 88 forks source link

Transcripts #148

Open jtagcat opened 3 years ago

jtagcat commented 3 years ago

I'd love if transcripts would be supported. Fetching them though, is difficult. Some provide them right in the show notes, others on the webpage (ex https://www.20k.org/episodes/433).

Custom per-feed importers would have to be added. This is a feature request for the base feature. For English and other popular languages, auto-captioning is available.

Having the ability to search through all text is brilliant. This can be achieved by exporting¹ (with references, where did the text come from) regularly by external tools.

¹ Grabbing from the data directory rather, or an API endpoint (potentially gigabytes of data?) exported/indexed regularly, automated.

Highlights and notes (both text and time) would also be nice, but that, as well may go out of scope.