advplyr / audiobookshelf

Self-hosted audiobook and podcast server
https://audiobookshelf.org
GNU General Public License v3.0
5.78k stars 393 forks source link

[Enhancement]: Adding Transcription/Subtitle Viewing Support #2919

Open mfcar opened 2 months ago

mfcar commented 2 months ago

Describe the feature/enhancement

Transcription/Subtitle support ===

Summary

Add initial support for transcriptions. Apple now supports transcriptions in podcasts.

Some audiobook files have transcriptions, and currently, we can use tools based on Whisper to transcribe audio to text.

In fact, most software based on Whisper support transcribing audio to text and exporting it as an SRT or VTT file. VTT is a native format for the web, and SRT is a common format for subtitles.

I'm creating this issue to discuss the best way to implement transcription support on the web player. I'm trying to implement some features on the pull request #

Podcast transcription is supported by:

Possible tasks:

Note: I think we need define a standard for multi-language transcriptions. For example use some prefix in the file name like en- for English and es- for Spanish.

UI Ideas on the Web Player:

What's the best way to display the transcription on the web player?

Screenshot 2024-05-04 at 20 47 33
Screenshot 2024-05-02 at 09 09 47

Related

barolo commented 2 weeks ago

IMHO, when it comes to the UI you should combine both, the big panel for browsing and the panel below controls perhaps just with the current line but bigger. You have to take into account accessibility, some folks will want it to be resizable.