jooyoungseo / youtubecaption

Downloading YouTube Subtitle Transcription in a Tidy Tibble Data_Frame in R
https://jooyoungseo.github.io/youtubecaption
GNU General Public License v3.0
38 stars 1 forks source link

Multiple captions for the same language #5

Open jobreu opened 3 years ago

jobreu commented 3 years ago

First of all, thank you for this great package! We just used this in a workshop on working with YouTube data and found it extremely helpful (especially since the get_captions function from the tuber package does not seem to work anymore. While using the package in the workshop, we were wondering whether it may be possible to add an option for choosing between different caption tracks for the same language. For some videos, there are automatically generated (ASR) and manually generated caption tracks for the same language. If that is not (easily) possible, could you maybe say which track the function picks if there is more than one caption track for the same language? The most recently created/edited one or does it select manually generated captions if they exist and ASR otherwise?

rangaro commented 3 years ago

Is there any update on this issue? We want to mention this package in a book chapter, but as of now, we would also have to mention this limitation.

jooyoungseo commented 3 years ago

Sorry for my delayed response. I will investigate this issue this weekend, and will get back to you all! Thank you very much for your patience.

jooyoungseo commented 3 years ago

It would be greatly appreciated if either of you could provide me with a sample video URL for better reproducibility.

rangaro commented 3 years ago

Thanks for looking into this issue!

ID = 3TNkWTRNNYE

This is one of my own videos where YouTube automatically created subtitles (as it always does; those get the stamp "ASR"). Afterwards, I manually edited the subs and thus created another set of subtitles.

You can use the package "tuber" to get a list of the subtitle tracks: list_caption_tracks(video_id = "3TNkWTRNNYE").

If you wish, I can create an additional test set of subtitles for this video.