jdepoix / youtube-transcript-api

This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headless browser, like other selenium based solutions do!
MIT License
2.54k stars 279 forks source link

Why are the subtitles obtained incomplete? #274

Closed 2322312693 closed 2 months ago

2322312693 commented 2 months ago

The content of the first sentence is not finished will appear in the second subtitle

: { "duration": 6.56, "start": 0.16, "text": "what are you going to do with us now I'm taking you to your parents can't we just " }, { "duration": 8.28, "start": 6.72, "text": "forget the whole thing no then I'll get in trouble with my boss please we'll also get " },

2322312693 commented 2 months ago

please help me

jdepoix commented 2 months ago

The transcripts are grouped into snippets as they are displayed in YouTube. Each snippet corresponds to one subtitles line in the web player. Therefore, it's to be expected that they usually don't correspond to a full sentence.

I will close this now, as there is nothing we can do here.