issues
search
ChakshuGautam
/
whisper-hinglish
1
stars
0
forks
source link
Build pipeline for downloading and processing audio from URL
#4
Open
rayaanoidPrime
opened
6 months ago
rayaanoidPrime
commented
6 months ago
Building a pipeline to automate Issue #3
[ ] Write script to download and transcript (captions) from youtube URL.
[ ] Write script to download audio and transcript files from podcast sources.
[ ] Handle format conversions from
mp3
,
mp4
, etc to
wav
[ ] Check token distribution of the audio source between
hi
and
eng
tokens
Building a pipeline to automate Issue #3
mp3
,mp4
, etc towav
hi
andeng
tokens