OpenPecha / stt-split-audio

MIT License
0 stars 0 forks source link

STT0035: Create a pipeline to download stt_nw data from github and convert it to proper format to be uploaded into stt pecha tools. #6

Open gangagyatso4364 opened 1 week ago

gangagyatso4364 commented 1 week ago

Description

creating a pipeline to successfully download a stt data from github repo tibetan news audio release page and convert the audio into proper format required for training data. then spilt the audio as usual and upload the transcript csv to stt pecha tool database.

Completion Criteria

The stt_nw data are shown in stt.pecha.tools stats .

Implementation Plan

Subtasks

gangagyatso4364 commented 1 week ago

previous work on github download by sp https://github.com/OpenPecha/saymore-report-generator/blob/main/elan_to_segments/download_full_audio_gh.ipynb