OpenPecha / news-with-audio-data

MIT License
0 stars 0 forks source link

TTS data preparation from News data #1

Open kaldan007 opened 1 month ago

kaldan007 commented 1 month ago

Description: We currently have news full audio and corresponding news transcript. We would like to get the news text and audio data split into segments to train our STT and TTS model.

Implementation:

Subtask:

kaldan007 commented 1 month ago

please run on two point catalog

tenzinchoedon commented 1 month ago

Sr. no ID Audio link Audio text link Audio Duration (hh:mm:ss) Speaker name Speaker Gender News channel Publishing Year

tenzinchoedon commented 1 month ago

Link to the Google Sheet: https://docs.google.com/spreadsheets/d/1H1QVQgux6C170GSKKJ10wzNh0I-o1OL4BlWqe2BfViA/edit?gid=378785097#gid=378785097

kaldan007 commented 1 month ago

@tenzinchoedon can u explore any existing lib to diff between male and female audio

tenzinchoedon commented 1 month ago

Here are some libraries used in python to classify the differences between male and female audio files:

1. pyAudioAnalysis

2. Librosa

3. SpeechRecognition

gangagyatso4364 commented 1 month ago

Analyzing the helpfulness of text transfer in new audio data. Find the case of audio with news anchor and interviewee speaking. it should have

  1. News anchor audio text present in audio text file.
  2. interviewee audio text not present in audio text file.
  3. Also compare the transfer text result before and after.
  4. Analyze how well text transfer works in comparison to audio book.
kaldan007 commented 1 month ago