pydatadelhi / talks

Talks at PyData Delhi Meetups
44 stars 13 forks source link

Machine Learning Techniques for Audio and Speech synthesis #73

Open kdhingra307 opened 6 years ago

kdhingra307 commented 6 years ago
manojpandey commented 6 years ago

@Dawny33 Please review sir ;)

Dawny33 commented 6 years ago

The topic looks pretty interesting.

@kdhingra307 Pl add more details regarding how you want to structure the session.

kdhingra307 commented 6 years ago

Hi @Dawny33 yaa sure

this is kind of overview of what would be comprehensible for most

  1. A brief overview of human-computer interaction, why is it so important.
  2. Quick basics into signals and what are the major differences between audio and speech synthesis. Intro to statistical and neural vocoders.
  3. Is neural powerful or statistical?? comparison between two (will be using wavenet and griffin-lim)
  4. Ongoing work in the field of Speech processing (will include tacotron-2 demo and quickly go through the tacotron-2 model)
  5. What I am currently doing(making tacotron-2 faster and near real-time)

your suggestions are most welcome

Ridhwanluthra commented 6 years ago

@kdhingra307 any update on the slides and jupyter notebook? We are planning a meetup on 16th and would love you have your talk then.

kdhingra307 commented 6 years ago

@Ridhwanluthra I will add those slides in 1 or 2 days

I would love to give the talk at the next meetup

kdhingra307 commented 6 years ago

Hey guys

I have attached a link to the GitHub repository which contains summary.pdf, this mainly includes the subjects I will be specifically sharing tomorrow

But in general, I will use jupyter notebook, as these will allow an interactive communication. I have yet to add the notebook in the repository, will add them in a couple of hours and update you guys too.

https://github.com/evolution_of_audio_and_speech_synthesis

manojpandey commented 6 years ago

@MSanKeys963 can we slate this for next meetup?

MSanKeys963 commented 6 years ago

This was delivered in the last meetup. I've added the appropriate labels.