linuxchixin / talks

Propose your talks/workshops here
14 stars 9 forks source link

Speech Recognition using Python #44

Open anjalibhavan opened 5 years ago

anjalibhavan commented 5 years ago

Abstract Computers can tell us whether we’re happy, sad, angry or any of the several emotions we feel. Computers can understand what we’re saying and answer back. How does all this magic happen? This concept of teaching a program to analyze speech and understand it is called speech recognition. I’ll talk about speech recognition and its various nuances, and how it is handled using Python.

About

I’ll talk about various branches of speech recognition such as speech emotion recognition and text generation based on speech data, and speech recognition implementations on hardware as well. Here is a basic summary of what all I will cover: 1.Speech recognition: what is it, why is it required - concepts like spectral analysis, MFCCs (Mel Frequency Cepstral Coefficients), Fourier transforms, signal processing etc. 2.How Python can make speech recognition easier 3.Branches and new areas of speech recognition: speech emotion recognition, sentiment analysis etc., work done in these fields in the past few decades 4.How speech recognition models are built: acoustic and language models etc. 5.Resources like blogs, libraries, toolkits etc. for studying and getting started with speech recognition models in Python 6.Basic workflow and tips on how to create your first speech recognition model using Python 7.A brief on various repositories of speech databases, how they can be accessed and prepared for input to speech models 8.Other media such as video data and face emotion recognition, resources for studying them up further 9.Applications and future scope, closing remarks I will cover the basics of how speech is read, processed and quantified, concepts like the Fourier Transform and spectral analysis, the various Python libraries and resources that exist for the same, and how one can build their own speech recognition system easily. Perhaps an Alexa 2.0?

Pre-requisites

Basic knowledge of Python and machine learning. Expected duration

15-20 minutes

Level

Intermediate

Speaker Bio

I am a third year undergrad at Delhi Technological University. I am passionately fond of data science and machine learning, and have worked on several projects and authored research papers on the same. My research area particularly centers around ensemble learning and methods, and I've started taking an interest in speech recognition systems in recent months. I have worked with professors across several universities, and am always up for discussing Python, machine learning and data science with anyone.

- Can be done after the talk/workshop -

Include link to slides here

Include link to video here

cocoa1231 commented 5 years ago

Hi! Would you be available to give this talk on the fifth of August?

nihilistdbanana commented 5 years ago

Hi, Are you available to give this talk on the 12th of may?

anjalibhavan commented 5 years ago

Hi! Erm, I don't want to deliver this talk here, it's nowhere related to Linux or anything interesting (:P). If, however, I'm able to prepare a new talk by tonight, I'll add a new issue request for that instead.

nihilistdbanana commented 5 years ago

Thanks a ton