abhisheks008 / DL-Simplified

Deep Learning Simplified is an Open-source repository, containing beginner to advance level deep learning projects for the contributors, who are willing to start their journey in Deep Learning. Devfolio URL, https://devfolio.co/projects/deep-learning-simplified-f013
https://quine.sh/repo/abhisheks008-DL-Simplified-499023976
MIT License
342 stars 294 forks source link

Hindi Voice Assistant #888

Open Amarta113 opened 1 month ago

Amarta113 commented 1 month ago

Deep Learning Simplified Repository (Proposing new issue)

:red_circle: Project Title : Hindi Voice Assistant
:red_circle: Aim : To develop a chatbot that takes input in hindi audio and give output in hindi text and audio
:red_circle: Dataset :
:red_circle: Approach : Try to use 3-4 algorithms to implement the models and compare all the algorithms to find out the best fitted algorithm for the model by checking the accuracy scores. Also do not forget to do a exploratory data analysis before creating any model.


πŸ“ Follow the Guidelines to Contribute in the Project :


:red_circle::yellow_circle: Points to Note :


:white_check_mark: To be Mentioned while taking the issue :


Happy Contributing πŸš€

All the best. Enjoy your open source journey ahead. 😎

github-actions[bot] commented 1 month ago

Thank you for creating this issue! We'll look into it as soon as possible. Your contributions are highly appreciated! 😊

abhisheks008 commented 1 month ago

Are you using any dataset for this project? If yes, please share it. Also what are the deep learning models you are planning to implement here for this problem statement?

Amarta113 commented 1 month ago

@abhisheks008 I had not used any dataset. I used "whisper-large-v3" model for speech to text and "llama3-8b-8192" model to answer queries with groq API and gtts library for text to speech.

abhisheks008 commented 1 month ago

Any other models which will compete the whisper architecture?

Amarta113 commented 1 month ago

I used API therefore only whisper was available. Here is the documentation link

Amarta113 commented 1 month ago

It is streamlit application

abhisheks008 commented 1 month ago

Only implementing whisper will not qualify for this repository. You need to come up with at least 3 models for any problem statement. @Amarta113

Amarta113 commented 1 month ago

@abhisheks008 I also used llama3 for q&a and for text to speech I used google tts library which contains two algorithms Tacotron and WaveNet .

abhisheks008 commented 1 month ago

@abhisheks008 I also used llama3 for q&a and for text to speech I used google tts library which contains two algorithms Tacotron and WaveNet .

Cool I am assigning this issue to you. Make sure you complete it by today 6 PM IST.