abhisheks008 / DL-Simplified

Deep Learning Simplified is an Open-source repository, containing beginner to advance level deep learning projects for the contributors, who are willing to start their journey in Deep Learning. Devfolio URL, https://devfolio.co/projects/deep-learning-simplified-f013
https://quine.sh/repo/abhisheks008-DL-Simplified-499023976
MIT License
389 stars 338 forks source link

Hindi Voice Assistant #888

Open Amarta113 opened 3 months ago

Amarta113 commented 3 months ago

Deep Learning Simplified Repository (Proposing new issue)

:red_circle: Project Title : Hindi Voice Assistant
:red_circle: Aim : To develop a chatbot that takes input in hindi audio and give output in hindi text and audio
:red_circle: Dataset :
:red_circle: Approach : Try to use 3-4 algorithms to implement the models and compare all the algorithms to find out the best fitted algorithm for the model by checking the accuracy scores. Also do not forget to do a exploratory data analysis before creating any model.


📍 Follow the Guidelines to Contribute in the Project :


:red_circle::yellow_circle: Points to Note :


:white_check_mark: To be Mentioned while taking the issue :


Happy Contributing 🚀

All the best. Enjoy your open source journey ahead. 😎

github-actions[bot] commented 3 months ago

Thank you for creating this issue! We'll look into it as soon as possible. Your contributions are highly appreciated! 😊

abhisheks008 commented 3 months ago

Are you using any dataset for this project? If yes, please share it. Also what are the deep learning models you are planning to implement here for this problem statement?

Amarta113 commented 3 months ago

@abhisheks008 I had not used any dataset. I used "whisper-large-v3" model for speech to text and "llama3-8b-8192" model to answer queries with groq API and gtts library for text to speech.

abhisheks008 commented 3 months ago

Any other models which will compete the whisper architecture?

Amarta113 commented 3 months ago

I used API therefore only whisper was available. Here is the documentation link

Amarta113 commented 3 months ago

It is streamlit application

abhisheks008 commented 3 months ago

Only implementing whisper will not qualify for this repository. You need to come up with at least 3 models for any problem statement. @Amarta113

Amarta113 commented 3 months ago

@abhisheks008 I also used llama3 for q&a and for text to speech I used google tts library which contains two algorithms Tacotron and WaveNet .

abhisheks008 commented 3 months ago

@abhisheks008 I also used llama3 for q&a and for text to speech I used google tts library which contains two algorithms Tacotron and WaveNet .

Cool I am assigning this issue to you. Make sure you complete it by today 6 PM IST.

Abhiiesante commented 1 month ago

Can you please assign this issue to me under 𝗚𝗦𝗦𝗼𝗖 '𝟮𝟰 𝗘𝘅𝘁𝗲𝗻𝗱𝗲𝗱, Hacktoberfest-accepted

abhisheks008 commented 4 weeks ago

Can you please assign this issue to me under 𝗚𝗦𝗦𝗼𝗖 '𝟮𝟰 𝗘𝘅𝘁𝗲𝗻𝗱𝗲𝗱, Hacktoberfest-accepted

Can you please share your approach and thoughts for solving this issue?