Avdhesh-Varshney / Jarvis

Jarvis is an AI-powered application that allows users to control various devices and applications through voice commands, offering tailored services based on user roles.
https://jarvis-ai-assistant.streamlit.app/
MIT License
44 stars 67 forks source link

πŸ“ƒ: Real time speech-to-text generation. #191

Closed jaidh01 closed 1 day ago

jaidh01 commented 5 days ago

:red_circle: Title: Real time speech-to-text generation. :red_circle: Aim: To build a system which eventually help us to convert our speech into text in real time. :red_circle: Brief Explanation:

The goal of this project is to develop a web-based application that leverages modern speech recognition technologies to transcribe spoken words into written text instantly. By using the browser's built-in speech recognition API, this project will allow users to interact with the system via voice commands, capturing both interim and final transcripts in real-time. This system can be used for various purposes such as dictation, real-time captions for meetings, and assisting individuals with speech disabilities. The project demonstrates seamless integration of front-end technologies (HTML, JavaScript) with speech-to-text capabilities, focusing on user-friendly interaction and continuous recognition for uninterrupted transcriptions.

I have also attached a video of it.

Screenshots πŸ“·

https://github.com/user-attachments/assets/b3f560a6-14b0-4e92-b72a-2a9773ce349f


:white_check_mark: To be Mentioned while taking the issue :


Happy Contributing πŸš€

All the best. Enjoy your open source journey ahead. 😎

Avdhesh-Varshney commented 4 days ago

Create a new category Voice Recognition Models, if it's a model.

jaidh01 commented 4 days ago

@Avdhesh-Varshney , it's not a model, it uses browser built in speech recognition api, hence I was thinking to add this in Programs section.

jaidh01 commented 4 days ago

@Avdhesh-Varshney @Zedoman , after reviewing the difficulty level, I request you to change the label of level from level 2 to level 3.