Rivier-Computer-Science / Adaptive-Learning

Adaptive mathematics learning using multiple autogen agents
GNU Affero General Public License v3.0
3 stars 12 forks source link

US25: As a user, I want to use speech recognition to input text into the adaptive learning GUI, so I can interact more naturally and conveniently with the application. #272

Open Ajaykumar8179086865 opened 2 months ago

Ajaykumar8179086865 commented 2 months ago

Conditions of Satisfaction (CoS): Voice-to-Text Input: Users should be able to speak into their microphone and have their speech converted to text within any text input field in the GUI. Accuracy: The speech recognition must accurately transcribe user speech into text, recognizing various accents and dialects effectively. Real-time Transcription: Speech should be transcribed in real-time, allowing for a seamless interaction experience as if the user were typing. Feedback Mechanisms: Provide visual and/or auditory feedback when the system is listening, processing, or stops speech recognition. Error Handling and Corrections: Users should have easy options to correct any misrecognitions by the system either by voice command or through a simple editing interface. Security and Privacy: Ensure that speech data is processed securely, with clear disclosures on how user data is handled and who has access to it, complying with relevant privacy laws.

Definition of Done (DoD): Code Complete: All frontend and backend code required for the speech-to-text functionality is fully implemented and integrated into the existing GUI. System Testing: Extensive testing confirms that the speech recognition accurately transcribes spoken language into text in the GUI under various conditions, including noisy environments and different user accents. Performance Metrics Met: The implementation does not degrade the system’s performance or responsiveness. User Documentation Updated: Documentation and user guides are updated to include instructions on how to use the speech recognition feature. Security Review Passed: The feature has undergone a security review to ensure it meets all necessary data protection standards. User Acceptance Testing Passed: Feedback from real users confirms that the feature works well and meets their needs.

Tasks:

glossner commented 2 months ago

@Ajaykumar8179086865 - this is accepted and a good project. I'm modifying a couple of your tasks and I want you to get through the first 3 in Sprint 1. Task 25.2 is 1 line of python code if you pick the right library. This will also need to integrate into the existing Chat panels so that users have the option to use the microphone or type.