Voice Interaction - user voice input

Item type: App Frontend, App Backend

Description: This feature aims to enhance the chat application by enabling users to interact with the AI agent using voice commands instead of typing. The user’s voice input will be recorded, transcribed to text, and sent to the AI agent. If the transcription is inaccurate, users will have the option to edit the text before it is sent. A voice input button will be added to the chat interface for easy access.

User Profile:

As a developer, I want to provide the app users the option to interact with the AI agent using voice command, so that they can communicate more conveniently without having to type.

Acceptance Criteria:

[x] Implement a voice input button to the chat interface, located next to the text input area.
[x] When the user presses the voice input button, start recording their voice, and convert the recorded voice input to text
[x] Display the transcribed text to the user.
[x] Provide an option to re-record the voice input if the transcription is inaccurate.
[x] Allow the user to edit the transcribed text before sending it to the AI agent.

Definition of Done

[x] The feature has been fully implemented and integrated into the app.
[x] The feature has been manually tested and works as expected without critical bugs.
[x] The feature code is documented with clear explanations of its functionality and usage.
[x] The feature code has been reviewed and approved by at least one team member.
[x] The feature branches have been merged into the main branch and closed.
[x] The feature utility, function and usage have been documented in the respective project wiki on github.

amosproj / amos2024ss06-health-ai-framework