amosproj / amos2024ss06-health-ai-framework

Ailixir is an application that utilises LLMs and custom user input to generate AI agent prototypes specialised in fields such as health, economics, physics etc. The prototypes enable the user, which is an entrepreneur-developer, to compare the results produced by different LLMs.
MIT License
7 stars 1 forks source link

Voice Interaction - user voice input #196

Closed Ghemechis closed 1 week ago

Ghemechis commented 2 weeks ago

Item type: App Frontend, App Backend

Description: This feature aims to enhance the chat application by enabling users to interact with the AI agent using voice commands instead of typing. The user’s voice input will be recorded, transcribed to text, and sent to the AI agent. If the transcription is inaccurate, users will have the option to edit the text before it is sent. A voice input button will be added to the chat interface for easy access.

User Profile:

As a developer, I want to provide the app users the option to interact with the AI agent using voice command, so that they can communicate more conveniently without having to type.

Acceptance Criteria:

Definition of Done