IDinsight / ask-a-question

Trustworthy question-answering AI plugin for chatbots in the social sector with advanced content performance analysis.
https://ask-a-question.com
BSD 3-Clause "New" or "Revised" License
28 stars 4 forks source link

[128] - Refactoring of the E2E speech endpoint #395

Closed MustafaAkolawala closed 2 months ago

MustafaAkolawala commented 2 months ago

Reviewer: @amiraliemami Estimate: 20 min


Ticket

Fixes: JIRA_TICKET_LINK

Description

This PR implements a new end-to-end speech workflow design via the voice-search endpoint to make the . This change isolates speech functionality, making it more intuitive and easier to implement

Goal

To create a dedicated, isolated endpoint for the end-to-end speech workflow, featuring an optional generate_tts flag. This flag enables the generation of voice notes from the LLM response, streamlining speech functionality implementation and improving overall intuitiveness.

Changes

How has this been tested?

dev environment docker-compose unit Tests

How to test this?

  1. Configure the Speech_Api environment variables in template.core_backend.env.
  2. Launch the Docker containers using: docker compose -f docker-compose.yml -f docker-compose.dev.yml -f docker-compose.speech.yml -p aaq-stack watch
  3. Send a POST request to the voice-search endpoint.

To-do before merge (optional)

Once VoiceApi/GCP is merged, will merge this to main

Checklist

Fill with x for completed.