saharmor / whisper-playground

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
MIT License
777 stars 140 forks source link

Integrate MonsterAPI for Advanced Audio Transcription in Whisper Playground #49

Open DheerajQblocks opened 5 months ago

DheerajQblocks commented 5 months ago

Title:

Description:

This PR introduces the MonsterAPI integration into the Whisper Playground, aiming to enhance the audio transcription capabilities of our application. By leveraging MonsterAPI's robust transcription features, we provide users with more accurate and flexible transcription options. This integration includes the ability to specify various transcription parameters such as language, transcription format, and the option to diarize or remove silence from the audio.

Key Features Added:

Technical Changes:

  1. Added MonsterApiClient.js to handle API requests to MonsterAPI.
  2. Updated App.js to include MonsterAPI transcription logic in the audio processing flow.
  3. Introduced new environment variable REACT_APP_MONSTERAPITOKEN for API authentication.
  4. Enhanced the settings section in the UI to include additional transcription options provided by MonsterAPI.

Dependencies Added:

Setup Instructions:

To utilize MonsterAPI's transcription service, an API token is required. Follow these steps to configure your environment:

  1. Obtain an API token from MonsterAPI.
  2. Create a .env file in the root directory of the project (if not already present).
  3. Add the following line to the .env file:
    REACT_APP_MONSTERAPITOKEN=your_monsterapi_token_here
  4. Ensure the .env file is included in your .gitignore to prevent exposing your API token.

Testing:

Screenshots:

image image

Notes for Reviewers:

Please review the changes focusing on the integration of MonsterAPI and the added environment variable management. Feedback on the UI enhancements for setting transcription options would also be appreciated.