Closed SarveshAtawane closed 5 months ago
The updates introduce several new features and components, including Docker environments, real-time audio transcription interfaces, issue reporting integrations with Discord and FastAPI, and enhanced testing suites. Key additions encompass client-side utilities, server-side functionalities for handling audio streams, and a buffering strategy factory. Additionally, several files are created to support testing, transcription management, and HTTP request handling.
Files/Groups | Change Summary |
---|---|
Transcription/Dockerfile , Report_issues/Dockerfile |
New Dockerfiles for the transcription server and issue reporting setup. |
Transcription/client/VoiceStreamAI_Client.html |
New HTML client interface for real-time audio transcription. |
Transcription/client/utils.js |
Client-side utilities for WebSocket communication and audio processing. |
Transcription/src/main.py , server.py , buffering_strategy_factory.py , post_serve.py , audio_utils.py , asr/faster_whisper_asr.py , vad/pyannote_vad.py |
New and updated server-side scripts for handling real-time transcription, buffering strategies, audio file management, VAD, and ASR functionalities. |
Transcription/modified_text_files/... |
Introduced changes related to a legal case section. |
Transcription/test/server/test_server.py |
New test suite for the Server class and its functionalities. |
Report_issues/.env , Report_issues/.gitignore |
Environment and gitignore configurations for issue reporting. |
Report_issues/main.py |
New file for integrating a Discord bot with FastAPI for issue reporting. |
Report_issues/requirements.txt |
Lists dependencies for the issue reporting setup. |
Report_issues/Readme.md |
Sample CURL requests for submitting issues to a local server. |
sequenceDiagram
participant Client as HTML Client
participant WebSocket as WebSocket Server
participant VAD as VAD Pipeline
participant ASR as ASR Pipeline
Client->>+WebSocket: Initiate Connection
WebSocket-->>Client: Connection Established
Client->>WebSocket: Start Streaming Audio
WebSocket->>VAD: Process Audio Stream
VAD-->>WebSocket: VAD Results
WebSocket->>ASR: Transcribe Audio
ASR-->>WebSocket: ASR Results
WebSocket-->>Client: Transcription Results
Client->>Client: Display Transcription
sequenceDiagram
participant User as User
participant FastAPI as FastAPI Server
participant DiscordBot as Discord Bot
User->>+FastAPI: Submit Issue
FastAPI-->>-User: Issue Acknowledged
FastAPI->>DiscordBot: Forward Issue
DiscordBot-->>DiscordBot: Process Message
DiscordBot->>User: Notify Issue Logged (via Discord)
In a Docker world, we spin our dreams,
With WebSockets and Python, linked at the seams.
Real-time whispers, transcriptions so keen,
VAD and ASR in a codebase so clean.
Discord hears your plight, through FastAPI's light,
Bugs take flight, in a system so bright.
🎶🐰🐾
Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?
7 Implemented transcription view for live and recorded files, ensuring proper file format saving as specified in the ticket.
Summary by CodeRabbit
New Features
VoiceStreamAI_Client.html
.utils.js
.BufferingStrategyFactory
for creating instances of different buffering strategies.Server
class for managing WebSocket connections for real-time audio transcription.pyannote_vad.py
.Report_issues/main.py
.Report_issues/Readme.md
.Bug Fixes
Documentation
Readme.md
with instructions for reporting transcription, OCR, and validation issues.Tests
Server
class to ensure proper transcription processing.Chores
.gitignore
to exclude.env
file inReport_issues
.requirements.txt
with necessary dependencies for the Discord bot and FastAPI server.