mediar-ai / screenpipe

24/7 local AI screen & mic recording. Works with Ollama. Llama3.2 control your computer. Alternative to Rewind.ai & Zapier. Open. Secure. You own your data. Rust.
https://screenpi.pe
MIT License
7.95k stars 435 forks source link

[bug] Repetitive Transcription During Silence or White Noise Periods #525

Open cyberpapiii opened 4 hours ago

cyberpapiii commented 4 hours ago

Repetitive Transcription During Silence or White Noise Periods

Description

Running Version 0.5.6 (20241017.030846)

Apple M3 Max MacBook Pro 14-inch, Nov 2023 Memory: 128 GB

macOS Sequoia Version 15.1 Beta (24B5077a)

The Screenpipe app is experiencing an issue where the transcription feature generates repetitive text during periods of silence or white noise. This problem affects the accuracy of meeting summaries and may be related to how the app processes audio input when no distinct speech is detected.

The repetitive text appears as:

"The world is a great place to be able to get the world to be able to get the world..." or other disjointed phrases.

This redundancy prevents the app from providing meaningful summaries and indicates a potential bug in the transcription process or AI settings.

CX  10-17-2024 @ 05 15 26PM

Current Settings

CX  10-17-2024 @ 05 14 38PM

Steps to Reproduce

  1. Open the Screenpipe app with the current settings.
  2. Start a recording where there are expected periods of silence or white noise.
  3. Check the transcription output in the "meeting and conversation history" section.

Expected Behavior

Actual Behavior

Suggested Fixes

  1. Investigate and adjust the audio processing algorithm, particularly for the whisper-large-turbo model, to handle silence and white noise more effectively.
  2. Implement filters to prevent repetitive text from affecting transcription outputs.
  3. Consider adding a silence detection feature that skips transcription during prolonged quiet periods.
  4. Explore options to fine-tune the AI model to better distinguish between speech and non-speech audio.

Additional Notes

Attachments

Please address this issue to improve transcription accuracy and summary reliability in meetings with varying audio conditions.

linear[bot] commented 4 hours ago

MED-211 [bug] Repetitive Transcription During Silence or White Noise Periods

louis030195 commented 3 hours ago

521 related