mediar-ai / screenpipe

open source 24/7 screen & voice recording for the age of superintelligence
https://screenpi.pe
MIT License
8.7k stars 492 forks source link

identify persons in audio #306

Open louis030195 opened 1 month ago

linear[bot] commented 1 month ago

MED-92 identify persons in audio

NicodemPL commented 1 month ago

I tried myself with paynnote audio - this works pretty good. fully local. but python based. https://github.com/pyannote/pyannote-audio

highly recommended - tried different systems (also paid) and this one is really efficient and delivers good quality.

On top of this - once you have separate audio - microphone and display this is even more promising for better quality meeting notes. I've developed a Python tool that combines Whisper transcription and Pyannote diarization to create comprehensive meeting transcript. This automated system transcribes audio, identifies speakers, and integrates the results, laying the groundwork for AI-assisted prompt for good notes generations. Still got some issues on my side but its basically working and 100% local. So this is doable for sure. And it beats Rewind.ai / Limitless for sure :) Locally.

louis030195 commented 1 month ago

/bounty 100

definition of done:

rules:

algora-pbc[bot] commented 1 month ago

💎 $100 bounty • Screenpi.pe

Steps to solve:

  1. Start working: Comment /attempt #306 with your implementation plan
  2. Submit work: Create a pull request including /claim #306 in the PR body to claim the bounty
  3. Receive payment: 100% of the bounty is received 2-5 days post-reward. Make sure you are eligible for payouts

Thank you for contributing to mediar-ai/screenpipe!

Add a bounty • Share on socials