NavodPeiris / speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names
MIT License
138 stars 12 forks source link

On windows? + questions #17

Closed APISeeker closed 1 month ago

APISeeker commented 8 months ago

Hello I was installing and I saw this:

GPU execution requires the following NVIDIA libraries to be installed:

cuBLAS for CUDA 11 cuDNN 8 for CUDA 11

But when I click on the links for (cuBLAS for CUDA 11), and go to windows links I shows this:

The NVIDIA HPC SDK for Windows will be available at a later date.

image

Can I install this anyway? I am really interested. 2) Second question, why do we need to give ADMIN right to make it work? 3) Have you compared it to pyannote? Is it different? Better etc?

Thanks

NavodPeiris commented 8 months ago
  1. I haven't tried installing CUDA toolkit as i don't have a GPU. However, I have provided a Colab notebook that installs cublas and you can run using Colab GPU. If you don't want GPU execution then you can just pip install and run on CPU. in that case you don't need to install cublas.

    1. The speaker recognition function requires admin rights to access files.
    2. it uses pyannote for diarization. so it is same. only difference is, it does speaker recognition.
APISeeker commented 8 months ago

Hello, I absolutely 100% no 100000% need GPU. I can't wait for long to get results.. Unless you have a solution that uses multi cores of a single processor to make it faster ? (suggestion)

Ok thank you I will try the collab to see how it works.

I don't know yet how pyannote works yet, so I don't know very well the difference yet. I guess your solution is better because you can use "audio samples" to explain with precision which is which and who is who and avoid having mistakes.