alphacep / vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Apache License 2.0
7.37k stars 1.04k forks source link

vosk transcription not loading in jupyter notebook, excessively long loading time that does not conclude #1492

Open Rahil8533 opened 6 months ago

Rahil8533 commented 6 months ago

I'm trying to do ASR speech to text transcription using the VOSK API, I have downloaded all the required models and and imported the required modules however my transcription simply does not load here is my code below

here is my code ` from vosk import Model, KaldiRecognizer import os import queue import vosk import sys import json import wave model_dir="data/raw/vosk-model-small-en-us-0.15" model = vosk.Model(model_dir) with wave.open("Audio_Files/EN/checkin.wav") as wf: assert wf.getnchannels() == 1, "must be a mono wav" assert wf.getsampwidth() == 2, "must be a 16bit wav" assert wf.getcomptype() == "NONE", "must be PCM data"

rec = vosk.KaldiRecognizer(model, wf.getframerate())
while True:
    data = wf.readframes(4000)
    if rec.AcceptWaveform(data):
        res = json.loads(rec.Result())
        print(res["text"])

` Ive been seeing his result for 40 minutes image