Closed Matthias84 closed 2 years ago
Have you tried clearing the contents of your cache directory ($MP4GREP_CACHE) and then transcribing again? Does this issue only occur when you use the --transcribe option?
Hi, i cleaned the cache, but if I fire up the line without explicitly adding model param, it seems to be confused, even if the var is set properly?
mp4grep --transcribe ~/Musik/Marc-Uwe\ Kling/Qualityland\ 2/1-02\ Qualityland\ 2.0.mp3 Model directory "model" not found. Exiting.
echo $MP4GREP_MODEL /home/matthias/Downloads/mp4grep-0.1.1/model
Have you exported $MP4GREP_MODEL so that the variable also exists in the shell through which mp4grep is run? Is the variable set in your .bashrc? You may need to run export MP4GREP_MODEL=/home/..../model
before running mp4grep --transcribe
.
The VAR is set and available. Also if I reset the Model VAR in my current Terminal session, the same error persist. This behaviour is on my Ubuntu 21.10 Laptop and on my Linux Mint Uma Desktop as well :disappointed: Any idea for further debugging?
What kind of audio format are you using? The reason for the parse error is likely that Vosk is not transcribing your audio input correctly when your audio/video is converted to a 16000 MHz, single-channel audio file. Also, what version of the JDK do you have installed? Try running java --version
.
My JRE is
openjdk 11.0.13 2021-10-19
OpenJDK Runtime Environment (build 11.0.13+8-Ubuntu-0ubuntu1.20.04)
OpenJDK 64-Bit Server VM (build 11.0.13+8-Ubuntu-0ubuntu1.20.04, mixed mode, sharing)
This test sample of a audiobook is
Stream 0
Codec: MPEG Audio layer 1/2 (mpga)
Kanäle: Stereo
Abtastrate: 44100 Hz
Bits pro Sample: 32
Bitrate: 192 kB/s
I tried different MP3s and a different Vosk model, but the JSonSyntaxException persists :disappointed:
Vosk takes as input single-channel, 16000hz wav files. It's possible that the conversion process is not working properly. You could try converting to the correct format via ffmpeg first, and then running mp4grep on them. I'd also recommend seeing if your input works using some of the examples in the vosk-api repository. Here's the command to convert to the correct wav format: ffmpeg -i [input.mp4/mp3/etc] -acodec pcm_s16le -ac 1 -ar 16000 [output.wav]
.
@Matthias84 has this been fixed?
Sorry was busy in RL, need some time for experiments though ...
Should be fixed in the most recent release.
Hi there, I struggle on the usage of your tool. After the installation of the last release, I encounter the following error
mp4grep --model $MP4GREP_MODEL --transcribe mytestfile.mp3
I tried different audio files and different models, always with this error. If I call the tool again, it just raises 100% transcription, but no text is plottet.