Mozilla-Ocho / llamafile

Distribute and run LLMs with a single file.
https://llamafile.ai
Other
20.49k stars 1.03k forks source link

Bug: Failed to read audio file #556

Open misters2008 opened 2 months ago

misters2008 commented 2 months ago

Contact Details

makintosh_2008@ukr.net

What happened?

Hi, i ve downloaded whisper-large-v3.llamafile and added ".exe" to it. It opened a command window and showed me "whisper server listening at http://127.0.0.1:8080" i opened tis link in browser and saw an interface to upload an audio file and set a temperature.

i uploaded 10mb .mp3 audiofile and when i submitted it, it showed tha page with only a single phrase: "Failed to read .Wav file"

Meanwhile in the opened command window it is showing this: "Received request: Howcut.mp3 ID3: converting to wav... ID3: failed to open audio file: Invalid argument (we support .wav, .mp3, .flac, and .ogg) error: failed to read WAV file"

how to tackle this?

Version

dont know how to check, since i donwloaded a single .llamafile and dodnt download a separate app. the opened llamafile.exe doesnt let me enter anything into the command line.

What operating system are you seeing the problem on?

Windows

Relevant log output

No response

WilliamTambellini commented 2 months ago

same issue here on linux.

WilliamTambellini commented 2 months ago

whisperfile-0.8.13 seems to work though:

 $ ./whisperfile-0.8.13 -m ggml-large-v3.bin -f jfk.wav 
whisper_init_from_file_with_params_no_state: loading model from 'ggml-large-v3.bin'
...
[00:00:00.300 --> 00:00:09.000]   And so, my fellow Americans, ask not what your country can do for you, ask what you
[00:00:09.000 --> 00:00:11.000]   can do for your country.
gnmarten commented 2 months ago

AFAIK, wav file needs to be in a specific format: convert via ffmpeg -i "your mp3 or whatever file" -ar 16000 -ac 2 -c:a pcm_s16le -y output.wav (I did not have success with the --convert flag within whisperfile itself)

cjpais commented 1 month ago

this should be resolved with the latest code of whisperfile in #568 and the following commit 74dfd2