V-Sekai / godot-whisper

An GDExtension addon for the Godot Engine that enables realtime audio transcription, supports OpenCL for most platforms, Metal for Apple devices, and runs on a separate thread.
MIT License
47 stars 5 forks source link

.wav file issues #74

Open JBlank19 opened 1 month ago

JBlank19 commented 1 month ago

Good day!

The audio transcription node does not work with most of the .wav files. For example the .wav files produced from godot itself when recording the mic. However, it does work with the capture node. It seems some issue with the formatting of the input data.

fire commented 1 month ago

As far as I know we wrote the capture node because we weren't able to get the record node to work 3-4 years ago.

AllenDang commented 1 week ago

@fire My god, I just wasted like half a day trying to figure out why the recorded wav doesn't work...

fire commented 1 week ago

@AllenDang Here was the original design documentation. https://github.com/godotengine/godot-proposals/issues/2013