Closed emcf closed 2 months ago
Looking to support mp3, wav
mp3
wav
Audio is not standard in commercial multimodal models today in 2024. Because of this, I am also looking to transcribe audio to text, probably via Whisper.
FIxed by #12
Looking to support
mp3
,wav
Audio is not standard in commercial multimodal models today in 2024. Because of this, I am also looking to transcribe audio to text, probably via Whisper.