gpt-omni / mini-omni2

Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。
https://arxiv.org/abs/2410.11190
MIT License
1.31k stars 151 forks source link

Does it support passing in multiple audio files and text prompts? #19

Open binzhouu opened 3 days ago

mini-omni commented 3 days ago

hi, for now, the model is only trained on single turn dialogue data, so it does not support multiple files.