Does it support passing in multiple audio files and text prompts?

gpt-omni / mini-omni2

Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。

https://arxiv.org/abs/2410.11190

MIT License

1.31k stars 151 forks source link

Open binzhouu opened 3 days ago

mini-omni commented 3 days ago

hi, for now, the model is only trained on single turn dialogue data, so it does not support multiple files.