end-4 / dots-hyprland

Modern, feature-rich and accessible desktop configuration.
https://end-4.github.io/dots-hyprland-wiki/en/
GNU General Public License v3.0
3.06k stars 196 forks source link

[Feature] GPT-4o with audio support #500

Open codewithkenzo opened 1 month ago

codewithkenzo commented 1 month ago

Mr. end-4, I know you want it too

H0mire commented 1 month ago

Can't wait for OpenAI to release GPT-4o with stt and tts support. :o

end-4 commented 1 month ago

oxygen api provides that for free idk if it's real but it does use emojis like the gpt4o on poe.com 4 or 4o? no clue

idk how to include sound yet

H0mire commented 1 month ago

Yeah currently "Audio" is usually generated through a tts service, which you would have to integrate separately. OpenAI hinted that they will release the GPT 4o with Audio processing, which is basically native tts and stt without a separate model or service. This Results to a low latency like a normal human conversation and capability to process emotional expression.