-
OpenAI GPT 4o model supports both in and out of text, image and audio. Understanding is finer than usual STT > model > TTS approach because the model has direct access to user behavior, emotions, etc.…
-
### Confirm this is a feature request for the .NET library and not the underlying OpenAI API
- [X] This is a feature request for the .NET library
### Describe the feature or improvement you are requ…
-
Would be nice if you could capture the participants screenshare and feed the context to the LLM.
-
### Due diligence
- [X] I have done my due diligence in trying to find the answer myself.
### Topic
Other / All
### Question
Hi,
I saw [on hackernews](https://news.ycombinator.com/it…
-
Looking at
https://guide.pycord.dev/voice/receiving
It appears its possible to also recieve audio to the bot. With this, its possible to create a voice mode for Gemini models back and forth
The out…
-
I want to make something like realtime audio will be coming, and there will be one face, Need something like If I say Hi and I have incorporated OpenAI with this, openAI will generate Hello, this imag…
-
Hi,
I am getting following error when I run `npm run generate`
```
No valid data found at path: cache\index_store.json starting new store.
```
I have tried following OS:
- Windows 10
- M…
-
I am trying to create a node console application, but...
I am getting multiple (sometimes 10+) calls being made to my tool function calls.
Even when using the example get_weather as per docs:
…
-
I have a scenario where it may take a bit of time to run a tool (I need to make other model calls) so I'm trying to use manual tool calls but they don't work for a number of reasons. One issue I can w…
-
is g711_ulaw formatted supported? I followed twillio example and sending this session updat:
```ts
const sessionUpdate = {
type: 'session.update',
session: {
turn_de…