Implement Assistants Streaming

EthanBarlo commented 3 months ago

What:

[ ] Bug Fix
[X] New Feature

Description:

This PR implements Full Assistants Streaming API support. https://platform.openai.com/docs/api-reference/assistants-streaming/events

ThreadMessageDelta (Including nested classes)
ThreadRunStepDelta (Including nested classes)
StreamedThreadRunResponseFactory (To map the events to the correct classes)
EventStreamResponse (Modified iterator to work with the events format)
EventStreamResponseItem (Contains the event type and the data, similar to how the api returns the data)

Lack of types annotations for stream events. Needed to set them in place, so my ide could give me at least some hints what is going on
The example with tool output is not working. As soon as you set new $stream variable, foreach loop ends. Was able to run it using while loop and generator instance directly:

$stream = $client->threads()->runs()->createStreamed(
    threadId: 'thread_tKFLqzRN9n7MnyKKvc1Q7868',
    parameters: [
        'assistant_id' => 'asst_gxzBkD1wkKEloYqZ410pT5pd',
    ],
);

$iterator = $stream->getIterator();

while ($iterator->valid()) {
    $item = $iterator->current();
    $iterator->next();

    // ----

    if ($item->event === 'thread.run.requires_action') {
        $stream = $client->threads()->runs()->submitToolOutputsStreamed(
            threadId: $item->data->threadId,
            runId: $item->data->id,
            parameters: [
                'tool_outputs' => [[
                    'tool_call_id' => 'call_KSg14X7kZF2WDzlPhpQ168Mj',
                    'output' => '12',
                ]],
            ]
        );
        $iterator = $stream->getIterator();
    }

    // ----
}

spont4e commented 2 months ago

Hey all, Is there anybody available to review this PR and maybe merge and release it so we can use the streaming functionality?

Thx for your efforts.

🥇@gehrisandro and 🥈@nunomaduro are the people

EthanBarlo commented 2 months ago

@punyflash Thanks for pointing out these issues

Lack of types annotations for stream events. Needed to set them in place, so my ide could give me at least some hints what is going on

Yeah this is something worth adding, I don't actually know how to set those up. So if you could point me to something or request the necessary changes, that'd be helpful.

The example with tool output is not working.

Yeah that example does need to be updated, Heres an updated example, closer to how i've been using it. I'll probably update the example in the readme with this.

$stream = $client->threads()->runs()->createStreamed(
    threadId: 'thread_tKFLqzRN9n7MnyKKvc1Q7868',
    parameters: [
        'assistant_id' => 'asst_gxzBkD1wkKEloYqZ410pT5pd',
    ],
);

do{
    foreach($stream as $response){
        $response->event // 'thread.run.created' | 'thread.run.in_progress' | .....
        $response->data // ThreadResponse | ThreadRunResponse | ThreadRunStepResponse | ThreadRunStepDeltaResponse | ThreadMessageResponse | ThreadMessageDeltaResponse

        switch($response->event){
            case 'thread.run.created':
            case 'thread.run.queued':
            case 'thread.run.completed':
            case 'thread.run.cancelling':
                $run = $response->data;
                break;
            case 'thread.run.expired':
            case 'thread.run.cancelled':
            case 'thread.run.failed':
                $run = $response->data;
                break 3;
            case 'thread.run.requires_action':
                // Overwrite the stream with the new stream started by submitting the tool outputs
                $stream = $client->threads()->runs()->submitToolOutputsStreamed(
                    threadId: $run->threadId,
                    runId: $run->id,
                    parameters: [
                        'tool_outputs' => [
                            [
                                'tool_call_id' => 'call_KSg14X7kZF2WDzlPhpQ168Mj',
                                'output' => '12',
                            ]
                        ],
                    ]
                );
                break;
        }
    }
} while ($run->status != "completed")

petermjr commented 2 months ago

You can check these changes for adding faking capabilities. They can be implemented similarly in this branch https://github.com/petermjr/openai-php/compare/v0.8.4...v0.8.5

subet commented 2 months ago

Hi,

Thanks for the great and full-featured SDK. Is there an estimated timeline for when this feature might be integrated and when we might expect the next release?

Regards

EthanBarlo commented 2 months ago

@subet Unfortunately I have no idea for a timeline on this. In the meantime you can use the new functionality with this https://github.com/openai-php/client/pull/367#issuecomment-2046120856

Im quite confident that all the functionality has been implemented using the same style as the rest of the API. We have at least 4 people using it on their projects without issues, So I do think its close to being ready to go.

The next stage is for @gehrisandro and @nunomaduro to review the PR.

gehrisandro commented 2 months ago

Hi @EthanBarlo

Thank you very much for your PR! I am going to review it now.

slaven-ii commented 2 months ago

Hi, I have been actively tracking the progress of this PR, but lately, it looks like there has been no movement after the tests were run. Is this what is blocking it or?

Guervyl commented 1 month ago

Hi there, I did not check the code to see the code.

My question is it cost optimize?

I'm new to open ai gpt. I don't know much about there pricing. But they say the have output cost for tokens.

The actual way this library do to fetch the response is by fetching all the messages: $messages = $client->threads()->messages()->list('...');. Does your code fetch the last message like this?

Because if the output tokens is about fetching the messages it would be bad fetching all messages again and again.

EthanBarlo commented 1 month ago

@Guervyl The output tokens refers to text generations from the Ai models. So retrieving previous messages, and old text generations does not cost anything.

gehrisandro commented 1 month ago

Thanks, @EthanBarlo, for your PR!

Changed some stuff to work similar to the other streaming endpoints.

Will make a new release this week.

magarrent commented 1 month ago

Thank you guys! Amazing work.

Please note that the current version is pointing OpenAi-beta assistants=v1, it would crash if you're using some new parameters like file_search in v2

openai-php / client

Implement Assistants Streaming #367

What:

Description:

Related: