Hi, I'm testing with 2 Macbooks. I found the discussion #130 and It works fine with 1 or 2 devices. One thing I hope is, enabling subsequential requests in multiple devices.

For instance, if I pass the same request_id for every request from chatgpt_api, it works in a single device. However, not working in more than one device.

130 fixes this, by changing the request id in every request.

I'm kinda working on the impact of subsequential requests - where the KVs are continuously accumulated. In single device test, all-same-request-id queries shows decreasing speed in each http request(due to the memory limit - I'm using 8GB macbooks!) I hope to check this out while two devices used. With the current code (changing request id), no speed degradation is found (and also no context found in the answers).

At first, I attempted to put the requests through web UI. However, with multiple devices, the web UI stops in 3~4 subsequent requests.

not always, but most cases. getting frequent when accumulation goes.
Also single device not shows this issue. Don't know why, but the exo log shows the answers to the end-of-token. So I thought, it might be web UI's issue, so testing via chatgpt_api would be better. And now - this situation.

Could you share some ideas or thinkings to make the subsequential requests work?

exo-explore / exo

About #130, regarding subsequential requests #158

130 fixes this, by changing the request id in every request.