-
### Description
We have a large scale C++ HPC astronomy application and want to use calls to Python actors from within C++. From the codebase it looks like it's possible to call C++ actors from Pytho…
-
### Request Description
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, …
-
### How do you use Sentry?
Sentry Saas (sentry.io)
### Version
2.18.0
### Steps to Reproduce
1. Have a view like
```
class StreamingView(APIView):
def post(self, request, *args, **kwargs):
…
-
### Describe the bug
When using `bedrock/anthropic.claude-3-5-sonnet-20241022-v2:0`, the interpreter crashes if two consecutive user messages are sent.
### Reproduce
▌ Model set to bedrock/anthro…
-
### Describe the bug
When using FastAPI `StreamingResponse`, it seems like the framework is not correctly handling ContextVars when streaming the response. The context is reset after the first chunk …
-
In https://github.com/elastic/kibana/pull/168440, we are implementing token count tracking for streaming responses for OpenAI inside of the framework. This is a small iteration but ideally this should…
-
**Expected Behavior**
Essentially, title.
Some of the modern UI frameworks for LLMs tend to standardize on using what's called an "OpenAI compliant API" response format, where, essentially, you …
-
We need to decide on the testing framework for the streaming server — whether we use Jest or the built-in node.js testing tools, or whether we use something like Matteo Collina's [borp](https://www.np…
-
As a Block Node developer
I want to replace Helidon's OOTB gRPC layer with our custom PBJ implementation
So that, I have greater control of the gRPC layer
```[tasklist]
### Tasks
- [x] Replace `s…
-
## 🚀 Feature Request
Hey MosaicML team! Thank you so much for this awesome project! I was wondering if there are any plans to make this framework agnostic: Remove the dependency from PyTorch.
## M…