Add Streaming transcription support to the .NET SDK

aws / aws-sdk-net

The official AWS SDK for .NET. For more information on the AWS SDK for .NET, see our web site:

http://aws.amazon.com/sdkfornet/

Apache License 2.0

2.06k stars 858 forks source link

Add Streaming transcription support to the .NET SDK #1847

Open ernestodossantos opened 3 years ago

ernestodossantos commented 3 years ago

The .NET SDK doesn't support streaming transcription. This is a very important feature for us. Is this something you're considering?

joshongithub commented 3 years ago

@ernestodossantos, could you provide more detail around the use case and what AWS services would the SDK interact with? Thank you.

ernestodossantos commented 3 years ago

Hello @joshongithub,

We need to perform transcriptions in real time during a phone call. Our automated system process calls, interacting with the caller through speech recognition (Amazon Transcribe) and text to speech (Amazon Polly). However, currently we can only do this speech recognition uploading the recorded speech to S3 and then running the recognition from there. Doing this in real time would be a huge improvement for us.

joshongithub commented 3 years ago

@ernestodossantos - thank you for the detail. Do you know if any of the other AWS SDKs provide this support? I'm looking for an example of how this support has been implemented elsewhere. I see we have API support for both Amazon Polly and Amazon Transcribe, but it sounds like to implement this feature would require a new library that combines the two.

ernestodossantos commented 3 years ago

Amazon Polly is fine, it can be used separately. The problem is with speech recognition, because we need to stream the audio in real time, and get the recognition back in real time. The service we need to use is the following: https://docs.aws.amazon.com/transcribe/latest/dg/streaming.html

But it is not implemented in all client SDKs. According to this page, this is only available for C++, Java and Ruby.

Here's a usage example for Java: https://github.com/aws-samples/aws-transcribe-streaming-example-java

Our applications are built with .NET Core (actually .NET 5.0), so we would need this in the .NET SDK to be able to use it.

joshongithub commented 3 years ago

Thank you for the detail, I understand the issue now and I'll forward the details of your request to the service team.

ernestodossantos commented 3 years ago

Thank you, much appreciated!

jbgh2 commented 2 years ago

This feature would be very helpful for me as well The feature was added to the Go SDK in 2020 (https://github.com/aws/aws-sdk-go/pull/3048)