akras14 / speech-to-text

Example transcribing audio file (speech) to text with Google Cloud Speech API and Python
177 stars 89 forks source link

[HELP], Want to recognize the voice #1

Closed IAmVinnnn closed 6 years ago

IAmVinnnn commented 6 years ago

I have used google cloud speech to text API which is working well but I need to show speakers just above the line. Suppose I have an audio in which 4 persons involved Now I want to get the persons just before start his / her text. Like Person1: Here is the text of person1. Person2: Here is the text of person2. Person1: Here is another line of text from person1. Person3: Here is the text of person3. Can anyone let me know how I can get the speaker also with the text by using google API?

akras14 commented 6 years ago

Unfortunately, I don't think this is currently possible.

I'll leave the issue open for now, in case somebody might have a better suggestion.

IAmVinnnn commented 6 years ago

@akras14 Thanks for the response, Let see if anyone have any solution regarding this.

IAmVinnnn commented 6 years ago

I am going to close this issue, Because now I am going with IBM speech to text API and it's working as per my requirements. You guys can check HERE

ericbaranowski commented 6 years ago

It's currently available, but it's in beta. Since this project is using the SpeechRecognition module instead of the google api directly, I'm not sure if you can modify the setting, but it's just two added fields in the request.

config = speech.types.RecognitionConfig( encoding=speech.enums.RecognitionConfig.AudioEncoding.LINEAR16, sample_rate_hertz=16000, language_code='en-US', enable_speaker_diarization=True, diarization_speaker_count=2)

Google Cloud Speech Diarization

avikat commented 5 years ago

Can we change video file to text