Kaljurand / K6nele

An Android app that offers speech-to-text user interfaces to other apps
http://kaljurand.github.io/K6nele/
Apache License 2.0
267 stars 83 forks source link

Improve continuous dictation interface (applies to ICS and higher) #28

Open Kaljurand opened 9 years ago

Kaljurand commented 9 years ago

Original issue 24 created by Kaljurand on 2012-04-06T07:37:32.000Z:

Make it work better with apps which expect the continuous dictation interface (e.g. Evernote v3.6.2 on ICS). Currently some audio is not transcribed unless you pause long and carefully between sentences.

Kaljurand commented 9 years ago

Comment #1 originally posted by Kaljurand on 2012-04-06T08:17:56.000Z:

The audio that Evernote attaches to the note (as a wav file) seems to be obtained entirely from the speech recognizer (the bufferReceived callback?), and interpreted as having 8k sample rate. This seems compatible with Google Voice Search but in case of Kõnele the saved audio sounds like 16k audio interpreted as 8k.

Note that the bufferReceived-callback does not specify the sample rate. Also it requires big-endian for some reason...

http://developer.android.com/reference/android/speech/RecognitionService.Callback.html#bufferReceived(byte[])