mozilla / DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Mozilla Public License 2.0
24.82k stars 3.93k forks source link

Why I call DS_IntermediateDecode always crash a few seconds later at the iphone device? #3778

Open czwldy opened 10 months ago

czwldy commented 10 months ago

For support and discussions, please use our Discourse forums.

If you've found a bug, or have a feature request, then please create an issue with the following information:

                stream!.feedAudioContent(buffer: shorts)
                print("czwxx\(byteSize):\(stream!.intermediateDecode())")  // I add the code here, at the SpeechRecognitionImpl class.

At the swift demo, I add stream!.intermediateDecode() after stream!.feedAudioContent(buffer: shorts), it would crash EXC_BAD_ACCESS (code=2, address=0x16e023ff0) at the function get_history_helper a few seconds later.

mkos11 commented 7 months ago

To analyze syllables or phonemes alongside DeepSpeech, consider integrating with libraries like NLTK or PyDictionary in Python. These libraries provide syllable information and phonetic details, complementing DeepSpeech's text-to-speech conversion. By combining these tools, you can create a comprehensive solution for a more in-depth linguistic analysis of spoken words.