mozilla / DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Mozilla Public License 2.0
24.81k stars 3.93k forks source link

Benchmark the CTC against a RNN-Transducer model #753

Open kdavis-mozilla opened 6 years ago

kdavis-mozilla commented 6 years ago

See for example[1]

patrickms commented 6 years ago

It sounds like Deep Speech 3 also uses an RNN transducer.

Slides here:

https://sites.grenadine.co/sites/cmu-scs-lti/en/colloquium/items/392