kensho-technologies / pyctcdecode

A fast and lightweight python-based CTC beam search decoder for speech recognition.
Apache License 2.0
416 stars 89 forks source link

Character LM #64

Closed zelabean closed 2 years ago

zelabean commented 2 years ago

Is this library work with character level LM?

when i using character level lm for korean it didn't work well...

lopez86 commented 2 years ago

the package currently assumes using word level lms, the code ought to work for character-based lms but chances are performance will suffer compared to word level lms. There's a bit more discussion here https://github.com/kensho-technologies/pyctcdecode/issues/50

zelabean commented 2 years ago

Thank you for help : )