jopedroliveira / speech_recog_uc

Speech processing ROS-package. Performs speech recognition and estimates the direction of arrival based on a real-time voice activity detection mechanism.
MIT License
13 stars 10 forks source link
direction-of-arrival robotics ros ros-node speech-recognition

Speech_Recognition_UC

Purpose

The Speech Recognition Package (speech_recog_uc) is a ROS (Robot Operative System) package designed by Universidade de Coimbra, that allow robots to perform high-reliability real-time speech recognition based on a real-time Voice Activity Detection algorithm, with high efficiency and low computational resources consumption.

Wiki

More info: http://wiki.ros.org/speech_recog_uc

Changelog

V 1.0.0

speech_recog_uc_basic: A package to perform real-time speech recognition using specific sample rate and number o channels (16kHz, mono).

speech_recog_uc: A package to perform real-time speech recognition using custom sample rate and number o channels (16kHz or 48kHz, mono or stereo), able to real-time estimate the sound source position (direction of arrival).

Others

Developed by: Jose Pedro Oliveira, joliveira@deec.uc.pt Fernando Perdigao, fp@deec.uc.pt

Acknowledgments: Gonçalo S. Martins Luis Santos Jorge Dias GrowMeUp project, funded by the European Union’s Horizon 2020 Research and Innovation Programme - Societal Challenge 1 (DG CONNECT/H) under grant agreement No 643647

Copyright (c) 2018 Universidade de Coimbra