gunawanlg / voice-to-text-bahasa

Apache License 2.0
0 stars 0 forks source link
python speech-recognition

Voice to Text Bahasa

Build Status codecov

gurih is an automatic speech recognition (ASR) python module, crafted specifically for recognizing and transcribing speech spoken in Bahasa. Given an audio input, gurih will perform the transcription of the input.

For example, given an input of 30.240s-long audio recording of Kancil story gurih will output the following transcription:

    Pada suatu hari, terjadilah kelaparan di sebuah pulau yang penduduknya 
    kebanyakan di huni oleh para Harimau. Mereka sangat kelaparan, 
    karena semakin hari tidak ada hewan yang dapat mereka mangsa. 
    Akhirnya, Raja Harimau mengutus Panglima dan para Prajuritnya untuk pergi 
    ke pulau kecil di sebrang dan kembali dengan membawa banyak makanan.

How to Install

Currently gurih can only be installed by first cloning it from the repository hosted on GitHub. The following is how you can install the module, from the command-line:

Usage

gurih can be used as a Python package inside third-party code directly as the .py script or embedded to .ipynb notebooks.

Implemented Functionality

gurih provides the following functionality commonly used in the automatic speech recognition task. Currently, gurih contains the following functionalities:

Versioning

This project use Semantic Versioning 2.0.0 from https://semver.org/.


Copyright © 2020 Gunawan Lumban Gaol, M Haries Ramdhani Ade M

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at: http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language overning permissions and limitations under the License.