gurih is an automatic speech recognition (ASR) python module, crafted specifically for recognizing and transcribing speech spoken in Bahasa. Given an audio input, gurih will perform the transcription of the input.
For example, given an input of 30.240s
-long audio recording of Kancil story
gurih will output the following transcription:
Pada suatu hari, terjadilah kelaparan di sebuah pulau yang penduduknya
kebanyakan di huni oleh para Harimau. Mereka sangat kelaparan,
karena semakin hari tidak ada hewan yang dapat mereka mangsa.
Akhirnya, Raja Harimau mengutus Panglima dan para Prajuritnya untuk pergi
ke pulau kecil di sebrang dan kembali dengan membawa banyak makanan.
Currently gurih
can only be installed by first cloning it from the repository
hosted on GitHub. The following is how you can install the module, from the command-line:
git clone https://github.com/Arc-rendezvous/voice-to-text-bahasa.git
cd voice-to-text-bahasa
pip install -e .
gurih can be used as a Python package inside third-party code directly
as the .py
script or embedded to .ipynb
notebooks.
gurih provides the following functionality commonly used in the automatic speech recognition task. Currently, gurih contains the following functionalities:
gurih.models
: contains the models used to perform the speech recognition taskgurih.data
: contains all the scripts needed for the data preprocessing of the audio files and transcriptiongurih.features
: contains all the scripts needed for the feature extraction processThis project use Semantic Versioning 2.0.0 from https://semver.org/.
Copyright © 2020 Gunawan Lumban Gaol, M Haries Ramdhani Ade M
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at: http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language overning permissions and limitations under the License.