A speech signal processing library in Python with emphasis on deep learning.
audlib provides a collection of utilities for developing speech-related applications using both signal processing and deep learning. The package offers the following high-level features:
Some use cases of audlib are:
audlib focuses on correctness, efficiency, and simplicity. Signal processing functionalities are mathematically checked whenever possible (e.g. constant overlap-add, istft(stft(X))==X
). Deep neural networks follow the PyTorch's convention.
sig.util.freqz
becomes sig.util.fftfreqz
nn.strfnet
is removed. See the official repository here.transform.stlogm
is removedaudioread
follows the interface of soundfile.read
audiowrite
follows the interface of soundfile.write
sr
is removed from all short-time transformspip install audlib
In the source directory, install the library with test dependencies:
pip install ".[tests]"
Run test:
python -m pytest tests
python setup.py sdist bdist_wheel
twine upload --repository-url https://upload.pypi.org/legacy/ dist/*
More extensive examples can be found in examples/
.
Please reach out to Raymond Xia (raymondxia@cmu.edu) for comments.
More information about our research groups can be found on at
Distributed under the MIT license. See LICENSE
for more information.