raymondxyy / pyaudlib

A speech signal processing library in Python with emphasis on deep learning.
MIT License
31 stars 6 forks source link

Dataset #1

Open raymondxyy opened 5 years ago

raymondxyy commented 5 years ago

Dataset

Updated: 12/27/2018, 11:47 PM


Generic Dataset

An abstract Dataset class is defined for easy batch processing. The base class is a direct copy of the Dataset class in PyTorch, with a few additional requirements:

Other Datasets

Apart from the generic Dataset class, some common speech-related datasets subclass Dataset:

Overview

See audlib.data.dataset for the abstract interfaces of Dataset and its subclasses. For implementations of specific datasets, see the Wall Street Journal (WSJ) module in audlib.data.wsj, or other dataset modules.