zbytek-team / speech-analysis

MIT License
0 stars 0 forks source link

Speech Analysis Project

This research project at the Gdańsk University of Technology aims to develop a method for identifying characteristic frequency ranges for different languages. The goal is to review methods of speech frequency analysis and, based on speech recordings for various languages, verify Dr. Alfred Tomatis's theory that different languages have distinct frequency ranges for specific phonemes.

Requirements

Installation

  1. Clone the repository:

    git clone https://github.com/ack2406/speech-analysis
    cd speech-analysis
  2. Install the required Python packages:

    pip install -r requirements.txt

Usage

This project uses a Makefile to streamline various tasks like downloading datasets, extracting features, and listing available options.

Download and Extract Data

To download speech data and extract features for specific languages in one step:

make download_and_extract LANGUAGES="pl en" DATA_SIZE=2 FEATURES="pitch mfcc"

Download Data

To only download speech data for specific languages:

make download LANGUAGES="pl en" DATA_SIZE=2

Extract Features

To only extract features from already downloaded data:

make extract LANGUAGES="pl en" FEATURES="pitch mfcc"

List Available Languages

To list all available languages from Mozilla Common Voice:

make list-languages

List Available Features

To list all available features that can be extracted:

make list-features

Directory Structure

License

This project is licensed under the MIT License - see the LICENSE file for details.