audio preprocessing - Githubissues

SR-71-group / birdanalysis

science-camp-project for migration birds calls

Apache License 2.0

0 stars 0 forks source link

Audio Preprocessing for Bird Call Extraction and Spectrogram Conversion

1. Bird Call Extraction

Goal: 🔸Isolation of the audio fragments from files that contain just the bird calls.

2. Zero Padding for Uniform Length

Goal: 🔸 Ensure all audio clips have a uniform length for model compatibility. implementation: finding the longest call duration and apply zero padding for others to make them same length

why? learning models like CNNs or RNNs, which expect inputs of fixed size

3. Spectrogram Conversion

Goal: 🔸Convert the processed audio clips into spectrograms for model input

Expected result:

🔸spectrograms of bird calls input with zero padding in .npy format in folder /data/preprocessed with exact names as a videofiles from which they was extracted.