Preprocess data - Githubissues

pmhalvor / public-data

One central location for my public facing datasets

https://pmhalvor.github.io/public-data/

GNU General Public License v3.0

0 stars 0 forks source link

Preprocess data #2

Closed pmhalvor closed 11 months ago

pmhalvor commented 11 months ago

Complete preprocess file plus add the data preprocessed.

Note: I decided to go with .pt format for mfcc and covariance. These were unfortunately too big to push to git, so their previous .parquet df implementations were added instead. The .pt are easily generated again by running preprocess with the GTZAN dataset downloaded locally