Closed SutirthaChakraborty closed 3 years ago
Hi @SutirthaChakraborty
You have two options:
torch.utils.data.Dataset
which returns mel spectrograms and a corresponding vector of beat activations (see the Davies & Bock paper for how they represented beats in this vector)BallroomDataset
class provided in this repo. The beat annotations are just text files with one line per beat, and two space separated values per line: the first is the beat time in seconds and the second is the beat number within the bar. Then you can just use the scripts provided to create spectrograms and train the model.Hope that helps!
Ben
Hi, Thank you for this repo. I have few songs, and I estimated the beats using librosa ( as ground truth) how can I preprocess the data and convert it into a dataset (X and Y) for training to models? Cheers, Suti