ethman / slakh-utils

Utilities for interfacing with Slakh2100
MIT License
59 stars 15 forks source link

List of Stems with Errors in Slakh #17

Open greenbech opened 3 years ago

greenbech commented 3 years ago

After using Slakh for multi-instrument automatic music transcription, I've found several stems with different kinds of errors. At the time of this writing (2021-06-07), 22 stems are found. Most of the errors are only relevant for music transcription.

The errors are classified in the following classes:

white-noise

Something must have gone wrong with the audio rendering—the audio for the stem only consists of white noise

wrong-pitch

The pitch of the label and audio are not the same

wrong-octave

The octave of the label and audio are not the same

missing-audio

Not all the notes in the label are rendered

short-labels

Some of the notes in the MIDI file parsed with PrettyMIDI are shorter than the rendered audio

long-labels

Some of the notes in the MIDI file parsed with PrettyMIDI are longer than the rendered audio

A list of the errors is added below as well as in this GitHub repository (which might be updated if more errors are found). This list does not contain the systematic error of the highest octave in the label not being present in the audio as described in issue #18

List of stems with errors
```json { "Track00262": { "S01": "short-labels" }, "Track00357": { "S03": "white-noise" }, "Track00377": { "S07": "white-noise" }, "Track00385": { "S00": "white-noise" }, "Track00398": { "S00": "white-noise" }, "Track00400": { "S00": "white-noise" }, "Track00404": { "S03": "long-labels" }, "Track00496": { "S01": "wrong-pitch" }, "Track00629": { "S01": "white-noise" }, "Track00633": { "S01": "white-noise" }, "Track00737": { "S01": "long-labels" }, "Track00749": { "S01": "white-noise" }, "Track00893": { "S01": "long-labels" }, "Track01629": { "S00": "white-noise" }, "Track01876": { "S01": "missing-audio" }, "Track01908": { "S05": "missing-audio" }, "Track01918": { "S10": "wrong-pitch" }, "Track01929": { "S04": "wrong-octave" }, "Track01931": { "S01": "wrong-pitch" }, "Track01993": { "S01": "missing-audio" }, "Track01937": { "S03": "wrong-pitch" }, "Track02024": { "S13": "missing-audio" } } ```
ethman commented 3 years ago

Holy cow, this is great work! With the crunch of deadlines, I was unable to inspect every single file, but it looks like you found a nice way. Would you mind sharing the code you used to catch these issues?

I'll find a way to fix these in the upcoming Slakh release. Again, thank you so much!!!!

ethman commented 3 years ago

Hey bump on this. Would it be possible to share the code that you used to find these issues?