UAlbertaALTLab / recording-validation-interface

Maskwacîs recordings validation interface
https://speech-db.altlab.app/
Other
1 stars 1 forks source link

Missing sessions from maskwacis-recordings #447

Open fbanados opened 3 months ago

fbanados commented 3 months ago

There are several sessions that have wav and eaf data in our server but have not been included into speech-db. This is likely to have arisen from inconsistencies in the filenames: Although the import script tries in many ways to find a matching .wav file for a specific .eaf input, at some point the script gives up and if it can't find the .wav files, it won't import the session. Normalizing the filenames in the folders to expected conventions or updating the script to deal with all variations currently in the data would fix this issue.

Several candidates have been identified in a comment for #446

fbanados commented 2 months ago

This is an RA task: Figure out what sessions are really missing and should be added.

fbanados commented 2 months ago

As I go through the re-importing process I've found (and marked) several sessions that are on the Masters Recordings MetaData spreadsheet but not on speech-db. I've also found at least two sessions that have a folder in maskwacis-recordings and do not appear in the Masters Recordings Metadata spreadsheet (2017-02-02 downstairs sessions)