Open fbanados opened 3 months ago
This is an RA task: Figure out what sessions are really missing and should be added.
As I go through the re-importing process I've found (and marked) several sessions that are on the Masters Recordings MetaData spreadsheet but not on speech-db. I've also found at least two sessions that have a folder in maskwacis-recordings
and do not appear in the Masters Recordings Metadata spreadsheet (2017-02-02 downstairs sessions)
There are several sessions that have
wav
andeaf
data in our server but have not been included into speech-db. This is likely to have arisen from inconsistencies in the filenames: Although the import script tries in many ways to find a matching.wav
file for a specific.eaf
input, at some point the script gives up and if it can't find the.wav
files, it won't import the session. Normalizing the filenames in the folders to expected conventions or updating the script to deal with all variations currently in the data would fix this issue.Several candidates have been identified in a comment for #446