KeyError: "['length'] not in index" in the preprocess_reference_dir() call of scorer

usnistgov / ccu_validation_scoring

Other

5 stars 0 forks source link

In the latest code update, we see that the preprocess_reference_dir() function in the CCU_validation_scoring/score_submission.py file seeks for data type and length in addition to fileID, in line 685: index_df = index_df[["file_id", "type", "length"]]

What are the type and length? I can guess that type is about video/audio/text but I'm really not sure about length. At some point, I saw a LDC download in which all data files have some length of 10000 or sth (but is that how it should be?).

I tried looking at the index files in test/reference/ to get a sense, but those reference files of how latest index files should look like have not been updated.

Hence, help appreciated, thank you!

usnistgov / ccu_validation_scoring

KeyError: "['length'] not in index" in the preprocess_reference_dir() call of scorer #3