Open wlssyuu opened 12 months ago
Hi @wlssyuu,
According to the documentation, you should set your JSON file to the same structure as @YuanGongND 's JSON file.
Here is the author's JSON file example:
# this is just an sample, if you only use audio, 'video_id' and 'image' entries are not necessary.
{
"data": [
{
"video_id": "--4gqARaEJE",
"wav": "/data/sls/audioset/data/audio/eval/_/_/--4gqARaEJE_0.000.flac",
"image": "/data/sls/audioset/data/images/eval/_/_/--4gqARaEJE_5.000.jpg",
"labels": "/m/068hy,/m/07q6cd_,/m/0bt9lr,/m/0jbk"
},
{
"video_id": "--BfvyPmVMo",
"wav": "/data/sls/audioset/data/audio/eval/_/_/--BfvyPmVMo_20.000.flac",
"image": "/data/sls/audioset/data/images/eval/_/_/--BfvyPmVMo_25.000.jpg",
"labels": "/m/03l9g"
},
{
"video_id": "--U7joUcTCo",
"wav": "/data/sls/audioset/data/audio/eval/_/_/--U7joUcTCo_0.000.flac",
"image": "/data/sls/audioset/data/images/eval/_/_/--U7joUcTCo_5.000.jpg",
"labels": "/m/01b_21"
},
{
"video_id": "--i-y1v8Hy8",
"wav": "/data/sls/audioset/data/audio/eval/_/_/--i-y1v8Hy8_0.000.flac",
"image": "/data/sls/audioset/data/images/eval/_/_/--i-y1v8Hy8_4.500.jpg",
"labels": "/m/04rlf,/m/09x0r,/t/dd00004,/t/dd00005"
},
{
"video_id": "-0BIyqJj9ZU",
"wav": "/data/sls/audioset/data/audio/eval/_/0/-0BIyqJj9ZU_30.000.flac",
"image": "/data/sls/audioset/data/images/eval/_/0/-0BIyqJj9ZU_35.000.jpg",
"labels": "/m/07rgt08,/m/07sq110,/t/dd00001"
}
]
}
@p4vlos thanks so much for the clarification!
@wlssyuu hello. I am curious if you have solved this problem and run the AST code using the desired dataset. Also, I would like to use the dataset structure described in json. Can you share which dataset you used?
I tried to follow readme and train my own dataset, but I could not. If I'm not bothering you, let me know how to use my own dataset. My json file is structured like this with 62 classes of datas.