Open LZHgrla opened 6 months ago
Thanks for the reminder, I will resolve this issue as soon as possible.
Hi, @Espere-1119-Song
We found another two invalid tar file: movies/s01e08-1.tar (10.6 GB)
, movies/S01E2-4.tar (6.24 GB)
thanks, we are hurry to upload them
We upload the raw videos of the training set :)
Hi @Espere-1119-Song
I found some pairing issues between the JSON and TAR files in the MovieChat-1K_train dataset.
There are a total of 830 JSON files (json.txt) and 769 TAR files (tar.txt). They are mismatched. I checked and found that there are 74 missing TAR files (tar_missing.txt) and 13 extra TAR files (tar_extra.txt).
Additionally, there seem to be issues with
AWB-8.tar
andearth9-2.tar
files in HuggingFace hub, possibly due to the compression or upload failure. (AWB-8.tar
is an extra TAR file and can be deleted directly, whileearth9-2.tar
should be considered for re-uploading)