shard /mmu_nlp_ssd/wangxiao24/internvid/InternVid-10M-FLT_sub/_tmp/3.feather failed with error 'ResolutionSubsampler' object has no attribute 'encode_formats'
Traceback (most recent call last):
File "/mmu_nlp_ssd/wangxiao24/internvid/video2dataset/video2dataset/workers/download_worker.py", line 102, in __call__
self.download_shard(row)
File "/mmu_nlp_ssd/wangxiao24/internvid/video2dataset/video2dataset/workers/download_worker.py", line 161, in download_shard
writer_encode_formats["video"] = self.subsamplers["video"][0].encode_formats["video"]
AttributeError: 'ResolutionSubsampler' object has no attribute 'encode_formats'
Primary analysis
The errors come from the following code:
# The subsamplers might change the output format, so we need to update the writer
writer_encode_formats = self.encode_formats.copy()
if self.subsamplers["audio"]:
writer_encode_formats["audio"] = self.subsamplers["audio"][0].encode_formats["audio"]
if self.subsamplers["video"]:
writer_encode_formats["video"] = self.subsamplers["video"][0].encode_formats["video"]
I cannot understand the meaning of these codes, specifically why The subsamplers might change the output format.
Bug descriptions
I am using the following config:
I got the following errors:
Primary analysis
The errors come from the following code:
I cannot understand the meaning of these codes, specifically why
The subsamplers might change the output format
.