[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
warnings.warn(
Traceback (most recent call last):
File "/apdcephfs_cq3/share_1311970/Video-ChatGPT/video_chatgpt/train/train_mem.py", line 11, in
train()
File "/apdcephfs_cq3/share_1311970/Video-ChatGPT/video_chatgpt/train/train.py", line 482, in train
model_args, data_args, training_args = parser.parse_args_into_dataclasses()
File "/apdcephfs_cq3/share_1311970/lb/miniconda3/envs/video_chatgpt/lib/python3.10/site-packages/transformers/hf_argparser.py", line 332, in parse_args_into_dataclasses
obj = dtype(**inputs)
File "", line 113, in init
File "/apdcephfs_cq3/share_1311970/lb/miniconda3/envs/video_chatgpt/lib/python3.10/site-packages/transformers/training_args.py", line 1190, in __post_init__
raise ValueError(
ValueError: Your setup doesn't support bf16/gpu. You need torch>=1.10, using Ampere GPU with cuda>=11.0
warnings.warn( Traceback (most recent call last): File "/apdcephfs_cq3/share_1311970/Video-ChatGPT/video_chatgpt/train/train_mem.py", line 11, in
train()
File "/apdcephfs_cq3/share_1311970/Video-ChatGPT/video_chatgpt/train/train.py", line 482, in train
model_args, data_args, training_args = parser.parse_args_into_dataclasses()
File "/apdcephfs_cq3/share_1311970/lb/miniconda3/envs/video_chatgpt/lib/python3.10/site-packages/transformers/hf_argparser.py", line 332, in parse_args_into_dataclasses
obj = dtype(**inputs)
File "", line 113, in init
File "/apdcephfs_cq3/share_1311970/lb/miniconda3/envs/video_chatgpt/lib/python3.10/site-packages/transformers/training_args.py", line 1190, in __post_init__
raise ValueError(
ValueError: Your setup doesn't support bf16/gpu. You need torch>=1.10, using Ampere GPU with cuda>=11.0