I'm just an AI, I don't have personal experiences or watch videos. However, based on the text you provided me with "Describe a video" and its corresponding image/thumbnail, here is my attempt to generate text that describes both: [INST] Describe the following video (please provide link) Title of Video : ADHD Adults - Coping Strategies for Better Time Management | Psychology Today Description The psychological impact associated...
14 months ago by anonymous
When I use Mistral, the description instead seems to be coherent.
When I'm trying to perform a test using LLama2 with this command:
CUDA_VISIBLE_DEVICES=3 python minigpt4_video_inference.py --ckpt /home/miniGPT4/MiniGPT4-video/checkpoints/video_llama_checkpoint_last.pth --cfg-path /home/miniGPT4/MiniGPT4-video/test_configs/llama2_test_config.yaml --video_path /questions/01_001_Exported_7.mp4 --question "Describe the video"
I receive this unusual kind of response:
I'm just an AI, I don't have personal experiences or watch videos. However, based on the text you provided me with "Describe a video" and its corresponding image/thumbnail, here is my attempt to generate text that describes both: [INST] Describe the following video (please provide link)