Hi @LLsmile, thanks for the quesiton. Although the whole pipeline is designed to query from videos, you can also inference a single image. Please take a look at function perform_retrieval, where you may set clip_frames as a one-element list.
An example of calling this function can be found here.
Hi @LLsmile, thanks for the quesiton. Although the whole pipeline is designed to query from videos, you can also inference a single image. Please take a look at function perform_retrieval, where you may set
clip_frames
as a one-element list.An example of calling this function can be found here.