iejMac / clip-video-encode

Easily compute clip embeddings from video frames
MIT License
136 stars 19 forks source link

embed the text as well #61

Open iejMac opened 2 years ago

iejMac commented 2 years ago

can't see why we shouldn't have an option for encoding the text and putting that in the WDS

iejMac commented 2 years ago

tasks:

matchyc commented 1 year ago

is that available now? I really need to encode text @iejMac

iejMac commented 1 year ago

you can't save the caption embeddings in the current state but you could probably hack around the code (or make a clean PR! that would be great) and get it to do that since they are computed here - https://github.com/iejMac/clip-video-encode/blob/ff390b01091e9d93263d3cb4e8a747fb4143cc1d/clip_video_encode/clip_video_encode.py#L107

currently they aren't saved but instead they are just compared to each frame in the input video to get a similarity score per frame and that is saved

matchyc commented 1 year ago

Thanks, I guess I'll be able to finish a pr in the future after getting my things done. By the way, is there embedded caption data for webvid? I only see video embeddings in huggingface.