rom1504 / image_embeddings

Using efficientnet to provide embeddings for retrieval
https://rom1504.github.io/image_embeddings/
MIT License
154 stars 32 forks source link

Video datasets #21

Open Sebagam opened 3 years ago

Sebagam commented 3 years ago

Hello,

I would like to get the embeddings for each frame of a video dataset. I'm using CV2 to get the frames, so I end up with frames of (720, 1280, 3). Being the first two dimensions the resolution and 3 the color channels.

How can I get the embedding of each frame? I don't see a function to load new datasets in the .ipynb from scratch.

Thanks

rom1504 commented 3 years ago

you can use ffmpeg to generate images and then this repo to index them