tattle-made / feluda

A configurable engine for analysing multi-lingual and multi-modal content.
https://tattle.co.in/products/feluda/
GNU General Public License v3.0
14 stars 16 forks source link

Video Operator should process video of any length and size #323

Open aatmanvaidya opened 6 months ago

aatmanvaidya commented 6 months ago

A core component of Feluda are Operators , they are modules that help us analyse media items such a text, image, video, audio etc. One key operator is the video operator which takes in a video file as input and generates the vector representation of the file in output.

Link to the Video Vec Operator - https://github.com/tattle-made/feluda/blob/main/src/core/operators/vid_vec_rep_resnet.py

Currently, The video operator has a size limit of the videos it can process. The task is to modify the operator so that it can process large video files. When we say video of any length and size should be processed, what we mean is that the RAM usage should be constant and not increase linearly with video size/length

Expected Outcome

Files to look at

kes-edge commented 6 months ago

Mind if I have a look at this?

dennyabrain commented 6 months ago

@kes-edge Its not on our immediate roadmap, so we won't be looking at it. Feel free to work on it and share any insights/progress. Good luck

plon-Susk7 commented 1 month ago

Hey @aatmanvaidya , can I work on this issue if anyone isn't working?

dennyabrain commented 1 month ago

@plon-Susk7 thank you for your work so far. While you are here, take a look at this issue as well - https://github.com/tattle-made/feluda/issues/410. Its a prominent issue we are working on right now. Now that you are slighly familiar with feluda, its the kind of moderately complex issue that you should be able to take up :) Lets discuss details on that issue if its of interest.

plon-Susk7 commented 1 month ago

Hmm, the issue seems interesting. I'd be happy to work @dennyabrain .