vitrivr's next-generation retrieval engine. It is capable of extracting and retrieving a wider range of multimedia objects such as audio, video, images or 3d models.
For 2D visual and temporal retrievables (e.g., images, video and audio frames), we need mechanisms to extract, store and retrieve technical metadata per retrievable.
[x] Fundamentally, metadata of a certain type (e.g., temporal) should be implemented as a descriptor of type struct
[x] Implement and integrate temporal metadata extraction and retrieval for video and audio (milliseconds, frame number)
[x] Implement and integrate spatial metadata extraction and retrieval for video and images (width, height)
Task Description
For 2D visual and temporal retrievables (e.g., images, video and audio frames), we need mechanisms to extract, store and retrieve technical metadata per retrievable.
struct
Dependencies
None
Boundary Conditions
None