Closed vDawgg closed 5 months ago
Currently all of the VLMs output for a keyframe is written into the same CSV collumn. Instead the output should be seperated and placed into different columns (i.e. Captions-Objects, etc.) for easier building of the metadata object.
Currently all of the VLMs output for a keyframe is written into the same CSV collumn. Instead the output should be seperated and placed into different columns (i.e. Captions-Objects, etc.) for easier building of the metadata object.