cvat-ai / cvat

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
https://cvat.ai
MIT License
12.06k stars 2.93k forks source link

Idea: Annotating a sequence of images #7819

Open doantientai opened 3 months ago

doantientai commented 3 months ago

Discussed in https://github.com/cvat-ai/cvat/discussions/7818

Originally posted by **doantientai** April 29, 2024 I am working on a project in which we have to give a label status of objects (a classification problem). However, we have to look at the object from multiple angles, so we took multiple images of each object. For the annotation step, now we have to display a sequence of images each time to label an object. Surprisingly, it seems there is no native solution for this. So I am trying two solutions, but they are both come with drawbacks: - **Contextual images**: For each object, I display 1 main image + 12 contextual images so show other angles. However, sometimes there are more than 13 images for each object, so I will lose some images if I choose this solution. Plus, the main image is huge comparing to the contextual ones. - **Concatenating images**: I can merge all the views of each object as one single image. The problem is the views are cropped in different sizes and shapes, moreover, the number of views of each object can be different, making it difficult to merge them without having to resize to fit them in the same grid. What I would like to suggest, is the ability to have more contextual images (like 50), and the ability to hide the main image so that it does not take too many space on the interface. Thank you!
realtimshady1 commented 2 months ago

I am facing a similar problem where I'm trying to label the differences between two images. So viewing the second image simultaneously would be very helpful