Open dobkeratops opened 7 years ago
example - this is the kind of thing that springs to mind when I hear of a labelling application.. get the most value out of every image by labelling absolutely everything, and encourage users to get more precise. If you just uploaded this as an image of a 'building', later people could have come along and labelled the cars. And so on.. (this is why I found the initial narrow label, and single label per image rather too restrictive). Leave things open ended, because later users may know more. One person has a camera in an interesting place.. another is sat at home with time on their hands to do labelling
Just thought about this again when going through the site to do some image donation.. (the script is great but the site UI is what most people will use)
I'd suggest scene labels to give the most useful single label for an entire image; it would make sorting & searching easier too.
These would have overlap but you can chose on a case by case basis.. what single word best describes a particular scene? That would help whole image training?
existing scene labels
kitchen
forest
beach
new suggestions - from memory in the dataset..
street
residential area
landscape
town
rural area
urban area
city
city centre
town centre
village
park
garden
office
home
shop
living room
restaurant
cafe
shopping mall
workshop
garage
jungle
valley
industrial area
construction site
canal
(canal is actually similar to river but it's an artficial channel)
harbour
classroom
library
museum
hospital
farm
highway
(specialisation of road, often means it's going through landscape)
port
.. each scene label could guide the 'popular labels' in the 'add labels' view.. eg 'street' would suggest labels: road, pavement,tree,building,house,car,person,bin; 'park' would suggest labels foliage, tree trunk, bush,grass,path,wall. 'farm' would suggest fence, field,tractor,barn, ... and so on. You could even add those all as default tasks, and use the 'skip' to remove a specific label
Thanks for the labels list - those labels definitely make sense!
As you've mentioned the donate page: We've talked about the donate page a while ago and you suggested to make the < unlabeled > label the default one. I recently thought about that again and I am thinking whether we should go even a step further and remove the labels list completely from the donate page?
In my opinion the donate page is only useful in case you wanna donate a handful of images. If you wanna donate more than that, it's probably better to use a scripted approach (maybe we should cleanup the script a bit, create a commandline tool out of it, write a better documentation of the tool and add a reference to the donate page?).
I think with the current approach it's pretty easy to label an image wrongly (in case you forget to change the label in the label dropdown before uploading). I remember when I first posted about ImageMonkey at reddit (~1-2 years ago) there were quite a few donations from people who wanted to give ImageMonkey a shot. However, a significant portion of those images were obviously wrong labeled (tree vs car, etc) after upload. But to be fair: This was still the time where a new random label was chosen from the label dropdown after every upload...so maybe that was the reason for that. :D
Another problem I see is, when the user uploads a completely different set of images where we do not already have labels defined for that. In the best scenario, the user just selects < unlabeled >, in the worst scenario he could pick a label that's similar enough but not correct.
What do you think about that?
this is my highest priority suggetion: At the moment the site presents you with a very limited set of labels; as such the range of images you can submit is very restrictive.
How about adding some very broad categories for labels applying to entire scenes; these would not be suitable for area tagging, but you'd later go in and crop regions to distinguish the contained objects. This would instantly broaden the contributions to the database.
Scene labels still have training value, since different scenes contain different correlations of objects. , and it's possible to divide object recognition into multiple networks, e.g. the result of a 'broad classification' network selects one of several domain-specific 'narrow-classification networks' to use to refine. (e.g. you could have 4 networks.. 1='urban v domestic v nature', 2.1='urban objects', 2.2='domestic objects', '2.3=nature objects', and based on the result of '1', you then feed the image into 2.1,2.2, or 2.2)
Suggested Scene Labels: