ImageMonkey / imagemonkey-core

ImageMonkey is an attempt to create a free, public open source image dataset.
https://imagemonkey.io
47 stars 10 forks source link

Pre-annotate video frames? #35

Open bbernhard opened 7 years ago

bbernhard commented 7 years ago

I was playing a little bit with the idea of detecting object boundaries within video frames (idea originated in #32).

Currently my implementation looks like this:

Here is a GIF that shows the result (I used some random video from Youtube as input):

video_extraction_test

However I am not really happy with the result...it definitely got better after using the DBSCAN algorithm, but it's still not as good as I would have expected. Also tried a few different algorithms, but the "weighted moving mean" was creating the best results (at least for my test video).

Anyone an idea on how to improve the results? I am still a novice in the ML / image processing sector, so any help is really appreciated ;-)

dobkeratops commented 7 years ago

interesting experiment.. i think its worth it.. you can always manually clean up. one comment, maybe polygonal boundaries would be worthwhile (8 sided, whatever), so you don't get noise in rotations from frame to frame (it would localise the effect of the noise more)

bbernhard commented 7 years ago

Thanks for your advice, really appreciated!

maybe polygonal boundaries would be worthwhile (8 sided, whatever), so you don't get noise in rotations from frame to frame (it would localise the effect of the noise more)

Do you mean something like a convex hull?

dobkeratops commented 7 years ago

yes exactly a convex hull. maybe it's worth trying an aligned bounding box clipped with diagonals, like this definitions and tools k-dop an 8-dop in two dimensions .. or something more general. My theory is the aligned+diagonal version would be more stable between frames .. but I haven't tried anything like this

bbernhard commented 7 years ago

Coool, thanks :) Will definitely check that out!