isi-vista / adam

Abduction to Demonstrate an Articulate Machine
MIT License
10 stars 4 forks source link

2D symbolic paths from Pixel Data #1126

Open lichtefeld opened 2 years ago

lichtefeld commented 2 years ago

While implementing the action data loader I noticed that we have access to some positional information that we are a) currently encoding only for display purposes and b) could be used to construct a form of path traversal information. The current object feature set gives us a centroid x/y for the object in the Pixel space of the RGB image. It would be possible to compute a linear path between the start and end position and break it into the component x/y axes. This would give us 2 paths in this 'pixel space' feature set we could evaluate where a Path is defined by a discrete reference axis, a discrete direction, and a continuous distance.

There are a few assumptions about this interpretation:

@spigo900 and I believe that while this feature may not always be useful there may be situations (fall) where is it reasonably useful in combination with other features. Other actions may be distinguishable if we're fairly constrained with the visual look of the samples.