Closed JisuHann closed 3 years ago
: Next-best-view planning scheme based on Supervised Learning
task: Fine-grained Object classification
: 3D attention model for active 3D object identification with multi-view depth acquisition
: select NBV for depth acquisition targeting at an object of interest, sequential NBV regression based on RNN
each step
Recurrent attention model for NBV regression Repeat
: concentrates on the discriminative regions in each view for part-based recognition
task: object recognition
train by 3D shape dataset --> give best views targeting an object of interest for recognizing it
differentiable rendering(depth image to be differentiable with respect to the viewing parameters) → loss backpropagation
Recurrent 3D attentional architecture
loss: cross-entropy loss
training
input: a view parameterized in the local spherical coordinate system(using a ray casting algorithm with a random initial view from our selected 50 views)
depth layer: generate depth image
aggregates information of past views on RNN hidden layer
FC class: prediction of the categorical label
FC loc: for NBV selection in 3D space, produce an update to the current view for future observations
Next Best View Prediction