Does this method also generalize well to multi-view predictions ? , i.e generating heatmaps from several views and training in this manner to predict directly the 3d poses as in voxel pose (but the data is only synthetic only camera poses known I'm advance but no actual pairs available)
Does this method also generalize well to multi-view predictions ? , i.e generating heatmaps from several views and training in this manner to predict directly the 3d poses as in voxel pose (but the data is only synthetic only camera poses known I'm advance but no actual pairs available)