RAI is a multi-vendor agent framework for robotics, utilizing Langchain and ROS 2 tools to perform complex actions, defined scenarios, free interface execution, log summaries, voice interaction and more.
Apache License 2.0
82
stars
8
forks
source link
Distance (or BBox3D) estimation using RGBD camera with GSAM or similar #194
Is your feature request related to a problem? Please describe. RAI agent could benefit from better spatial reasoning.
Describe the solution you'd like Image -> 2D segmentation mask (open set) -> Masked depth data -> Distance estimation
Relevant links: