RobotecAI / rai

RAI is a multi-vendor agent framework for robotics, utilizing Langchain and ROS 2 tools to perform complex actions, defined scenarios, free interface execution, log summaries, voice interaction and more.
Apache License 2.0
82 stars 8 forks source link

Distance (or BBox3D) estimation using RGBD camera with GSAM or similar #194

Open maciejmajek opened 2 weeks ago

maciejmajek commented 2 weeks ago

Is your feature request related to a problem? Please describe. RAI agent could benefit from better spatial reasoning.

Describe the solution you'd like Image -> 2D segmentation mask (open set) -> Masked depth data -> Distance estimation

Relevant links: