Adding more prompts - Githubissues

remyxai / VQASynth

Compose multimodal datasets 🎹

216 stars 13 forks source link

Closed smellslikeml closed 8 months ago

smellslikeml commented 8 months ago

Adding more of the prompts described in the Spatial VLM paper including the distinction between canonicalized point clouds.

Also improved depth estimation, switching to using GPU