issues
search
SiyuanHuang95
/
ManipVQA
ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
40
stars
1
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Datasets
#4
RussRobin
opened
3 weeks ago
5
Fine-tuning and inference of ManipVQA on less GPU resources
#3
hyang1974
closed
3 days ago
2
using the pretain model infer images
#2
PredyDaddy
closed
2 weeks ago
2
image_test
#1
GentlesJan
closed
1 month ago
2