yuxng / PoseCNN

A Convolutional Neural Network for 6D Object Pose Estimation in Cluttered Scenes
https://rse-lab.cs.washington.edu/projects/posecnn/
MIT License
756 stars 243 forks source link

How to create a dataset? #57

Closed Kaju-Bubanja closed 5 years ago

Kaju-Bubanja commented 6 years ago

I was wondering whether this repo contains code that helps with the generation of a dataset? Can you or do you plan on open sourcing the code used to create a dataset? Or is it already in the package and I just didn't find it?

Abdul-Mukit commented 5 years ago

Hi, @Kaju-Bubanja. How make the dataset? I studied the article several times but didn't understand the 6D pose annotation part. The paragraph goes like this: "To avoid annotating all the video frames manually, we manually specify the poses of the objects only in the first frame of each video. Using Signed Distance Function (SDF) representations of each object, we refine the pose of each object in the first depth frame. Next, the camera trajectory is initialized by fixing the object poses relative to one another and tracking the object configuration through the depth video. Finally, the camera trajectory and relative object poses are refined in a global optimization step."

How were the 6D poses determined in the first frame? I don't get it. Also, I don't see any marker field for camera pose detection. Moreover the objects seem to be on top of each other in many frames and not on the base as in LINEMOD dataset.

If you fond the codes for pose annotation can you please mention which codes are these?

Kaju-Bubanja commented 5 years ago

There is no Code in the Repo to produce a Dataset. I wrote my own and used a vicon system and blender to generate a dataset. I will be able to tell you how exactly to do that in the future, but not right now.

RobbyHuang commented 4 years ago

Hi, @Kaju-Bubanja @Abdul-Mukit ,I want to creat a new datasets ,and I met the same problem that how to annotating the poses of the objects in the first frame of each video. Do you know how to get ground-truth annotations for object poses?

Abdul-Mukit commented 4 years ago

I moved on to DOPE and NDDS both from Nvidia research. NDDS is great for synthetic data generation.

Kaju-Bubanja commented 4 years ago

Feel free to have a look at my master thesis, especially the section 3.4 Dataset generation. You can find my thesis Textureless, Symmetric, Multiple Instances 6D Pose Estimation in Cluttered Scenes Using RGB-D Data here.

RobbyHuang commented 4 years ago

@Kaju-Bubanja thanks for your share @Abdul-Mukit thanks your advice I finally choose labelfusion and NDDS

THANKS

pra-dan commented 3 years ago

@Abdul-Mukit What was your last choice for 6D pose estimation ? PoseCNN or DOPE ? I am planning to train it for a single custon object

Abdul-Mukit commented 3 years ago

@pra-dan DOPE. I got the job done quickly with it. I am sure there are better and faster pose estimators out there now. I have been busy with other stuff. Can you let me know which estimator you end up working with later?