working with real-world images and complex backgrounds

threedle / GeoCode

GeoCode maps 3D shapes to a human-interpretable parameter space, allowing to intuitively edit the recovered 3D shapes from a point cloud or sketch input.

382 stars 25 forks source link

If you want to test the existing model with real images, then converting the images to sketches is the best option I can think of right now. To generate the sketches you could explore classical algorithms or recent papers related to the subject, perhaps a combination of the two. If these fail to produce good results, I would also think of a segmentation model to extract the object and then convert the object to a sketch. Given a dataset, I would start with an image encoder to see how it performs before exploring anything else. Creating such a dataset could be tricky. To preserve the labels of the shapes as much as possible, I would probably go the way of generating scenes involving the objects as synthetic data. You will have to explore ways to bridge the domain gap if such a dataset will not have a good performance on real images.

threedle / GeoCode

working with real-world images and complex backgrounds #10