Some questions about the 'make_imnet_dataset' function

QhelDIV / ShapeFormer

Official repository for the ShapeFormer Project

https://shapeformer.github.io/

190 stars 22 forks source link

Some questions about the 'make_imnet_dataset' function #11

Open LydJason opened 1 year ago

LydJason commented 1 year ago

Thank you very much for your great work! I am trying to use the full dataset of IMnet and HSP to train ShapeFormer. I discovered the make_imnet_dataset function in shapeformer/data/imnet_datasets/imnet_datasets.py write shape_vocab, vocab_idx, xbd and cat_id into the hdf5 file. However, the code didn't write Ytg to the hdf5 file, which was provided in the processed dataset from google drive. I was wondering maybe there are some other code that processes the original IMnet data to Ytg? Thank you very much!

https://github.com/QhelDIV/ShapeFormer/blob/3558e2a794c580795f465ca6ccd75a493850fda3/shapeformer/data/imnet_datasets/imnet_datasets.py#L350

QhelDIV commented 1 year ago

You can have a check at the Imnet2Dataset class at the file ShapeFormer/shapeformer/data/imnet_datasets/imnet_datasets.py. There are some snippets to decompress the shape_vocab back to Ytg. Basically shape_vocab and vocab_idx is the HSP's representation to compress the Ytg volume (256x256x256). The compression is actually really good and the storage is much smaller.

LydJason commented 1 year ago

I got it. Thank you very much!

saladcat commented 1 year ago

Thank you very much for your great work! I am trying to use the full dataset of IMnet and HSP to train ShapeFormer. I discovered the make_imnet_dataset function in shapeformer/data/imnet_datasets/imnet_datasets.py write shape_vocab, vocab_idx, xbd and cat_id into the hdf5 file. However, the code didn't write Ytg to the hdf5 file, which was provided in the processed dataset from google drive. I was wondering maybe there are some other code that processes the original IMnet data to Ytg? Thank you very much!

Thanks for your great works. Could you please provide the code for reducing the resolution from 256 to 64? I would like to keep my processing consistent with yours

dqj5182 commented 2 weeks ago

@QhelDIV I am also interested in code for reducing the resolution of Ytg from 256 to 64.

QhelDIV commented 2 weeks ago

Hi @saladcat and @dqj5182 , thanks for the interest! I am not sure if I can find the original code for this. But this should be simple as it is a mean-pooling process. You can use this (untested) code to do this with the einops package.

# Assuming Ytg has a shape of (256, 256, 256)
Ytg_x64 = einops.reduce(Ytg, '(x k1) (y k2) (z k3) -> x y z', 'mean', k1=8, k2=8, k3=8)

dqj5182 commented 2 weeks ago

Thanks for the clarification!