Open ArjunSridhar1 opened 3 years ago
Hello! Thanks for your attention to our research! In our experiments (https://arxiv.org/abs/2103.08157), we haven't tried with randomly initialized encoder yet. However, I'm guess your problem would be fixed if you pretrained the encoder with our method, and then provide the pretrained encoder weights to pretrain_model_path
. You can pretrain the encoder via CUDA_VISIBLE_DEVICES={device_index} python3 pretrain.py experiment=AngularLoss
, or whatever loss you want to use.
Can you please give us a step by step procedure?
Hello, @bhavna-gopal , sorry for the late reply.
embedder
, we only use the trunk
.python3 train.py pretrained_model_path=/path/to/your/weights/trunk-1.h5
.I'm closing the issue #28 , let's continue communicating here!
The is_valid check fails for all candidates produced in train.py resulting in an empty generated architectures list. Are there any things that could be causing this mainly: How is the encoder model from pretrain loaded into train? Is there some value that should be used in the .yaml for pretrain_model_path?
Any help would be greatly appreciated! Thank you so much for your time!