How to control the model parameters when jointly optimize 'alpha' and 'W' over the union of the training and validation sets using coordinate descent.? #140
The paper reported the model parameters is 3.1M if α and w are jointly optimized over the union of
the training and validation sets using coordinate descent.
I want to know that the model parameters directly searched out is 3.1M or after modifying the number of conv channels?
The paper reported the model parameters is 3.1M if α and w are jointly optimized over the union of the training and validation sets using coordinate descent.
I want to know that the model parameters directly searched out is 3.1M or after modifying the number of conv channels?
Look forward to your reply.