mlpnet input layer has one less feature

automl / Auto-PyTorch

Automatic architecture search and hyperparameter optimization for PyTorch

Apache License 2.0

2.37k stars 287 forks source link

mlpnet input layer has one less feature #26

Closed GautamSridhar closed 4 years ago

GautamSridhar commented 4 years ago

Hi,

I recently started using AutoPytorch for discovering normal mlpnets for the regression setting. Unfortunately, if my data has 21 features as in the example, the learned network actually takes only 20 input features. I'm not sure what I am doing wrong, but if I try to use the model for further predictions as a standard pytorch model, it gives errors because of the matrix dimension mismatch. Any solutions to this?

This is my discovered network -

LMZimmer commented 4 years ago

For featurized data there is a lot of preprocessing techniques implemented. One of them probably downsamples the data. Could you provide the full configuration?

However, it is probably a good idea to implement a method to extract the preprocessing pipeline in some form for further usage. We should do this in a future release

GautamSridhar commented 4 years ago

Thanks for the reply! Turns out it was a preprocessing technique that was downsampling the data. In my configuration, all the options were set active. Setting the preprocessing to none produced the right input size for the architecture. Is there any available documentation other than the example notebook which describes the role of all the options in the configuration?

LMZimmer commented 4 years ago

The instances of the autonet classes have a print_help() function which gives information about some options. This will become more detailed in the future. In terms of preprocessing options we adapted common terminology so that it should be possible to find documentation about it elsewhere (although it would be nice to have some small guide here).

GautamSridhar commented 4 years ago

Thanks! I'll look forward to more documentation then, but it's good enough to be used for my purposes right now. Closing the issue!

bunnyfighting commented 1 year ago

@GautamSridhar

Hi,

When I use the AutoPytorch for searching network, the data was downsampled. However, I used "include_components" to set the preprocessing none, like this "include_components={'preprocessing': ["NoPreprocessor"]}". But it was not work. So how did you set the preprocessing to none produced the right input size for the architecture?