I wanted to try out your HF implementation, but cannot get it to run in your example as given in the README.md.
Even after fixing a few small errors in deep_autoencoder (272: rbm.get_cost_updates() no longer takes weight_decay param, 401:402 valid_batch_size/test_batch_size params should be swapped), I still get nans in the CG code. There were also a bunch of warnings in the compilation of theano.function for unused inputs. This gist contains a subset of the output, basically the ridge just keeps growing until it reaches inf.
Hi,
I wanted to try out your HF implementation, but cannot get it to run in your example as given in the README.md.
Even after fixing a few small errors in deep_autoencoder (272: rbm.get_cost_updates() no longer takes weight_decay param, 401:402 valid_batch_size/test_batch_size params should be swapped), I still get nans in the CG code. There were also a bunch of warnings in the compilation of theano.function for unused inputs. This gist contains a subset of the output, basically the ridge just keeps growing until it reaches inf.