Closed flipdazed closed 7 years ago
RBM not learning data set Bad hyper-parameters or related to the cost issue of #2 ?
nodes
Here I tried [15000, 15000, 15000]
. I assumed I would need >10000
nodes in at least the first RBM layer to model cross-relationships between the data sets. I made this assumption from reading this paper by Geoffrey Hinton who seemed to arbitrarily just bump up the first number of nodes depending on the complexity of the task at hand. I’m not entirely sure how complex this particular task is though and how many nodes to include and how to distribute them.
learning rate, epochs, batch size, early stopping I don’t think this is the problem here as these would just cause the learning to be slower/faster rather than the wrong direction. Mini-batch size seems reasonable at 10 for a total sample size of 582. Early stopping is not part of the pre-training so can’t be responsible for the direction issues
k
, number of Gibbs steps in CD/PCD
Is this possibly responsible? 1
is the lowest this could possibly be!
A sample of the weights showing activity early on then nothing for the rest of the run time
closed as too general.
Issue The extension to the weather data is a boolean classification task. The affected models are both only returning
False
for all entries.Affected
Reproduction
Runlog (open in OS X terminal with
cat runlog.txt
to enable full colour output)