nitishsrivastava / deepnet

Implementation of some deep learning algorithms.
BSD 3-Clause "New" or "Revised" License
896 stars 438 forks source link

pbtxt parameters #7

Closed rdevon closed 11 years ago

rdevon commented 11 years ago

Hi, the pbtxt files are a bit difficult to configure. At least, at some parts it's intuitive given the examples, other times it is not. What are the defaults for each parameter, what are the available parameters (eg in the case of activation, "logistic", "relu", "hyperbolic tagent"?). Reading through the examples is easy, but incomplete. Reading through the code is incomplete but difficult.

Unfortunately there are some parameters for even a basic DBN which are not intuitive. "shape" is one of them: what is this parameter in terms of the bias or interaction in network, and why do i need it? Is these parameters that will be eventually used for optimization? There are two of them of the same name, and oftentimes they change with the same layer (say hidden1 in the dbn training). In addition, sometimes shape (first one) x shape (second one) = dimensions (number of hidden nodes?). Or is it shape^2 (the first one) and the second one is something else entirely?

Anyways, thanks in advance!

rdevon commented 11 years ago

Ah, the protos/deepnet.proto is a good resource for the parser, so scratch that first part.