How would a layer add itself to the computational graph structure in CNTK?

The output of a node in the computational graph is dependent on what operation it represents and what inputs it was given. These inputs can be constants, input variables (raw data) or the outputs of previous nodes/layers and so on. Trainer takes the model, loss function, metric... which are all defined in terms of other nodes/layers. You build a graph as you define operations. Here's an example:

import cntk
from cntk.layers import Sequential, Dense

x = cntk.input_variable((2, ), name='x')
y = cntk.input_variable((1, ), name='y')

z = Sequential([
    Dense(3, name='dense_1'),
    Dense(1, name='dense_2')
])(x)

loss = cntk.squared_error(z, y, name='loss')
graph = cntk.logging.plot(loss, 'graph.png')

graph

The arrows in the plot show the flow of information. W and b are the weights and bias of the dense layer. The input, x, is used to compute the output of the first dense layer whose output is used to compute the output of the second dense layer (model output, z) whose output is in turn used to compute the loss as the squared difference between the target, y, and z. Combined with a specified learner, gradients will be computed and correctly propagated backwards to the appropriate nodes.

microsoft / CNTK

How would a layer add itself to the computational graph structure in CNTK? #2774