The most commonly-used neural networks are multilayer perceptrons, or MLPs. The
nodes in an MLP are arranged into layers : an input layer, then one or more hidden layers
and an output layer, where each layer is fully connected to the next layer in the network.
Connections between layers can only be directed forwards in the network: for instance,
connections can be made from a hidden layer to the output layer but connections from the
output layer back to a hidden layer are not allowed. Although not strictly necessary, the
same activation function is usually dened for all hidden and output nodes in an MLP.
Using an MLP consists of activating the input nodes, propagating that input between
the network’s layers and reading the output from its output nodes.
The most common MLP has one input layer, one hidden layer and one output layer
and is known as a two-layer network (the input layer is ignored when counting the number
of layers in a network). There are some variants of MLPs, for example a soft committee
network or soft committee machine [4] is a two-layer MLP whose hidden to output weights
are not allowed to vary, and are usually set to one, and output nodes whose activation
functions are or are proportional to a simple sum of their input activations. Soft committee
machines are used as example networks for much of this document and preserve most of
the characteristics of general two layer networks [20]. Reviews of other MLP variants and
their learning methods can be found in [5][9]).