|
Polynomial Neural Networks
Polynomial neural networks (PNN) are multilayer perceptrons of neuron-like units which produce high-order
multivariate polynomial mappings. These are tree-structured hierarchical cascades of first-order and second-order
activation polynomials in the nodes, and input variables passed from the leaves. The activation polynomial outcomes
are fed forward to their parent nodes, where partial polynomial models are made. This PNN topology follows
the construction of the multilayer GMDH [Ivakhnenko,A.G., 1979] and allows to produce high-order multivariate polynomials
by composing tractable activation polynomials in the hidden network nodes.
From a theoretical point of view, polynomial neural networks are attractive for nonlinear modelling because they are:
-universal approximators with which one may approximate any continuous function mapping to an arbitrary precision;
-mathematically tractable since they assume manipulations, like decompositions and reformulations,
which make them flexible for adaptive structural identification;
- statistically tractable as they can be directly analysed with the available statistical tools.
From a practical point of view, polynomial neural networks are:
-computationally efficient, for they allow parameter estimation by well known algorithms like least squares;
-open-box models, which are easy to understand and interpret.
The process of learning PNN from data is decomposed in two separate steps: 1) finding the network structure by
global population-based search using inductive genetic programming (IGP), and 2) further adjustment of the
polynomial weights by retraining using high-order backpropagation algorithms. These two steps make a coherent
and integrated methodology for polynomial identification that enables to identify accurate, parsimonious, and
predictive models.
Experimental results show that PNN exhibit superior accuracy than many global models such as: statistical learning
networks, multilayer perceptrons, and recurrent neural networks on multivariate non-linear regression,
time-series forecasting and classification problems [Nikolaev,N. and Iba,H., 2002].
|