Study of Improved BP Algorithm based on Gradient Descent and
Numerical Optimization
Qiuhong Sun
1, 2
, Weihong Bi
1
and Xinhang Xu
3
1
YanShan University, Qinhuangdao 066004, China
2
Hebei University of Science and Technology, Shijiazhuang 050000, China
3
State Grid Hebei Electric Power Research Institute, Shijiazhuang 050000, China
sunqiuhong@hebust.edu.cn
Keywords: Gradient Descent, Numerical Optimization, Improved BP Algorithm.
Abstract: Studied limitations exist in BP model, and discussed the proposed improved algorithm based on BP neural
network. Respectively, from the third of the aspects discussed based on improved gradient descent
algorithm and improved algorithm based on numerical optimization. Research results showed that the
comprehensive method is better than the standard BP algorithm in terms of the number of iterations, the
training time and the mean square error and the like, of additional momentum and adaptive outstanding
performance parameter method. Researches showed that the Marquardt-Levenberg algorithm neural
network convergence fastest training times at least.
1 INTRODUCTION
Along with the application of information
management system indifferent fields, the data are
continuously stored in the database. Among them,
people expect to find out potential knowledge that
will help them make decisions. The emergence and
development of data mining are just based on this
expectation. As a midpoint of different subject
studies, data mining involves a great deal of subjects
such as database, statistic, machine learning,
artificial intelligence, high performance computing,
pattern recognition and data visualization etc.
Among them, artificial neural network is widely
used due to its ability of inherent non-linear
processing, adaptive learning and high fault-
tolerance.
BP neural network is a feedforward network, the
most representative type of network. This class is a
multilayer neural network model that map neural
network, using a minimum variance of learning, is
one of the most widely used neural network model.
Multilayer Perceptron network is a kind of three or
more sectors of neural networks. A typical
multilayer perceptron network is three, feedforward
class network, namely: input layer, hidden layer
(also called intermediate layer), the output layer.
Each neuron between adjacent layers achieves full
connection, that is, each neuron and the next layer
on layer of each neuron to achieve full connection,
and each connection between neurons is not. In
practical applications, BP network can be used for
classification, regression and time series forecasting
and other data mining applications, and pattern
recognition problem research, nonlinear mapping
problem, such as handwriting recognition, image
processing, predictive control, function
approximation, data compression and so on.
2 LIMITATIONS OF THE
ALGORITHM
In data mining field, although BP network model is
currently the most widely used network model, gets
good benefits in practical application, and is
maturing in theory, but it also has its own limitations
and shortcomings, its uncertainty performance in the
training process. These limitations are mainly
concentrated in three areas: slow convergence and
easy to fall into local minima and completely unable
to train a network phenomenon.
In recent years, many researchers made many
useful improvements on BP network, put forward a
number of algorithms to improve the program, such
as rapid BP propagation algorithm, the extended
Kalman filter algorithm, the second-order
452