cosines as the activation function correspond to a spe-
cial case of Fourier networks that can approximate a
Fourier series for a given function. Cybenko deve-
loped a rigorous demonstration that MLPs with only
one hidden layer of processing elements is sufficient
to approximate any continuous function with support
in a hypercube (Cybenko, 1989).
The theorem is directly applied to MLP. The sig-
moid, radial basis and wavelets functions are a com-
mon choice for the network construction since it sa-
tisfies the conditions imposed in the theorem. The
theorem of function approximation provides a mathe-
matical basis that gives support to the approximation
of any continuous arbitrary function. Furthermore, it
defines for the case of MLP that a network composed
of only one hidden layer neurons is sufficient to com-
pute, in a given problem, a mapping from the input
space to the output space, based on a set of training
examples. However, with respect to training speed
and ease of implementation, the theorem does not pro-
vide any insight about the solutions developed. The
choice of activation functions and the learning algo-
rithm defines which particular network is used. In any
situation, the neurons operate as a set of functions that
generate an arbitrary basis for function approximation
which is defined based on the information extracted
from the input-output pairs. For training a feedfor-
ward network, the backpropagation algorithm is one
of the most frequently employed in practical applica-
tions and can be seen as an optimization.
3 WAVELET FUNCTIONS
Two categories of wavelet functions, namely, or-
thogonal wavelets and wavelet frames (or non-
orthogonal), were developed separately by different
interests. An orthogonal basis is a family of wavelets
that are linearly independent and mutually orthogo-
nal, this eliminates the redundancy in the representa-
tion. However, orthogonal wavelets bases are difficult
to construct because the wavelet family must satis-
fy stringent criteria (Daubechies, 1992; Chui, 1992).
This way, for these difficulties, orthogonal wavelets
is a serious drawback for their application to func-
tion approximationand process modeling (Oussar and
Dreyfus, 2000). Conversely, wavelet frames are con-
structed by simple operations of translation and di-
lation of a single fixed function called the mother
wavelet, which must satisfy conditions that are less
stringent than orthogonality conditions.
Let ϕ
j
(x) a wavelet, the relation:
ϕ
j
(x) = ϕ(d
j
.(x−t
j
))
where t
j
is the translation factors and d
j
is the dilation
factors ∈ R. The family of functions generated by ℧
can be defined as:
℧ =
ϕ(d
j
.(x−t
j
)),t
j
and d
j
∈ R
A family ℧ is said to be a frame of L
2
(R) if there
exist two constants c > 0 and C < ∞ such that for any
square integrable function f the following inequali-
ties hold:
ck fk
2
≤
∑
j
| < ϕ
j
, f > |
2
≤ Ck fk
2
where ϕ
j
∈ ℧, k fk denotes the norm of function f
and < ϕ
j
, f > the inner product of functions. Fa-
milies of wavelet frames of L
2
(R) are universal ap-
proximators (Zhang and Benveniste, 1992; Pati and
Krishnaprasad, 1993). In this work, we will show
that wavelet frames allow practical implementation of
multidimensional wavelets. This is important when
considering problems of large input and output di-
mension. For the modeling of multi-variable pro-
cesses, such as, the artificial neural networks bio-
logically plausible, multidimensional wavelets must
be defined. In the present work, we use multidi-
mensional wavelets constructed as linear combination
of sigmoid, denominated Polynomial Powers of Sig-
moid Wavelet (PPS-wavelet).
3.1 Sigmoidal Wavelet Functions
In (Funahashi, 1989) is showed that:
Let s(x) a function different of the constant func-
tion, limited and monotonically increase. For any
0 < α < ∞ the function created by the combination
of sigmoid is described in Equation 1:
g(x) = s(x+ α) − s(x− α) (1)
where g(x) ∈ L
1
(R), i.e,
Z
∞
−∞
g(x) < ∞
in particular, the sigmoid function satisfies this pro-
perty.
Using the property came from the Equation 1, in
(Pati and Krishnaprasad, 1993) boundary suggest the
construction of wavelets based on addition and sub-
traction of translated sigmoidal, which denominates
wavelets of sigmoid. In the same article show a pro-
cess of construction of sigmoid wavelet by the substi-
tution of the function s(x) by ϒ(qx) in the Equation 1.
So, the Equation 2 is the wavelet function created in
(Pati and Krishnaprasad, 1993).
ψ(x) = g(x+ r) − g(x− r) (2)
BIOSIGNALS 2008 - International Conference on Bio-inspired Systems and Signal Processing
262