How to Use an Adaptive High-gain Observer in Diagnosis Problems

Fr´ed´eric Lafont

1,2

, Jean-Franc¸ois Balmat

, Nathalie Pessel

1,2

and Jean-Paul Gauthier

1,2

Universit´e du Sud-Toulon-Var, LSIS, UMR CNRS 7296, B.P 20132, 83957 La Garde Cedex, France

Institut Universitaire de Technologie de Toulon, B.P 20132, 83957 La Garde Cedex, France

Keywords:

Observer, Diagnosis, Sensor.

Abstract:

This paper explains how to use an adaptive High-Gain observer in sensor diagnosis problems. This type of

observer allows to switch between a classical Extended Kalman Filter and High-Gain observer according to

an innovation function. Combined with a standard technique of residual generation, this approach is very

efﬁcient to determine fault occurence in the non-linear dynamical systems. We present the obtained results on

a wastewater treatment system.

1 INTRODUCTION

Nowadays, systems are more and more automated in

order to reduce the human intervention. So, these sys-

tems are composed of sensors and actuators. There-

fore, it involves to deﬁne a structure enable to detect

a sensor fault or a failing actuator. The aim of such

equipment is the diagnosis of failure to avoid the eco-

nomic losses and/or the environmental risks.

The present work deals the sensor diagnosis with

an observer for non-linear dynamical systems ap-

plied to a wastewater treatment system. There is a

lot of works on the synthesis of non-linear observers

for (bio)chemical processes (Alcaraz-Gonzalez et al.,

2002; Assis and Filho, 2000; Dochain, 2008; Meth-

nani et al., 2011; Nejjari et al., 2008; Sotomayor et al.,

2002). In this study, we choose an adaptive high-

gain observer, developed already as software sensor

(Boizot et al., 2010; Lafont et al., 2011), to solve a

sensor diagnosis problem. Transition from High-Gain

(HG) mode to Extended Kalman Filter (EKF) mode

is performed via an adaptation procedure based upon

the level of innovation. In the context of large transi-

tions, the HG observer guarantees theoretical conver-

gence with arbitrary rate, under certain observability

assumptions. For small enough error of initial esti-

mation, classical EKF is more or less optimal w.r.t.

noise.

Usually a changing coordinates is necessary in or-

der to obtain an observability canonical form. In some

cases, this change of coordinates is very complicated.

To avoid this step, we write our observer in the natural

coordinates. However, the counterpart of this choice

is that the Riccati equation of the Kalman ﬁlter has

not the standard form (Lafont et al., 2011).

A such observer is “robust” compared with ini-

tial conditions and measurement noises. Although the

generation of residues is standard, we show the capa-

bility of adaptive HG-EKF observer to detect a sensor

fault.

Section 2 summarizes sensor diagnosis problems

and observer-based residual generation. In Section 3,

we recall the structure of the adaptive high-gain ob-

server, which is the multi-output version developed in

the paper (Boizot et al., 2010). Also, the crucial con-

cept of innovation, which is used in order to switch

between the EKF and HG-EKF modes, is presented.

Section 4 is devoted to the application: A wastewater

treatment plant. Finally, in Section 5, we show simu-

lation results.

2 SENSOR DIAGNOSIS AND

OBSERVER

2.1 Sensor Diagnosis

We are interested at the problem of the bias and the

drift faults. These two faults are the most common

and the most repetitive.

An output with a bias fault is deﬁned by:

= y

+ b, (1)

with y

is the measured output, y

the real output and

b the constant offset value.

185

Lafont F., Balmat J., Pessel N. and Gauthier J..

How to Use an Adaptive High-gain Observer in Diagnosis Problems.

DOI: 10.5220/0003984501850190

In Proceedings of the 9th International Conference on Informatics in Control, Automation and Robotics (ICINCO-2012), pages 185-190

ISBN: 978-989-8565-22-8

 2012 SCITEPRESS (Science and Technology Publications, Lda.)

An output with a drift fault is deﬁned by:

= y

+ d(t), (2)

with y

is the measured output, y

the real output and

d(t) the time varying offset factor. d(t) can be rep-

resented by the function: d(t) = a t + b with a and b

two constant terms.

2.2 Observer-based Residual

Generation

The main problem for the diagnosis based on ob-

servers is to ﬁnd the residues. They are neglectable

in the absence of fault and signiﬁcantly affected when

some faults occur. One difﬁculty is to make the robust

observer w.r.t. disturbances which are no faults.

So, a non-linear system can be written:

= f (x, u),

y = h(x) = Cx,

(3)

where x is the state vector, y the measured outputs

and u the control variables.

The corresponding observer is deﬁned by:

d ˆx

= g( ˆx,u),

ˆy =

Cˆx,

(4)

The output estimation error is used to residual

generation. The residual is analysed to determine

fault occurence. We apply a standard method:

= |y

− ˆy

|. (5)

The output has a fault if r

> δ

. For each out-

put, we simulate off-line in nominal operating (with-

out fault) to determine the threshold level δ

. Then,

the method is applied on-line.

3 SYSTEMS UNDER

CONSIDERATION AND

OBSERVER EQUATIONS

3.1 The Observability Canonical Form

We consider a smooth non-linear system of the form

(3) which is mapped by a diffeomorphism ψ into the

following system:

dξ

= F (ξ, u) = A(t)ξ + b(ξ, u),

y = Cξ,

(6)

where ξ ∈ R

is the state vector in observable co-

ordinates (n the system order), where u are the control

variables belonging to a certain bounded subset of R

(p the number of the control variables) and the output

y ∈ R

the number of the outputs).

The matrices A(t), C and the vector b(ξ, u) have

a following form (all details can be found in (Boizot

et al., 2010)):

A(t) =







0 a

(t) 0 ··· 0

0 0 a

(t)

. ·· ·

. ·· · ·· · 0 a

(t)

0 0 ·· · ·· · 0







C = (a

(t), 0, ··· , 0) = (Id, 0, ·· · , 0),

(7)

where Id is an identity matrix of order d

b(ξ, u) =







(ξ

, u)

(ξ

, ξ

, u)

(ξ

, · ·· , ξ

, u)







. (8)

The state vector ξ(t) is assumed to have a “block”

structure ξ =



′

·· · ξ

′



′

, where ξ

∈ R

−1

the

size of i+ 1

“block”) with d

≥ d

≥ · · · ≥ d

k−1

. The

matrices a

(t) have dimension d

i−1

×d

and belong to

a compact subset K

of the set of d

i−1

× d

matrices of

maximum rank d

The f (x, u), a

(t) and b

(ξ, u) are assumed

smooth w.r.t. ξ, u and t. The b

depend on ξ in a

“block” triangular way and are compactly supported.

3.2 Observer Structure

Let Q (n × n) , R (d

× d

) be symmetric positive

deﬁnite matrices. Let θ be the high-gain parameter,

θ ≥ 1. For θ = 1 the observer will just be an ordinary

EKF.

Set ∆ = BD



, · ·· ,

k−1



, the block diagonal

matrix with diagonal blocks Id

, · ·· . Set Q

θ∆

−1

Q∆

−1

, R

= θ

−1

The equations of the system in observable coordi-

nates are:

dξ

= Tψ



−1

(ξ)





−1

(ξ), u



dξ

= F (ξ, u) .

(9)

The equations for the HG-EKF in the observable

coordinates are:

= F(

ξ, u) + PC

′

−1

(y−C

ξ), (10)

= TF(

ξ, u) P+ P TF(

ξ, u)

′

+ Q

−PC

′

−1

CP.

(11)

ICINCO2012-9thInternationalConferenceonInformaticsinControl,AutomationandRobotics

186

In the natural coordinates we have ˆx = ψ

−1

(

ξ) =

Φ( ˆx), where ˆx denotes the estimate of x. As shown in

(Lafont et al., 2011), the equations for the HG-EKF

become:

d ˆx

= f( ˆx, u) + pC

′

( ˆx, u)R

−1

(y− h( ˆx)), (12)

= T f ( ˆx, u)p+ pT f( ˆx, u)

′

+ q

( ˆx)

−pC

′

−1

+Tψ( ˆx)

−1

ψ( ˆx)

′

−1

(h( ˆx) − y)

+pD

ψ( ˆx)

′

−1

(h( ˆx) − y)

′



Tψ( ˆx)

−1



′

(13)

where

p = TΦ





P TΦ





′

(14)

and

( ˆx) = (Tψ( ˆx))

−1



(Tψ( ˆx))

−1



′

. (15)

TF denotes the tangent mapping to the mapping

F : x → F (x), R

→ R

i.e. its Jacobian matrix in co-

ordinates. Accordingly T

F denotes the double tan-

gent, a skew-symmetric bilinear mapping, R

-valued,

and for any u ∈ R

we deﬁne the matrix D

F (x){u}

by T

F (u, v) = D

F (x){u} · v.

3.3 Innovation

The function In

, introduced below, is called the inno-

vation. This function reﬂects the quality measurement

of the estimation error on a small moving time inter-

val of size d. The strategy is to adapt the High-gain

parameter θ according to In

. Due to the observabil-

ity properties of our system, if the ˆy is far from y then

θ will increase to High-gain mode. Contrarily, if ˆy

is close to y, the innovation will be small and θ will

decrease to 1 (Kalman ﬁltering mode). For this, the

variable θ will be subject to the differential equation

(19) deﬁned just below.

Let F

(θ) be deﬁned as follows:

(θ) =



∆T

if θ ≤ θ

∆T

(θ− 2θ

)

if θ > θ

(16)

where θ

max

and ∆T small enough is a constant.

The value θ

max

depends of the studied system and is

obtained by an heuristic approach. It is bounded and

the observer remains stable.

The innovation In

(t), with forgetting horizon d,

is:

(t) =

t− d

ky(τ) − ˆy(τ)k

dτ, (17)

where ˆy(τ) is the prediction from the initial state

ˆx(t − d).

Let us deﬁne

F (θ, In

) = µ(In

(θ) + (1− µ(In

))λ(1− θ),

(18)

for a λ > 0 and with µ(In

) a sigmoid function,

µ : ]−∞;+∞[→] 0;1[ , In

→

1+e

−β·

(

−m

)

. The equa-

tion for the HG parameter θ is:

θ = F (θ, In

). (19)

4 APPLICATION

The process under consideration is a real small-size

wastewater treatment plant (WWTP) composed of a

unique aeration tank equipped with surface aerators

which provide oxygen and mix the inﬂuent wastewa-

ter with biomass (Figure 1).

Figure 1: Wastewater treatment plant.

The model used is based upon the Activated

Sludge Model N

(ASM 1) (Henze et al., 1987). Then

the biodegradation model consists of 12 state vari-

ables (Table 1). Actually, we consider only biodegra-

dation.

The state variables describing the total alkalinity

being not included. The values of stoichiometric and

kinetic parameters, as well as the inﬂuent concentra-

tions can be found in (Lafont et al., 2011).

The complete set of equations and inﬂuent con-

ditions can be found on the International Water As-

sociation task group on benchmarking of control

strategies for wastewater treatment plants website

(http://www.benchmarkwwtp.org/, 2011).

The model is of the form ˙x = f (x, u), where the

control u consists of the state u

of the turbines and

the value Q

of the inﬂuent average ﬂow. The input

in (20) is a binary sequence switching between 0

and 1 and representing the state of turbines (off/on)

that aerate the plant. We make here the reasonable as-

sumptions of three measurements: S

, S

and S

Although the WWTP with these three outputs is ob-

servable, it is too complicated for our purpose. We

use a simpliﬁed model of lower dimension that has

been developped in (Chachuat, 2001).

4.1 The Reduced Model

The author proceeds as follow:

HowtoUseanAdaptiveHigh-gainObserverinDiagnosisProblems

187

Table 1: List of variables.

Deﬁnition Notation

1. Soluble inert organic matter S

2. Readily biodegradable substrate S

3. Particulate inert organic matter X

4. Slowly biodegradable substrate X

5. Active heterotrophic biomass X

B,H

6. Active autotrophic biomass X

B,A

7. Particulate products arising from biomass decay X

8. Oxygen S

9. Nitrate and nitrite nitrogen S

10. NH

+ NH

nitrogen S

11. Soluble biodegradable organic nitrogen S

12. Particulate biodegradable organic nitrogen X

- A single organic compound, denoted X

DCO

(DCO for “chemical oxygen demand”), is formed

by adding soluble and particulate organic compound

concentrations X

DCO

= S

+ X

- It is considered that the dynamics of X

, X

and X

are slow w.r.t. the others.

By removing the three unobservable variables X

and S

, we obtain a simpliﬁed model with 5 state

variables S

, S

, X

DCO

and S

. The three

variables S

, S

and S

are observables. All these

simpliﬁcations provide the following reduced model:



− S



+ α

DCO

O,H

+ er

(y) + u

· k

a·



max

− S



(20)



− S



+ α

DCO

O,H

+ er

(y)

(21)



− S



+ α

DCO



O,H

+ η

NO,g

O,H



+ er

(y) + α

(22)

DCO



DCO

−

DCO



+α

DCO



O,H

+η

NO,g

O,H



+ α

(23)



− S



− α

+ α

DCO



O,H

+ η

NO,h

O,H



(24)

DCO

= K

DCO

= K

DCO

B,H

(25)

(y) = α

NH,A

O,A

(y) = α

NH,A

O,A

(y) = −α

NH,A

O,A

(26)

The constant k

a is the oxygen transfer coef-

ﬁcient



a = 10 h

−1



and S

max

is the dissolved

oxygen saturation concentration



max

= 8 mgl

−1



The volume of the aeration tank (V) is equal to

6000 m

. The settler is a cylindrical tank where

the solids are either recirculated to the aeration tank



= 18446 m

day

−1



or extracted from the system



= 385 m

day

−1



. The parameter values α

, α

, K

and K

DCO

are given

in Table 2.

4.2 Change of Variables

We apply the developed observer to a simpliﬁed

model (ﬁve states, three outputs). The change of vari-

ables Ψ which relates natural coordinates to observer

coordinates is trivial. It consists of setting just :

]

DCO

+ X

DCO

. (27)

The state vector x = (S

DCO

)

′

changed for ξ =



]

DCO



′

, therefore

our system is almost naturally in observable coordi-

nates. The inverse Jacobian is trivial to compute.

The choice of parameters for the adaptation of in-

novation is presented in Table 3.

5 RESULTS

Simulations with the perturbed outputs are carried

out by an additive Orstein-Uhlenbeck process. The

ICINCO2012-9thInternationalConferenceonInformaticsinControl,AutomationandRobotics

188

Table 2: Constant coefﬁcients.

Coefﬁcient α

DCO

Value - 5892 - 875 - 1648 191 - 957 150 - 17855 830 561 574 296

Table 3: Parameters for the adaptation.

Parameter value

max

300

β 1664

m 1

∆T 0.01

λ 200

δ 0.01

d 0.1

alternative control u

has been chosen: “On” dur-

ing 15 minutes and “Off” during 5 minutes. The

simulations cover 14 days and the value of the in-

put ﬂow rate Q

come from the benchmark ﬁle

(http://www.benchmarkwwtp.org/, 2011). We have

three ﬁles: One for the dry weather, one for the storm

weather and one for the rain ﬁle.

To evaluate the performances of our observer we

have compared an ordinary EKF with our adaptive

HG-EKF presented in (Lafont et al., 2011). Consider-

ing the obtained results for this system, we propose to

use this adaptive HG-EKF observer for the diagnosis.

5.1 Threshold Level for each Output

For each output, we simulate the three ﬁles, without

fault, to determine the threshold level δ

(Table 4).

Table 4: Threshold level.

File/Output S

Dry 0.2505 0.9453 0.2466

Rain 0.2440 0.9306 0.2247

Storm 0.2576 0.9605 0.2200

We have selected one threshold level by output.

For S

, the taken threshold is 0.3, for S

, 1.0 and for

, 0.3. These levels must be valid for whatever ﬁle.

5.2 Faults

The bias and the drift faults are simulated. The bias

value is equal to 1.5 and 2. The drift fault is simulated

with two curve slopes: (t −t

) and 2∗(t−t

), t

is the

fault time and t is the simulation time.

Results are presented in Tables 5 and 6 where t

the detection time. Only the results for the fault time

equal to 3 with the dry ﬁle are presented in Table 5.

Indeed, the three ﬁles have the same seven ﬁrst days.

Moreover, Tables 5 and 6 present two interesting re-

sults:

- Whatever the fault time t

, the various faults are

detected,

- If the fault is more important, it is detected

quickly.

6 CONCLUSIONS AND FUTURE

WORKS

6.1 Conclusions

We have shown that an adaptive HG-EKF observer

is efﬁcient to detect sensor faults such as bias and

drift. The proposed method imposes to determine the

threshold levels with no fault. Thanks to the “robust-

ness” (compared with noise and initial conditions) of

this observer and thethreshold choice for the residues,

the adaptive HG-EKF observer is an interesting ap-

proach for the sensor diagnosis. Moreover, the resid-

ual generation is very easy.

To improve the method, we can work with the

eigenvalues of the matrix p. The calculation of the

trace permits to give a conﬁrmation:

Trace(p) =

∑

i=1

, (28)

where V

are the eigenvalues. When there is a sensor

fault, the trace has an abrupt change (Figure 2). This

result is a complementary information but it is not sat-

isfactory, because if the value Q

increases, the trace

becomes very big and the algorithm indicates a false

alarm.

2.97 2.98 2.99 3 3.01 3.02 3.03

x 10

Day

Trace

Figure 2: Trace with a fault at t

= 3 days.

6.2 Future Works

To improve this method, we can use the trace prop-

erties. The trace value is compared with the inﬂu-

HowtoUseanAdaptiveHigh-gainObserverinDiagnosisProblems

189

Table 5: Faults detection for the dry ﬁle.

Fault/Sensor S

Bias +1.5, t

= 3 3.0002 3.012 3.0004

Bias +1.5, t

= 12 12.0001 12.158 12.0004

Bias +2, t

= 3 3.0001 3.009 3.0003

Bias +2, t

= 12 12.0001 12.028 12.0002

Drift (t − t

), t

= 3 3.222 4.455 3.215

Drift (t − t

), t

= 12 12.150 12.739 12.197

Drift 2∗ (t −t

), t

= 3 3.092 3.539 3.090

Drift 2∗ (t −t

), t

= 12 12.067 12.959 12.115

Table 6: Faults detection for the rain and storm ﬁle.

Fault/Sensor S

File

Bias +1.5, t

= 12 12.0001 12.189 12.0004

Bias +2, t

= 12 12.0001 12.013 12.0003 Rain

Drift (t −t

), t

= 12 12.176 13.528 12.189

Drift 2∗ (t −t

), t

= 12 12.086 12.933 12.084

Bias +1.5, t

= 12 12.0001 12.201 12.0005

Bias +2, t

= 12 12.0001 12.012 12.0003 Storm

Drift (t −t

), t

= 12 12.175 13.813 12.203

Drift 2∗ (t −t

), t

= 12 12.089 12.941 12.084

ent ﬂow rate by developing a “black box” (neural net-

works for example) which select the peak level to no-

tify a fault.

REFERENCES

Alcaraz-Gonzalez, V., Harmand, J., Rapaport, A., Steyer, J.-

P., Gonzalez-Alvarez, V., and Pelayo-Ortiz, C. (2002).

Software sensors for highly uncertain wwtps : a new

approach based on interval observers. Water Research,

36:2515–2524.

Assis, A. and Filho, R. (2000). Soft sensors development

for on-line bioreactor state estimation. Computers and

Chemical Engineering, 24:1099–1103.

Boizot, N., Busvelle, E., and Gauthier, J.-P. (2010). An

adaptive high-gain observer for nonlinear systems.

Automatica, 469:1483–1488.

Chachuat, B. (2001). Methodology of dynamic optimisa-

tion and optimal control of small-size activated sludge

wastewater treatment plants. PhD, Institut National

Polytechnique de Lorraine, Nancy.

Dochain, D. (2008). Bioprocess control. volume ISBN

9781848210257. ISTE.

Henze, M., Grady, C., Gujer, W., Marais, G., and Matsuo, T.

(1987). Activated sludge model n1. In IAWQ, editor,

Technical Report 1. London.

http://www.benchmarkwwtp.org/ (2011).

Lafont, F., Busvelle, E., and Gauthier, J.-P. (2011). An

adaptive high-gain observer for wastewater treatment

systems. Journal of Process Control, 21:893–900.

Methnani, S., Gauthier, J.-P., and Lafont, F. (2011). Sen-

sor fault reconstruction and observability for unknown

inputs, with an application to wastewater treatment

plants. International Journal of Control, 84.4:822–

833.

Nejjari, F., Puig, V., Giancristofaro, L., and Koehler, S.

(July 6-11, 2008). Extended luenberger observer-

based fault detection for an activated sludge process.

Proceedings of the 17th World Congress The Interna-

tional Federation of Automatic Control, Seoul, Korea,

pages 9725–9730.

Sotomayor, O., Park, S., and Garcia, C. (2002). Software

sensor for on-line estimation of the microbial activ-

ity in activated sludge systems. ISA Transactions,

41:127–143.

ICINCO2012-9thInternationalConferenceonInformaticsinControl,AutomationandRobotics

190