Local Decay of Residuals in Dual Gradient Method Applied to MPC

Studied using Active Set Approach

Matija Perne, Samo Gerk

c and Bo

stjan Pregelj

zef Stefan Institute, Jamova cesta 39, Ljubljana, Slovenia

Keywords:

Model Predictive Control, Gradient Method, Optimization.

Abstract:

A dual gradient method is used for solving quadratic programs resulting from a model predictive control

problem in real-time control context. Evolution of iterates and residuals throughout iterations of the method

is studied. In each iteration, the set of active inequality constraints whose corresponding components of the

Lagrange multiplier are non-zero can be deﬁned. It is found that the set of active constraints tends to stay

constant through multiple iterations. Observing the decay of residuals for intervals where the set of active

constraints is constant leads to interesting ﬁndings. For such an interval, the dual residual can be expressed in

a base so that its components are decaying independently, uniformly, and at predictable rates. The base and the

rates only depend on the system matrices and the set of active constraints. The calculated decay rates match

the rates observed in numerical simulations of MPC control, which is shown for the AFTI-16 benchmark

example.

1 INTRODUCTION

Model predictive control (MPC) is traditionally lim-

ited to processes with relatively slow dynamics be-

cause of the computational complexity of online op-

timization (Qin and Badgwell, 2003). In the last

decade, a considerable advance has been made in the

ﬁeld of fast online optimization (Ferreau et al., 2008;

Wang and Boyd, 2010; Mattingley et al., 2011; Mat-

tingley and Boyd, 2012; Domahidi et al., 2012; Hart-

ley et al., 2014; Ferreau et al., 2014).

The advantages of MPC appear promising for

the implementation of advanced plasma current and

shape control in a tokamak fusion reactor (Gerk

and Tommasi, 2014). In particular, we are focusing

on fast online implementations of ﬁrst-order methods

adapted for use with MPC (Richter, 2012; Gisels-

son, 2013; Kouzoupis, 2014; Giselsson and Boyd,

2015; Patrinos et al., 2015). Compared to active-set or

interior-point methods, they may require a consider-

able number of iterations to converge to the optimum.

However, each iteration is relatively simple, so that

the implementation is possible in restricted hardware,

and the methods were found to be computationally

efﬁcient for the computation of quadratic programs

arising from MPC where a relatively low accuracy of

the solution is sufﬁcient.

We are highly interested in complexity certiﬁca-

tion, which proves that a solution with an acceptable

tolerance can be found in a certain maximum number

of iterations (limited time). However, useful certiﬁca-

tion is currently not available for any of the relevant

methods in practical cases of control problems with

state constraints. Despite this, a number of methods

typically converge reasonably fast, hence our interest

in the practical rates of decay of residuals.

In this work, we examine the rates of decay of

residuals observed with a dual gradient method. In

our MPC simulations, we have observed very differ-

ent decay rates, which we found to be in close rela-

tion with the sets of constraints that were active in

the corresponding intervals. We present a theoretical

expression for the decay rates in intervals with a con-

stant set of active constraints, which can be computed

from the system matrices and the set of constraints.

The result is illustrated with MPC application to the

AFTI-16 control benchmark (Kapasouris et al., 1990;

Giselsson, 2013).

2 PROBLEM DESCRIPTION

2.1 MPC

Optimal control of a linear system with constraints

and with a quadratic cost in discrete time with ﬁnite

horizon N (Giselsson, 2013) is investigated. The dy-

Perne, M., Gerkši

c, S. and Pregelj, B.

Local Decay of Residuals in Dual Gradient Method Applied to MPC Studied using Active Set Approach.

DOI: 10.5220/0006416500540063

In Proceedings of the 14th International Conference on Informatics in Control, Automation and Robotics (ICINCO 2017) - Volume 1, pages 54-63

ISBN: 978-989-758-263-9

namics is:

x(t + 1) = Ax (t) + Bu (t), (1)

where t is the time index, x is the system state, u is

the system input, the matrices A and B describe the

system dynamics. Value of x(0) is known. Possible

system states and inputs are constrained to x ∈ X , u ∈

U where X ⊂ R

, U ⊂ R

are polyhedra. A cost

function is deﬁned:

J =

∑

k=0

− x

ref

)

Q(x

− x

ref

)

+(u

− u

ref

)

R(u

− u

ref

)

(2)

where Q and R are symmetric positive semideﬁnite

cost matrices. Constant vectors x

ref

, u

ref

are reference

values.

The control is obtained by minimizing the cost

function J with respect to constraints

∗

= min

,...,x

,...,u

J (x

, u

)

subject to x

k+1

= Ax

+ Bu

∈ X , u

∈ U,

= x(0). (3)

The question of ﬁnding the optimizer

, . . . , x

, u

, . . . , u

) is a quadratic program

(QP) (Boyd and Vandenberghe, 2004). With the

receding-horizon implementation, u

is applied as

the current value of the controller output u(0).

2.2 Quadratic Program

We use the condensed form of the QP, in which only

are assembled into the optimization variable. The

-dependent terms of the cost function and the equal-

ity constraints corresponding to system dynamics are

substituted using (1) (Ullmann and Richter, 2012).

The QP (3) can thus be written as

minimize

Hz + c

subject to Cz  b. (4)

The vector z ∈ R

k·m

is the optimization variable that

can be constructed as

z =













The Hessian H is symmetric positive semideﬁnite by

construction. We are particularly interested in exam-

ples with positive deﬁnite H, which is often the case.

Lagrange duality is used. Unconstrained optimization

minimize

Hz + c

z + v

Cz (5)

for constant value of vector v leads to the solution of

a related quadratic program with different constraints

(Everett, 1963)

minimize

Hz + c

subject to Cz  b

. (6)

Lagrange multiplier v can be adjusted until b = b

and the solution of (5) solves the original problem (4).

Search for the correct value of Lagrange multiplier is

named solving the dual problem (Boyd and Vanden-

berghe, 2004).

2.3 Dual Proximal Gradient Method

Algorithm

The indicator function of the feasible set for d  b,

g(d), is deﬁned as

g(d) =



0; d  b

∞; otherwise.

(7)

Its conjugate function g

∗

(d) is deﬁned as

∗

(d) = sup

z − g(z)). (8)

The iteration scheme is best described using proxim-

ity operator

prox

(d) = argmin



ψ(y) +

y − d



. (9)

The algorithm of the dual gradient method is

= argmin



Hz + c

z +







(10)

k+1

= prox

∗



+ Cy



, (11)

where gradient of the Lagrange dual function of the

objective function in (4) is Lipschitz continuous with

the constant L. The highest eigenvalue of M =

−1

can always be used as L (Giselsson and

Boyd, 2014). By scaling C, L = 1 can be achieved,

so without loss of generality, L = 1 is assumed and

notation prox

∗

(d) = prox

∗

(d) is used.

It follows from the Moreau decomposition (Rock-

afellar, 1970, Theorem 31.5) (Giselsson and Boyd,

2014) that the prox operator of the conjugate of an

indicator function is

prox

∗

(d) = d − prox

(d). (12)

From deﬁnition of the proximity operator it follows

that prox

(d) is the projection of d onto the feasible

Local Decay of Residuals in Dual Gradient Method Applied to MPC Studied using Active Set Approach

set for d  b, so it is a min operation and computa-

tionally inexpensive, rendering the whole (11) inex-

pensive. In addition, a closed form of the solution for

(10) exists:

= −H

−1



+ c



(13)

for every positive deﬁnite Hessian. Dual proximal

gradient method can thus be practically applied to

solving QPs of the form discussed in (4).

3 RATES OF DECREASE OF

RESIDUALS

We deﬁne active constraints in step k as those whose

corresponding components of v

are positive. The set

of these constraints is the active set in step k, labelled

. The vector of the positive components of v

is la-

belled v

[ω

]. The matrix formed from the rows of C

that correspond to active constraints is labelled C[ω

The matrix formed from the intersections of the rows

and columns corresponding to active constraints of M

is M[ω

|ω

] and is a principal submatrix of M. It is

found that eigenvalues of M[ω

|ω

] determine the rate

of decrease of residuals.

Consider three iterations, k, k +1, k +2, for which

the active set remains constant (ω

= ω

k+1

= ω

k+2

ω). Let us deﬁne dual residuals

∆

= v

k+1

− v

(14)

∆

k+1

= v

k+2

− v

k+1

and analyse the relationship between ∆

and ∆

k+1

From(11), we get

k+1

= prox

∗



+ Cy



. (15)

According to (12),

k+1

= v

+ Cy

− prox



+ Cy



. (16)

Equation (16) combines with (13) into

k+1

= v

− Mv

− CH

−1

−prox



+ Cy



(17)

The components of ∆

k,k+1

that do not form ∆

k,k+1

[ω]

are equal to 0, so it is sufﬁcient to study ∆

k,k+1

[ω]. By

deﬁnition of ω, (17) turns into

k+1

[ω] =



− Mv

− CH

−1

c − b



[ω] (18)

where we have taken into account that prox

)[ω] = b[ω], following from the deﬁnition of ω.

Multiplying

k+1

[ω] = v

[ω] − M [ω|ω]v

[ω]

−



−1

c + b



[ω]

(19)

follows. Deﬁnition (14) leads to

∆

[ω] = v

k+1

[ω] − v

[ω]. (20)

And, (19, 20) give

∆

[ω] = −M[ω|ω]v

[ω] −



−1

c + b



[ω]. (21)

∆

k+1

can be expressed with ∆

∆

k+1

[ω] = −M [ω|ω]v

k+1

[ω]

−



−1

c + b



[ω]

∆

k+1

[ω] = −M [ω|ω]



[ω] + ∆

[ω]



−



−1

c + b



[ω]

∆

k+1

[ω] = ∆

[ω] − M [ω|ω]∆

[ω]

∆

k+1

[ω] = (I − M[ω|ω])∆

[ω], (22)

where I is the identity matrix the same size as

M[ω|ω].

Since M[ω|ω] is symmetric, its eigenvectors are

orthogonal (Graselli, 1975, p. 83) and (22) can be

conveniently diagonalized using eignedecomposition

into

k+1

= (I − D)d

. (23)

Here, D = V

M[ω|ω]V, d

k,k+1

= V

∆

k,k+1

[ω], V

is orthogonal and its columns are eignevectors of M,

D is diagonal with eigenvalues λ

of M[ω|ω] on the

diagonal.

In each iteration, the i-th component of d

gets

multiplied by 1 − λ

. If 0 < λ

< 2, the component

is decreasing toward 0. It can be shown that the com-

ponents of v

that lie in the nullspace of M[ω|ω] do

not affect y

(see appendix). So, if λ

= 0, the cor-

responding component does not inﬂuence the primal

solution y. Moreover, d

= 0 for λ

= 0 if ω is a fea-

sible active set, as shown in the appendix. For a given

ω, the qudratic norm of residual is thus decreasing to-

ward 0 if 0 ≤ λ

< 2 for every λ

. The slowest com-

ponent of residual to decay corresponds to the lowest

non-zero λ

; components of residual proportional to

higher λ

have faster dynamics. If the ω being stud-

ied is the ﬁnal active set and will not change in sub-

sequent iterations, the lowest non-zero λ

determines

the convergence rate.

From theorem 4.3.15 in (Horn and Johnson, 1990)

it follows that the lowest eigenvalue of the symmetric

matrix M forms the lower bound for eigenvalues of

the principal submatrices M[ω|ω] of M, and the high-

est eigenvalue of M is the upper bound for eigenval-

ues of M[ω|ω]. M is positive semideﬁnite, so its low-

est eigenvalue is bigger or equal to 0. It has been as-

sumed that the highest eigenvalue of M is set below or

equal to 1 through multiplying C and b with a positive

constant. It guarantees convergence of the method for

ICINCO 2017 - 14th International Conference on Informatics in Control, Automation and Robotics

L = 1. Scaling the problem inﬂuences the local rate of

decrease of residual: enlarging M also enlarges each

M[ω|ω] and its eigenvalues, among them each lowest

non-zero eigenvalue that determines the local rate of

decrease of the slowest component of residual.

In MPC-related practical examples, the lowest

eigenvalue of M is typically equal to 0. The theo-

rem is thus not sufﬁcient to derive a positive lower

bound for the lowest non-zero eigenvalue of the prin-

cipal submatrices M[ω|ω]. In particular, the lowest

non-zero eigenvalue of a principal submatrix M[ω|ω]

can be smaller than the lowest non-zero eigenvalue of

It is worth noting that analysing eigenvalues of

M[ω|ω] for given ω is sufﬁcient for determining the

local convergence rate. Convergence rate estimates

only depend on system matrices that are independent

of current MPC system state and reference. The vec-

tor of constraint limits b and the gradient c do not

inﬂuence the convergence rate directly, only through

ω.

4 PRACTICAL EXTENSIONS

4.1 Preconditioning or Generalization

Use of the generalized prox operator in place of the

prox operator can improve convergence (Giselsson,

2013). The generalized prox operator is deﬁned in

the following way:

prox

(d) = argmin



ψ(y) +

y − d



, (24)

where L

is chosen to be a diagonal positive deﬁnite

matrix. (11) is modiﬁed using the generalized prox

operator to obtain

k+1

= prox

∗



+ Cy



. (25)

Consider the following quadratic program:

minimize

Hz + c

subject to

Cz 

b, (26)

where

C stands for EC and

b is Eb, E being a di-

agonal positive deﬁnite matrix. Taking into account

(Giselsson, 2013)

prox

∗

(d) = d − L

−1

prox

−1

d), (27)

it follows that using (10, 11) to solve (26) is equiva-

lent to solving (4) using (10, 25) if E = L

−1

The procedure of generalization or precondition-

ing consists of ﬁnding a suitable E and changing the

QP in (4) into the one in (26). It is then solved us-

ing (10, 11). The matrix M is replaced by

M =

−1

= EME. The highest eignevalue of

M is

chosen by scaling E and should be ≤ 1 for conver-

gence to be guaranteed taking L = 1. Better choices

of E lead to bigger lowest positive eigenvalues of the

encountered

M[ω

|ω

4.2 Upper and Lower Boundaries

In MPC, it is typical to have upper and lower bounds

on the same signals, which leads to the same linear

functionals of the optimization variable in QP having

both upper and lower bounds as well. If the QP is

given in the form of (4), C can thus be written as

C =



−C



. (28)

Then inequality (4) can be reformulated as

 C

z  b

. (29)

In the computational code, C is replaced with C

, re-

sulting in smaller matrices (among them M) and in

halving the length of Lagrange multiplier, the same

component now corresponds to both the upper and the

lower boundary depending on its sign. It follows that

g(d) becomes the indicator function of the feasible set

for b

 d  b

. It causes prox

(d) to turn into pro-

jection onto a box that can be implemented using a

min and a max operation.

The modiﬁcation does not change the theoretical

behaviour of the system but serves to lower the com-

putational needs. Importantly, rewriting (4) in the

form of (28) halves the size of M while keeping the

same eigenvalues of

M[ω|ω] and preserving the rela-

tionship between

M[ω|ω] and local rate of decrease

of residual. In the modiﬁed system, Lagrange multi-

plier components can be negative and constraints cor-

responding to negative components are included in ω.

4.3 Soft Constraints

QP resulting from an MPC problem with state con-

straints may not be feasible, meaning it does not nec-

essarily have a solution. However, in practice we

want the controller to produce a sensible output u also

when the constraints cannot be satisﬁed. One way to

achieve this is by relaxing the inequality state con-

straints from (4) with a slack variable which is penal-

ized in the cost function. The QP expands to the form

minimize

Hz + c

z + w

s (30)

subject to Cz  b + s, (31)

Local Decay of Residuals in Dual Gradient Method Applied to MPC Studied using Active Set Approach

where s ∈ R

is the slack variable and w ∈ R

∪ ∞ is

its weight (Giselsson, 2013; Kouzoupis, 2014). If the

QP in (4) has a feasible solution and if the weight w

is big enough, the solution of the new problem is the

solution of the original QP and s = 0. For smaller w

or for infeasible QP, s has non-zero components.

A way to efﬁciently implement soft constraints

with linear cost of constraint violation can be seen by

comparing (5) to (30). If w is taken to be the upper

bound for Lagrange multipliers v, the dual proximal

gradient method solves the soft-constrained problem.

The components of the dual residual corresponding to

the violated soft constraints are 0, thus violated soft

state constraints appear among inactive constraints

when calculating the local rate of decrease of resid-

ual.

5 EXAMPLE

A discrete-time form of the AFTI-16 benchmark

model as in (Giselsson, 2013) has the system matri-

ces







0.9993 −3.0083 −0.1131 −1.6081

−0.0000 0.9862 0.0478 0.0000

0.0000 2.0833 1.0089 −0.0000

0.0000 0.0526 0.0498 1.0000













−0.0804 −0.6347

−0.0291 −0.0143

−0.8679 −0.0917

−0.0216 −0.0022







in (1). The constraints are:

X =



x ∈ R

;−0.5 − s

≤ x

≤ 0.5 + s

−100 − s

≤ x

≤ 100 + s

}

U =



u ∈ R

;−25 ≤ u

≤ 25,

−25 ≤ u

≤ 25

}

. (32)

The constraints on the components of x are soft, the

linear weight on the components of the slack is w =

[10

, 10

]

, and w

s(k) is added to the sum term in

(2). The cost matrices are

Q = diag(10

−4

, 10

−3

, 10

R = diag(10

−2

, 10

−2

). (33)

The reference x

is 0 in all components except for the

ﬁrst 50 time steps of the simulation, during which x

is 10. Initial state at the beginning of the simulation is

x(0) = [0, 0, 0, 0]

A family of QP in condensed form (4) correspond-

ing to the MPC problem is formed for N = 10 using

QPgen (QPgen, 2014; Giselsson and Boyd, 2014).

The matrices C and H are constant while the vectors

b and c are dependent on the system state and the ref-

erence. The QPs are 20-dimensional (10 x 2 compo-

nents of u) with 40 lines in C (limits on 10 x 2 input

signals and 10 x 2 state components).

The preconditioning diagonal matrix E is chosen

so as to minimize the condition number of the non-

singular part of M while setting the highest eigen-

value of M to 1. QPgen ﬁnds E = diag(10.4796,

3.5413, 9.9973, 10.0080, 9.9987, 10.0005, 10.0000,

10.0037, 9.9990, 9.9997, 10.0001, 10.0033, 9.9989,

9.9979, 10.0003, 10.0036, 9.9999, 9.9965, 9.9972,

10.0034, 0.2058, 0.0918, 0.1003, 0.1000, 0.1007,

0.1001, 0.1005, 0.1000, 0.1004, 0.1001, 0.1007,

0.1000, 0.1004, 0.1001, 0.1009, 0.0999, 0.1004,

0.1000, 0.1013, 0.1000). The model is initially simu-

lated in closed loop with 10

iterations in every time

step for 100 time steps. The system state x is recorded

at every time step and used as the input initial state

for observing convergence of the resulting QPs. Al-

gorithm behaviour is analysed in all time steps for 1

to 3000 iterations.

Figure 1 shows the active sets through iterations

for the ﬁrst 10 time steps. White columns are delim-

iters of time steps.

In Figure 2, convergence through the active set

changes in time step 1 is shown graphically. Firstly

we analyse the behaviour with the ﬁnal ω. The

quadratic norm of the primal residual is 0.22805 in

the 300th iteration and 8.74325×10

−12

in the 1800th

iteration, so the difference gets multiplied by 0.98414

in each iteration. This gives the estimate for the small-

est non-zero eigenvalue of the relevant M[ω|ω] of

1 − 0.98414 = 0.01586. In fact, 20 constraints are

active during the considered iterations (and no soft

ones are violated), 9 of them soft corresponding to

state constraints, and 11 hard corresponding to input

constraints. The matrix M[ω|ω] is non-singular and

its lowest eigenvalue is 0.01587, showing good agree-

ment with the numerical behaviour.

There is another longer interval in the ﬁrst sam-

ple where the active set does not change between the

24th and 153rd iteration. The set has 22 elements, 9

of which are soft constraints and 13 are hard; no con-

straint is violated. If the dual residual is transformed

into the eigenspace of M[ω|ω], one expects the com-

ponents to decay in proportion with their correspond-

ing eigenvalues of M. These components are plotted

in Figure 3. It can be seen that the 22 lines decay

with different slopes in the relevant region and that the

ones listed higher in the legend decay slower. They

are listed in the ﬁgure in ascending order with respect

to eigenvalues (the ﬁrst 2 correspond to the eigenval-

ues that are 0 and stay constant). Similarly, the active

set is constant from the 161st iteration on, has 20 ele-

ICINCO 2017 - 14th International Conference on Informatics in Control, Automation and Robotics

Active set number

20 40 60 80 100 120 140 160

Active set element

Figure 1: The list of active sets for the ﬁrst 10 samples. The constraints are listed from the bottom to the top: the ﬁrst 20

correspond to the input constraints on u from 1st to Nth time step, the following 20 are from constraints on x, again for 10

steps. Samples are listed from left to right with white columns separating them. For a given sample, the ﬁrst active set is on

the left and every change of the active set results in a new column. Yellow ﬁelds mean active upper limit constraint. Blue

ﬁelds are for the lower limit. Green ﬁelds are inactive constraints or violated soft constraints.

Iteration

0 500 1000 1500 2000 2500 3000

Quadratic norm of residual

-15

-10

-5

Figure 2: Convergence of the quadratic norm of the primal (blue) and the dual (red) solution in time step 1 as a function of

the iteration number. The yellow circles mark iterations in which the active set changes.

ments, and M[ω|ω] has no eigenvalues equal to 0. The

corresponding eigendecomposition of the dual resid-

ual is shown in Figure 4.

Similar results for the time step 76 are shown in

Figures 5 and 6, and in 7 the non-transformed compo-

nents of the dual residual are shown for comparison.

For the majority of iterations, ω has 4 elements

and the lowest eigenvalue of M[ω|ω] is 0.1266 so

convergence is faster than in time step 1.

6 CONCLUSION

The different rates of convergence observed in nu-

merical simulations of the the dual gradient method

are explained with the theoretical computation of the

Local Decay of Residuals in Dual Gradient Method Applied to MPC Studied using Active Set Approach

Iteration

0 200 400 600 800 1000 1200 1400 1600 1800 2000

Component of residual (iteration 24-153)

-20

-15

-10

-5

Figure 3: The components of the dual residual parallel to eigenvectors of M[ω|ω] between iterations 24 and 153 (red frame)

for time step 1 as a function of the iteration number. The components are listed in the order of ascending corresponding

eigenvalues.

Iteration

0 200 400 600 800 1000 1200 1400 1600 1800 2000

Component of residual (iteration 161+)

-20

-15

-10

-5

Figure 4: The components of the dual residual parallel to eigenvectors of M[ω|ω] from iteration 161 on (red frame) for time

step 1 as a function of the iteration number. The components are listed in the order of ascending corresponding eigenvalues.

local rate of decrease of residuals. The decay rates of

the residuals only depend on system matrices and the

active set on constraints and are independent of the

current MPC system state, the reference, and the limit

values of the constraints. For an active set ω, the de-

cay rate is limited by the lowest non-zero eigenvalue

of M[ω|ω]. The problem of certiﬁcation can thus be

seen as seeking a lower bound for non-zero eigenval-

ues of M[ω|ω] for active sets ω that could possibly

occur.

In further work, we intend to test lower bounds

of as many active sets as possible, although it is gen-

erally known that this problem is plagued with com-

binatorial complexity. We will also try to extend the

approach to the fast dual gradient method without and

with restarting and use the results in the precondition-

ing phase to speed up convergence.

ICINCO 2017 - 14th International Conference on Informatics in Control, Automation and Robotics

Iteration

0 500 1000 1500 2000 2500 3000

Quadratic norm of residual

-15

-10

-5

Figure 5: The quadratic norm of the primal (blue) and the dual (red) residual in time step 76 as a function of the iteration

number. The yellow circles mark iterations in which the active set changes.

Iteration

0 50 100 150 200 250 300

Component of residual (iteration 6+)

-18

-16

-14

-12

-10

-8

-6

-4

-2

data1

data2

data3

data4

Figure 6: The components of the dual residual parallel to eigenvectors of M[ω|ω] from iteration 6 on for time step 76 as a

function of the iteration number. The components are listed in the order of ascending corresponding eigenvalues.

ACKNOWLEDGEMENTS

Research supported by Slovenian Research Agency

(P2-0001). This work has been carried out within the

framework of the EUROfusion Consortium and has

received funding from the Euratom research and train-

ing programme 2014-2018 under grant agreement No

633053. The views and opinions expressed herein do

not necessarily reﬂect those of the European Commis-

sion.

REFERENCES

Boyd, S. and Vandenberghe, L. (2004). Convex Optimiza-

tion. Cambridge University Press, New York, NY,

USA.

Domahidi, A., Zgraggen, A. U., Zeilinger, M. N., Morari,

M., and Jones, C. (2012). Efﬁcient Interior Point

Methods for Multistage Problems Arising in Reced-

ing Horizon Control. In Proceedings of the 51st IEEE

Conference on Decision and Control.

Everett, H. (1963). Generalized lagrange multiplier method

Local Decay of Residuals in Dual Gradient Method Applied to MPC Studied using Active Set Approach

Iteration

0 50 100 150 200 250 300

Component of residual

-18

-16

-14

-12

-10

-8

-6

-4

-2

Figure 7: The original components of the dual residual for time step 76 as a function of the iteration number.

for solving problems of optimum allocation of re-

sources. Oper. Res., 11(3):399–417.

Ferreau, H. J., Bock, H. G., and Diehl, M. (2008). An on-

line active set strategy to overcome the limitations of

explicit MPC. International Journal of Robust and

Nonlinear Control, 18(8):816–830.

Ferreau, H. J., Kirches, C., Potschka, A., Bock, H. G., and

Diehl, M. (2014). qpoases: a parametric active-set al-

gorithm for quadratic programming. Math. Program.

Comput., 6(4):327–363.

Gerk

c, S. and Tommasi, G. D. (2014). Improving mag-

netic plasma control for ITER. Fusion Engineering

and Design, 89(910):2477 – 2488. Proceedings of

the 11th International Symposium on Fusion Nuclear

Technology-11 (ISFNT-11) Barcelona, Spain, 15-20

September, 2013.

Giselsson, P. (2013). Improving Fast Dual Ascent for MPC

- Part II: The Embedded Case. ArXiv e-prints.

Giselsson, P. and Boyd, S. (2014). Preconditioning in fast

dual gradient methods. In 53rd IEEE Conference on

Decision and Control, CDC 2014, Los Angeles, CA,

USA, December 15-17, 2014, pages 5040–5045.

Giselsson, P. and Boyd, S. (2015). Metric selection in fast

dual forward-backward splitting. Automatica, 62:1–

10.

Graselli, J. (1975). Linearna algebra. In Vi

sja matematika

II. Dr

zavna zalo

zba Slovenije, Ljubljana, Slovenija.

Hartley, E. N., Jerez, J. L., Suardi, A., Maciejowski, J. M.,

Kerrigan, E. C., and Constantinides, G. A. (2014).

Predictive control using an FPGA with application to

aircraft control. IEEE Trans. Control Systems Tech-

nology, 22(3).

Horn, R. A. and Johnson, C. R. (1990). Matrix Analysis.

Cambridge University Press.

Kapasouris, P., Athans, M., and Stein, G. (1990). Design of

feedback control systems for unstable plants with sat-

urating actuators. In Proceedings of the IFAC Sympo-

sium on Nonlinear Control System Design, page 302

307. Pergamon Press.

Kouzoupis, D. (2014). Complexity of First-Order Methods

for Fast Embedded Model Predictive Control (Master

Thesis). Eidgenssische Technische Hochschule, Zrich.

Mattingley, J. and Boyd, S. (2012). CVXGEN: a code gen-

erator for embedded convex optimization. Optimiza-

tion and Engineering, 13(1):1–27.

Mattingley, J., Wang, Y., and Boyd, S. (2011). Receding

horizon control, automatic generation of high-speed

solvers. IEEE Control Syst. Mag., 31:52 –65.

Patrinos, P., Guiggiani, A., and Bemporad, A. (2015). A

dual gradient-projection algorithm for model predic-

tive control in ﬁxed-point arithmetic. Automatica,

55(C):226–235.

Qin, S. and Badgwell, T. A. (2003). A survey of industrial

model predictive control technology. Control Engi-

neering Practice, 11(7):733 – 764.

QPgen (2014). QPgen. Accessed 24 January 2017.

Richter, S. (2012). Computational Complexity Certiﬁca-

tion of Gradient Methods for Real-Time Model Pre-

dictive Control (Dissertation). Eidgenssische Tech-

nische Hochschule, Zrich.

Rockafellar, R. T. (1970). Convex Analysis. Princeton Uni-

versity Press, Princeton, New Jersey.

Ullmann, F. and Richter, S. (2012). FiOrdOs – MPC Ex-

ample. Automatic Control Laboratory, ETH Zurich,

Zurich.

Wang, Y. and Boyd, S. (2010). Fast model predictive control

using online optimization. IEEE Trans Control Syst

Techn, 18:267–278.

ICINCO 2017 - 14th International Conference on Informatics in Control, Automation and Robotics

APPENDIX

Components of v

Corresponding to

Nullspace of M[ω|ω] Do Not Inﬂuence y

Let M[ω|ω]v

[ω] = 0. By deﬁnition,

M[ω|ω] = C[ω]H

−1

(C[ω])

. Since H

−1

positive deﬁnite, z

−1

z > 0 for every z dif-

ferent from 0 (Graselli, 1975, p. 90). Let z =

(C[ω])

[ω]. Then 0 = (v

[ω])

M[ω|ω]v

[ω] =

[ω])

C[ω]H

−1

(C[ω])

[ω] = z

−1

z. Thus

z = (C[ω])

[ω] = 0. According to (13), addition

of v

[ω] to v

[ω] does not affect y

Dual Residual Is Perpendicular to

Nullspace of M[ω|ω] for Feasible ω

If the deﬁnition of M[ω|ω] is taken into account in 21,

it follows:

∆

[ω] = −C [ω]H

−1

(C[ω])

[ω]

−C[ω]H

−1

c − b [ω]. (34)

Let t be a vector from nullspace of M[ω|ω]. Taking

the result from the previous appendix into account, it

follows (C[ω])

t = 0. Next, we calculate:

∆

[ω] = −t

C[ω]H

−1

(C[ω])

[ω]

−t

C[ω]H

−1

c − t

b[ω]

= −



(C[ω])



−1

(C[ω])

[ω]

−



(C[ω])



−1

c − t

b[ω]

= −t

b[ω].

If ω is a feasible active set, there exists a vector u so

that C[ω]u = b[ω]. Thus

∆

[ω] = −t

C[ω]u = −



(C[ω])



u = 0. (35)

An arbitrary vector t from the nullspace of M[ω|ω]

is orthogonal to ∆

[ω], so ∆

[ω] is orthogonal to the

nullspace of M[ω|ω].

Local Decay of Residuals in Dual Gradient Method Applied to MPC Studied using Active Set Approach