SOLVING ILL-POSED PROBLEMS USING DATA ASSIMILATION

Application to Optical Flow Estimation

Dominique B´er´eziat

Universit´e Pierre et Marie Curie, LIP6, 4 place Jussieu, 75005 Paris, France

Isabelle Herlin

INRIA, CEREA, Joint Laboratory ENPC-EDF R&D, Universit´e Paris-Est

BP105 78153 Le Chesnay Cedex, France

Keywords:

Data Assimilation, Ill-posed Problem, Variational Formulation, Optical Flow, Missing Data.

Abstract: Data Assimilation is a mathematical framework used in environmental sciences to improve forecasts per-

formed by meteorological, oceanographic or air quality simulation models. Data Assimilation techniques

require the resolution of a system with three components: one describing the temporal evolution of a state

vector, one coupling the observations and the state vector, and one deﬁning the initial condition. In this article,

we use this framework to study a class of ill-posed Image Processing problems, usually solved by spatial and

temporal regularization techniques. A generic approach is deﬁned to convert an ill-posed Image Processing

problem in terms of a Data Assimilation system. This method is illustrated on the determination of optical

ﬂow from a sequence of images. The resulting software has two advantages: a quality criterion on input data

is used for weighting their contribution in the computation of the solution and a dynamic model is proposed

to ensure a signiﬁcant temporal regularity on the solution.

1 INTRODUCTION

In the research ﬁeld of Image Processing, most prob-

lems are ill-posed in the sense that it is not possible to

provide a unique solution (Hadamard, 1923). A com-

mon cause of ill-posedness is that the equations used

to model image properties, called Image Model, are

under-determined. An example is the famous “aper-

ture problem” occurring in the estimation of optical

ﬂow.

A possible strategy to solve ill-posed problems is

to provide the Image Model with additional informa-

tion. Two options are considered: 1) Providing ex-

plicit information: additional images are used to en-

large the set of input data. However, this is generally

not possible because other acquisitions having the re-

quested properties are often not available. 2) Provid-

ing implicit information: either hypotheses on image

properties or constraints on the solution can be used.

A usual constraint is to restrict the dimension or the

size of the space of admissible solutions: for instance

the result is searched among the functions having

bounded spatial variations (Tikhonov, 1963). Such

methods are called “Tikhonov regularization meth-

ods”. In the general case, these additional image

properties and constraints are expressed as equations,

which, added to the Image Model, lead to a new in-

vertible model.

Assuming an image sequence is available, the

Tikhonov regularization can be extended by consid-

ering a regularization acting both in spatial and tem-

poral domains. This has been proposed by Weick-

ert (Weickert and Schn¨orr, 2001) for determining op-

tical ﬂow. The interest is to produce a solution which

is regular in time and in space. This approach has

however several drawbacks. First, the temporal regu-

larization imposes an arbitrary regularity which can

not deal with complex dynamics. Second, if irrele-

vant input data values occur on large regions, they are

however taken into account in the estimation process

and introduce errors in the solution. These aberrant

values may be due to a sensor failure or an acquisi-

tion noise. We call them “missing data” in the remain

of this paper. Third, as the solution is looked for in the

spatio-temporal domain, this leads to a high complex-

ity in comparison with a pure spatial model, which is

a limiting factor for operational purposes.

An alternative to the Weickert’s approach is to

model the temporal evolution of images. Such a mod-

elling can be inferred, for instance, from a priori

595

Béréziat D. and Herlin I. (2009).

SOLVING ILL-POSED PROBLEMS USING DATA ASSIMILATION - Application to Optical Flow Estimation.

In Proceedings of the Fourth International Conference on Computer Vision Theory and Applications, pages 595-602

DOI: 10.5220/0001792205950602

 SciTePress

knowledge on image acquisition. The challenge be-

comes to deﬁne an effective and pertinent model, and

to include it in the solution computation. Data qual-

ity has also to be evaluated in order to ignore missing

data.

In this paper, we propose to use the Data Assimi-

lation framework as a generic tool to solve ill-posed

problems in Image Processing. A Data Assimilation

method solves simultaneously an evolution equation,

describing the evolution of a state vector over time,

and an observation equation, modelling the links be-

tween the state vector and the observations. Each

equation can be inaccurate and a description of the

errors is stated in terms of a Gaussian noise charac-

terized by a covariance matrix. As it will be shown

in the paper, Data Assimilation is a suitable approach

to solve the three drawbacks of Weickert’s methods

because: the evolution model describes the images’

temporal dynamics; the covariance associated to the

observation equation error overcomes the problem of

missing data by weighting, at each pixel, the contribu-

tion of the observation equation in the computation of

the solution; Data Assimilation is a frame-by-frame

process allowing an affordable computational cost.

2 THE DATA ASSIMILATION

FRAMEWORK

The Data Assimilation framework aims to solve the

system (1,2,3) with respect to a state vector X(x,t),

depending on the spatial coordinate x and time t:

∂X

∂t

(x, t) + (X)(x,t) = ε

(x, t) (1)

X(x, 0) = X

(x) + ε

(x) (2)

(Y, X)(x,t) = ε

(x, t) (3)

Equation (1) describes the temporal evolution of

X. , called evolution model, is supposed differen-

tiable. As approximately describes the evolution

of the state vector, a model error ε

is introduced to

quantify inaccuracies. Equation (2) establishes the

initial condition of the state vector and the error is

expressed by the background error ε

. Equation (3),

called observation equation, describes the links, pos-

sibly complex and non-linear, between the observa-

tion vector, Y(x, t), and the state vector. The observa-

tion error ε

represents the imperfection of and the

measurement errors. ε

, ε

and ε

are assumed to be

Gaussian and fully characterized by their covariance

matrices Q, B and R.

Let X denote a Gaussian stochastic vector depend-

ing on a space-time coordinate (x, t). X = X(x, t) and

′

= X(x

′

) are the values on two locations. The co-

variance matrix Σ, computed for X and X

′

, measures

their dependency and is deﬁned by:

Σ(x,t, x

′

) =

(X − X)

′

− X

′

)dP

X,X

′

X,X

′

is the joint distribution of (X, X

′

) and de-

notes the expectation. P

X,X

′

is a Gaussian distribu-

tion whose covariance matrix depends on x, t, x

′

and

′

. The inverse of a covariance matrix is formally and

implicitly deﬁned (Oliver, 1998) by:

−1

(x, t, x

′′

)Σ(x

′′

, x

′

)dx

′′

δ(x− x

′

)δ(t −t

′

) (4)

with δ denoting the Dirac function.

We note A = Ω × [0, T], Ω being the spatial do-

main and [0, T] the temporal domain. In order to solve

equations (1), (2) and (3), we deﬁne the functional E,

to be minimized, as stated by equation (5), displayed

at the top of next page. If ε

, ε

and ε

are assumed

to be independent, E represents the log-likelihood of

X. The minimization is carried out using a calcu-

lus of variations: the differential of E with respect

to X, denoted

∂E

∂X

, is calculated and the associated

Euler-Lagrange equation is solved.

∂E

∂X

is obtained

by (i) computing the derivative of E in the direction

η:

∂E

∂X

(η) = lim

γ→0

dγ

(E(X+ γη)) (6)

(ii) introducing an auxiliary variable λ, called adjoint

variable in the literature:

λ(x, t) =

−1

(x, t, x

′

,t)



∂X

∂t

+ (X)



′

)dx

′

(7)

and (iii) writing the state vector X as X

+δX. Vectors

and δX are respectively called background vector

and incremental vector in the Data Assimilation liter-

ature. This leads to the following system of equations:

λ(x, T) = 0 (8)

−

∂λ

∂t



∂

∂X





∗

λ = −



∂

∂X





∗

−1

(9)



, Y) +

∂

∂X



(δX)



′

(x, 0) = X

(x) (10)

∂X

∂t

+ (X

) = 0 (11)

δX(x, 0) =

Ω

Bλ(x

′

, 0)dx

′

(12)

∂δX

∂t

∂

∂X



(δX) =

Qλ(x

′

)dx

′

(13)

VISAPP 2009 - International Conference on Computer Vision Theory and Applications

596

E(X) =



∂X

∂t

+ (X)



(x, t)Q

−1

(x, t, x

′

)



∂X

∂t

+ (X)



′

)dxdtdx

′

(X, Y)

(x, t)R

−1

(x, t, x

′

) (X, Y)(x

′

)dxdtdx

′

Ω



X(x, 0) − X

(x)



−1

(x, x

′

)



X(x

′

, 0) − X

′

)



dxdx

′

(5)

The reader is referred to (Valur H´olm, 2003) for de-

tails on the determination of the differential of E and

Euler-Lagrange equations. The decomposition of the

state vector into background and incremental vectors

requires linearizing the operators and using a

ﬁrst-order Taylor development: (X) ≃ (X

) +

∂

∂X



(δX). The operator



∂·

∂X



∗

is called adjoint

operator and represents a compact notation for inte-

gration by parts of the differential of the operators

and . It is formally deﬁned by:



∂

∂X

(η), λ





η,



∂

∂X



∗

(λ)



(14)

Riesz’s theorem ensures the existence and unique-

ness of the adjoint operator. To solve the system (8)

to (13), equation (11) is ﬁrst solved using initial con-

dition (10) and provides X

. Equation (9) is solved

retrogradely to provide λ using initial condition (8).

The incremental vector is ﬁnally provided by solv-

ing (13) with (12) as initial condition. If and

are not linear, an iterative algorithm is used to sequen-

tially improve the solution of (8,9,12,13): at each iter-

ation, the value δX is computed from the background

vector X

and the value δX+ X

becomes the back-

ground vector for the next iteration.

3 IMAGE ASSIMILATION

This section explains how to solve ill-posed Image

Processing problems using the framework of Data

Assimilation and constitutes the core of the research

and the main contribution of the paper. Using Data

Assimilation to solve Image Processing Problem is

an emerging domain. Studies have been done about

curve tracking (Papadakis and M´emin, 2007) and de-

termination of optical ﬂow (Papadakis et al., 2007a;

Papadakis et al., 2007b). In (Huot et al., 2008), a

method is proposed to estimate the ocean surface cir-

culation from SST images by assimilating image data

in an image model for producing pseudo-observations

for the oceanographic model. In this paper, we con-

sider the general case of ill-posed problems, solved

using Tikhonov regularization methods, and we de-

ﬁne a method to convert them, in a generic way, from

Tikhonov regularization to Data Assimilation frame-

work.

First, state and observation vectors have to be de-

ﬁned. Obviously, the observations will be images or

processed images, and the content of the state vector

will strongly depends on the studied application. For

example, segmentation, denoising and restoration use

a state vector which is an image. Tracking, image reg-

istration and motion estimation are represented by a

vector ﬁeld. Active contours are modelled by curves.

Sometimes, the links between the state vector and the

observation are highly indirect. For instance, as it is

not possible to deduce the ocean circulation from sur-

face temperatures by a shallow-water model, an addi-

tional system has to be considered with relevant evo-

lution and observation equations (Huot et al., 2008).

To solve an Image Processing problem with the

4D-Var algorithm, as described in Section 2, a suit-

able evolution model describing the temporal evolu-

tion of the state vector has to be stated. Its associated

error must also be deﬁned. Next, an observation equa-

tion and its error should characterize the properties of

observations. Last, the initial condition and its back-

ground error have to be established. This is discussed

in the next Subsections.

3.1 The Evolution Model and Error

One ﬁrst choice of evolution model is to consider the

simple transport equation

= 0, or:

∂X

∂t

∂X

∂x

∂t

= 0 (15)

The evolution model is then obtained by identiﬁca-

tion: (X)(x,t) =

∂X

∂x

∂t

. Section 4 describes the

use of equation (15) for determining optical ﬂow.

Another possible choice is to express the transport

of the state vector as a diffusion, a physical law rul-

ing the transport of chemical species or temperature.

The general formulation is:

∂X

∂t

= ∇

(D∇X) with

∇ =



∂

∂x

∂

∂y



. D is a tensor characterizing the di-

rection and intensity of the diffusion. It is possible to

drive this diffusion according to the image character-

istics. A well-known example is the Perona & Malik

diffusion: the tensor matrix is equal to c(k ∇Xk)Id

with c a Gaussian function and Id the identity matrix.

SOLVING ILL-POSED PROBLEMS USING DATA ASSIMILATION - Application to Optical Flow Estimation

597

One property of this diffusion is to smooth the image

while preserving the contours. In this case, identiﬁca-

tion gives: (X) = − ∇

(D∇X).

The matrix Q of the model error ε

has to be spec-

iﬁed. It is ﬁrst important to understand the action of

−1

in the functional (5). Let us describe two possi-

ble choices for Q and analyze their impact in a func-

tional E(X) =

(X)Q

−1

F(X)dxdx

′

which has to

be minimized.

First, let Q being the Dirac covariance deﬁned

by Q(x, x

′

) = δ



x− x

′



. We have E(X) =

Ω

kF(X)k

dx: a Dirac covariance is corresponding

to a zero-order regularization.

A second choice is the exponential covariance de-

ﬁned by Q(x, x

′

) = exp



−

x− x

′



. We have E(X) =

2σ

kF(X)k

+ σ



∂F(X)

∂x



dx. The expo-

nential covariance is associated to a ﬁrst-order reg-

ularization.

In the general case, the inversion of a covariance ma-

trix is often non-trivial and usually inaccessible. Re-

strictive choices have to be made such as the previ-

ous ones. For further details, the reader is referred

to (Oliver, 1998; Tarantola, 2005; B´er´eziat and Her-

lin, 2008).

3.2 The Observation Equation and

Error

As previously pointed out in Section 2, the observa-

tion equation describes the links between the state

vector and the observations. As the regularization

part of the Tikhonov’s method does not contain ob-

servation values, the observation equation will be ex-

actly corresponding to the Image Model existing in

the Tikhonov’s approach.

The observation error has to be stated and its co-

variance matrix R speciﬁed. R has then to be inverted

(see equation (9)) and, as explained, this is only pos-

sible for simple cases. The Dirac covariance can be

used to express a null interaction between two space-

time locations. This allows observation errors to be

located and quantiﬁed in space and time. R is written

as:

R(x, t, x

′

) = r(x, t)δ



x− x

′





t − t

′



(16)

with r a real invertible matrix and:

−1

(x, t, x

′

) = δ



x− x

′





t − t

′



−1

(x, t) (17)

The matrix r

−1

characterizes the quality of the ob-

servation: a high value indicates that the observa-

tion is relevant. Assuming the availability of a func-

tion f measuring the conﬁdence in observation values

(f ∈ [0, 1], f = 0 for no conﬁdence), r

−1

can be de-

ﬁned by:

−1

(x, t) = r

(1− f(x, t) + r

f(x, t) (18)

−1

(x, t) will be close to a “minimal value” r

on re-

gions where conﬁdence is low and r

otherwise. r

and r

are constant and invertible matrices. For efﬁ-

ciency, f is modeled as:

f(x, t) = f

noise

(x, t) f

sensor

(x, t) f (x,t) (19)

noise

characterizes noise acquisition: it is close to 0

for large noise. f

sensor

is equal to 0 when data are not

acquired by the sensor. f measures the conﬁdence

in the observation model; it is close to 0 if the obser-

vation equation is not valid.

Observation values with a low conﬁdence will

then not be taken into account for the computation

of the state vector and for the solution of the Image

Processing problem.

4 DETERMINING OPTICAL

FLOW

Let I be a sequence of images on a spatial domain Ω

and W(x, t) be the velocity vector of a point x ∈ Ω at

time t. The velocity vector is expressed by the Optical

Flow Constraint equation (Horn and Schunk, 1981):

∇I

(x, t)W(x, t) +

∂I

∂t

(x, t) = 0 ∀x ∈ Ω (20)

This is an ill-posed problem: the velocity vector has

two components and the optical ﬂow equation is not

sufﬁcient to compute both. A solution can be ob-

tained using Tikhonov’s method by constraining spa-

tially (Horn and Schunk, 1981) or spatially and tem-

porally (Weickert and Schn¨orr, 2001) the solution.

Let us explain, advantages of the data assimilation

methods on this simple example.

4.1 Observation and Evolution Models

W(x, t) is considered as the state vector X(x, t) and

the image gradients (∇I(x, t), I

(x, t)) constitutes the

observation vector Y(x, t). By identiﬁcation in (3)

and (20), the observation model is:

(W, I) = ∇I(x, t)

W(x, t) + I

(x, t) (21)

To determine a suitable observation error co-

variance handling missing data, we use equa-

tions (17), (18) and (19) to deﬁne the inverse covari-

ance. f

noise

is assumed to be equal to 1 without further

information. f

sensor

is set to 0 where data are not ac-

quired and to 1 otherwise. f is chosen from the fol-

lowing remark: on a sequence of images with regions

VISAPP 2009 - International Conference on Computer Vision Theory and Applications

598

of uniform grey level values, the spatio-temporal gra-

dient is null on these regions and equation (20) is de-

generated. For avoiding considering these points, f

is:

f (x,t) = 1− exp(−k∇

I(x, t)k

) (22)

where ∇

denotes the spatio-temporal gradient. The

observation model being scalar, r

and r

are scalar

and respectively set to ε and 1 − ε with ε ∼ 10

−6

The transport of the velocity is chosen as evolution

equation. The evolution model is:

(W) =



(W)





+VU

+VV



with (U,V)

= W.

Let us denote I

the 2×2 identity matrix. Q is cho-

sen as:

Q(x, t, x

′

) = I

exp(−

(kx− x

′

k + |t−t

′

|)) (23)

to ensure a ﬁrst-order spatial regularization.

We choose the Horn and Schunk’s algorithm to

compute the velocity ﬁeld on the ﬁrst image of the se-

quence. This initial condition is supposed to be with-

out errors: B(x, x

′

) = δ(x− x

′

4.2 Adjoint Operators

In order to determine the adjoint operators of

and , the directional derivatives must ﬁrst be estab-

lished.

Using the deﬁnition (6), we obtain:

∂

∂W

(η) =

∂

∂U

(η

) +

∂

∂V

(η

)

= Uη

+Vη

∂

∂W

(η) =

∂

∂U

(η

) +

∂

∂V

(η

)

= Uη

+Vη

with η =





. By deﬁnition (14) and consid-

ering border terms equal to zero, the adjoint operator

of is :



∂

∂W



∗

(λ) = −Uλ

−V

−Vλ



∂

∂W



∗

(λ) = −U

−Uλ

−Vλ

with λ =





The directional derivative of the observation operator

is:

∂

∂W

(η)(x,t) = ∇I

(x, t)η(x, t)

and the adjoint operator is:



∂

∂W



∗

(λ)(x,t) = ∇I(x,t)λ(x,t)

4.3 Discretization

Using the choices explained in Subsection 4.1, the

three PDEs (9,11,13) become:

∂W

∂t

+ W

∇W = 0 (24)

−

∂λ

∂t

− ∇λ

W− λ

∇

⊥

W = A⋆ L (25)

∂δW

∂t

+ W

∇δW+ ∇W

δW = Q⋆ λ (26)

with: A = −

2σ

∇IR

−1

, L = I

+ ∇I

(W + δW),

∇

⊥

W =



∇

⊥

U ∇

⊥



and ∇

⊥

U =



−U



These three equations are further approximated using

the ﬁnite difference technique.

As W(x, t) is a vector of

, equation (24) has

two components. The ﬁrst one combines a linear ad-

vection in direction y and a non linear in direction

x. A direct approximation leads to an unstable nu-

merical scheme. In (Papadakis et al., 2007a), the

scheme is stabilized by introducing a diffusive term

with the drawback to smooth the solution. We pro-

pose an alternative to stabilize the numerical scheme

associated to the original equation, as explain in the

following. The ﬁrst component of equation (24) is

ﬁrst expressed as a two-equation system using a split-

ting method (Verwer and Sportisse, 1998):

∂U

∂t

+UU

= 0 (27)

∂U

∂t

+VU

= 0 (28)

Equation (27) is rewritten using the Lax-Friedrich

method (Sethian, 1996) in the following form

∂U

∂t

∂F(U)

∂x

= 0 with F(u) =

. This new equation is

approximated by the explicit scheme:

k+1

i, j

i+1, j

i−1, j

) −

△t

i+1, j

− F

i−1, j

)

with U

i, j

= U(x

, y

), F

i, j

= F(U(x

, y

)). The

term

i+1, j

+ U

i−1, j

) stabilizes the numerical

scheme (by adding a diffusive effect) when △t has

a low value (the Courant-Friedrich-Levy condition).

The linear advection (28) is approximated using a

shock explicit scheme (Sethian, 1996):

k+1

i, j

= U

i, j

− △tS

)

i, j

SOLVING ILL-POSED PROBLEMS USING DATA ASSIMILATION - Application to Optical Flow Estimation

599

with S the shock operator, deﬁned in the x-direction

by: S

(U,V)

i, j

= max(V

i, j

, 0)(U

i, j

− U

i−1, j

) +

min(V

i, j

, 0)(U

i+1, j

−U

i, j

It can be seen that the second component of (24)

contains a linear advection term in direction x and a

non linear advection term in direction y. The same

strategy is then used to perform the discretization

process.

Equation (25) combines a linear advection, a term

of reaction and a forcing term. It also has two com-

ponents. The ﬁrst is −

∂λ

∂t

− Uλ

− V

− Vλ

= A

⋆ L with A

= −

2σ

−1

. It is split into

two parts. The ﬁrst one contains the linear advection

in direction x and the reaction term −

∂λ

∂t

− Uλ

−

= 0 and is approximated in the same way as (28)

with an explicit shock scheme. However, the equation

is retrograde as the initial condition is given at time T:

(λ

)

k−1

i, j



i, j+1

−V

i, j−1

)



(λ

)

i, j

△tS

((λ

)

The second part contains the linear advection term

in direction y and the forcing term: −

∂λ

∂t

− Vλ

= A

⋆ L. Again, a shock explicit scheme is used:

(λ

)

k−1

i, j

= (λ

)

i, j

−

△t



i, j+1

−U

i, j−1



(λ

)

i, j

−

△t(A

⋆ L)

i, j

− △tS

((λ

)

,V)

Having the same structure as the ﬁrst component, the

second component of (25) is approximated using the

same method

The last equation, (26), is similar to equation (25)

and the same discretization technique is applied.

4.4 Results

The “taxi” sequence and a synthetic sequence have

been chosen for discussing results. For both cases, the

initial condition is provided by the Horn and Schunk

method applied on the two-ﬁrst frames of the se-

quence. Image gradients, providing observations, are

computed with a convolution method and a derivative

Gaussian kernel. The parameter σ of the matrix Q is

set to 1. The number of iterations is set to 5.

The taxi sequence displays several cars with a

slow and quasi uniform motion.

In a ﬁrst experiment, we compute the optical ﬂow

on the sequence using the Data Assimilation method

(D.A.). The Horn & Schunk method (H.&S.) is also

applied and results, on the ﬁfth frame, are displayed

Figure 1: Result on frame 5: D.A. (top), H&S (bottom).

for comparison purposes on Figure 1. Results of both

methods are similar in this case because observation

values are available on the whole sequence obviously.

Having chosen H.&S. or another method does not re-

ally matter. The issue is not comparing optical ﬂow

method but proving the efﬁciency of D.A. for missing

data or complex dynamics. Then, a second experi-

ment is designed to prove the capability of D.A. to

manage missing data. To simulate a sensor failure,

a large region around the white car is set to zero in

a frame and the function f

sensor

returns zero on these

pixels. Image gradients are then computed. Figure 2

shows the results for D.A. and H.&S. methods. This

latter obviously fails to provide acceptable velocity

vectors over the missing data region while D.A. suc-

cesses due to the eviction of aberrant values in the

computation. Even a whole frame of observation can

be missing: we force image gradient values to zero

on the ﬁfth frame of the taxi sequence and f ≡ 0

on this frame. Figure 3 displays results obtained with

D.A. with and without image gradients observation on

frame 5. Results are similar, due to the fact that the

evolution model correctly describes the temporal dy-

namics of the sequence and compensate missing data.

H.&S. method is no more able to provide any result in

that case, conducting to a discontinuity in the output.

VISAPP 2009 - International Conference on Computer Vision Theory and Applications

600

Figure 2: Missing data: D.A. (top), H.&S. (bottom).

A third experiment is dedicated to the demonstra-

tion that the evolution model improves the estimation

of the image dynamics. For that purpose, we build

a synthetic sequence with two squares, one moving

horizontally and one vertically. At the end of the se-

quence, they meet each other. Figure 4 shows the re-

sult of both methods at the beginning and the end of

the sequence. H.&S.’s method fails to estimate a cor-

rect velocity direction when the squares meet. This is

a common drawback of image processing method due

to over regularization. D.A. however provides accu-

rate results: the evolution model correctly describes

the squares dynamics and decreases the effect of spa-

tial regularization.

5 CONCLUSIONS

In this paper we described a framework to solve ill-

posed Image Processing problems using Data Assim-

ilation algorithms. This approach is an alternative

to the Weickert’s method, which constrains the solu-

tion’s variations in space and time.

General principles to choose a suitable evolution

model, an observation equation, and their matrices

of covariance were discussed in the paper. The ob-

Figure 3: Result with gradients set to zero on frame 5 (top)

and with gradient available (bottom).

servation equation corresponds to the Image Model

of the Tikhonov regularization method. The evolu-

tion equation constrains the solution in time and the

model error constrains it in space. As an illustration,

we proved that an exponential covariance model er-

ror is equivalent to a ﬁrst order Tikhonov regulariza-

tion. The three drawbacks of the Weickert’s method,

pointed out in the Introduction, are overcome. First,

the state vector’s dynamics is described by a suitable

evolution model. Second, a covariance R built from

the conﬁdence on the observation values weights the

inﬂuence of these latter: observations having a low

quality, such as missing data, are not taken into ac-

count in the computation of the solution. And last,

the algorithm processes a sequence of images frame

by frame, which allows an implementation with low

memory requirements. The choice of the evolution

model remains a key point as it greatly depends on

the applicative context. We described it in the case

of determining optical ﬂow: image’s grey level values

are assimilated in an evolution model representing a

transport equation of the velocity and a stable numer-

ical scheme, approximating this equation, has been

established.

The main perspective of the work is to deﬁne a

SOLVING ILL-POSED PROBLEMS USING DATA ASSIMILATION - Application to Optical Flow Estimation

601

(a) frame 4

(b) frame 10

Figure 4: Result with D.A. (left) and H.&S. (right).

generic form of the evolution model which will then

be speciﬁed depending of the Image Processing ap-

plication. Diffusion laws, modelling the transport of

the state vector using the physical conservative prin-

ciples, are good candidates for a number of reasons.

First, their approximation in a discrete space is robust

as stable numerical schemes exist. Second, the adjoint

of the model operator is known explicitly. Third, sev-

eral types of diffusion can be considered depending

on their impact on the spatial and temporal regulariza-

tion of the solution. An isotropic diffusion is equiva-

lent to a uniform regularization but has the drawback

to smooth the solution while an adaptive diffusion,

driven by the state vector, allows discontinuities to be

preserved. As the spatial regularization may also be

performed by the covariance matrix Q, it could be in-

teresting to compare the two paradigms. Moreover,

the diffusion can also be driven by the observation

values or by their conﬁdence. For instance, a strong

diffusion on the state vector may operate when the

observation conﬁdence is low which is an alternative

way to deal with missing data. Again, a comparison

between such a diffusion and the covariance matrix R

built from the observation conﬁdence must be led.

REFERENCES

B´er´eziat, D. and Herlin, I. (2008). Solving ill-posed image

processing problems using data assimilation. Applica-

tion to optical ﬂow. Research Report 6477, INRIA.

Hadamard, J. (1923). Lecture on Cauchy’s Problem in Lin-

ear Partial Differential Equations. Yale University

Press, New Haven.

Horn, B. and Schunk, B. (1981). Determining optical ﬂow.

Artiﬁcial Intelligence, 17:185–203.

Huot, E., Herlin, I., and Korotaev, G. (2008). Assimila-

tion of sst satellite images for estimation of ocean

circulation velocity. In Proceedings of IEEE Inter-

national Geoscience and Remote Sensing Symposium

(IGARSS), Boston, Massachusetts, U.S.A.

Oliver, D. (1998). Calculation of the inverse of the covari-

ance. Mathematical Geology, 30(7):911–933.

Papadakis, N., Corpetti, T., and M´emin, E. (2007a). Dy-

namically consistent optical ﬂow estimation. In Pro-

ceedings of International Conference on Computer Vi-

sion, Rio de Janeiro, Brazil.

Papadakis, N., H´eas, P., and M´emin, E. (2007b). Im-

age assimilation for motion estimation of atmospheric

layers with shallow-water model. In Proceedings of

Asian Conference on Computer Vision, pages 864–

874, Tokyo, Japan.

Papadakis, N. and M´emin, E. (2007). Variational optimal

control technique for the tracking of deformable ob-

jects. In Proceedings of International Conference on

Computer Vision, Rio de Janeiro, Brazil.

Sethian, J. (1996). Level Set Methods. Cambridge Univer-

sity Press.

Tarantola, A. (2005). Inverse Problem Theory and Methods

for Model Parameter Estimation. Society for Indus-

trial and Applied Mathematics.

Tikhonov, A. N. (1963). Regularization of incorrectly posed

problems. Sov. Math. Dokl., 4:1624–1627.

Valur H´olm, E. (2003). Lectures notes on assimilation

algorithms. Technical report, European Centre for

Medium-Range Weather Forecasts Reading, U.K.

Verwer, J. and Sportisse, B. (1998). A note on operator

splitting in a stiff linear case. Technical Report MAS-

R9830, Center voor Wiskunde en Informatica.

Weickert, J. and Schn¨orr, C. (2001). Variational optic ﬂow

computation with a spatio-temporal smoothness con-

straint. Journal of Mathematical Imaging and Vision,

14:245–255.

VISAPP 2009 - International Conference on Computer Vision Theory and Applications

602