machine (Balter and Cao 2007). Such devices
provide information about local anatomy in the form
of fluoroscopic images and/or cone beam CT
(CBCT), enabling tracking and measurement of
tumour motion (Shirato et al 1999, Marchant et al
2008, Poulsen et al 2008).
There are essentially two methodologies for
tracking tumours in the kV images. The first is using
directly the image intensity patterns to estimate
tumour mass position without any implanted
markers (Cui et al 2007) but this approach however
is still considered to be under development. The
second methodology tracks one or more implanted
radio-opaque fiducial markers which are treated as a
reliable tumour surrogate. The RTRT (Shirato et al
1999) and IRIS (Berbeco et al 2004) are examples of
the hardware platforms proposed to solve this
problem. Both these systems use multiple pairs of
diagnostic x-ray tubes and imagers to determine the
3D marker position. The RTRT system uses a simple
template matching tracking algorithm to track a
single spherical marker. Tang et al (2007) proposed
a tracking algorithm capable of tracking multiple
cylindrical markers in fluoroscopic images acquired
from a monoscopic system. Their method uses
template matching in conjunction with a prediction
stage and multiple hypotheses to improve robustness
of the tracker in a presence of image clutter. More
recently Matuszewski et al (2010) proposed tracking
algorithm with multi-component score functions to
select the most likely position of the marker from a
set of generated marker position hypotheses. In
comparison to the algorithm proposed in (Tang et al
2007) the method uses: (i) the mean shift algorithm
instead of template matching, which provides higher
accuracy due to explicit sub-pixel accuracy of
marker position estimation and dynamic implicit
estimation of markers appearance (ii) use of random
sampling for hypothesis generation instead of
deterministic evaluation of all possible marker
locations in the predefined size window, enabling
efficient marker search in a much bigger region, and
maintaining track of possibly widely spatially
separated positional hypotheses. Additionally
contrary to the method described in (Tang et al
2007) the method proposed in (Matuszewski et al
2010) can operate even when: (i) average intensity
of the markers changes significantly; (ii) apparent
marker shape changes significantly; (iii) the
breathing pattern changes. The method does not
assume posterior distribution to be Gaussian, indeed
due to image clutter and presence of other proximate
markers the likelihood function could be highly non-
Gaussian – with multiple significant modes.
The method described in this paper can be seen
as an extension of the method proposed in
(Matuszewski et al 2010) where all the marker
position hypothesis are used in a batch processing
mode in a single combinatorial optimisation process.
The batch processing rather than real-time
tracking can be justified for some applications, for
example in CBCT motion correction. The tracking
of fiducial markers in such data is a challenging
problem. There are a number of reasons for this
including: a high level of noise due to scatter and a
low radiation dose delivered during a single CB
projection image acquisition; markers changing
shape and size for different projection angles;
occlusions and clutter caused by possible presence
of the foreign objects; markers overlapping with
each other or being masked by anatomical
structures; significant variations of the marker and
background intensities with projection angle.
Additionally apparent marker displacement in two
consecutive images could be quite significant as it is
a superposition of an intrinsic motion caused, for
example, by respiration and an extrinsic motion
induced by the sensor rotation.
The rest of the paper is organised as follows: in
section 2 the cone beam CT projection images are
introduced, section 3 briefly summarises the
algorithm proposed in (Matuszewski et al 2010),
whereas section 4 describes in details proposed
extensions of the method. The experimental results
are presented in section 5 with conclusions drawn in
section 6.
2 CONE BEAM CT PROJECTION
IMAGES
CB projection images shown in this paper were
acquired using Electra Synergy (XVI 3.5, Elekta,
Crawley, UK). This system has a kV imager fixed to
the rotating gantry, mounted orthogonally to the MV
treatment beam. Projection images were captured
over 360
o
of rotation at a frame rate of 5.5Hz with
640 projections. Projection images were acquired
using a 512x512 matrix with square pixel of size s =
0.8 mm at the detector. The geometrical
configuration of the rotating gantry with kV and MV
sources and kV imager is shown in Figure 1.
Assuming that the position (x,y,z) of a marker in
3D space is fixed its apparent motion in the
projection images as a function of the gantry angle is
given by (Marchant 2008):
MARKER TRACKS POST-PROCESSING FOR ACCURATE FIDUCIAL MARKER POSITION ESTIMATION IN
CONE BEAM CT PROJECTION IMAGES
523