2. Global Affine Registration;
3. Full image registration using the previous calcu-
lated transform as seed;
4. Detect features in the original images;
5. Correct the spots’ coordinates using the transform
calculated in step 3;
6. Match features with corrected coordinates;
After the results of the affine global and full reg-
istrations, the features in the original images are de-
tected using the approach proposed in (dos Anjos
et al., 2011). As mentioned before, position, area,
volume and other descriptors are extracted from the
spots. Nevertheless, only the coordinates of the de-
formed image are corrected by using the total dis-
placement field calculated from the image registration
process.
The total displacement u
t
(~x) for the position of
each detected spot is the following:
u
t
(~x) =~x− ((A~x+ b) + u(~x)) (11)
where (A~x + b) is the affine transform calculated as
described in Section 5.1, and u(~x) is the displacement
for each coordinate as calculated in (Knut Conradsen,
1992) with the proposed improvements.
This way, position can be given a much higher
weight in the cost function as it is supposed to be more
reliable. Another improvement, relatively to the pre-
vious presented approach is the replacement of the de-
tected volume by the volume of the spot weighed by
a Gaussian centered at the spot. Therefore, instead of
relying only on the detected volume, one relies on the
volume and on the context defined by the spot’s im-
mediate surroundings. Therefore, even having lower
representation, the information surrounding the spot
is included, contributing for a better match. Addition-
ally, a threshold t
ed
was created in order to immedi-
ately exclude the possibility of matches that are at a
very far position from the correspondingspot. Finally,
spot matches not respecting this threshold are set to ∞.
Only then, the cost matrix is processed by the HA.
7 RESULTS
All the parameters were the same for the complete set
of images. Distance and (new) volume, for the weigh-
ing function were set to: α
d
= 1.0; α
vo2
= 1.0. All the
other weight coefficients were set to zero. Moreover,
t
d
= 0.35;t
c
= 0.35;t
ed
= 35, where t
ed
is not normal-
ized as it is used during the construction of the cost
matrix.
As it will be demonstrated, one of the great advan-
tages of this approach is that it is very difficult to find
(a) 200 spots found. (b) 217 spots found.
Figure 6: Figures 1(a) and 1(b). Valid matches: 155.
a false match between the markers presented as valid
markers, meaning that the presence of false matches
is extremely low, or non-existent. Only markers pre-
sented in blue are matches that respect all the defined
thresholds. Green markers have found a match in the
HA but do not respect either t
d
or t
c
. Finally, the red
markers did not find any match in the HA, being con-
sidered as outliers.
Figure 6 shows the result of the suggested match-
ing process used to match the spots of Figures 1(a)
and 1(b).
A very popular approach used in feature matching
is Shape Context (Belongie and Malik, 2000). It ba-
sically consists of analyzing the spacial relationship
between points.
Shape contexts use mainly four parameters. The
first parameter defines the number of radial bins for
the creation of the histograms (set to 10), the second is
the number of theta bins that defines how many slices
should the histograms be divided in (set to 24), and
the minimum and maximum widths of the bins (set to
1 and 100, respectively).
Table 1 presents a comparison of the results of the
matches using the proposed approach and matching
with shape contexts.
It is clear that the proposed approach is doing a
better job than the shape context approach. The num-
ber of false matched spots is extremely low when us-
ing the proposed approach. This is highly desirable
because it avoids the necessity of a laborious manual
intervention to correct the results.
PDQuest is considered as one of the best software
packages in matching 2-DE gel images (Rosengren
et al., 2003). Despite not directly comparable with
PDQuest, or with any other commercial software, be-
cause the detected spots are not exactly the same as
the detected by our approach, results from matching
the same images used in Table 1 are presented in Ta-
ble 2 using PDQuest Version 8.0.1 build 055.
Although the number of false matched spots is
BIOINFORMATICS 2012 - International Conference on Bioinformatics Models, Methods and Algorithms
116