Figure 1: Displacement of edge. True edge fits between the
2 rows of pixels, however edge must conform to the pixel
grid, therefore when maintaining single pixel edge thick-
ness, this results in 2 correct edge locations, which needs to
be accounted for during performance evaluation.
edge points or non edge points. The output binary
edge map can be objectively compared against each of
these points using a variety of methods (Savitzky and
Golay, 1964; Abdou and Pratt, 1979; Bowyer et al.,
2001; Prieto and Allen, 2003). The advantages of
using a ground truth image for reference allows for
the number of correctly detected edge points (True
Positives) missed edge points (False Negatives), in-
correct edge points (False Positives), to be measured.
These values can used to grade the output through a
variety of metrics and performance evaluation meth-
ods. Several objective performance algorithms such
as the Pratt figure of Merit (PFOM) (Abdou and Pratt,
1979), Probabilistic Rand Index (PRI) of Savitzsky
and Golay (Savitzky and Golay, 1964), Pixel Corre-
spondence Metric (PCM) of Prieto and Allen (Prieto
and Allen, 2003), Receiver Operating Characteristic
(ROC) Curves of Bowyer et al citeBowyer2001, the
Precision (P), Recall (R) and F-measures (F) of Mar-
tin et al (Martin et al., 2004), and also the Variation
of Information (VI) measure by Meil
˘
a (Meil
˘
a, 2005).
Each of these methods produce an objective metric
which grades how accurately the result image corre-
lates to the ground truth image.
While suitable for reference based objective mea-
sures, the above techniques can result in errors be-
tween the overall performance score and the visual
edge and surface detection results. Notably some ob-
jective measures do not account for a pixel (2-D) or
voxel (3-D) shift between the detected interface and
the ideal interface (Fig 1) or do not account for the
fragmentation of the detected edges which can re-
sult in incorrect assumptions about the quality of the
detection (Williams et al., 2008). Methods which
aim to evaluate the fragmentation and displacement
in detected edges often apply a one-to-many corre-
spondence match whereby for each candidate pixel in
the ground truth image, multiple candidates may be
matched in the result image as is the case with the
PFOM (Abdou and Pratt, 1979). This significantly
affects the reliability of the performance method and
leads to inaccuracies between the objective results and
the visual results (Williams et al., 2008).
To solve this problem, one-to-one correspondence
matching is required. In order to achieve optimal
one-to-one correspondence, the Hungarian algorithm
(Kuhn, 1955) for optimising solutions to the assign-
ment problem can be employed. Although the Hun-
garian algorithm provides the optimal solution to ref-
erence based edge detection performance evaluation,
the Hungarian algorithm introduces a high computa-
tional cost and is therefore impractical for large 2-D
image data sets and 3-D edge detection evaluation
such as Brats (et al., 2015). Acquiring more pro-
cessing power to solve the assignment task is not al-
ways practical, thus an efficient method for perform-
ing accurate one-to-one performance evaluation is de-
sirable, and such methods prior to this paper are lack-
ing in the literature .
The rest of the paper is structured as follows, sec-
tion 2 presents the overall problem associated with
pair matching in image edge detection evaluation.
Section 3 then presents an Efficient Paring Strategy
(EPS) algorithm for pair matching, detailing the step
by step functionality of this technique. Section 4 then
presents a comparative analysis of this against three
alternative approaches (CSA,CDM,PFOM). The ac-
curacy of the performance measure with respect to
optimal correspondence matching a Pearson pairwise
comparison is given for each of the methods against
the optimal Hungarian algorithm (Kuhn, 1955). We
then present the cost efficiency of the EPS method
compared against alternate methods, when applied to
a large data set of both 2-D and 3-D images. Fi-
nally section 5 presents the overall conclusions and
outcomes from this work and proposes the potential
for the EPS to be applied in efficient edge and surface
detection evaluation situations.
2 POINT CORRESPONDENCE IN
PERFORMANCE MEASURES
Digital images are comprised of discrete data, thus
the location of an edge (2-D) or surface (3-D) point is
constrained by the pixel or voxel resolution of the im-
age. Since these points are interfaces between regions
the true position of a region cannot be accurately rep-
resented by a discrete pixel or voxel point in an im-
age. Therefore an edge or surface detection algorithm
must position the result in accordance with the dis-
crete framework of the image and this introduces lo-
cation error (see Fig. 1 and Fig. 2c). Therefore, when
assessing the performance of these algorithms against
a ground truth an allowance for displacement should
be available to account for these localisation errors,
since detected, connected boundaries even with a dis-
placement are of value (Williams et al., 2008).
Efficient One-to-One Pair Matching for 2-D and 3-D Edge Detection Evaluation
591