Table 1: Required theoretical iteration number of RAN-
SAC (Fischler and Bolles, 1981) combined with minimal
methods (columns) with confidence set to 95% on different
outlier levels (rows).
Confidence 95%
Outl. 6 7 8
50% 190 382 765
80% ∼ 10
4
∼ 10
5
∼ 10
6
95% ∼ 10
8
∼ 10
9
∼ 10
10
99% ∼ 10
12
∼ 10
14
∼ 10
16
In order to overcame the approximative nature of
the proposed approximating six-point technique we
combined it with a recent variant of locally optimized
RANSAC (Chum et al., 2003). We chose Graph-Cut
RANSAC (Barath and Matas, 2017) (GC-RANSAC)
as robust estimator since it can be considered as state-
of-the-art and its source code is publicly available
1
.
Briefly, it replaces the local optimization step of LO-
RANSAC with graph-cut applied to the current best
model. In the local optimization step, we used the
normalized eight-point algorithm. Thus the approxi-
mated fundamental matrix is used only as an initial es-
timate to determine a set of inliers, then the obtained F
is refined iteratively exploiting a set of corresponden-
ces, i.e. the inliers. We used the same parameters as
the authors proposed: the inlier-outlier threshold was
set to 0.31, the iteration limit to 5000 and the weight
of the spatial coherence term was 0.14.
We used the AdelaideRMF, Kusvod2, Multi-H,
and Strecha datasets (see Fig. 2) to evaluate the propo-
sed method on real world data. AdelaideRMF, Kus-
vod2 and Multi-H contains image pairs of size from
455 × 341 to 2592 ×1944 and manually annotated
point correspondences (assigned to the outlier or in-
lier classes) for each pair. Since the reference points
do not contain rotation components we detected and
matched points applying ORB feature detector (Ru-
blee et al., 2011). ORB features contain the orienta-
tion and the point coordinates.
Strecha dataset contains image sequences toget-
her with projection matrices. Each image is of resolu-
tion 3072 ×2048. The fundamental matrices are esti-
mated for all possible image pairs in every sequence.
Correspondences were obtained by ORB detector and
the ground truth fundamental matrices were calcula-
ted from the given projection matrices (Hartley and
Zisserman, 2003). All detected point pairs were con-
sidered as reference points for which the symmetric
epipolar distance (Hartley and Zisserman, 2003) from
the ground truth F was smaller than 1.0 pixels. To dis-
card not stable image pairs, the minimum reference
point number was set to 10. Thus every image pair
1
https://github.com/danini/graph-cut-ransac
for which less than 10 correspondences were closer
to the ground truth F than one pixels was not used in
the evaluation.
We used the reference point sets to validate the
estimated fundamental matrices. The reported geo-
metric errors were computed as the mean symmetric
epipolar distance as
1
2
∑
(p
1
,p
2
)∈P
R
Fp
1
q
(Fp
1
)
2
1
+ (Fp
1
)
2
2
+
p
T
2
F
q
(p
T
2
F)
2
1
+ (p
T
2
F)
2
2
,
(11)
where P
R
is the set consisting of the reference point
correspondences.
The competitor methods, i.e. the minimal sol-
vers combined with GC-RANSAC, were the normali-
zed eight- and seven-point algorithms
2
. In the least-
squares model re-fitting step of GC-RANSAC, the
normalized eight-point method was applied using the
current inlier set.
Table 3 reports the mean result of 100 runs on
every pair from the Strecha dataset. The first column
denotes the name of the sequence, the second one is
the number of the image pairs used – the ones for
which more than 10 reference points were kept. The
next two blocks, each consisting of three columns,
shows the results of the methods if the confidence
of GC-RANSAC is set to 99% (first block) and for a
strict time limit (60 FPS; second block). The reported
properties are the mean and median geometric errors
of the estimated fundamental matrices (Eq. 11) w.r.t.
the reference point sets, and the number of the sam-
ples, i.e. iterations, drawn by GC-RANSAC. It can
be seen that for no time limit (first three columns),
the seven-point algorithm obtains the most accurate
results on average. This is not surprising since it is
a consistent estimator (for no noise the error is zero)
and “inifite” time is given to get the most accurate re-
sult. It can also be seen that if there is a strict time
limit to achieve real time performance (last three co-
lumns), the proposed method yields the most accurate
results.
Table 2 shows the mean results on AdelaideRMF,
Kusvod2 and Multi-H datasets (first column) if the
confidence is set to 99% (third – sixth column). The
last three columns report the results if there is a strict
60 frames-per-seconds (FPS) time limit. The same
property can be seen as for the Strecha dataset: (i) for
no time limit, the best is the seven-point algorithm.
(ii) For 1/60 FPS, the proposed approximating six-
point method combined with GC-RANSAC leads to
the most accurate fundamental matrix estimates.
2
The implementation provided in OpenCV is used for
the eight- and seven-point algorithms.
VISAPP 2018 - International Conference on Computer Vision Theory and Applications
468