to create a height map of a given image with no prior
information (Zhou et al., 2017).
However, using a downwards facing camera on
a drone enables the advantage of stereo correspon-
dence between sequential images. Previous work
has successfully implemented photogrammetric ae-
rial depth triangulation to estimate elevation or heig-
hts using vision (Schenk, 1997), (Hadjutheodorou,
1963), (Matthies et al., 1989), (Choi and Lee, 2012).
Using the stereo principle on sequential images from
the drone seems feasible to calculate the distance to
the ground based on the motion of the drone, as has
previously been done on rotor-drones (Campos et al.,
2016). This paper explores the possibility of enabling
automatic free height estimation using optical flow
and stereo vision principles and calculate a reliability
measure for the operator on the ground. Investigating
similar work where computer vision has been used
for obstacle avoidance or mapping, the most common
methods used are optical flow or SLAM based appro-
aches (Lu et al., 2018). Feature-based methods have
rarely been used despite its potential low cost. One
example of feature-based previous work is the use of
SIFT to detect obstacles by tracking size expansion
ratios in sequential images (Al-Kaff et al., 2016).
We therefore propose to evaluate feature-based
height estimation using a single monoscopic RGB ca-
mera in order to assess potential shortcomings and
error sources of such a system. A feature-based sy-
stem would not be affected by changing illumination,
which is an unavoidable aspect in the outdoors con-
text, and is more robust in handling sudden changes in
speed or rotation. The system will be suited to drones
operating above 100 meters, and assuming a relatively
constant flight level. To this end, the existing sensors
on the drone can be used to retrieve the missing va-
riables, available from the drone’s inertial measure-
ment unit (IMU) and GPS. Based on these findings,
an evaluation should suggest potential pitfalls for fu-
ture works, and assess the viability of estimating free
height of a drone using a single RGB camera.
3 METHODS
We explore the use of the stereo equation in combi-
nation with feature detection and matching to calcu-
late the height of the drone and the reliability thereof
using sequential images from a downwards facing ca-
mera mounted to the drone. This section provides an
overview of the applied feature tracking, height esti-
mation, and reliability measure methods. The evalua-
tion approach and test methodology is also described
to clarify aspects such as drone data retrieval and ot-
her issues encountered through the process.
3.1 Feature Detection and Tracking
The system requires a method to detect robust fe-
atures that can be matched between sequential fra-
mes. Oriented FAST and Rotated BRIEF (ORB), was
chosen based on the speed of computation and rela-
tive robustness. Even though other methods such as
Scale Invariant Feature Transform (SIFT) are scale
invariant, ORB outperforms them in execution time
and with comparable accuracy allowing for real-time
tracking (Rublee et al., 2011).
ORB is a feature detector made for real-time
computations and low-power devices. It builds on
the Features from Accelerated Segment Test (FAST)
keypoint detector and a variant of the Binary Ro-
bust Independent Elementary Features (BRIEF) des-
criptor. The FAST method performs well in high
speed corner detection, by considering a circle of a
set amount pixels around a corner candidate. If the
brightness of these pixels are darker or brighter than
the candidate with a given threshold it is considered
a corner (Rosten et al., 2010). The BRIEF procedure
allows a shortcut in finding the binary strings without
having to find the descriptors. It takes a smoothened
part of an image and finds a chosen amount of loca-
tion pairs. A pixel intensity comparison are done to
these pairs.
ORB employs a Harris corner filter to discard ed-
ges. FAST keypoints are computed with an orienta-
tion component, while BRIEF descriptors are consi-
dered rotation-invariant using steering according to
the orientation of the keypoints. According to the
authors, ORB performs as good as SIFT and better
than SURF on their evaluation data while being up
to factor-2 faster. Furthermore, ORB is derived with
the purpose of running in real-time or for low-power
systems. However, an issue with ORB, compared to
other feature detectors, is that it is not scale invari-
ant. As the drone used in this test is flying at relati-
vely constant altitude between two frames, we assume
only little accuracy is missed from this. For applica-
tions where scale invariance is a necessity, computa-
tionally heavier solutions might be needed, such as
SIFT or SURF. As this is intended as an on-line solu-
tion for constant flight level, the system benefits from
the drone’s constant altitude by using less computati-
onally heavy feature detection.
With these arguments we hypothesize that ORB
is sufficient to detect and track features for reliable
disparity calculations for use in height estimation.
VISAPP 2019 - 14th International Conference on Computer Vision Theory and Applications
752