2.1 Conventional Approaches
The conventional approaches for vehicle detection
commonly adopt the image feature extraction tech-
niques and classification algorithms.
Vertical and Horizontal Edges. The edge has al-
ways been an important feature in computer vi-
sion applications. Srinivasa (Srinivasa, 2002) and
Sun em et al. (Sun et al., 2002) used this char-
acteristic to identify front vehicles. However, the
problem with the edge methods is how to set a
suitable threshold. Different thresholds must be
used for different application scenarios to obtain
the best results. Although the methods can be
improved in an adaptive manner, when the color
of the vehicle body is similar to the surrounding
environment, the edges cannot be detected easily
and correctly.
Shadow. Extracting features directly from the ob-
jects is sometimes not the best solution for ob-
ject detection. The information around the object
might be used effectively. One good example is
the shadow under the vehicle. Based on the ob-
servation, the darkest part of the image has a very
high probability of being located at the bottom of
the vehicle region. According to this characteris-
tic, Tzomakas and von eelen (Tzomakas and von
Seelen, 1998) proposed a way to effectively de-
termine the gray value threshold to detect the ve-
hicle. However, this method is prone to misjudge
the shadow regions in the night scenes.
Tail Light. To deal with the problem of detecting ve-
hicles at night, OMalley et al. (O’Malley et al.,
2008) used tail lights to identify the front vehicle
location. The core idea is that the tail lights are
red and easy to recognize. Since some vehicles
do not have red tail lights or have the lights on,
this does not guarantee that all true positives can
be considered.
Support Vector Machine. Through the characteris-
tics of vehicles, such as the simple features de-
scribed above, we are able to detect the vehicle
location under normal conditions. For more com-
plicated situations, it is difficult to detect the ve-
hicles correctly. Some previous works proposed
to use Haar transform or HOG to extract textures,
and use support vector machines or simple neu-
ral networks for vehicle detection and verification
(Ortega et al., 2013; Sun et al., 2006a). The re-
sults obtained from this approaches are generally
better, but the computation is the key issue. The
operation time is dramatically increased due to the
bounding box extraction using a sliding window.
It is thus fatal for real-time applications.
2.2 Convolutional Neural Networks
Since the year of 2012, the convolutional neural net-
works used for object detection and classification
have gained a great success. The object detection
research is mainly divided into two categories: one-
stage and two-stage detectors.
Two-Stage Detector. The two-stage detector is the
first development with a significant detection rate
based on the recent deep learning framework. Its
architecture mainly consists of two parts: region
proposal and prediction of the content of the de-
tected bounding box (Girshick et al., 2014; Gir-
shick, 2015; Dai et al., 2016). Most region pro-
posals are designed to be very large in pursuit of
good results, and thus it slows down the comput-
ing speed of the network. In order to mitigate this
issue, Li et al. (Li et al., 2017) refer to the idea
of (Szegedy et al., 2015; Chollet, 2016) to reduce
the amount of calculation while keeping the re-
sults satisfied.
One Stage Detector. Even the two-stage detectors
have very good recognition rates, they are not suit-
able for many application scenarios in terms of the
execution speed. To deal with this problem, Red-
mon and Farhadi divide the input image into an
n ×n grid and use them to generate several dif-
ferent sizes of bounding boxes instead of using
the network with region proposal (Redmon and
Farhadi, 2016). In the proposed method, each grid
only predicts one object and the major cost is that
it does not work well for the detection of small
objects. To reduce the cost, Liu et al. propose a
method to forecast at different scales and this is
able to deal with the small size objects fairly well
(Liu et al., 2016).
3 METHOD
In the proposed technique, we first use a machine
learning algorithm to detect the vehicles, and then
SVM is adopted as a verification mechanism to ef-
fectively remove the non-vehicle bounding boxes. Fi-
nally, a tracking algorithm is carried out to reduce the
overall computation time. The real-time system can
be realized by adjusting the detection and tracking
rate. Figure 1 shows the flowchart of the proposed
method.
VEHITS 2019 - 5th International Conference on Vehicle Technology and Intelligent Transport Systems
298