Design of Audio based Accident and Crime Detection using Simple
Architecture of Neural Network
Afis Asryullah Pratama
1
, Sritrusta Sukaridhoto
1
, Mauridhi Hery Purnomo
2
, Achmad Basuki
3
,
Vita Lystianingrum
4
and Rizqi Putri Nourma Budiarti
5
1
Department of Electronic Engineering, Politeknik Elektronika Negeri Surabaya, Surabaya, Indonesia
2
Department of Computer Engineering, Institut Teknologi Sepuluh Nopember, Surabaya, Indonesia
3
Department of Creative Multimedia Technology, Politeknik Elektronika Negeri Surabaya, Surabaya, Indonesia
4
Department of Electrical Engineering, Institut Teknologi Sepuluh Nopember, Surabaya, Indonesia
5
Engineering Department, Universitas Nahdlatul Ulama Surabaya, Surabaya, Indonesia
Keywords: Audio Recognition, Mel Spectrogram, CNN, RNN, Surveillance System.
Abstract: The accident and crime happened on the road still increasing nowadays. Those two events were considered
as emergency event that need a quick response. In this research, a method to detect accident and crime were
proposed. The proposed method uses audio data as input and extracting the Mel spectrogram as the feature,
which later be fed to our simple neural network architectures. We classify our dataset into engine_idling,
car_crash, and gun_shot classes to represent normal, accident, and crime condition on the road. Our simple
CNN architecture obtains accuracy of 95.31% and 93.75% with 200ms and 1000ms segment duration
respectively, and our simple RNN architecture obtains 86.67% and 58.67% by using 200ms and 1000ms
segment duration respectively. We can conclude that the best simple architecture was performed by CNN
architecture with 200ms segment duration.
1 INTRODUCTION
The transportation technology was being developed
day by day, this has an impact not only to the system
of the vehicles but also the number of vehicles and its
passengers, in Indonesia there were 146,858,759
vehicles which include passenger cars, buses, freight
cars, and motorcycles in 2018 (BPS, 2018b;
Mahfuzhon and Setyawan, 2018). The increment of
vehicles and its passengers also increase the number
of accidents happened. In 2018, there are 109,215
accidents and 29,472 deaths were recorded in
Indonesia (BPS, 2018a). Most of the death cases from
car accident were happened due to the late treatment
for the casualties (Kattukkaran, George, and Haridas,
2017).
Other than car accidents, crime also an emergency
event that needs a quick response. In 2018 there are
8,423 mugs and 90.757 snitches happened in
Indonesia. But according to statistics but only 23.44%
in 2017 and 23.99% in 2018 was reported (Badan
Pusat Statistik, 2019). The low reporting rate of
crimes mostly caused by the lack of awareness and
information about where to report it.
Therefore, we need a system to detect accidents and
crimes with capability to deliver the emergency events
happened on the road. In this research, we propose a
method to detect car crash, idling engine and gunshot
as representatives of accident and crime sounds. we use
audio data of mentioned events for surveillance
purpose. We use several audio segmentation
parameters and neural network architectures to find the
best result for the case we focused on.
2 SYSTEM DESIGN
There are many methods for audio recognition such
as analyzing both time and frequency domain of
sample audio gives an accuracy of 65%-82%
(Sammarco and Detyniecki, 2018), or by extracts the
audio feature using MFCC and inferenced with DNN
which gives an accuracy of 98.4% (Arslan and
Canbolat, 2018). The MFCC and DNN method could