Visualization of the Spread Covid-19 Spread in East Nusa Tenggara
using the K-Means Data Mining Classification Method
Lita A. Ndoloe, Petrisia W. Sudarmadji and Christa E. B. Bire
Department of Informatic Engineering, Kupang State Polytechnic, Jalan Adji Sucipto, Indonesia
Keywords: Covid19, Mining Data, Kmeans, Web Applications.
Abstract: Covid-19 is a contagious disease, and is characterized by acute respiratory symptoms (SARS-CoV-2). The
COVID-19 is easily transmitted from one human to another through coughing or sneezing droplets (droplets).
The spread of COVID-19 cases in Indonesia is categorized as quite fast and has a negative impact on all fields.
The large area of the State of Indonesia is a problem in tracking the spread rate of Covid-19 in each province.
K-Means is a clustering algorithm that is used to group data into several groups by maximizing the similarity
of the data in a cluster. The implementation of k-means in a web-based application system aims to facilitate
the analysis of the spread of Covid-19 in East Nusa Tenggara Province. With this convenience, not only the
system can display information on the spread rate of Covid-19 informatively but also attract readers' interest
through a graphic display that makes it easier for readers to get complete information in one view.
1
INTRODUCTION
Covid-19 is a contagious disease, and characterized
by acute respiratory symptoms (SARSCoV-2). The
Covid-19 is easily transmitted from one human to
another through coughing/sneezing droplets
(droplets) originating from an infected human body.
Droplets containing the covid-19 virus can stick to
objects that are often touched. Consequently, humans
can get infected by touching the surface of the object
and then touching the face parts (For example, eyes,
nose, and mouth). Due to the ease of transmission, as
of April 3rd, 2021, the number of positive cases in
Indonesia has reached 1.5 million cases with 40
thousand deaths (Asroni, 2012).
The spread of Covid-19 cases that is evenly
distributed throughout Indonesia is a fairly rapid
spread and has a negative impact on all fields. The
vast territory of Indonesia allows the need for
grouping parts based on provinces in Indonesia.
Information regarding the level of spread of Covid-19
cases in each province of Indonesia can be found on
online media or news. However, the current delivery
of information tends to be textual, making it less
informative. With the large number of levels of the
spread of The COVID-19 Virus, the delivery of
information is textually irrelevant and does not attract
readers’ interests.
K-Means is one of the clustering algorithms
included in the unsupervised learning group which is
used to group data into several groups with a partition
system. This algorithm groups the data based on the
cluster center point (centroid) closest to the data. The
purpose of k-means is to group data by maximizing
the similarity of data in one cluster and minimizing
the similarity of data among clusters (Fitri Larasati,
2017).
The application of k-means in cases of the Covid-
19 spread can be used as a way to measure the level
of the Covid-19 virus spread in each province in
Indonesia. With accurate information on the level of
the Covid-19 spread, it can provide efforts for those
who take action to prevent the spread of this virus. It
also becomes a reference for the public to avoid places
that are the centre of the Covid-19 virus spread.
2
RESEARCH METHODS
In this study, the author uses the k-means algorithm
method to analyze the results of data on the spread of
the Covid-19 virus and the stages of the System
Development Life Cycle (SDLC) for system
development (Muhammad Ikbal, 2021).