Analysis of Centroid Cluster in X-Means Clustering in Data
Classification: Power Absorb Oxygen
Sardo Pardingotan Sipayung
1
, Poltak Sihombing
1
and Sutarman
2
1
Department of Computer Science, Faculty of Computer Science and Information Technology, Universitas Sumatera Utara,
Medan, Indonesia
2
Department of Information Technology, Faculty of Computer Science and Information Technology, Universitas Sumatera
Utara
Keywords: Oxygen, Cluster, Centroid, X-Mean.
Abstract: On gardens city of Medan, there are type different trees. On every tree have power absorbency oxygen and
work issue oxygen every day. It will be grouping the tree data that issued oxygen with use X-Means method
on Clustering algorithm. Then in research, an analysis to centroid that is point data center inside process
grouping, then need t o a n analysis centroid in determining gift value early to process the beginning
of clustering. So that data was used as point center cluster on process X-Means clustering algorithm.
1 INTRODUCTION
Centroid cluster selected in a manner random through
a number of K-cluster. Algorithm share the data
provided to in K-cluster, respectively have
membership cluster own and set every data point to
center mass closest. Then compile reset it centroid use
association cluster when this and if grouping not
fused, the process will be repeat to several times. X-
means clustering is variation from K-means
clustering treat allocation cluster with try partition
over and over and keep separation optimal results,
arrive some criteria achieved. X-mean cluster with do
grouping intrinsic in a data set that is not labeled.
Giving fast way and efficient for grouping data that
doesn't structure, usage concurrency with speed up
process model and construction use.
Point center cluster or centroid is a point early
start grouping in the cluster on algorithm K-Means.
Data grouping is done with calculating distance
closest with point center initial cluster as point central
information every group or cluster. However on its
application, determination point center initial cluster
this is what become weakness from algorithm K-
Means. This caused because not there is an approach
used to choose and determine point center cluster.
Point center cluster selected in a manner just any or
random from a set of data. The results clustering from
algorithm K-Means often less optimal and not
maximum in every experiment conducted. By
because that, can say it that well bad the results
clustering, very depend on point center cluster or
centroid beginning (Baswade, 2013).
Some researchers have looked for the problem of
k-means clustering and some have taken many
approaches to accelerate k-means. But several
methods have been introduced to scalability and
reduce the time complexity of the k-means algorithm.
(Pelleg, 2000) has proposed a method called X-
means. The purpose of this method is to divide
several centroids into two to match the data reached.
The X-means algorithm has proven to be more
efficient than k-means. This method does not have
any disadvantages, based on the BIC (Bayesian
Information Criterion) on the separation of many
centroid selections when the data is not completely
spherical.
2 RESEARCH METHODS
2.1 Clustering
Clustering is method classify or partition data inside
a dataset. On basically clustering are something
method for looking for and group data that has
similarity characteristic (similarity) between one data
with other data (Bhusare, 2014). The Cluster is a
group data objects that have similarity one each other
Sipayung, S., Sihombing, P. and Sutarman, .
Analysis of Centroid Cluster in X-Means Clustering in Data Classification: Power Absorb Oxygen.
DOI: 10.5220/0008547601350137
In Proceedings of the International Conference on Natural Resources and Technology (ICONART 2019), pages 135-137
ISBN: 978-989-758-404-6
Copyright
c
2019 by SCITEPRESS – Science and Technology Publications, Lda. All rights reserved
135