loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Ilias K. Savvas 1 and M-Tahar Kechadi 2

Affiliations: 1 T.E.I. of Larissa, Greece ; 2 UCD, Ireland

Keyword(s): MapReduce, HDFS, Hadoop, Clustering, K-means Algorithm.

Related Ontology Subjects/Areas/Topics: Cloud Application Architectures ; Cloud Application Scalability and Availability ; Cloud Computing ; Cloud Middleware Frameworks ; Energy and Economy ; Load Balancing in Smart Grids ; Platforms and Applications ; Smart Grids

Abstract: The Apache Hadoop software library is a framework for distributed processing of large data sets, while HDFS is a distributed file system that provides high-throughput access to data-driven applications, and MapReduce is software framework for distributed computing of large data sets. The huge collections of raw data require fast and accurate mining process in order to extract useful knowledge. One of the most popular techniques of data mining is the K-means clustering algorithm. In this paper, we developed a distributed version of the K-means algorithm using the MapReduce framework on the Hadoop Distributed File System. The theoretical and experimental results of the technique proved its efficiency.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.142.212.119

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
K. Savvas, I. and Kechadi, M. (2012). MINING ON THE CLOUD - K-means with MapReduce. In Proceedings of the 2nd International Conference on Cloud Computing and Services Science - CLOSER; ISBN 978-989-8565-05-1; ISSN 2184-5042, SciTePress, pages 413-418. DOI: 10.5220/0003927204130418

@conference{closer12,
author={Ilias {K. Savvas}. and M{-}Tahar Kechadi.},
title={MINING ON THE CLOUD - K-means with MapReduce},
booktitle={Proceedings of the 2nd International Conference on Cloud Computing and Services Science - CLOSER},
year={2012},
pages={413-418},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003927204130418},
isbn={978-989-8565-05-1},
issn={2184-5042},
}

TY - CONF

JO - Proceedings of the 2nd International Conference on Cloud Computing and Services Science - CLOSER
TI - MINING ON THE CLOUD - K-means with MapReduce
SN - 978-989-8565-05-1
IS - 2184-5042
AU - K. Savvas, I.
AU - Kechadi, M.
PY - 2012
SP - 413
EP - 418
DO - 10.5220/0003927204130418
PB - SciTePress