Clustering Big Data

Michele Ianni; Elio Masciari; Giuseppe M. Mazzeo; Carlo Zaniolo

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Clustering Big Data

Topics: Big Data Search and Mining

In Proceedings of the 7th International Conference on Data Science, Technology and Applications DATA - Volume 1, 276-282, 2018 , Porto, Portugal

Authors: Michele Ianni ¹ ; Elio Masciari ² ; Giuseppe M. Mazzeo ³ and Carlo Zaniolo ⁴

Affiliations: ¹ DIMES, University of Calabria, Rende (CS) and Italy ; ² ICAR-CNR, Rende (CS) and Italy ; ³ Facebook, Menlo Park and U.S.A. ; ⁴ UCLA, Los Angeles and U.S.A.

Keyword(s): Clustering, Big Data, Spark.

Abstract: The need to support advanced analytics on Big Data is driving data scientist’ interest toward massively parallel distributed systems and software platforms, such as Map-Reduce and Spark, that make possible their scalable utilization. However, when complex data mining algorithms are required, their fully scalable deployment on such platforms faces a number of technical challenges that grow with the complexity of the algorithms involved. Thus algorithms, that were originally designed for a sequential nature, must often be redesigned in order to effectively use the distributed computational resources. In this paper, we explore these problems, and then propose a solution which has proven to be very effective on the complex hierarchical clustering algorithm CLUBS+. By using four stages of successive refinements, CLUBS+ delivers high-quality clusters of data grouped around their centroids, working in a totally unsupervised fashion. Experimental results confirm the accuracy and scalability of CLUBS+ on Map-Reduce platforms. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.19

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Ianni, M., Masciari, E., M. Mazzeo, G. and Zaniolo, C. (2018). Clustering Big Data. In Proceedings of the 7th International Conference on Data Science, Technology and Applications - DATA; ISBN 978-989-758-318-6; ISSN 2184-285X, SciTePress, pages 276-282. DOI: 10.5220/0006858702760282

@conference{data18,
author={Michele Ianni and Elio Masciari and Giuseppe {M. Mazzeo} and Carlo Zaniolo},
title={Clustering Big Data},
booktitle={Proceedings of the 7th International Conference on Data Science, Technology and Applications - DATA},
year={2018},
pages={276-282},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006858702760282},
isbn={978-989-758-318-6},
issn={2184-285X},
}

TY - CONF

JO - Proceedings of the 7th International Conference on Data Science, Technology and Applications - DATA
TI - Clustering Big Data
SN - 978-989-758-318-6
IS - 2184-285X
AU - Ianni, M.
AU - Masciari, E.
AU - M. Mazzeo, G.
AU - Zaniolo, C.
PY - 2018
SP - 276
EP - 282
DO - 10.5220/0006858702760282
PB - SciTePress