ROBUST CENTROID-BASED CLUSTERING USING DERIVATIVES OF PEARSON CORRELATION

Marc Strickert; Nese Sreenivasulu; Thomas Villmann; Barbara Hammer

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

ROBUST CENTROID-BASED CLUSTERING USING DERIVATIVES OF PEARSON CORRELATION

Topics: Biometrics; Computational Intelligence; Medical Signal Acquisition, Analysis and Processing; Neural Networks; Pattern Recognition

In Proceedings of the First International Conference on Bio-inspired Systems and Signal Processing - Volume 2: BIOSIGNALS, 197-203, 2008 , Funchal, Madeira, Portugal

Authors: Marc Strickert ¹ ; Nese Sreenivasulu ¹ ; Thomas Villmann ² and Barbara Hammer ³

Affiliations: ¹ Leibniz Institute of Plant Genetics and Crop Plant Research Gatersleben, Germany ; ² Clinic for Psychotherapy, University of Leipzig, Germany ; ³ Institute of Computer Science, University of Clausthal, Germany

Keyword(s): Centroid-based clustering, correlation, quantization cost optimization.

Related Ontology Subjects/Areas/Topics: Applications ; Applications and Services ; Artificial Intelligence ; Biomedical Engineering ; Biomedical Signal Processing ; Biometrics ; Biometrics and Pattern Recognition ; Computational Intelligence ; Computer Vision, Visualization and Computer Graphics ; Data Manipulation ; Health Engineering and Technology Applications ; Human-Computer Interaction ; Medical Image Detection, Acquisition, Analysis and Processing ; Methodologies and Methods ; Multimedia ; Multimedia Signal Processing ; Neural Networks ; Neurocomputing ; Neurotechnology, Electronics and Informatics ; Pattern Recognition ; Physiological Computing Systems ; Sensor Networks ; Signal Processing ; Soft Computing ; Telecommunications ; Theory and Methods

Abstract: Modern high-throughput facilities provide the basis of -omics research by delivering extensive biomedical data sets. Mass spectra, multi-channel chromatograms, or cDNA arrays are such data sources of interest for which accurate analysis is desired. Centroid-based clustering provides helpful data abstraction by representing sets of similar data vectors by characteristic prototypes, placed in high-density regions of the data space. This way, specific modes can be detected, for example, in gene expression profiles or in lists containing protein and metabolite abundances. Despite their widespread use, k-means and self-organizing maps (SOM) often only produce suboptimum results in centroid computation: the final clusters are strongly dependent on the initialization and they do not quantize data as accurately as possible, particularly, if other than the Euclidean distance is chosen for data comparison. Neural gas (NG) is a mathematically rigorous clustering method that optimizes the centro id positions by minimizing their quantization errors. Originally formulated for Euclidean distance, in this work NG is mathematically generalized to give accurate and robust results for the Pearson correlation similarity measure. The benefits of the new NG for correlation (NG-C) are demonstrated for sets of gene expression data and mass spectra. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.108

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Strickert, M., Sreenivasulu, N., Villmann, T. and Hammer, B. (2008). ROBUST CENTROID-BASED CLUSTERING USING DERIVATIVES OF PEARSON CORRELATION. In Proceedings of the First International Conference on Bio-inspired Systems and Signal Processing (BIOSTEC 2008) - Volume 2: BIOSIGNALS; ISBN 978-989-8111-18-0; ISSN 2184-4305, SciTePress, pages 197-203. DOI: 10.5220/0001062601970203

@conference{biosignals08,
author={Marc Strickert and Nese Sreenivasulu and Thomas Villmann and Barbara Hammer},
title={ROBUST CENTROID-BASED CLUSTERING USING DERIVATIVES OF PEARSON CORRELATION},
booktitle={Proceedings of the First International Conference on Bio-inspired Systems and Signal Processing (BIOSTEC 2008) - Volume 2: BIOSIGNALS},
year={2008},
pages={197-203},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001062601970203},
isbn={978-989-8111-18-0},
issn={2184-4305},
}

TY - CONF

JO - Proceedings of the First International Conference on Bio-inspired Systems and Signal Processing (BIOSTEC 2008) - Volume 2: BIOSIGNALS
TI - ROBUST CENTROID-BASED CLUSTERING USING DERIVATIVES OF PEARSON CORRELATION
SN - 978-989-8111-18-0
IS - 2184-4305
AU - Strickert, M.
AU - Sreenivasulu, N.
AU - Villmann, T.
AU - Hammer, B.
PY - 2008
SP - 197
EP - 203
DO - 10.5220/0001062601970203
PB - SciTePress