loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Aalaa Mojahed 1 and Beatriz de la Iglesia 2

Affiliations: 1 University of East Anglia and King Abdulaziz University, United Kingdom ; 2 University of East Anglia, United Kingdom

Keyword(s): Heterogeneous Data, Distance Measure, Fusion, Clustering, Uncertainty.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Clustering and Classification Methods ; Computational Intelligence ; Evolutionary Computing ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Soft Computing ; Symbolic Systems

Abstract: In this paper, we introduce heterogeneous data as data about objects that are described by different data types, for example, structured data, text, time series, images etc. We provide an initial definition of a heterogeneous object using some basic data types, namely structured and time series data, and make the definition extensible to allow for the introduction of further data types and complexity in our objects. There is currently a lack of methods to analyse and, in particular, to cluster such data. We then propose an intermediate fusion approach to calculate distance between objects in such datasets. Our approach deals with uncertainty in the distance calculation and provides a representation of it that can later be used to fine tune clustering algorithms. We provide some initial examples of our approach using a real dataset of prostate cancer patients including visualisation of both distances and uncertainty. Our approach is a preliminary step in the clustering of such heterog eneous objects as the distance between objects produced by the fusion approach can be fed to any standard clustering algorithm. Although further experimental evaluation will be required to fully validate the Fused Distance Matrix approach, this paper presents the concept through an example and shows its feasibility. The approach is extensible to other problems with objects represented by different data types, e.g. text or images. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.138.105.31

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Mojahed, A. and de la Iglesia, B. (2014). A Fusion Approach to Computing Distance for Heterogeneous Data. In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2014) - KDIR; ISBN 978-989-758-048-2; ISSN 2184-3228, SciTePress, pages 269-276. DOI: 10.5220/0005083702690276

@conference{kdir14,
author={Aalaa Mojahed. and Beatriz {de la Iglesia}.},
title={A Fusion Approach to Computing Distance for Heterogeneous Data},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2014) - KDIR},
year={2014},
pages={269-276},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005083702690276},
isbn={978-989-758-048-2},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2014) - KDIR
TI - A Fusion Approach to Computing Distance for Heterogeneous Data
SN - 978-989-758-048-2
IS - 2184-3228
AU - Mojahed, A.
AU - de la Iglesia, B.
PY - 2014
SP - 269
EP - 276
DO - 10.5220/0005083702690276
PB - SciTePress