loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Diogo Pratas ; Armando J. Pinho and Sara P. Garcia

Affiliation: University of Aveiro, Portugal

Keyword(s): Normalized-compression distance, Finite-context models, Human chromosomal similarity.

Related Ontology Subjects/Areas/Topics: Algorithms and Software Tools ; Bioinformatics ; Biomedical Engineering ; Sequence Analysis

Abstract: A compression-based similarity measure assesses the similarity between two objects using the number of bits needed to describe one of them when a description of the other is available. For being effective, these measures have to rely on “normal” compression algorithms, roughly meaning that they have to be able to build an internal model of the data being compressed. Often, we find that good “normal” compression methods are slow and those that are fast do not provide acceptable results. In this paper, we propose a method for measuring the similarity of DNA sequences that balances these two goals. The method relies on a mixture of finite-context models and is compared with other methods, including XM, the state-of-the-art DNA compression technique. Moreover, we present a comprehensive study of the inter-chromosomal similarity of the human genome.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.145.186.173

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Pratas, D.; J. Pinho, A. and P. Garcia, S. (2012). COMPUTATION OF THE NORMALIZED COMPRESSION DISTANCE OF DNA SEQUENCES USING A MIXTURE OF FINITE-CONTEXT MODELS. In Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms (BIOSTEC 2012) - BIOINFORMATICS; ISBN 978-989-8425-90-4; ISSN 2184-4305, SciTePress, pages 308-311. DOI: 10.5220/0003780203080311

@conference{bioinformatics12,
author={Diogo Pratas. and Armando {J. Pinho}. and Sara {P. Garcia}.},
title={COMPUTATION OF THE NORMALIZED COMPRESSION DISTANCE OF DNA SEQUENCES USING A MIXTURE OF FINITE-CONTEXT MODELS},
booktitle={Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms (BIOSTEC 2012) - BIOINFORMATICS},
year={2012},
pages={308-311},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003780203080311},
isbn={978-989-8425-90-4},
issn={2184-4305},
}

TY - CONF

JO - Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms (BIOSTEC 2012) - BIOINFORMATICS
TI - COMPUTATION OF THE NORMALIZED COMPRESSION DISTANCE OF DNA SEQUENCES USING A MIXTURE OF FINITE-CONTEXT MODELS
SN - 978-989-8425-90-4
IS - 2184-4305
AU - Pratas, D.
AU - J. Pinho, A.
AU - P. Garcia, S.
PY - 2012
SP - 308
EP - 311
DO - 10.5220/0003780203080311
PB - SciTePress