A LOSSLESS COMPRESSION ALGORITHM FOR DNA SEQUENCES

Taysir H. A. Soliman; Tarek F. Gharib; Alshaimaa Abo-Alian; Mohammed Alsharkawy

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

A LOSSLESS COMPRESSION ALGORITHM FOR DNA SEQUENCES

Topics: Datamining

In Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 6: ICEIS, 435-441, 2008 , Barcelona, Spain

Authors: Taysir H. A. Soliman ¹ ; Tarek F. Gharib ² ; Alshaimaa Abo-Alian ² and Mohammed Alsharkawy ²

Affiliations: ¹ Faculty of Computer and Information, Assiut University, Egypt ; ² Faculty of Computer and Information Sciences, Ain Shams University, Egypt

Keyword(s): Lossless Compression Algorithm, encoding, approximate repeats, palindrome.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Biomedical Engineering ; Business Analytics ; Data Engineering ; Data Mining ; Databases and Information Systems Integration ; Datamining ; Enterprise Information Systems ; Health Information Systems ; Sensor Networks ; Signal Processing ; Soft Computing

Abstract: Homology search is the seed for both genomics and proteomics research. However, the increase of the amount of DNA sequences requires efficient computational algorithms for performing sequence comparison and analysis. This is due to the fact that standard compression algorithms are not able to compress DNA sequences because they do not consider special characteristics of DNA sequences (i.e. DNA sequences contain several approximate repeats and complimentary palindromes are frequent in DNA). Recently, new algorithms have been proposed to compress DNA sequences, often using detection of long approximate repeats. The current work proposes a Lossless Compression Algorithm (LCA), providing a new encoding method. LCA achieves a better compression ratio than that of existing DNA-oriented compression algorithms, when compared to GenCompress and DNACompress, using nine different datasets.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.219

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

H. A. Soliman, T., F. Gharib, T., Abo-Alian, A. and Alsharkawy, M. (2008). A LOSSLESS COMPRESSION ALGORITHM FOR DNA SEQUENCES. In Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 6: ICEIS; ISBN 978-989-8111-37-1; ISSN 2184-4992, SciTePress, pages 435-441. DOI: 10.5220/0001683504350441

@conference{iceis08,
author={Taysir {H. A. Soliman} and Tarek {F. Gharib} and Alshaimaa Abo{-}Alian and Mohammed Alsharkawy},
title={A LOSSLESS COMPRESSION ALGORITHM FOR DNA SEQUENCES},
booktitle={Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 6: ICEIS},
year={2008},
pages={435-441},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001683504350441},
isbn={978-989-8111-37-1},
issn={2184-4992},
}

TY - CONF

JO - Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 6: ICEIS
TI - A LOSSLESS COMPRESSION ALGORITHM FOR DNA SEQUENCES
SN - 978-989-8111-37-1
IS - 2184-4992
AU - H. A. Soliman, T.
AU - F. Gharib, T.
AU - Abo-Alian, A.
AU - Alsharkawy, M.
PY - 2008
SP - 435
EP - 441
DO - 10.5220/0001683504350441
PB - SciTePress