loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Taysir H. A. Soliman 1 ; Tarek F. Gharib 2 ; Alshaimaa Abo-Alian 2 and Mohammed Alsharkawy 2

Affiliations: 1 Faculty of Computer and Information, Assiut University, Egypt ; 2 Faculty of Computer and Information Sciences, Ain Shams University, Egypt

Keyword(s): Lossless Compression Algorithm, encoding, approximate repeats, palindrome.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Biomedical Engineering ; Business Analytics ; Data Engineering ; Data Mining ; Databases and Information Systems Integration ; Datamining ; Enterprise Information Systems ; Health Information Systems ; Sensor Networks ; Signal Processing ; Soft Computing

Abstract: Homology search is the seed for both genomics and proteomics research. However, the increase of the amount of DNA sequences requires efficient computational algorithms for performing sequence comparison and analysis. This is due to the fact that standard compression algorithms are not able to compress DNA sequences because they do not consider special characteristics of DNA sequences (i.e. DNA sequences contain several approximate repeats and complimentary palindromes are frequent in DNA). Recently, new algorithms have been proposed to compress DNA sequences, often using detection of long approximate repeats. The current work proposes a Lossless Compression Algorithm (LCA), providing a new encoding method. LCA achieves a better compression ratio than that of existing DNA-oriented compression algorithms, when compared to GenCompress and DNACompress, using nine different datasets.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 13.59.111.183

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
H. A. Soliman, T.; F. Gharib, T.; Abo-Alian, A. and Alsharkawy, M. (2008). A LOSSLESS COMPRESSION ALGORITHM FOR DNA SEQUENCES. In Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 6: ICEIS; ISBN 978-989-8111-37-1; ISSN 2184-4992, SciTePress, pages 435-441. DOI: 10.5220/0001683504350441

@conference{iceis08,
author={Taysir {H. A. Soliman}. and Tarek {F. Gharib}. and Alshaimaa Abo{-}Alian. and Mohammed Alsharkawy.},
title={A LOSSLESS COMPRESSION ALGORITHM FOR DNA SEQUENCES},
booktitle={Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 6: ICEIS},
year={2008},
pages={435-441},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001683504350441},
isbn={978-989-8111-37-1},
issn={2184-4992},
}

TY - CONF

JO - Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 6: ICEIS
TI - A LOSSLESS COMPRESSION ALGORITHM FOR DNA SEQUENCES
SN - 978-989-8111-37-1
IS - 2184-4992
AU - H. A. Soliman, T.
AU - F. Gharib, T.
AU - Abo-Alian, A.
AU - Alsharkawy, M.
PY - 2008
SP - 435
EP - 441
DO - 10.5220/0001683504350441
PB - SciTePress