loading
Papers

Research.Publish.Connect.

Paper

Paper Unlock

Authors: George Volis ; Christos Makris and Andreas Kanavos

Affiliation: University of Patras, Greece

ISBN: 978-989-758-186-1

Keyword(s): Searching and Browsing, Web Information Filtering and Retrieval, Text Mining, Indexing Structures, Inverted Files, Index Compression-Gram Indexing, Sequence Analysis and Assembly.

Related Ontology Subjects/Areas/Topics: Searching and Browsing ; Web Information Systems and Technologies ; Web Interfaces and Applications

Abstract: The number and size of genomic databases have grown rapidly the last years. Consequently, the number of Internet-accessible databases has been rapidly growing .Therefore there is a need for satisfactory methods for managing this growing information. A lot of effort has been put to this direction. Contributing to this effort this paper presents two algorithms which can eliminate the amount of space for storing genomic information. Our first algorithm is based on the classic n-grams/2L technique for indexing a DNA sequence and it can convert the Inverted Index of this classic algorithm to a more compressed format. Researchers have revealed the existence of repeated and palindrome patterns in DNA of living organisms. The main motivation of this technique is based on this remark and proposes an alternative data structure for handling these sequences. Our experimental results show that our algorithm can achieve a more efficient index than the n-grams/2L algorithm and can be adapted by any algorithm that is based to n-grams/2L The second algorithm is based on the n-grams technique. Perceiving the four symbols of DNA alphabet as vertex of a square scheme imprint a DNA sequence as a relation between vertices, sides and diagonals of a square. The experimental results shows that this second idea succeed even more successfully compression of our index structure. (More)

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.227.240.143

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Volis, G.; Makris, C. and Kanavos, A. (2016). Two Novel Techniques for Space Compaction on Biological Sequences.In Proceedings of the 12th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-758-186-1, pages 105-112. DOI: 10.5220/0005801101050112

@conference{webist16,
author={George Volis. and Christos Makris. and Andreas Kanavos.},
title={Two Novel Techniques for Space Compaction on Biological Sequences},
booktitle={Proceedings of the 12th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2016},
pages={105-112},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005801101050112},
isbn={978-989-758-186-1},
}

TY - CONF

JO - Proceedings of the 12th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - Two Novel Techniques for Space Compaction on Biological Sequences
SN - 978-989-758-186-1
AU - Volis, G.
AU - Makris, C.
AU - Kanavos, A.
PY - 2016
SP - 105
EP - 112
DO - 10.5220/0005801101050112

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.