Search of Periodicity Regions in the Genome A.thaliana

E. V. Korotkov, F. E. Frenkel, M. A. Korotkova

2017

Abstract

A mathematical method was developed in this study to determine tandem repeats in a DNA sequence. A multiple alignment of periods was calculated by direct optimization of the position-weight matrix (PWM) without using pairwise alignments or searching for similarity between periods. Random PWMs were used to develop a new mathematical algorithm for periodicity search. The developed algorithm was applied to analyze the DNA sequences of A.thaliana genome. 13997 regions having a periodicity with length of 2 to 50 bases were found. The average distance between regions with periodicity is ~9000 nucleotides. A significant portion of the revealed regions have periods consisting of 2 nucleotide, 10-11 nucleotides and periods in the vicinity of 30 nucleotides. No more than ~30% of the periods found were discovered early. The sequences found were collected in a data bank from the website: http://victoria.biengi.ac.ru/cgi-in/indelper/index.cgi. This study discussed the origin of periodicity with insertions and deletions.

Download


Paper Citation


in Harvard Style

Korotkov E., Frenkel F. and Korotkova M. (2017). Search of Periodicity Regions in the Genome A.thaliana. In - BIOINFORMATICS, (BIOSTEC 2017) ISBN , pages 0-0. DOI: 10.5220/0006106000001488


in Bibtex Style

@conference{bioinformatics17,
author={E. V. Korotkov and F. E. Frenkel and M. A. Korotkova},
title={Search of Periodicity Regions in the Genome A.thaliana},
booktitle={ - BIOINFORMATICS, (BIOSTEC 2017)},
year={2017},
pages={},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006106000001488},
isbn={},
}


in EndNote Style

TY - CONF

JO - - BIOINFORMATICS, (BIOSTEC 2017)
TI - Search of Periodicity Regions in the Genome A.thaliana
SN -
AU - Korotkov E.
AU - Frenkel F.
AU - Korotkova M.
PY - 2017
SP - 0
EP - 0
DO - 10.5220/0006106000001488