PREDICTION OF SIGNIFICANT CRUCIFORM STRUCTURES FROM SEQUENCE IN TOPOLOGICALLY CONSTRAINED DNA - A Probabilistic Modelling Approach

Matej Lexa, Karel Nejedlý, Lucie Navrátilová, Marie Brázdová

2012

Abstract

Sequence-dependent secondary DNA structures, such as cruciform or triplex DNA, are implicated in regulation of gene transcription and other important biological processes at the molecular level. Sequences capable of forming these structures can readily be identified in entire genomes by appropriate searching techniques. However, not every DNA segment containing the proper sequence has equal probability of forming an alternative structure. Calculating the free energy of the potential structures provides an estimate of their stability in vivo, but there are other structural factors, both local and non-local, not taken into account by such simplistic approach. In is paper we present the procedure we currently use to identify potential cruciform structures in DNA sequences. The procedure relies on identification of palindromes (or inverted repeats) and their evaluation by a nucleic acid folding program (UNAFold). We further extended the procedure by adding a modelling step to filter the predicted cruciforms. The model takes into account superhelical density of the analyzed segments of DNA and calculates the probability of cruciforms forming at several locations of the analyzed DNA, based on the sequences in the stem and loop areas of the structures and competition among them.

References

  1. Brazda V., Laister R.C., et al. (2011). Cruciform structures are a common dna feature important for regulating biological processes. BMC Molecular Biology, 12:33.
  2. Lexa M., Martinek T., et al. (2011). A dynamic programming algorithm for identification of triplex-forming sequences. Bioinformatics, 27:2510-2517.
  3. Lilley, D. (1989). Structural isomerization in dna: The formation of cruciform structures in supercoiled dna molecules. Chemical Society Reviews, 18:53-83.
  4. Markham, N. and Zuker, M. (2008). Unafold: software for nucleic acid folding and hybridization. Methods in Molecular Biology, 453:3-31.
  5. Martinek, T. and Lexa, M. (2008). Hardware acceleration of approximate palindrome searching. In The International Conference on Field-Programmable Technolog., pages 65-72.
  6. Neidle, S. (2002). Nucleic acid structure and recognition. Oxford University Press.
  7. Palecek E., Vlk D., et al. (1997). Tumor supressor protein p53 binds preferentially to supercoiled dna. Oncogene, 15:2201-2209.
  8. Palecek E., Brazda V., et al. (2004). Enhancement of p53 sequence-specific binding by dna supercoiling. Oncogene, 23:2119-2127.
  9. Pennacchio L.A., Ahituv N., et al. (2006). In vivo enhancer analysis of human conserved non-coding sequences. Nature, 444:499-502.
  10. Sinden, R. (1994). DNA structure and function. Academic Press.
  11. Singleton, C. and Wells, R. (1982). Relationship between superhelical density and cruciform formation in plasmid pvh51. The Journal of Biological Chemistry, 257:6292-6295.
  12. Url 1: http://www.fi.muni.cz/˜lexa/cruciform/index.html Visited Oct 2011.
Download


Paper Citation


in Harvard Style

Lexa M., Brázdová M., Navrátilová L. and Nejedlý K. (2012). PREDICTION OF SIGNIFICANT CRUCIFORM STRUCTURES FROM SEQUENCE IN TOPOLOGICALLY CONSTRAINED DNA - A Probabilistic Modelling Approach . In Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2012) ISBN 978-989-8425-90-4, pages 124-130. DOI: 10.5220/0003705701240130


in Bibtex Style

@conference{bioinformatics12,
author={Matej Lexa and Marie Brázdová and Lucie Navrátilová and Karel Nejedlý},
title={PREDICTION OF SIGNIFICANT CRUCIFORM STRUCTURES FROM SEQUENCE IN TOPOLOGICALLY CONSTRAINED DNA - A Probabilistic Modelling Approach},
booktitle={Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2012)},
year={2012},
pages={124-130},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003705701240130},
isbn={978-989-8425-90-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2012)
TI - PREDICTION OF SIGNIFICANT CRUCIFORM STRUCTURES FROM SEQUENCE IN TOPOLOGICALLY CONSTRAINED DNA - A Probabilistic Modelling Approach
SN - 978-989-8425-90-4
AU - Lexa M.
AU - Brázdová M.
AU - Navrátilová L.
AU - Nejedlý K.
PY - 2012
SP - 124
EP - 130
DO - 10.5220/0003705701240130