A PRELIMINARY STUDY ON THE DETECTION OF TRANSCRIPTION FACTOR BINDING SITES

Erola Pairo, Santiago Marco, Alexandre Perera

Abstract

Transcription starts when multiple proteins, known as transcription factors recognize and bind to transcription start site in DNA sequences. Since mutation in transcription factor binding sites are known to underlie diseases it remains a major challenge to identify these binding sites. Conversion from symbolic DNA to numerical sequences and genome data make it possible to construct a detector based on a numerical analysis of DNA binding sites. A subspace model for the TFBS is built. TFBS will show a very small distance to this particular subspace. Using this distance binding sites are distinguished from random sequences and from genome data.

References

  1. Anastassiou, D. (2001). Genomic signal processing. Signal Processing Magazine, IEEE, 18(4):8-20.
  2. Bailey, T. and Elkan, C. (2006). Meme:discovering and analizing dna and protein sequence motifs. Nucleic acids research, 34:W369-W373.
  3. Bulyk, M. (2003). Computational prediction of transcription-factor binding site locations. Genome Biology, 5(1):201.
  4. Cristea, P. (2005). Genomic Signal processing and statistics, chapter Representation and analysis of DNA sequences. Hindawi Publishing Corporation.
  5. Edgar, R. (2004). Muscle: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res, 32(5):1792-1797.
  6. Hannenhali, S. (2008). Eukaryotic transcription factor binding sites- modeling and integrative search methods. Bioinformatics, 24.
  7. Neuwald, A., Liu, J., and Lawrence, C. (1995). Gibbs motif sampling: Detection of bacterial outer membrane protein repeats. Protein Sci., 4:1618-1632.
  8. Pavesi, G., Mauri, G., and Pesole, G. (2001). An algorithm for finding signals of unknown length in dna sequences. Bioinformatics, 17:207-214.
  9. Pavesi, G., Mauri, G., and Pesole, G. (2004). In silico representation and discovery of transcription factor binding sites. Brief Bioinform, 5(3):217-236.
  10. Schneider, T. (1997). Information content of individual genetic sequences. J. Theor. Biol., 189:427-441.
  11. Silverman, B. and Linske, R. (1986). A measure of dna periodicity. Journal of Theoretical Biology, 118:295- 300.
Download


Paper Citation


in Harvard Style

Pairo E., Marco S. and Perera A. (2009). A PRELIMINARY STUDY ON THE DETECTION OF TRANSCRIPTION FACTOR BINDING SITES . In Proceedings of the International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2009) ISBN 978-989-8111-65-4, pages 506-509. DOI: 10.5220/0001550305060509


in Bibtex Style

@conference{biosignals09,
author={Erola Pairo and Santiago Marco and Alexandre Perera},
title={A PRELIMINARY STUDY ON THE DETECTION OF TRANSCRIPTION FACTOR BINDING SITES},
booktitle={Proceedings of the International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2009)},
year={2009},
pages={506-509},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001550305060509},
isbn={978-989-8111-65-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Bio-inspired Systems and Signal Processing - Volume 1: BIOSIGNALS, (BIOSTEC 2009)
TI - A PRELIMINARY STUDY ON THE DETECTION OF TRANSCRIPTION FACTOR BINDING SITES
SN - 978-989-8111-65-4
AU - Pairo E.
AU - Marco S.
AU - Perera A.
PY - 2009
SP - 506
EP - 509
DO - 10.5220/0001550305060509