Systematic Analysis of Structure of Multiple Tandem Repeat Arrays in the Human Genome

Woo-Chan Kim, Dong-Ho Cho

Abstract

Repetitive elements constitute the vast majority of the human genome and form many complex but highly-ordered patterns. Tandem repeats whose repeat units are placed next to each other particularly form very highly structured patterns in the human genome when homologous multiple tandem repeats are close together. In this paper, the structure of the multiple tandem repeat array (MTRA) is analyzed based on systematic analysis. The proposed system for analyzing MTRA derives the original tandem repeat units by using the characteristics of homology of MTRA and represents diagram model to show the structure of MTRA easily. The analysis results of the four MTRAs in the human genome are shown and the proposed algorithm is proved to be very efficient for analyzing MTRA by the comparison of three conventional algorithms.

References

  1. Benson, G. (1999). Tandem repeats finder: a program to analyze dna sequences. Nucleic Acids Research, 27(2):573-580.
  2. Brodzik, A. (2007). Quaternionic periodicity transform: an algebraic solution to the tandem repeat detection problem. Bioinformatics, 23(6):694-700.
  3. Buchner, M. and Janjarasjitt, S. (2003). Detection and visualization of tandem repeats in dna sequences. IEEE Transactions on Signal Processing, 51(9):2280-2287.
  4. Christian, M., Dennis, J., and John, M. (2001). Strbase: a short tandem repeat dna database for the human identity testing community. Nucleic Acids Research, 29(1):320-322.
  5. Chung, B., Lee, K., Shin, K., Kim, W., Kwon, D., You, R., Lee, Y., Cho, K., and Cho, D. (2011). Reminer: a tool for unbiased mining and analysis of repetitive elements and their arrangement structures of large chromosomes. Genomics, 98(5):381-389.
  6. Edgar, R. and Myers, E. (2005). Piler: identification and classification of genomic repeats. Bioinformatics, 21(Suppl. 1):i152-i158.
  7. Hauth, A. and Joseph, D. (2002). Beyond tandem repeats: complex pattern structures and distant regions of similarity. Bioinformatics, 18(Suppl. 1):S31-S37.
  8. Humberto, C. and David, L. (1998). The multiple sequence alignment problem in biology. SIAM Journal on Applied Mathematics, 48(5):1073-1082.
  9. Just, W. (2001). Computational complexity of multiple sequence alignment with sp-score. Journal of Computational Biology, 8(6):615-623.
  10. Kazazian, H. (2004). Mobile elements: drivers of genome evolution. Science, 303(5664):1626-1632.
  11. Kim, W., Lee, K., Shin, K., You, R., Lee, Y., Cho, K., and Cho, D. (2012). Reminer-ii: A tool for rapid identification and configuration of repetitive element arrays from large mammalian chromosomes as a single query. Genomics, 100(3):131-140.
  12. Lipman, D., Altschul, S., and Kececioglu, J. (1989). A tool for multiple sequence alignment. Proceedings of the National Academy of Sciences of the United States of America, 86(12):4412-4415.
  13. Prak, E. and Kazazian, H. (2000). Mobile elements and the human genome. Nature Reviews Genetics, 1(2):134- 144.
  14. Sharma, D., Issac, B., Raghava, G., and Ramaswamy, R. (2004). Spectral repeat finder (srf): identification of repetitive sequences using fourier transformation. Bioinformatics, 20(9):1405-1412.
  15. Sinden, R. (1999). Biological implications of the dna structures associated with disease-causing triplet repeats. American Journal of Human Genetics, 64(2):346- 353.
  16. Wang, L. and Jiang, T. (1994). On the complexity of multiple sequence alignment. Journal of Computational Biology, 1(4):337-348.
  17. Zhang, Z., Schwartz, S., Wagner, L., and Miller, W. (2000). A greedy algorithm for aligning dna sequences. Journal of Computational Biology, 7(1-2):203-214.
Download


Paper Citation


in Harvard Style

Kim W. and Cho D. (2013). Systematic Analysis of Structure of Multiple Tandem Repeat Arrays in the Human Genome . In Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2013) ISBN 978-989-8565-35-8, pages 44-52. DOI: 10.5220/0004243300440052


in Bibtex Style

@conference{bioinformatics13,
author={Woo-Chan Kim and Dong-Ho Cho},
title={Systematic Analysis of Structure of Multiple Tandem Repeat Arrays in the Human Genome},
booktitle={Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2013)},
year={2013},
pages={44-52},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004243300440052},
isbn={978-989-8565-35-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2013)
TI - Systematic Analysis of Structure of Multiple Tandem Repeat Arrays in the Human Genome
SN - 978-989-8565-35-8
AU - Kim W.
AU - Cho D.
PY - 2013
SP - 44
EP - 52
DO - 10.5220/0004243300440052