The Symmetry of Oligonucleotide Distance Distributions in the Human Genome

Ana Helena Tavares, Vera Afreixo, João M. O. S. Rodrigues, Carlos A. C. Bastos

2015

Abstract

The inter-oligonucleotide distance is defined as the distance to the next occurrence of the same oligonucleotide. In this work, using the inter-oligonucleotide distance concept, we develop new methods to evaluate the lack of homogeneity in symmetric word pairs (pairs of reversed complement oligonucleotides), in equivalent composition groups. We apply the developed methods to the human genome and we conclude that a strong similarity exists between the distance distributions of symmetric oligonucleotides. We also conclude that exceptional distance symmetry is present in several equivalent composition groups, that is, there is a strong lack of homogeneity in the group and a strong homogeneity in the included symmetric word pairs. This suggests a stronger parity rule than Chargaff’s: in the human genome, symmetric oligonucleotides have equivalent occurrence frequency and, additionally, they present similar distance distributions.

References

  1. Afreixo, V., Bastos, C. A., Rodrigues, J. M., (2014), 'Analysis of exceptional word symmetry in single strand DNA: new measures', doi: 10.1093/biostatistics/kxu041.
  2. Afreixo, V., Garcia, S. P. and Rodrigues, J. M. (2013a), 'The breakdown of symmetry in word pairs in 1,092 human genomes', Jurnal Teknologi, 63(3).
  3. Afreixo, V., Bastos, C. A., Garcia, S. P., Rodrigues, J. M., Pinho, A. J., & Ferreira, P. J. (2013b), 'The breakdown of the word symmetry in the human genome'. Journal of theoretical biology, 335, pp.153- 159.
  4. Afreixo, V., Bastos, C. A., Pinho, A. J., Garcia, S. P. and Ferreira, P. J. (2009), 'Genome analysis with internucleotide distances', Bioinformatics, 25(23), pp. 3064-3070.
  5. Albrecht-Buehler, G. (2006). 'Asymptotically increasing compliance of genomes with Chargaff's second parity rules through inversions and inverted transpositions', Proceedings of the National Academy of Sciences, 103(47), pp.17828-17833.
  6. Baisnée, P. F., Hampson, S. and Baldi, P. (2002). 'Why are complementary DNA strands symmetric?78, Bioinformatics, 18(8), pp.1021-1033.
  7. Bastos, C. A., Afreixo, V., Pinho, A. J., Garcia, S. P., Rodrigues, J. M. O. S. and Ferreira, P. J. (2011), 'Inter-dinucleotide distances in the human genome: an analysis of the whole-genome and protein-coding distributions', Journal of Integrative Bioinformatics, 8(3), pp.172.
  8. Cohen, J. (1988). Statistical power analysis for the behavioral sciences , 2nd edn. Hillsdale, NJ: Lawrence Erlbaum Associates, Inc.
  9. Forsdyke, D. R. and Mortimer, J. R. (2000), 'Chargaff's legacy', Gene, 261(1), pp.127-137.
  10. Forsdyke, D. R. (2010). Evolutionary Bioinformatics. Springer, Berlin.
  11. Mitchell, D. and Bridge, R. (2006), 'A test of Chargaff's second rule', Biochemical and Biophysical Research Communications, 340(1), pp.90-94.
  12. Nair, A. S. S. and Mahalakshmi, T. (2005), 'Visualization of genomic data using inter-nucleotide distance signals', Proceedings of IEEE Genomic Signal Processing, 408. Bucharest, Romania.
  13. Powdel, B. R., Satapathy, S. S., Kumar, A., Jha, P. K., Buragohain, A. K., Borah, M., & Ray, S. K. (2009). 'A study in entire chromosomes of violations of the intra-strand parity of complementary nucleotides (Chargaff's second parity rule)78, DNA Research, 16(6), pp.325-343.
  14. Rea, L. M. and Parker, R. A. (1992) 'Designing and conducting survey research', San Francisco, CA: Jossey-Bass.
Download


Paper Citation


in Harvard Style

Tavares A., Afreixo V., M. O. S. Rodrigues J. and A. C. Bastos C. (2015). The Symmetry of Oligonucleotide Distance Distributions in the Human Genome . In Proceedings of the International Conference on Pattern Recognition Applications and Methods - Volume 2: ICPRAM, ISBN 978-989-758-077-2, pages 256-263. DOI: 10.5220/0005223102560263


in Bibtex Style

@conference{icpram15,
author={Ana Helena Tavares and Vera Afreixo and João M. O. S. Rodrigues and Carlos A. C. Bastos},
title={The Symmetry of Oligonucleotide Distance Distributions in the Human Genome},
booktitle={Proceedings of the International Conference on Pattern Recognition Applications and Methods - Volume 2: ICPRAM,},
year={2015},
pages={256-263},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005223102560263},
isbn={978-989-758-077-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Pattern Recognition Applications and Methods - Volume 2: ICPRAM,
TI - The Symmetry of Oligonucleotide Distance Distributions in the Human Genome
SN - 978-989-758-077-2
AU - Tavares A.
AU - Afreixo V.
AU - M. O. S. Rodrigues J.
AU - A. C. Bastos C.
PY - 2015
SP - 256
EP - 263
DO - 10.5220/0005223102560263