Prediction of Protein Tertiary Structure Class from Synchrotron Radiation Circular Dichroism Spectra

Andreas Procopiou, Nigel M. Allinson, Gareth R. Jones, David T. Clarke

Abstract

A new approach to predict the tertiary structure class of proteins from synchrotron radiation circular dichroism (SRCD) spectra is presented. A protein’s SRCD spectrum is first approximated using a Radial Basis Function Network (RBFN) and the resulting set is used to train different varieties of Support Vector Machine (SVM). The performance of three well known multi-class SVM schemes are evaluated and a method presented that takes into account the properties of spectra for each of the structure classes.

References

  1. Wallace, B.A. and R.W. Janes, Synchrotron radiation circular dichroism spectroscopy of proteins: secondary structure, fold recognition and structural genomics. Current Opinion in Chemical Biology, 2001. 5(5): p. 567-571.
  2. Whitford, D., Proteins : structure and function. 2005, Chichester, West Sussex, England ; Hoboken, NJ: John Wiley & Sons c2005.
  3. Fasman Gerald, D., Circular dichroism and the conformational analysis of biomolecules. 1996, New York ; London: Plenum Press.
  4. Manavalan, P. and W.C. Johnson, Sensitivity of Circular-Dichroism to Protein Tertiary Structure Class. Nature, 1983. 305(5937): p. 831-832.
  5. Venyaminov, S.Y. and K.S. Vassilenko, Determination of Protein Tertiary Structure Class from Circular Dichroism Spectra. Analytical Biochemistry, 1994. 222(1): p. 176.
  6. Scholkopf, B. and A.J. Smola, Learning with kernels : support vector machines, regularization, optimization, and beyond. Adaptive computation and machine learning. 2002, Cambridge, MA ; London: MIT Press.
  7. Branden, C. and J. Tooze, Introduction to protein structure. 1999, New York: Garland c1999.
  8. Murzin, A.G., et al., SCOP: A Structural Classification of Proteins Database for the Investigation of Sequences and Structures. Journal of Molecular Biology, 1995. 247(4): p. 536.
  9. Pearl, F., et al., The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis. Nucleic Acids Research, 2005. 33(Supp): p. D247-D251.
  10. Broomhead, D.S. and D. Lowe, Multi-Variable Function Interpolation and Adaptive network. Complex System, 1988: p. 2:321.
  11. Bishop, C.M., Neural networks for pattern recognition. 1995, Oxford: Clarendon Press c1995.
  12. Orr, M.J.L., Introduction to Radial Basis Function Networks. 1996, University of Edinburgh: Edinburgh, Scotland, UK.
  13. Orr, M.J.L. Optimising the Widths of RBFs Radial Basis Functions. in Fifth Brazilian Symposium on Neural Networks. 1998. Belo Horizonte, Brazil.
  14. Vapnik, V., The nature of statistical learning theory. 1995, New York ; London: Springer.
  15. Shawe-Taylor, J. and N. Cristianini, Kernel methods for pattern analysis. 2004, Cambridge, UK ; New York: Cambridge University Press.
  16. Friedman, J., Another approach to polychotomous classification. Technical report Stanford University, UA, 1996.
  17. Platt, N.C.J. and J. Shawe-Taylor, Large magin dags for multiclass classification. Technical report, Microsoft Research, Redmond, US, 1999.
  18. Chapelle, O., et al., Choosing Multiple Parameters for Support Vector Machines. Machine Learning, 2001. 46(1/3): p. 131-160.
Download


Paper Citation


in Harvard Style

Procopiou A., M. Allinson N., R. Jones G. and T. Clarke D. (2006). Prediction of Protein Tertiary Structure Class from Synchrotron Radiation Circular Dichroism Spectra . In 6th International Workshop on Pattern Recognition in Information Systems - Volume 1: PRIS, (ICEIS 2006) ISBN 978-972-8865-55-9, pages 58-67. DOI: 10.5220/0002478200580067


in Bibtex Style

@conference{pris06,
author={Andreas Procopiou and Nigel M. Allinson and Gareth R. Jones and David T. Clarke},
title={Prediction of Protein Tertiary Structure Class from Synchrotron Radiation Circular Dichroism Spectra},
booktitle={6th International Workshop on Pattern Recognition in Information Systems - Volume 1: PRIS, (ICEIS 2006)},
year={2006},
pages={58-67},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002478200580067},
isbn={978-972-8865-55-9},
}


in EndNote Style

TY - CONF
JO - 6th International Workshop on Pattern Recognition in Information Systems - Volume 1: PRIS, (ICEIS 2006)
TI - Prediction of Protein Tertiary Structure Class from Synchrotron Radiation Circular Dichroism Spectra
SN - 978-972-8865-55-9
AU - Procopiou A.
AU - M. Allinson N.
AU - R. Jones G.
AU - T. Clarke D.
PY - 2006
SP - 58
EP - 67
DO - 10.5220/0002478200580067