NOVEL APPROACHES FOR RETRIEVING PROTEIN 3D STRUCTURES

Georgina Mirceva, Ivana Cingovska, Zoran Dimov, Slobodan Kalajdziski, Danco Davcev

2009

Abstract

To understand the structure-to-function relationship, life sciences researchers and biologists need to retrieve similar structures from protein databases and classify them into the same protein fold. With the technology innovation, the number of protein structures increases every day, so, retrieving structurally similar proteins using current algorithms may take hours or even days. Therefore, improving the efficiency of protein structure retrieval becomes an important research issue. In this paper, we propose three novel approaches for retrieving protein 3D structures, which rely on the 3D structure of the proteins. In the first approach, Discrete Fourier Transform is applied to protein structures. Additionally, some properties of the primary and secondary structure of the protein are taken. In the second approach, some modification of the ray based descriptor is applied on the backbone of the protein molecule. In the third approach, some wavelet transformations are applied on the distance matrix of the protein. We have compared our approaches with several existing approaches. Generally, our approaches give higher precision than existing approaches that where used in this research. The results show that the proposed ray based descriptor gives the best average retrieval accuracy (92.95%), while it is much simpler and faster than the other approaches.

References

  1. Berman, H. M., Westbrook, J., Feng , Z., Gilliland, G., Bhat, T. N., Weissig, H., Shindyalov, I., N., Bourne, P., E., 2000. The Protein Data Bank. Nucleic Acids Research, 28(1), 235-242.
  2. Chi, P. H., Scott, G., Shyu, C. R., 2004. A Fast Protein Structure Retrieval System Using Image-Based Distance Matrices and Multidimensional Index, In BIBE'04, Fourth IEEE Symposium on Bioinformatics and Bioengineering, pp. 522-532.
  3. Daras, P., Zarpalas, D., Axenopoulos, A., Tzovaras, D., Strintzis, M.,G., 2006. Three-Dimensional ShapeStructure Comparison Method for Protein Classification. IEEE/ACM Transactions on computational biology and bioinformatics, 3(3). pp. 193-207.
  4. Holm, L., Sander, C., 1996. The FSSP Database: Fold Classification Based on Structure-Structure Alignment of Proteins. Nucleic Acids Research, 24(1), pp. 206-209.
  5. Marsolo, K., Srinivasan, P., Ramamohanarao, K., 2006. Structure-Based Querying of Proteins Using Wavelets. In CIKM'06, ACM Fifteenth Conference on Information and Knowledge Management, pp. 24-33. Arlington, USA.
  6. Murzin, A. G., Brenner, S. E., Hubbard, T., Chothia, C., 1995. Scop: A Structural Classification of Proteins Database for the Investigation of Sequences and Structures. Journal of Molecular Biology, 247, pp. 536-540.
  7. Vranic, D. V., 2004. 3D Model Retrieval. Ph.D. Thesis. University of Leipzig.
Download


Paper Citation


in Harvard Style

Mirceva G., Cingovska I., Dimov Z., Kalajdziski S. and Davcev D. (2009). NOVEL APPROACHES FOR RETRIEVING PROTEIN 3D STRUCTURES . In - KDIR, (IC3K 2009) ISBN , pages 0-0


in Bibtex Style

@conference{kdir09,
author={Georgina Mirceva and Ivana Cingovska and Zoran Dimov and Slobodan Kalajdziski and Danco Davcev},
title={NOVEL APPROACHES FOR RETRIEVING PROTEIN 3D STRUCTURES},
booktitle={ - KDIR, (IC3K 2009)},
year={2009},
pages={},
publisher={SciTePress},
organization={INSTICC},
doi={},
isbn={},
}


in EndNote Style

TY - CONF
JO - - KDIR, (IC3K 2009)
TI - NOVEL APPROACHES FOR RETRIEVING PROTEIN 3D STRUCTURES
SN -
AU - Mirceva G.
AU - Cingovska I.
AU - Dimov Z.
AU - Kalajdziski S.
AU - Davcev D.
PY - 2009
SP - 0
EP - 0
DO -