detection, at the authors knowledge there are not
packages specialized for segmenting the SES on the
basis of protuberances. Among the quoted packages
the only one that was available and directly
applicable has been CASTp. We compared the
results obtained with this package to our one. The
number of pocket selected has been 29 and 25
respectively for CASTp and our own. Let us first
point out that all the main pockets of the quoted
protein (the bigger and deeper ones) are detected in
both cases. Moreover, for the main pockets, almost
the same set atoms at the border of the SES
delimiting the pockets. Nevertheless, in general the
number of these atoms is higher in our solution (up
to 20% in a few cases) and seems to cover in a
complete way the pocket concavity. An example of
this case is given in figure 8.
Figure 8: Wireframe of the main binding site of PDB ID
1MK5. In red atoms detected by CASTp. Our software
detects both the red and green atoms.
The results differ more for what concerns the
smallest pockets. This is due to the thresholds to
accept the concavity with a short travel depth as a
possible active site. We have two thresholds on the
basis of the travel depth and of a minimum
concavity volume. Generally speaking CASTp
accept more small concavities as pockets, but
sometimes there are cases in which our volume
constraint is satisfied and the concavity is not
accepted by CASTp. This must not be a critical
issues because (Liang, 1998) the binding sites are
usually the pockets having the greatest volume.
While CASTp includes empty volume internal to the
protein, in our approach these are identified but not
classified as pockets.
Referring to computational performance, our
algorithm runs on an Intel Q6600 Processor with 4
GB of Ram. The analysis of pockets and
protuberances on 1MK5 protein as been done in 58
seconds starting from the PDB file (this include the
operations of creating the 3D matrix, the Convex
Hull, all mathematical morphology operations, the
triangulation of the voxels surface of each
pocket/protuberance with a marching cube/mesh
smoothing algorithm and so on). In fact besides the
segmentation process for each detected segment
(pocket or protuberance) a rich parameter set is
computed to guide the analysis of possible match,
such as volume, surface to volume ratio, mouth
(base) aperture, travel depth, and many others (a full
list is given in (Cantoni, 10b). It is important to note
that all the algorithms presented in this paper are
already thought to be simply implemented into
parallel architectures.
5 CONCLUSIONS
In this paper we present a new approach for the
segmentation of SES of proteins in order to support
the search of dual active sites (i.e. pockets and
protuberances) that can be morphologically arranged
together. This is a preliminary step for important
structural biology application. The results achieved
look very promising and in comparison to others
solutions presented in literature it seems to add
something not only from the computational point of
view. Now we have started an extensive
experimentation phase to validate our solution from
the best practice point of view.
REFERENCES
Barber, C. B., Dobkin, D. P., and Huhdanpaa H., 1996.
The Quickhull Algorithm for Convex Hull. ACM
Transactions on Mathematical Software, Vol. 22(4):
469–483.
Binkowski, A. T., Naghibzadeh, S., and Liang, J., 2003.
Castp: Computed atlas of surface topography of
proteins. Nucl. Acids Res.,31(13): 3352- 3355.
Bock, M. E., Garutti C., Guerra C., 2007. Effective
labeling of molecular surface points for cavity
detection and location of putative binding sites. Proc.
of CSB, San Diego, Vol. 6: 263-744.
Borgefors, G. and Sanniti di Baja, G., 1996. Analyzing
Nonconvex 2D and 3D Patterns. Computer Vision and
image Understanding, 63(1): 145– 157.
Brady, G. P., Stouten, P. F. W., 2000. Fast prediction and
visualization of protein binding pockets with PASS. J
Comput-Aided Mol Des, 14: 383–401.
Cantoni, V., Gatti, R., Lombardi, L., 2010. Segmentation
of SES for Protein Structure Analysis. In Proceedings
of the 1st International Conference on Bioinformatics.
BIOINFORMATICS 2011 - International Conference on Bioinformatics Models, Methods and Algorithms
20