-metry by B.K.P. Horn (Horn, 1984) in the years '80
and has been extended by K. Ikeuci (the Complex-
EGI) (Kang, 1993) in the years '90 to overcome the
ambiguity rised in the representations by the convex
parts. Later other improvements have been
introduced in sequence: the More Extended
Gaussian Image (MEGI) in 1994, the Multi-Shell
Extended Gaussian Image (MSEGI) and the
Adaptive Volumetric Extended Gaussian Image (A-
VEGI) in 2007, and finally the Enriched Complex
Extended Gaussian Image (EC-EGI) in 2010.
They have not up-to-now been applied on
proteomics; starting from these, we propose a
representation suitable for describing the matching
between the ligand (here the protuberance) and the
protein (the cavity under analysis).
Extended Gaussian Image (EGI). The EGI of a
3D object or shape is an orientation histogram that
records the distribution of surface area with respect
to surface orientation. Each surface patch is mapped
to a point on the unit Gaussian sphere according to
its surface normal. The weight for each surface
normal (represented by a point on the Gaussian
sphere) is the total sum of area of all the surface
patches that are of that surface normal. Being a
distribution related to surface orientation, EGI is in
principle invariant to translation.
Complex EGI (Kang, 1991). CEGI encodes each
surface patch’s signed perpendicular distance from
the reference coordinate center.
It uses a complex number, as opposed to a scalar
in EGI, as the weight for the corresponding point on
the Gaussian sphere. The magnitude and phase of
the complex number are the area and signed
perpendicular distance of the patch (from the origin
of the reference coordinate frame), respectively. The
use of complex numbers allows the area and position
information to be decoupled. Furthermore, the
translation component of the pose can be determined
more readily.
More Extended Gaussian Image (MEGI)
(Matsuo, 1994). The MEGI model consists of a set
of position vectors X
i
for surfaces originating from
an object center and their normal vectors p
i
. Each
length of a normal vector also corresponds with
surface area, as in the EGI. Also this model is shift-
invariant since it is expressed by an object-oriented
coordinate. The MEGI model is an extended EGI
modeling which is able to represent concave objects.
Multi-Shell Extended Gaussian Image (MSEGI)
(Wang, 2007) or Volumetric Extended Gaussian
Image (VEGI) (Zhang, 2006). The VEGI captures
the volumetric distribution of a triangulated 3D
model by connecting the vertices of each triangle
with the geometry centroid of the object to form a
tetrahedron as the elementary volume unit. Then the
3D model is decomposed into a number of N
s
concentric spheres. Each sphere surface is
subdivided in cells, each one identified through their
polar and longitudinal angles (θ
i
, ϕ
j
). The quantized
volume of each tetrahedron and its associated
direction (the outward surface normal) are mapped
to the corresponding cell of the concentric sphere
with radius ρ, obtaining N
s
spherical distribution
functions η (ρ, θ, ϕ). These functions are expanded
into spherical harmonics to achieve a features
vector. The VEGI and this representation, without
canonical alignment, maintains the property of
translation, scaling, rotation invariance and facilitate
multiple scale approximation. An improvement to
fix the irregular sampling of the polar and
longitudinal coordinate system (in the poles there is
a higher sampling density than in the equator) has
been proposed with the Adaptive Volumetric
Extended Gaussian Image (A-VEGI) (Wang, 2007).
Enriched CEGI (Hu, 2010). The EC-EGI
encodes each surface patch’s signed with its 3D
position. It uses three complex numbers, as the
weight for the corresponding point on the Gaussian
sphere. The resultant weight at the point is then the
sum of the contributions of all surface patches that
are of the corresponding surface normal referred to
each one of the coordinate planes. The magnitude
part of the EC-EGI representation is translation-
invariant. This is an important property that allows
the rotation part of pose, in the pose estimation
application, to be determined separately from the
translation. The EC-EGI can be viewed as three
independent Gaussian spheres, each encoding the 3D
position information along the x-, y- and z-axes,
respectively.
In this paper we propose the adoption of this last
EC-EGI for ligand and cavity to evaluate
quantitatively the matching between candidate active
sites and ligand. ρ
3 CONSTRUCTION OF CE-EGI
A given 3D molecule, modeled through its Solvent
Excluded Surface in a triangular mesh, is described
by the set of triangles:
=
{
,…,
}
,
⊂
(1)
where each T
l
consists of a set of three vertices:
={
,
,
,
,
,
}
(2)
GEOMETRICAL CONSTRAINTS FOR LIGAND POSITIONING
205