tive internal degree rk
in
i
of the vertex: k
in
i
/Max
s
(k
in
),
where k
in
i
represents the internal degree of vertex
i and Max
s
(k
in
) is the maximum internal degree in
community s. However, we observed that vertices
with a medium number of internal connections had
a score that was too low and thus we applied the loga-
rithmic formula in equation (7). In a weighted graph,
we can use the relative internal weight rw
in
i
, where
we replace the internal degree with internal weight in
equation (7).
rk
in
i
=
log(k
in
i
+ 1)
log(Max
s
(k
in
) + 1)
(7)
We also observed that there are vertices with a
relatively small number of connections, but with a
high significance. Looking at those vertices we re-
alized that they were connected to other vertices in
the same community that had a high internal degree,
so they were less likely to leave the community when
the graph was perturbed. A good example is vertex
15 in the Karate network which has only two connec-
tions in its community, so judging solely by relative
internal degree (considering that the maximum inter-
nal degree in that community is 14) one would think it
has a weak connection to the community. The signifi-
cance of vertex 15 is however high at 0.9, most likely
because it is connected to the two most connected ver-
tices in the community: 33 and 34. Considering this,
it seems natural that not only the internal degree, but
also the internal neighborhood (neighbors within the
community) of the vertex is an important factor for its
connection strength. It is important to note that the
internal neighborhood should only have a positive ef-
fect on the score of the vertex: a well-connected ver-
tex should not have its score lowered simply because
it is connected to low-degree vertices.
Similar to the previous point, we found that we
also have to look at the external neighborhood of a
vertex: a vertex that is well connected inside its own
community but has connections with well-connected
vertices in other communities will have a higher like-
lihood of leaving the community. A good example for
such a vertex is 32: it has an internal degree of 5, an
external degree of only 1 and is connected to vertices
33 and 34, so one would expect a high significance.
The significance is actually quite low, at 0.63, because
it is connected to vertex 1, which is the most con-
nected vertex in another community. One can view
highly connected vertices as attractors that exercise a
pulling force on their weaker-connected neighbors, so
neighbors both inside and outside of the community
have to be considered.
To summarize, we have identified the following
factors which should be considered for estimating
how strongly a vertex belongs to its own community
(i.e. measuring commitment):
• The internal degree
• The internal degree of its internal neighbors
• The internal degree of its external neighbors
Considering the fact that embeddedness does not
represent a sufficiently expressive metric for estimat-
ing the commitment of a vertex, and that significance
is computationally intensive, we propose a new mea-
sure for quantifying vertex commitment, which we
call Relative Commitment. We define the internal
score of a vertex to be the sum of the relative inter-
nal degrees of its internal neighbors and the external
score, the sum of the relative internal degrees of its
external neighbors. Relative Commitment is the ratio
between the internal score and the total score (internal
+ external), multiplied by the relative internal degree
of the vertex in question. Thus, we obtain the formula
in equation (8), where rk
in
represents the relative in-
ternal degree of the vertex and i and j denote the inter-
nal and external neighbors of the vertex, respectively.
rc = rk
in
∗
∑
i
rk
in
i
∑
i
rk
in
i
+
∑
j
rk
in
j
(8)
For a weighted graph, we use relative internal
weight instead of degree and additionally we use the
connection weight to each neighbor in the internal
and external score (9), where w
i
and w
j
represent the
weight of the connection with internal neighbor i and
external neighbor j, respectively.
rcw = rw
in
∗
∑
i
w
i
∗ rw
in
i
∑
i
w
i
∗ rw
in
i
+
∑
j
w
j
∗ rw
in
j
(9)
Figure 7 shows histograms comparing the Relative
Commitment to embeddedness and significance for
both Karate and Netscience networks. In the Karate
network the Relative Commitment values are on aver-
age lower than significance, while in the Netscience
network, we have more values of Relative Commit-
ment between 0.95 and 1.0.
Figure 8 shows Relative Commitment compared
to significance for both networks. As we can see,
there is a clear correlation between the two measures
for many vertices. For both networks, the correlation
coefficient r is approximately 0.59. This means that
the factors identified previously are indeed important
for the significance of the vertices. However, we can
see that there are still differences, vertices that have
a high Relative Commitment but a low significance or
vice-versa, so there are still aspects of significance we
have not covered in Relative Commitment. Still, even
in its current form, Relative Commitment represents a
KDIR 2015 - 7th International Conference on Knowledge Discovery and Information Retrieval
52