The next step in the SNA is to try and isolate po-
tential subgroups within the network. With a topic
as large as pediatric pain there may be evidence of
subgroups in which actors are more active around a
certain topic of interest. If the network were broken
into groups, one would expect a lot of communication
within groups, and relatively little communication be-
tween groups.
3.1.2 Structural Equivalence
The goal of structural equivalence (SE) is to identify
nodes that occupy similar roles within the network.
Formally, two nodes are SE if they have the same ties
to all other nodes in the network. If two nodes are SE
then one can replace the other without interfering with
the flow of information in the network. In reality true
SE is rare, so approximate SE needs to be measured.
A simple measure would be to count the proportion of
matching ties, or the number of tie changes required
to make two nodes SE. There are several measures
available, but for this project a simple count of the
number of similar ties is used.
Regardless of which SE measure is used, a SE ma-
trix is developed, which records the SE between all
the actors. This matrix is used to group similar actors
using a hierarchical clustering algorithm. The result
is a binary tree, or dendogram, depicting a hierarchi-
cal ranking of similarities, as in figure 7. Cutting the
tree off at a particular level results in partitions being
created from the clusterings. The red blocks in figure
7 represent the cutpoint at which the clusters are cre-
ated. Assigning the actors to these groups creates a
blockmodel.
A blockmodel is a partitioning of the network into
exclusive, non-overlapping groups, such that nodes
within the group are approximately SE. For a block-
model there tends to be a lot of communication within
the blocks and relatively little between them. Once
the optimal block model is determined the active
blocks can be further investigated to determine the
content that makes certain blocks unique.
3.2 Network Visualization
The visualization of networks is a key component
of SNA, and as such there is a rich literature base
describing methods of presenting networks visually.
Linton Freeman (Freeman, 1999) documents the his-
tory of social network visualization from a socio-
logical perspective, including theories on node lay-
out (both information-based and algorithmic theories)
along with the use of colour, size and shape to encode
network information. There are many current tools
for analytic network visualization, including UCINet
(Borgatti et al., 2002) and an extension for the R sta-
tistical language called statnet (Handcock et al.,
2003).
Previous work on social network visualization
has also been directed towards network navigation.
Examples include ContactMap (Nardi et al., 2002)
for identifying community groups within email con-
tacts, PieSpy (Mutton, 2004), which provides a real-
time visualization of social networks for Internet Re-
lay Chat (IRC) members, and Vizster (Heer and
boyd, 2005), a tool for exploring the Friendster
(www.friendster.com) social networking site. These
tools are all designed for 1-mode networks, for ex-
ample, the nodes in the Vizster program all represent
users of Friendster, and the ties represent friendship
links between them. In contrast, this project is visual-
izing a 2-mode network, where the first class of nodes
represent mailing list members and the second rep-
resents threads, and the links between a node and a
thread indicate that a certain list member has commu-
nicated on that thread.
The software being used to implement this project
is the prefuse toolkit in Java (Heer et al., 2005).
Prefuse was chosen because it provides a full Java
library, and previous implementations of prefuse, in-
cluding the Vizster program, have proven successful.
4 VeCON System
4.1 Visualization
The purpose of the visualization is to first provide a
tool for visually exploring social networks, and sec-
ondly to provide some insight into the underlying so-
cial structure of the network. This section will outline
the graph-theoretic layout decisions for the network,
and then explain the visualization tools implemented
to help the exploration of the mailing list.
4.1.1 Graph Structure
The network is laid out using a force-directed lay-
out, in which the nodes repel one another and the
edges act as “springs” that hold the nodes together.
Because of this spring effect, the layout is also
sometimes referred to as a “spring embedding” al-
gorithm. Prefuse implements the Barnes-Hut algo-
rithm (Barnes and Hut, 1986) which allows for real
time calculation of spring-embedding forces. The al-
gorithm is an iterative process, and following the lead
of Vizster, this project chooses to not limit the num-
ber of iterations of the algorithm, resulting in a visu-
alization in which the nodes migrate to their optimal
UNDERSTANDING MEDICINE 2.0 - Social Network Analysis and the VECoN System
73