imizing downside of not finding optimal solutions.
The paper continues as follows: In Section 2, we
summarize related work in the literature about net-
work alignments. In the 3rd section, we define our
methods and give some visual examples about them.
Section 4 gives information about the evaluations and
the strengths of the algorithm. In final section, we
conclude the paper.
2 RELATED WORK
There are many algorithms used on global and local
network alignment problems in the literature. Global
network alignment problems are solved via various
techniques such as integer programming (Li et al.,
2007), spectral clustering (Liao et al., 2009) and mes-
sage passing (Zaslavskiy et al., 2009). While lo-
cal network alignment problem is solved by differ-
ent kinds of algorithms and applications. PathBLAST
(Kelley et al., 2003), Network-BLAST (Scott et al.,
2006), QPath (Shlomi et al., 2006), PathMatch and
GraphMatch (Yang and Sze, 2007) are generally used
in finding conservative structures in biological net-
works. Most used algorithm in literature is Path-
BLAST which is an efficient algorithm for aligning
two Protein Protein Interaction (PPI) networks. This
algorithm looks for high-scoring pathway alignments
by considering the homology between aligned pro-
teins. When PPI data are noisy, it can allow gaps and
mismatches to handle variations (Kelley et al., 2003).
Local network alignment algorithms are generally
good at finding small sized common substructures in
given networks but they have same failures about find-
ing larger substructures because of complexity issues.
Also, some of these algorithms do not give chance to
node insertions and deletions in the alignment pro-
cess. In order to handle mentioned issues, Hidden
Markov Models (HMMs) based local network align-
ment is introduced in (Qian and Yoon, 2009). HMMs
has ability to combine node similarities and interac-
tion reliabilities (transition probabilities) to compare
aligned paths and they can also overcome the path iso-
morphism. In (Qian and Yoon, 2009), the researchers
adopt the HMMs framework to find optimal and bio-
logically significant paths in general biological net-
works. Their main goal is to find conserved paths
in two or more biological networks which have sim-
ilarities. They used a scoring scheme to find align-
ments and they search for top k alignments of homol-
ogous paths with the highest scores. Their extended
algorithms has polynomial complexity and it is de-
pendent on the length of aligned paths and the num-
ber of interactions (edges) between each networks.
Aligned paths may have insertions and/or deletions.
After finding high scoring paths, we will attempt to
combine overlapped ones to form the conserved sub-
networks in general network structure.
3 METHODS
In this section we will present an extension to the al-
gorithm described in (Qian and Yoon, 2009) for find-
ing the conserved substructures of varying sizes. The
algorithm presented in (Qian and Yoon, 2009) uses
HMMs for solving local network alignment problem
to find the optimal paths of fixed length.
The details how to do pairwise local alignment in
study (Qian and Yoon, 2009) is stated in 3.1.
3.1 Pairwise Local Alignment
We assume that we have two graphs as G1= (u,d)
and G2=(v,e) representing two biological networks.
The graph G1 has 10 nodes represented as u1, u2
. . . u10 respectively. Also, it has edges between the
nodes which shows the interaction between each en-
tity (node). The graph G2 has 9 nodes named from v1
to v9 and it has again the edges between the nodes
interacted each other. The example of two graphs
shown in Figure 1. These networks are undirected
that means there are same relations between node i to
node j and node j to node i.
For example, when G1 represents a PPI network,
each ui corresponds to a protein, and the edge be-
tween ui and uj shows that these proteins can interact
to each other. When we look at the interacting pair
nodes ie. (ui,uj), the interaction reliability is defined
as w1(ui, uj). It can be accepted as a weight of edge
between node ui and node uj (dij).Similarly, the in-
teraction reliability between two nodes vi and vj in
the graph G2 can be stated as w2(vi,vj).
Finally, the similarity between two nodes ui from
G1 and vj from G2 in the shown networks is defined
as h(ui,vj) and it is found by sequence similarity be-
tween two nodes The aim is to find the best matching
pair of paths from two networks maximizing defined
path alignment score that uses w1(ui, uj), w2(vi,vj),
h(ui,vj) and penalty gap scores. The main strength of
the algorithm is to find pairs of paths have fixed length
size.
Figure 2 shows an example of an alignment be-
tween two similar paths s and q, where s belongs to
G1 and q belongs to G2. The dashed lines in Figure
1 connect two nodes ui and vj indicate that there exist
significant similarities between the connected nodes.
BIOINFORMATICS 2016 - 7th International Conference on Bioinformatics Models, Methods and Algorithms
254