struction with Maximum Parsimony. Confronted
to other existing implementations our method
does not allow to find trees with a better score
which is the aim of Path-Relinking but represents
an interesting tool to compare the topologies of
the source and guiding trees. The bottom-up ite-
rative implementation that we have described is
faster than the top-down recursive implementa-
tions and can serve as a measure of distance be-
tween trees and could be applied to any other con-
text.
• Simulated annealing: we have presented an im-
proved Simulated Annealing algorithm to find
near-optimal solutions for the MP problem un-
der the optimality criterion of Fitch. In the
experiments our algorithm was carefully com-
pared with an existing Simulated Annealing
implementation (LVB) (Barker, 2003; Barker,
2012), and other three state-of-the-art algorithms
GA+PR+LS, TNT and Hydra. The results show
that our SA is able to consistently improve the
best results produced by LVB, obtaining in cer-
tain instances important reductions in the parsi-
mony score. Compared with the state-of-the-art
algorithm called Hydra (Go
¨
effon, 2006) our SA
algorithm was able to improve on 9 previous best-
known solutions and to equal these results on the
other 11 selected benchmark instances. Further-
more, it was observed that the solution cost found
by our SA presents a relatively small standard de-
viation, which indicates the precision and robust-
ness of the proposed approach.
As future work we suggest to integrate the CUDA
evaluation technique into our implementation of Sim-
ulated Annealing for MP an use the Path-Relinking
technique to determine local optima during the de-
crease of the temperature in order to avoid unsuccess-
ful evaluations of many trees.
7 AVAILABILITY
The C++ source code for the SA algorithm and Path-
Relinking can be found on sourceforge.net under the
biosbl project. The code for the GPU implementation
is freely available from the website of Jean-Michel
Richer. It should run under all Unix/Linux platforms
(http://www.info.univ-angers.fr/pub/richer/rec.php).
REFERENCES
Allen, B. J. and Steel, M. (2001). Subtree transfer opera-
tions and their induced metrics on evolutionary trees.
Annals of Combinatorics, 5(1):1–15.
Bader, D. A., Chandu, V. P., and Yan, M. (2006). Exactmp:
An efficient parallel exact solver for phylogenetic tree
reconstruction using maximum parsimony. In Parallel
Processing, 2006. ICPP 2006. International Confer-
ence on, pages 65–73. IEEE.
Barker, D. (2003). LVB: parsimony and simulated annea-
ling in the search for phylogenetic trees. Bioinformat-
ics, 20(2):274–275.
Barker, D. (2012). LVB homepage.
Chase, M. W., Soltis, D. E., Olmstead, R. G., Morgan,
D., Les, D. H., Mishler, B. D., Duvall, M. R., Price,
R. A., Hills, H. G., Qiu, Y., Kron, K. A., Rettig,
J. H., Conti, E., Palmer, J. D., Manhart, J. R., Sytsma,
K. J., Michaels, H. J., Kress, W. J., Karol, K. G.,
Clark, W. D., Hedren, M., Gaut, B. S., Jansen, R. K.,
Kim, K., Wimpee, C. F., Smith, J. F., Furnier, G. R.,
Strauss, S. H., Xiang, Q., Plunkett, G. M., Soltis, P. S.,
Swensen, S. M., Williams, S. E., Gadek, P. A., Quinn,
C. J., Eguiarte, L. E., Golenberg, E., Learn, G. H.,
Graham, S. W., Barrett, S. C. H., Dayanandan, S., and
Albert, V. A. (1993). Phylogenetics of seed plants:
An analysis of nucleotide sequences from the plastid
gene rbcl. Annals of the Missouri Botanical Garden,
80(3):528–580.
Felsenstein, J. (2003). Inferring phylogenies. Sinauer As-
sociates.
Fitch, W. (1971). Towards defining course of evolution:
minimum change for a specified tree topology. Sys-
tematic Zoology, 20:406–416.
Foulds, L. R. and Graham, R. L. (1982). The steiner pro-
blem in phylogeny is np-complete. Advances in Ap-
plied Mathematics, 3(1):43–49.
Gladstein, D. S. (1997). Efficient incremental character op-
timization. Cladistics, 13(1-2):21–26.
Glover, F., Laguna, M., and Mart, R. (2000). Fundamen-
tals of scatter search and path relinking. Control and
Cybernetics, 39:653–684.
Go
¨
effon, A. (2006). Nouvelles heuristiques de voisinage
et m
´
em
´
etiques pour le probl
`
eme maximum de parci-
monie. PhD thesis, LERIA, Universit
´
e d’Angers.
Goloboff, P. (1999). Analyzing large data sets in reason-
able times: solutions for composite optima. Cladis-
tics, 15:415–428.
Goloboff, P. (2002). Techniques for analyzing large data
sets. In DeSalle R., G. G. and Wheeler W., e., editors,
Techniques in Molecular Systematics and Evolution,
page 7079. Brikhuser Verlag, Basel.
Goloboff, P. A. (1993). Nona, version 2.0. Computer Pro-
gram and Manual Distributed by the Author.
Goloboff, P. A., Farris, J. S., and Nixon, K. C. (2008). Tnt,
a free program for phylogenetic analysis. Cladistics,
24(5):774–786.
Gusfield, D. (1997). Algorithms on strings, trees, and se-
quences: Computer science and computational biol-
ogy. Cambridge University Press, 1st. edition.
Hartigan, J. A. (1973). Minimum mutation fits to a given
tree. Biometrics, 29:53–65.
Strategies for Phylogenetic Reconstruction - For the Maximum Parsimony Problem
235