empirical evidence to support our conjecture, and pro-
vide an approximate formula that estimates the sur-
vival rate of a program as a function of its semantic
metrics. We envision the following extensions to our
work:
• Because the dependent variable is normalized (a
percentage), we argue that the independent vari-
ables ought to be normalized as well; hence we
envision to redo the statistical analysis using nor-
malized metrics (most can be normalized to the
entropy of the declared state of the program).
• Building a larger base of programs, involving
larger size programs.
• Validate the new regression formula on a realistic
set of programs, distinct from the programs with
which we build the predictive model.
• Automating the calculation of the semantic met-
rics, ideally by the generation of a compiler that
analyzes the source code of the program to com-
pute its semantic metrics.
• Consider the oracle that is used in identify-
ing equivalent mutants, and integrate the non-
determinacy of the oracle in the statistical model.
• Consider a possible standardization of the test
data we use to test mutants: currently, we are re-
lying on the test data provided by the Commons
Math Library. It is possible that some test classes
are more thorough than others; yet in order for our
statistical study to be meaningful, all test classes
need to be equally thorough; we need to define
standards across classes.
REFERENCES
Aadamopoulos, K., Harman, M., and Hierons, R. (2004).
How to overcome the equivalent mutant problem and
achieve tailored selective mutation using co-evolution.
In Proceedings, Genetic and Evolutionary Computa-
tion, volume 3103 of LNCS, pages 1338–1349.
Coles, H. (2017). Real world mutation testing. Technical
report, pitest.org.
Csiszar, I. and Koerner, J. (2011). Information Theory:
Coding Theorems for Discrete Memoryless Systems.
Cambridge University Press.
Debroy, V. and Wong, W. E. (2010). Using mutations to
automatically suggest fixes for faulty programs. In
Proceedings, ICST, pages 65–74.
Debroy, V. and Wong, W. E. (2013). Combining mutation
and fault localization for automated program debug-
ging. Journal of Systems and Software, 90:45–60.
Etzkorn, L. H. and Gholston, S. (2002). A semantic entropy
metric. Journal of Software Maintenance Evolution:
research and Practice, 14:293–310.
Gall, C. S., Lukins, S., Etzkorn, L., Gholston, L., Farring-
ton, P., Utley, D., Fortune, J., and Virani, S. (2008).
Semantic software metrics computed from natural lan-
guage design specifications. IET Software, 2(1).
Inozemtseva, L. and Holmes, R. (2014). Coverage is not
strongly correlated with test suite effectiveness. In
Procedings, 36th International Conference on Soft-
ware Engineering. ACM Press.
Just, R., Ernst, M. D., and Fraser, G. (2013). Using state
infection conditions to detect equivalent mutants and
sped up mutation analysis. In Proceedings, Dagstuhl
Seminar 13021: Symbolic Methods in Testing.
Ma, Y.-S., Offutt, J., and Kwon, Y.-R. (2005). Mujava : An
automated class mutation system. Journal of Software
Testing, Verification and Reliability, 15(2):97–133.
Mili, A., Jaoua, A., Frias, M., and Helali, R. G. (2014).
Semantic metrics for software products. Innovations
in Systems and Software Engineering, 10(3):203–217.
Morell, L. J. and Voas, J. M. (1993). A framework for defin-
ing semantic metrics. Journal of Systems and Soft-
ware, 20(3):245–251.
Nica, S. and Wotawa, F. (2012). Using constraints for equiv-
alent mutant detection. In Andres, C. and Llana, L.,
editors, Second Workshop on Formal methods in the
Development of Software, EPTCS, pages 1–8.
Papadakis, M., Delamaro, M., and LeTraon, Y. (2014).
Mitigating the effects of equivalent mutants with mu-
tant clasification strategies. Science of Computer Pro-
gramming, 95(P3):298–319.
Schuler, D. and Zeller, A. (2010). (un-)covering equiva-
lent mutants. In Proceedings, International Confer-
ence on Software Testing, Verification and Validation,
pages 45–54.
V., D. and W.E., W. (2010). Using mutation to automatically
suggest fixes to faulty programs. In Proceedings, ICST
2010, pages 65–74.
Voas, J. M. and Miller, K. (1993). Semantic metrics for
software testability. Journal of Systems and Software,
20(3):207–216.
Yao, X., Harman, M., and Jia, Y. (2014). A study of equiv-
alent and stubborn mutation operators using human
analysis of equivalence. In Proceedings, ICSE.
Zem
´
ın, L., Gutti
´
errez, S., de Rosso, S. P., Aguirre, N., Mili,
A., Jaoua, A., and Frias, M. (2015). Stryker: Scaling
specification-based program repair by pruning infeasi-
ble mutants with sat. Technical report, ITBA, Buenos
Aires, Argentina.
Estimating the Survival Rate of Mutants
213