Improving Proceeding Test Case Prioritization with Learning Software

Agents

Sebastian Abele and Peter Göhner

Institute of Industrial Automation and Software Engineering, University of Stuttgart, Stuttgart, Germany

Keywords:

Machine Learning, Test Case Prioritization, Test Suite Optimization, Software Agents.

Abstract:

Test case prioritization is an important technique to improve the planning and management of a system test.

The system test itself is an iterative process, which accompanies a software system during its whole life cy-

cle. Usually, a software system is altered and extended continuously. Test case prioritization algorithms ﬁnd

and order the most important test cases to increase the test efﬁciency in the limited test time. Generally, the

knowledge about a system’s characteristics grows throughout the development. With better experience and

more empirical data, the test case prioritization can be optimized to rise the test efﬁciency. This article intro-

duces a learning agent-based test case prioritization system, which improves the prioritization automatically

by drawing conclusions from actual test results.

1 INTRODUCTION

The development of systems with a high quality is an

important factor to succeed in the market. The high

competition leads to a decreasing time-to-market.

Hence, the development process must be carried out

efﬁciently. One of the major parts of the test pro-

cess is the system test. The system test is a process

which accompanies the whole development process

and can’t be considered isolated. With about 80%, the

largest quantity of the total test expenditure is spent to

the regression test (Chittimalli and Harrold, 2009). In

the regression test, already available test cases are ex-

ecuted repeatedly to ﬁnd faults that may have been

newly introduced with changes in the system. Over

time, the test suites for regression testing may grow

to very large repositories with thousands of test cases.

Executing all the test cases takes a vast amount of

time and resources which are often not available due

to the short time-to-market.

The planning of a test run with the selection of ap-

propriate test cases is a very complex time intensive

task. A lot of data has to be considered to achieve a

very efﬁcient test plan. In order to address these chal-

lenges, computer-aided test case selection and priori-

tization techniques are used to ﬁnd the most important

test cases for the available time slots automatically.

Test case selection techniques reduce the test suites

by identifying only relevant test cases, for example

based on the coverage of changes of the source code

since the last test run. An overview over different test

selection techniques can be found in (Engström et al.,

2010). Prioritization techniques order the test cases

by their expected beneﬁt for the test. Unlike test case

selection, a test run that is executed on the base of pri-

oritized test cases may be interrupted at any time hav-

ing still the maximal beneﬁt possible to the interrup-

tion time. (Yoo and Harman, 2012) describe test case

selection and prioritization techniques, which have

been developed in the past decades. The test case pri-

oritization techniques have in common that they cal-

culate the test case order to speciﬁc times before a

new test run starts.

The knowledge about the tested system grows

from test run to test run. More and more data, like

fault histories, are collected and evaluated to gener-

ate the test case order for the next test run. Not only

the collected data, but also the knowledge about the

tested system and the developer and tester experience

is growing. The tested software may show some un-

expected behavior in the test, which is understood bet-

ter and better by the test engineers. The classic test

case prioritization techniques are usually not adapted

to grown knowledge and experience. The test case

prioritization may not be optimal with respect to the

new knowledge.

To improve the test case prioritization with the

grown knowledge during the test process, we propose

a test case prioritization system, which uses machine

learning approaches. With machine learning, the test

293

Abele S. and Göhner P..

Improving Proceeding Test Case Prioritization with Learning Software Agents.

DOI: 10.5220/0004920002930298

In Proceedings of the 6th International Conference on Agents and Artiﬁcial Intelligence (ICAART-2014), pages 293-298

ISBN: 978-989-758-016-1

 2014 SCITEPRESS (Science and Technology Publications, Lda.)

case prioritization algorithm is updated every test run

with the actual test results. The test case prioritiza-

tion algorithm and the machine learning approaches

are developed using software agents. Chapter 2 de-

scribes the agent-based test case prioritization and the

used prioritization algorithm. This approach is ex-

tended by a machine learning approaches in chapter

3. Finally, chapters 4 and 5 describe further research

plans and ideas.

2 AGENT-BASED TEST CASE

PRIORITIZATION

The paradigm of agent-based software development is

well suited to conquer the issues given by the bound-

ary conditions in a system test. Agent systems are

software systems, which consist of various agents. An

agent is a piece of software, which is capable to act

largely independent to fulﬁll the given goals. In an

agent system, different agents cooperate to achieve a

superior goal. Agent systems and their development

are described in (Wooldridge and Jennings, 1995) and

(Mubarak, 2008).

In the ﬁeld of prioritizing test cases, the agents

collect and integrate information about the tested sys-

tem. Using this information they provide a list of pri-

oritized test cases. An agent-based test case prioriti-

zation system has been developed by (Malz and Göh-

ner, 2011). Each software module and each test case

is represented by one agent. The prioritization pro-

cess consists of two main steps: First the test mod-

ule agents predict the fault-proneness of the software

modules for the next test run. In the second step,

the test case agents calculate the fault-revealing prob-

ability of the test cases. For every module, the test

case agents calculates how probable it is that the rep-

resented test case reveals faults in that module. By

calculating the mean value of all fault-revealing prob-

abilities weighted with the fault-pronenesses of the

modules, the priority value is obtained.

The fault-proneness is calculated by evaluating

different metrics for each software module. Fault-

proneness prediction is described for example in (Kim

et al., 2007). (Bellini et al., 2005) compare different

models to estimate the fault-proneness. The metrics

are differentiated in white box and black box met-

rics. White box metrics base on the analysis of the

source code, e.g. to cover changes since the last test

run. Especially when the software system is devel-

oped by different departments or even different com-

panies together, the access to the source code may

be restricted and the white-box metrics not available.

Black-box metrics represent information about the

modules, which is available without direct access to

the source code. Part of the black-box metrics are

fault and change histories and developer-given criti-

cality and complexity values.

This kind of agent-based systems is predestined to

run in a distributed environment. Information about

a software system that is under test is usually dis-

tributed in such an environment. Every department

or company holds information about the part of the

software, which is developed by the department or by

the company. Agent-based test case prioritization al-

lows to integrate the distributed information to gener-

ate the test case order for the system test. By operat-

ing the relevant agents locally, departments or compa-

nies keep the control over their data. The agents only

deliver data, which is necessary for the current pri-

oritization task. A concept for an agent-based infor-

mation retrieval system was described by (Pech and

Goehner, 2010).

The agent-based test case prioritization system

uses fuzzy-logic rules, which reﬂect expert knowl-

edge about the relation of the metrics to the fault-

proneness and the fault-revealing probability. A de-

scription of the fuzzy logic rules for test case prior-

itization can be found in (Malz et al., 2012). The

rules mainly state common relations like "a fault-

prone module in the past will be fault-prone in the

future", "complex modules are more fault prone than

simple ones" or "test cases, which found a lot of faults

in the past, will ﬁnd a lot of faults in the future".

Like (Fenton and Neil, 1999) states, expert knowl-

edge often is only an expert opinion, when the fault-

proneness is predicted by them. This statement is

applicable to the fuzzy-logic rules on the one hand

but also to some evaluated parameters like complex-

ity and criticality values, which are obtained by the

developers. With additional knowledge about the sys-

tem, e.g. with the actually found faults during the test,

the fault-proneness prediction can be evaluated and

improved. Therefore learning mechanisms are inte-

grated into the agents.

3 IMPROVING PRIORITIZATION

BY LEARNING AGENTS

In the classic agent-based test case prioritization ap-

proach, the prioritization is determined using fuzzy-

logic. Rules are formalized, which reﬂect com-

mon expert knowledge, which is usually valid for all

software development projects. In reality, the soft-

ware development projects may have deviations be-

cause the size, purpose, development team and many

other factors differ from project to project. Adapting

ICAART2014-InternationalConferenceonAgentsandArtificialIntelligence

294

the prioritization algorithm manually to the project

speciﬁcs is nearly impossible because the speciﬁcs are

usually unknown and hard to be identiﬁed. With the

usage of learning algorithm, the prioritization system

is able to adapt itself using the knowledge from actual

performed tests.

3.1 Learning Fault-proneness

Prediction

As a ﬁrst step towards a learning agent-based test

case prioritization system, the prediction of the fault-

proneness has been extended by a genetic algorithm.

The usage of the genetic algorithm is depicted in ﬁg-

ure 1. The classic approach uses fuzzy logic rules

to estimate the fault-proneness out of the parame-

ters also shown in the ﬁgure. The calculated fault-

proneness value is compared with the number of ac-

tually found faults in the test. The difference between

the predicted fault-proneness and the actually found

faults is given to the genetic algorithm as a correction

value for the next test run.

Figure 1: Usage of a Genetic Algorithm to improve the

fault-proneness prediction.

The genetic algorithm optimizes the fuzzy logic

rules in that way that the fault-proneness prediction

would have given a value, which was much closer to

the number of actually found faults. For the next test

run, the fuzzy logic will use the optimized rule set

and provide a better prediction of the fault-proneness

value. With the better fault-proneness a better test

case prioritization is achieved.

The genetic algorithm has been applied to opti-

mize the weight of the single rules. The rule weight

reﬂects the inﬂuence of a parameter to the fault-

proneness. The more the fault-proneness depends on

a single parameter, the higher the weight of the rules

representing this parameter is. The weight of all rules

is combined to a chromosome for the genetic algo-

rithm. With inheritance and random mutation, the

genetic algorithm searches for a new, evolved chro-

mosome, which is better suited to calculate the fault-

proneness. Therefore it creates a various number of

chromosomes and compares their ﬁtness. The ﬁtness

is determined by comparing the prediction result of

the fuzzy logic rules using the current chromosome as

rule weights with the number of actually found faults.

The smaller the difference, the higher the ﬁtness.

Since not every rule is used necessarily for each

fault-proneness calculation, only the weights of rules

that are used should be changed to avoid unwanted

side effects. If a mutation to a weight doesn’t have

an inﬂuence to the result, then this weight is locked

and not changed anymore. The chromosome with the

best ﬁtness is given to the fuzzy logic. The fuzzy

logic adapts the weight to it’s rules for the next fault-

proneness prediction.

3.2 Evaluation of the Learning

Fault-proneness Prediction

To evaluate the learning fault-proneness prediction, it

is compared against the classic approach that doesn’t

learn. As an example project, a computer game was

chosen that has been developed during a 24 hours

programming competition. The boundary conditions

of this competition assure that the developer is time-

bounded in implementation and test. Additionally, we

assume that the development process is fundamen-

tally different to classic software development in a

short-time programming competition. Therefore the

classic rules may not be optimal and can be optimized

using the learning approach.

The game was divided into ﬁve modules, which

have been investigated: Underground, Obstacle,

Player, Enemy and Background. Before the develop-

ment started, the developer assigned complexity and

criticality values to the modules (see Table 1). As

third parameter, the relative development effort to the

single modules has been recorded as an indicator of

the changes done to the modules in the development

phases (see Table 3). Additionally, the faults that were

revealed were recorded for further analysis (see Ta-

ble 2). The fault-proneness has been calculated three

times during the 24 hours, once at the beginning of

the project, once in the middle and once shortly be-

fore the deadline.

Table 1: Parameters given by the developer.

Complexity Criticality

Underground 10 6

Obstacle 1 5

Player 8 10

Enemy 6 5

Background 7 7

To assess the quality of both fault-proneness pre-

diction methods, a reference value has been calcu-

lated. The reference value is assembled by the actual

number of revealed faults, the severity of the faults

ImprovingProceedingTestCasePrioritizationwithLearningSoftwareAgents

295