Reproducibility Practices of Software Engineering Controlled

Experiments: Survey and Prospective Actions

Andr

e F. R. Cordeiro

and Edson Oliveira Jr

Informatics Department, State University of Maring

a, Colombo Avenue - 5790, Maring

a, Brazil

Keywords:

Reproducibility, Research Opportunities, Review, Software Engineering Research, Systematic Mapping,

Controlled Experimentation.

Abstract:

Reproducibility can be described as a characteristic that contributes to expanding knowledge in science. This

paper investigates the reproducibility of experiments in Software Engineering (SE) in a context where the lit-

erature points to challenges in verifying experimental results. The central problem addressed is the difﬁculty

in reproducing experiments in SE due to the different factors, such as sharing and artifact management. We

then aimed to identify the factors necessary to achieve reproducibility in SE experiments, characterizing these

factors in terms of the reproducibility crisis, experimental workﬂows, research practices, FAIR principles ap-

plication, and reproducibility improvements. We planned and conducted one survey with 16 participants who

answered a questionnaire with 33 questions. The results show that most participants perceive a reproducibility

crisis in the ﬁeld and point to factors such as lack of public data and incomplete information on methods and

experimental setups as the main causes. Furthermore, the results highlight the importance of sharing data,

metadata, and information about research teams. We also provide points to possible actions to improve repro-

ducibility in SE experiments. The contributions include a detailed analysis of the challenges to reproducibility

in SE, as well as the identiﬁcation of practices and measures that can improve reproducibility.

1 INTRODUCTION

The literature contains different studies describing ex-

perimental activities in software engineering (SE).

An assessment of methods for verifying experimental

ﬁndings is presented in Juristo and Vegas (2010). The

results indicate that reanalysis, replication, and repro-

duction are frequently considered in SE research.

Reproducibility, often referred to as reproduction,

is a fundamental principle for advancing knowledge

across scientiﬁc ﬁelds Juristo and Vegas (2010). It

is achieved when identical results are obtained using

the same methodology, even when experiments are

conducted in different laboratories, by different oper-

ators, and with varying equipment Kitchenham et al.

(2020).

Controlled experiments in SE often face chal-

lenges related to reproducibility. These challenges en-

compass aspects such as availability, standardization,

review processes, and the generation and evolution of

experimental artifacts (Solari et al., 2018). To address

https://orcid.org/0000-0001-6719-2826

https://orcid.org/0000-0002-4760-1626

these challenges, this paper surveys SE researchers

who have experienced experimentation. The survey

aims to deepen the understanding of reproducibility

issues in the ﬁeld.

The structure of this study is as follows. Section

2 discusses the background and related work. Sec-

tion 3 outlines the research methodology. Section 4

presents the ﬁndings, followed by a discussion in Sec-

tion 5. Section 6 discusses the validity evaluation of

this study. Prospective actions are proposed in Sec-

tion 7, and ﬁnal remarks are provided in Section 8.

2 BACKGROUND AND RELATED

WORK

This section outlines the theoretical foundation for the

study, organized into controlled experiments in soft-

ware engineering, reproducibility in software engi-

neering, and related research.

372

Cordeiro, A. F. R. and Oliveira Jr, E.

Reproducibility Practices of Software Engineering Controlled Experiments: Survey and Prospective Actions.

DOI: 10.5220/0013475600003929

Paper published under CC license (CC BY-NC-ND 4.0)

In Proceedings of the 27th International Conference on Enterprise Information Systems (ICEIS 2025) - Volume 2, pages 372-379

ISBN: 978-989-758-749-8; ISSN: 2184-4992

2.1 Controlled Experimentation in

Software Engineering

Experimental Software Engineering (ESE) is a re-

search area focused on investigating best practices for

conducting experiments in SE (Wohlin et al., 2012).

A controlled experiment in SE can be described as a

process that involves several phases, including deﬁ-

nition, planning, and execution (Wohlin et al., 2012).

Figure 1 illustrates the experimental process.

Figure 1: Experimental process (Wohlin et al. (2012)).

During the deﬁnition phase, the scope of the ex-

periment is established, and speciﬁc objectives are

determined. The planning phase involves organiz-

ing the experiment by selecting the context, formulat-

ing hypotheses, identifying variables, choosing par-

ticipants (if required), deciding on an experimental

design, preparing instrumentation, and assessing va-

lidity.

The operation phase is where the experiment is ex-

ecuted according to the protocol deﬁned during plan-

ning. Afterward, the collected data is analyzed and in-

terpreted to determine whether the hypothesis can be

rejected. All experimental artifacts, including data,

should be carefully packaged for presentation, dis-

semination, and potential future reproduction. Fi-

nally, detailed experimental reports are prepared to

document ﬁndings and methodologies (Wohlin et al.,

2012).

2.2 Reproducibility in Software

Engineering

Reproducibility in SE can be examined across various

levels, contexts, or subareas (Li, 2021; Kitchenham

et al., 2020). The study of Liu et al. (2021) investi-

gates the urgency and importance of reproducibility

and replicability in the application of Deep Learning

(DL) in SE.

Despite the availability of various solutions, in-

cluding processes and tools, the reproducibility prob-

lem persists, particularly in the context of empirical

studies (Li, 2021; Kitchenham et al., 2020). This

challenge is especially perceived in controlled exper-

iments (Liu et al., 2021; Anchundia et al., 2020).

2.3 Related Work

Kitchenham et al. (2020) provides an assessment of

the reproducibility and validity of experimental re-

sults in SE. The study involved a systematic re-

view of research published in SE journals. Similarly,

Li (2021) investigates issues in the Evidence-Based

Software Engineering (EBSE) subarea, focusing on

challenges such as the reuse of search strings and the

lack of reproducibility in automatic searches. From

these studies, it was possible to observe evidence of

the reproducibility problem in SE, at least in terms of

experimental results and secondary studies.

An analysis of tools to enhance reproducibility

in SE experiments is presented in Anchundia et al.

(2020). It also explores the role of community ac-

ceptance in adopting tools and practices. This study

inﬂuenced the understanding of possible inﬂuences of

research tools and practices to achieve reproducibility.

To better understand reproducibility challenges

across scientiﬁc ﬁelds, Samuel and K

onig-Ries

(2021) conducted a survey focusing on planning and

execution practices. The study examined the con-

cept of a reproducibility crisis, the application of

FAIR principles, and measures to ensure reproducibil-

ity. Key areas covered include Chemistry, Biology,

and Computer Science. The ﬁndings of Samuel and

onig-Ries (2021) highlight that reproducibility is-

sues are not limited to SE but represent a widespread

problem across multiple disciplines.

The perceptions resulting from the aforemen-

tioned studies inﬂuenced the deﬁnition of the scope

of this study, within the SE.

3 RESEARCH METHODOLOGY

The survey described in this paper considers a

methodology based on goals, research questions, tar-

get audience, population, sampling, instrument and

evaluation, and data sharing. We structured the survey

according to the guidelines speciﬁc to SE, presented

in the literature (Lin

aker et al., 2015).

3.1 Goal and Research Questions

This study aims to identifying factors needed to

achieve SE experiment results’ reproducibility, with

Reproducibility Practices of Software Engineering Controlled Experiments: Survey and Prospective Actions

373

the purpose of characterizing such factors, with re-

spect to reproducibility crisis, experiment workﬂows

and research practices, measures to ensure of repro-

ducibility of results, the introduction of FAIR data

principles, and research practices for improving re-

producibility, from the perspective of SE researchers

and practitioners, in the context of the SE research

community.

Therefore, the main research question that guided

this study was: ’How is reproducibility understood

and applied in software engineering?’. To answer

this question, we deﬁne the following secondary re-

search questions (SRQ):

• SRQ1. What leads to a reproducibility crisis in

software engineering?

• SRQ2. What are the different experiment work-

ﬂows and research practices followed in software

engineering?

• SRQ3. What are the current measures taken in

software engineering to ensure the reproducibility

of results?

• SRQ4. Has the introduction of FAIR data princi-

ples inﬂuenced the research practices in software

engineering?

• SRQ5. Which research practices could improve

reproducibility in software engineering?

SRQ1 assumes the existence of a reproducibility

crisis in SE, as suggested by studies such as Li (2021)

and Kitchenham et al. (2020). This assumption is

based on the observed lack of reproducibility efforts

and a limited number of reproductions in SE studies.

To explore this, researchers are asked about their per-

spectives on this potential crisis and the factors that

may contribute to reduced reproducibility.

SRQ2 investigates the workﬂows and practices

employed in SE research. These workﬂows involve

the management of artifacts, data, and storage de-

vices, which play a crucial role in scientiﬁc research

processes.

SRQ3 posits that speciﬁc measures can help en-

sure the reproducibility of experimental results in SE.

Examples include the ease of retrieving experimental

data and metadata, the ability to reproduce published

results, and the effective execution of reproductions.

These factors are considered critical in assessing the

assurance of reproducibility.

SRQ4 explores the role of the FAIR Data Princi-

ples in promoting reproducibility. It seeks to deter-

mine whether these principles are known and applied

by SE researchers. The assumption is that familiarity

with and application of the FAIR principles signiﬁ-

cantly inﬂuences research practices in SE.

SRQ5 focuses on identifying research practices

that can improve reproducibility in SE. It assumes

that sharing data, metadata, and information about re-

search teams can contribute to enhanced reproducibil-

ity.

3.2 Target Audience and Population

This study considered researchers and practitioners in

the SE area who are knowledgeable about experimen-

tation.

3.3 Sampling

This session presents the proﬁle of the 16 participants

in this study. Details on the areas of activity and study

in SE are presented in Figure2. Most of the partici-

pants (11) work as university professors. In addition

to these professionals, the participation of students

(undergraduate and graduate) and researchers (partic-

ipant and leader of the research group) was also ob-

served. A professional who works as a developer also

participated.

Amount

Software Processes

Software Requirements

Software Architecture

Software Testing

Sociotechnical Systems

Software Reliability

Software Reuse

Software Project

Management

Software Quality

Other

0 2 4 6 8

Figure 2: Software Engineering Participants’ Areas.

Figure 2 presents the SE areas considered by the

participants. Each participant was allowed to register

one or more areas of study. When observing the re-

sults of such a ﬁgure, some areas stand out, such as

Software Processes, Software Testing, and Software

Quality. Practitioners who investigate other areas not

mentioned as options in the form also participated.

These professionals investigate problems addressed in

the areas of SE Experimentation (1), SE Education

(2), Human-Computer Interaction in SE context (1),

Search-Based SE (1), Human Aspects of SE (1), and

Software Accessibility.

3.4 Instrument and Evaluation

We adapted one questionnaire for this survey, with 33

questions related to the reproducibility of experi-

ICEIS 2025 - 27th International Conference on Enterprise Information Systems

374

Amount

Lack of sufficient metadata regarding the

experiment

Lack of data that is publicly available for

use

Lack of complete information in the

Methods/Standard Operating

Poor experimental design

Lack of the information related to the

settings used in original experiment

Difficulty in understanding laboratory

notebook records

Pressure to publish

Lack of knowledge or training on

reproducible research practices

Lack of time to follow reproducible

research practices

Data privacy

0 2 4 6 8 10 12

Figure 3: Factors related to poor reproducibility.

ments in SE. The base questionnaire, considered in

the adaptation, was used in the study of Samuel and

onig-Ries (2021). We built this instrument with

Google Forms

. Initially, we contacted the partici-

pants and after the concordance with the participation,

we sent participants the access link by email.

During the development of the instrument, we es-

timate the response time of the questions. We eval-

uated the instrument with a pilot project with three

researchers who were not in the sampling. After the

pilot project, no changes were suggested. Thus, we

kept the instrument as is before the pilot project.

We presented the questions of the instrument in

different formats, such as multiple choice, selection

box, open box, and checkbox grid. To facilitate inter-

pretation, we prepared optional questions with short

statements.

3.5 Study Data Availability

Data from this paper is available at https://zenodo.org/

records/14888972.

4 RESULTS

This section presents the results of our survey regard-

ing the ﬁve research questions from Section 3.1.

4.1 SRQ1 - Reproducibility Crisis

The reproducibility crisis refers to the growing belief

that the results of many scientiﬁc studies are difﬁcult

or impossible to reproduce after further investigation,

either by independent researchers or by the original

researchers themselves (Kitchenham et al., 2020).

https://www.google.com/intl/pt-BR/forms/about/

Fourteen participants registered a perception

about the existence of this problem. These results rep-

resent a previous perception observed in other studies

as Li (2021) and Kitchenham et al. (2020). Consider-

ing the existence of a reproducibility difﬁculty, Figure

3 presents possible factors associated with such difﬁ-

culties.

Among the factors that may explain low repro-

ducibility and that were considered in this study, we

can mention the “lack of data that is publicly avail-

able for use”, registered by eleven participants.

4.2 SRQ2 - Experiment Workﬂows and

Research Practices

Experiment workﬂows and research practices can be

understood as guidelines and tools for conducting re-

search in SE.

Considering the locations used for storing experi-

mental data and metadata, Figures 4 and 5 present the

details, respectively. Figure 4 presents that some of

Amount

Personal Devices (eg.

Computer)

Local Server provided at your

workplace

Removable Storage Device

(eg. USB, Harddisk, CD Drive)

Version Controlled

Repositories (eg. Github,

GitLab, Figshare, Zenodo etc.)

Data Management Platforms

Other

0 2 4 6 8 10 12

Figure 4: Local of storage of the experimental data ﬁles.

the experimental data storage locations cited include

”personal devices” and ”version controlled repos-

itories.” Regarding metadata, Figure 5 presents the

main current storage options, such as ”handwritten

lab notebooks”, ”electronic notebooks”, and ”data

management platforms”. Six participants recorded

that they write scripts and programs. Four participants

Reproducibility Practices of Software Engineering Controlled Experiments: Survey and Prospective Actions

375

Hand written Lab

Notebooks

Electronic Notebooks Data Management

Platforms

Other

Primary Source Secondary Source Other

Figure 5: Local for storage of the experimental metadata.

recorded that they do not write scripts.

In addition to the workﬂows and practices pre-

sented in Figures 4, and 5, participants also reported

the importance of detailed documentation on the in-

frastructure used in the experiment, with the objec-

tives of maintaining traceability and facilitating repro-

ducibility.

4.3 SRQ3 - Measures to Ensure

Reproducibility of Results

Initially, there is a supposition that different actions

can be taken to ensure reproducibility. Figures 6 and

7 present details related to possible actions.

Figure 6 presents different data sets to investigate

the ease of retrieval of experimental data in the con-

text of a researcher participating in the planning and

execution of an experiment. Considering the context

Input Data Metadata about the

methods

Metadata about the

steps

Metadata about the

experimental setup

Results

Very Easy Easy Neither Easy nor difficult Difficult Very Difficult

Figure 6: Facility to ﬁnd all the experimental data.

of a new participant in a research team without any

instruction, Figure 7 presents the results of the eval-

uation of the ease of retrieval of experimental data.

Regarding the reproduction of published results from

others, it was found that eight participants reported

that they ”never tried to reproduce other published

results”. Despite possible difﬁculties with the repro-

duction of published studies, most participants did not

experience problems related to the reproduction of

their studies. Only two participants reported having

Input Data Metadata about

the methods

Metadata about

the steps

Metadata about

the experimental

setup

Results

Very Easy Easy Neither Easy nor difficult Difficult Very Difficult

Figure 7: Facility for a newcomer member to obtain all the

experimental data without any instructions.

been contacted about reproducing published studies.

Considering the reproduction of their own experi-

ments to verify the results, eight participants reported

that they sometimes perform this reproduction to

verify. Five participants also reported that they do

not consider this activity.

4.4 SRQ4 - FAIR Data Principles

The FAIR principles can be applied to artifacts in gen-

eral. Data sets represent examples. The acronym

FAIR stands for the combination of the characteristics

Findable, Accessible, Interoperable, and Reusable.

Twelve participants are aware of the FAIR principles.

Although this knowledge is positive, it was also ob-

served that six participants heard about these prin-

ciples but did not know what they meant. Regard-

ing the application of the principles, Figure 8 presents

the details.

Always Often Sometimes Rarely Never

Findable Accessible Interoperable Reusable

Figure 8: Application of the FAIR principles in the research.

Considering the principles ﬁndable, accessible, in-

teroperable, and reusable, it can be seen that some

principles are considered more than others in

terms of frequency.

ICEIS 2025 - 27th International Conference on Enterprise Information Systems

376

4.5 SRQ5 - Research Practices to

Improve Reproducibility

Initially, there is a premise that different factors are

important to understanding a scientiﬁc experiment in

SE, with the purpose of enabling reproducibility. Fig-

ures 9, 10, 11, and 12present details of the sharing of

data and metadata, and the knowledge of the research

team involved as potential factors that contribute to

reproducibility.

Raw Data Processed Data Negative Results Measurements Scripts/Code/Program Image Annotations

Little Importance Average Importance Very Important Absolutely Essential Not Applicable

Figure 9: Sharing of experimental data.

Regarding the sharing of experimental data, Fig-

ure 9 shows that participants classiﬁed as ”absolutely

essential” the sharing of ”raw data”, ”processed

data”, ”negative results”, ”measurements”,

”script/code/program”.

Not Important

At All

Little

Importance

Average

Importance

Very Important Absolutely

Essential

Not Applicable

Instrument Settings Experiment Environment Conditions Publications used

Figure 10: Sharing of metadata regarding settings.

Figure 10 presents the evaluations on the shar-

ing of metadata related to experiment settings.

The majority of participants also evaluated as

”absolutely essential” the sharing of ”instruments

settings”, ”experiment environment conditions”,

and ”publications used”.

Figure 11 presents results on the sharing

of metadata related to steps and plans related

to the experiment. The sharing metadata was

considered ”absolutely essential”, in terms of

”methods”, ”activities/steps”, ”order of activi-

ties/steps”, ”validation methods” and ”quality con-

trol methods”.

Laboratory Protocols Methods Activities/Steps Order of Activities/Steps Validation Methods Quality Control Methods

Little Importance Average Importance Very Important Absolutely Essential Not Applicable

Figure 11: Sharing of metadata regarding all the steps and

plans.

Not Important

At All

Little

Importance

Average

Importance

Very Important Absolutely

Essential

Not Applicable

Final Results Intermediate Results

Figure 12: Sharing of the intermediate and ﬁnal results of

each step of the experiment.

Regarding the sharing of experiment results, Fig-

ure 12 presented different evaluation results. The

”ﬁnal results” were evaluated as ”absolutely essen-

tial” and the ”intermediate results” were evaluated

as ”average importance”.

In addition to sharing experimental data and meta-

data and knowledge about the research team (Figures

9, 10, 11, 12), participants also reported the impor-

tance of sharing detailed descriptions of experiment

limitations, as well as justiﬁcations for methodologi-

cal choices, previous versions of scripts and data for

traceability, cross-validation reports, and third-party

reanalysis.

5 DISCUSSION OF RESULTS

This section discusses the results presented in Section

5.1 Reproducibility Crisis

Considering the results presented in section 4, an ap-

parent reproducibility crisis in SE is observed. Differ-

ent factors may explain the difﬁculty of reproducing

an experiment. Among them are the lack of publicly

available data, the lack of complete information on

the methods employed, lack of information on the

settings used in the experiment.

Reproducibility Practices of Software Engineering Controlled Experiments: Survey and Prospective Actions

377

5.2 Experiment Workﬂows and

Research Practices

Different kinds of artifacts can be considered in ex-

periments, such as scripts, diagrams, and code. Dif-

ferent types of data are also considered. Tabulars,

measurements, metrics, and graphs are examples.

As for data storage locations, personal devices, ver-

sion control repositories, and institutional repositories

were mentioned.

5.3 Measures to Ensure Reproducibility

of Results

In general, the researchers responsible for the exper-

iment considered it easy to recover experimental

data for both input data and results, as well as the

metadata about the methods, steps, and experimental

setup. In the case of data retrieval to be performed by

a novice researcher without any instruction, it was ob-

served that it was difﬁcult to obtain metadata about

methods, steps, and experimental setup.

Regarding the reproduction of experiments to ver-

ify results, we observed that the majority of partici-

pants reported that they did not perform this activity,

which is a concerning factor.

5.4 FAIR Data Principles

Considering the knowledge about the FAIR princi-

ples, we ﬁnd that a considerable number of partici-

pants know the principles, even if they do not know

exactly what they mean. About application, we ac-

knowledge that some principles can be applied more

frequently. This is the case of the Findable and Ac-

cessible principles, which are essential characteristics

for effective research data.

5.5 Research Practices to Improve

Reproducibility

The sharing of experimental data is considered ab-

solutely essential by participants. Regarding the shar-

ing of metadata about materials, and settings, time,

duration, location, steps, and software used in the

experiment, the participants also considered it abso-

lutely essential. For the sharing of results, we saw

that the sharing of intermediate results was evalu-

ated as medium importance. The sharing of ﬁnal re-

sults is considered absolutely essential.

6 VALIDITY EVALUATION

In this section, we discuss the main threats to the

validity of our survey based on the guidelines by

Lin

aker et al. (2015) regarding face, content, crite-

rion, and construct. To ensure face validity, the sur-

vey form was reviewed by three researchers during a

pilot project. We focused on achieving content va-

lidity by conducting an unstructured interview with

researchers experienced in experimentation in SE to

review the questionnaire.

To address criterion validity, we organized the

questionnaire into distinct sections, each correspond-

ing to a speciﬁc research question. To assess con-

struct validity, we conducted activities during the in-

strument pilot project, the interviews, and the litera-

ture review.

7 PROSPECTIVE ACTIONS

The observed results and respective discussions

present clear evidence that reproducibility must be ad-

dressed in prospective investigations. Therefore, we

will provide some actions to be taken regarding the

topic discussed.

Data and metadata storage options should be ana-

lyzed regarding different options, assessing their ad-

vantages and disadvantages. It might include personal

devices, version-controlled repositories, local servers,

data management platforms, and electronic or physi-

cal lab notebooks. This analysis can aid in identi-

fying best practices for ensuring data accessibility

and integrity, facilitating the reproducibility of ex-

periments.

Creating frameworks for sharing data and meta-

data that can include data repositories, tools for meta-

data management, and best practice guidelines might

encourage data sharing. These frameworks can

help to overcome the lack of availability of data and

complete information on methods and settings, which

have been identiﬁed as the main problems for repro-

ducibility.

8 FINAL REMARKS

Based on the results presented and discussed in the

paper, it is clear that there is a need to address the is-

sue of reproducibility in future research in the ﬁeld

of SE. This study corroborated an apparent repro-

ducibility crisis in the ﬁeld, caused mainly by the lack

of public data and incomplete information about the

ICEIS 2025 - 27th International Conference on Enterprise Information Systems

378

methods used and the experiment settings. This crisis

not only makes it difﬁcult to verify research results.

We also identiﬁed several factors that inﬂuence re-

producibility in SE experiments, such as the different

types of artifacts used , the types of data considered ,

and the storage locations of these data and metadata.

Our survey emphasized the importance of sharing raw

and processed data, negative results, measurements,

scripts/code/programs, as well as metadata related to

materials, and settings.

We acknowledge essential points to be investi-

gated for future actions, such as analysis of the ad-

vantages and disadvantages of the various forms of

data and metadata storage; and development of frame-

works to facilitate the sharing of data and metadata

and promote reproducibility in SE experiments.

ACKNOWLEDGMENTS

The authors thank the participants for their collab-

oration with the research. Edson OliveiraJr thanks

CNPq/Brazil Grant #311503/2022-5.

REFERENCES

Anchundia, C. E. et al. (2020). Resources for reproducibil-

ity of experiments in empirical software engineering:

Topics derived from a secondary study. IEEE Access,

pages 8992–9004.

Juristo, N. and Vegas, S. (2010). Replication, reproduction

and re-analysis: Three ways for verifying experimen-

tal ﬁndings. In RESER.

Kitchenham, B., Madeyski, L., and Brereton, P. (2020).

Meta-analysis for families of experiments in software

engineering: a systematic review and reproducibility

and validity assessment. EMSE, 25:353–401.

Li, Z. (2021). Stop building castles on a swamp! the cri-

sis of reproducing automatic search in evidence-based

software engineering. In ICSE-NIER, pages 16–20.

IEEE.

Lin

aker, J., Sulaman, S. M., Maiani de Mello, R., and H

ost,

M. (2015). Guidelines for conducting surveys in soft-

ware engineering.

Liu, C., Gao, C., Xia, X., Lo, D., Grundy, J., and Yang,

X. (2021). On the reproducibility and replicability

of deep learning in software engineering. TOSEM,

31(1):1–46.

Samuel, S. and K

onig-Ries, B. (2021). Understanding ex-

periments and research practices for reproducibility:

an exploratory study. PeerJ, 9:e11140.

Solari, M., Vegas, S., and Juristo, N. (2018). Content and

structure of laboratory packages for software engi-

neering experiments. IST, 97:64–79.

Wohlin, C., Runeson, P., H

ost, M., Ohlsson, M. C., Reg-

nell, B., and Wessl

en, A. (2012). Experimentation in

software engineering. Springer.

Reproducibility Practices of Software Engineering Controlled Experiments: Survey and Prospective Actions

379