something that is, in itself coded (Cruzes and Dyba,
2011) (Ellsberg and Heise, 2005).
This experiment will provide us inputs to
improve CVM-TD, resulting a new release of the
contextualized vocabulary. We also intend to
develop new features in eXcomment. This feature is
associated with the new vocabulary to quickly
support the interpretation of comments (7).
After this study, we planned to perform a controlled
experiment (8). In this experiment, besides evaluating
the new release of the contextualized vocabulary and
tool, we expect to compare the overall accuracy when
classifying candidate comments between two groups,
one using the tool to analyze comments and another
one without the tool.
In FindTD V (9), we expected to perform an
exploratory study in the software industry. In this
study, we purpose to compare patterns and TD items
identified into open source code and closed code
developed in an industrial environment.
The last one is the FindTD VI (10). We intend to
compare our approaches to different tools that use
metrics extracted from the source code to identify
TD items.
Our methodology might set some limitations on
what can be experimented. The first considers the
power of the proposed vocabulary. It is possible that
the set of terms and combinations used by our model
and vocabulary are simply too many to be studied.
An alternative would be to limit the studies to a very
specific context and software. Other risk involves
the effort to carry out all studies because of the
difficulty of performing experiments in this area.
5 EXPECTED OUTCOME
In the context of our empirical investigation, we are
interested in findings that help us to comprehend
how code comments analysis can support the
identification and management of different TD
types, considering the developers’ point of view.
We hope to develop a rich contextualized
vocabulary and a tool to support the TD
identification through comment analysis. We believe
this approach can improve methods of identifying
and classifying TD items, analyzing code comments.
6 STAGE OF THE RESEARCH
In accordance with proposed methodology described
in Section 4 and shown in Figure 1, we purpose a
systematic mapping study and a family of
experiment in order to discuss our goals. Figure 1 is
broken down into two parts. The tasks that have
already performed were organized on top of the
figure (part A), and the tasks that will be performed
in the future were organized on bottom of the figure
(part B).
In this sense, we have: (i) performed the
systematic mapping study, (ii) developed a
Contextualized Vocabulary Model, (iii) performed
the first exploratory study and its analysis, (iv)
performed the first controlled experiment and its
analysis, and (v) designed the FindTD III and carried
out the experiment. Currently, we are analyzing data
from this study, using qualitative methods.
REFERENCES
Alves, N.S.R. et al., 2016. Identification and Management
of Technical Debt: A Systematic Mapping Study.
Information and Software Technology, 70, pp.100–
121.
Alves, N.S.R. et al., 2014. Towards an Ontology of Terms
on Technical Debt. In Sixth International Workshop
on Managing Technical Debt (MTD). pp. 1–7.
Basili, V.R., Shull, F. and Lanubile, F., 1999. Building
knowledge through families of experiments. IEEE
Transactions on Software Engineering, 25(4), pp.456–
473.
Campbell, D.T. and Fiske, D.W., 1959. Convergent and
discriminant validation by the multitrait-multimethod
matrix. Psychological Bulletin, 56(2), pp.81–105.
Cruzes, D.S. and Dyba, T., 2011. Recommended Steps for
Thematic Synthesis in Software Engineering. 2011
International Symposium on Empirical Software
Engineering and Measurement, (7491), pp.275–284.
Davis, C.G. and Bowen, L.L., 2001. The language of
comments in computer software : A sublanguage of
English. , 166(00), pp.1731–1756.
Ellsberg, M. and Heise, L., 2005. Researching Violence
Against Women. A PRACTICAL GUIDE FOR
RESEARCHERS AND ACTIVISTS, Washington:
World Health.
Farias, M. et al., 2015. A Contextualized Vocabulary
Model for Identifying Technical Debt on Code
Comments. In Seventh International Workshop on
Managing Technical Debt. pp. 25–32.
Farias, M.A. de F., Novais, R., et al., 2016. A Systematic
Mapping Study on Mining Software Repositories. In
ACM SAC.
Farias, M.A. de F., Silva, A.B., et al., 2016. Investigating
the Use of a Contextualized Vocabulary in the
Identification of Technical Debt : A Controlled
Experiment. In 18Th International Conference on
Enterprise Information System - ICEIS (Accepted).
Freitas, J.L., Da Cruz, D. and Henriques, P.R., 2012. A
comment analysis approach for program