ACKNOWLEDGEMENT
This research was supported in part by the Euro-
pean Union, co-financed by the European Social Fund
(EFOP-3.6.3-VEKOP-16-2017-00002). The Min-
istry of Human Capacities, Hungary grant 20391-
3/2018/FEKUSTRAT is acknowledged.
REFERENCES
Antoniol, G., Canfora, G., Casazza, G., De Lucia, A., and
Merlo, E. (2002). Recovering traceability links be-
tween code and documentation. IEEE Transactions
on Software Engineering, 28(10):970–983.
Baltes, S., Treude, C., and Diehl, S. (2019). SOTorrent:
Studying the Origin, Evolution, and Usage of Stack
Overflow Code Snippets. In MSR ’19 Proceedings of
the 16th International Conference on Mining Software
Repositories.
Barua, A., Thomas, S. W., and Hassan, A. E. (2014). What
are developers talking about? An analysis of topics
and trends in Stack Overflow. Empirical Software En-
gineering.
Bazelli, B., Hindle, A., and Stroulia, E. (2013). On the
Personality Traits of StackOverflow Users. In 2013
IEEE International Conference on Software Mainte-
nance, pages 460–463.
Beyer, S., Macho, C., Pinzger, M., and Di Penta, M. (2018).
Automatically classifying posts into question cate-
gories on stack overflow. In Proc. of the 26th Con-
ference on Program Comprehension, pages 211–221.
Csuvik, V., Kicsi, A., and Vidács, L. (2019). Source code
level word embeddings in aiding semantic test-to-code
traceability. In 10th International Workshop at the
41st International Conference on Software Engineer-
ing (ICSE) – SST 2019. IEEE.
Dai, A. M., Olah, C., and Le, Q. V. (2015). Document Em-
bedding with Paragraph Vectors.
Deerwester, S., Dumais, S., and Landauer, T. (1990). Index-
ing by latent semantic analysis. Journal of the Amer-
ican Society for Information Science and Technology,
41(6):391–407.
DeFronzo, R. A., Lewin, A., Patel, S., Liu, D., Kaste, R.,
Woerle, H. J., and Broedl, U. C. (2015). Combination
of empagliflozin and linagliptin as second-line ther-
apy in subjects with type 2 diabetes inadequately con-
trolled on metformin. Diabetes Care, 38(3):384–393.
Ginsca, A. L. and Popescu, A. (2013). User profiling
for answer quality assessment in Q&A communities.
In Proc. of the 2103 workshop on Data-driven user
behavioral modelling and mining from social media,
pages 25–28.
Guo, J., Cheng, J., and Cleland-Huang, J. (2017). Se-
mantically Enhanced Software Traceability Using
Deep Learning Techniques. In Proceedings - 2017
IEEE/ACM 39th International Conference on Soft-
ware Engineering, ICSE 2017, pages 3–14. IEEE.
Kaushik, N., Tahvildari, L., and Moore, M. (2011). Recon-
structing Traceability between Bugs and Test Cases:
An Experimental Study. In 2011 18th Working
Conference on Reverse Engineering, pages 411–414.
IEEE.
Kicsi, A., Tóth, L., and Vidács, L. (2018). Exploring the
benefits of utilizing conceptual information in test-to-
code traceability. Proceedings of the 6th International
Workshop on Realizing Artificial Intelligence Syner-
gies in Software Engineering, pages 8–14.
Kicsi, A., Vidács, L., Beszédes, A., Kocsis, F., and Kovács,
I. (2017). Information retrieval based feature analysis
for product line adoption in 4gl systems. In Proceed-
ins of the 17th International Conference on Compu-
tational Science and Its Applications – ICCSA 2017,
pages 1–6. IEEE.
Le, Q. V. and Mikolov, T. (2014). Distributed Representa-
tions of Sentences and Documents.
Marcus, A., Maletic, J. I., and Sergeyev, A. (2005). Re-
covery of Traceability Links between Software Doc-
umentation and Source Code. International Journal
of Software Engineering and Knowledge Engineering,
pages 811–836.
Mathieu, N. and Hamou-Lhadj, A. (2018). Word embed-
dings for the software engineering domain. Proceed-
ings of the 15th International Conference on Mining
Software Repositories - MSR ’18, pages 38–41.
Mikolov, T., Sutskever, I., Chen, K., Corrado, G., and Dean,
J. (2013). Distributed Representations of Words and
Phrases and their Compositionality. NIPS’13 Pro-
ceedings of the 26th International Conference on Neu-
ral Information Processing Systems, 2:3111–3119.
Nasehi, S. M., Sillito, J., Maurer, F., and Burns, C. (2012).
What makes a good code example?: A study of pro-
gramming q a in stackoverflow. In 2012 28th IEEE
International Conference on Software Maintenance
(ICSM), pages 25–34.
Nguyen, T. D., Nguyen, A. T., Phan, H. D., and Nguyen,
T. N. (2017). Exploring API embedding for API
usages and applications. In Proceedings - 2017
IEEE/ACM 39th International Conference on Soft-
ware Engineering, ICSE 2017, pages 438–449. IEEE.
Philipp Bouillon, Jens Krinke, Nils Meyer, F. S. (2007).
EzUnit: A Framework for Associating Failed Unit
Tests with Potential Programming Errors. In Agile
Processes in Software Engineering and Extreme Pro-
gramming, volume 4536, pages 101–104. Springer
Berlin Heidelberg.
Qusef, A., Bavota, G., Oliveto, R., De Lucia, A., and Bink-
ley, D. (2011). SCOTCH: Test-to-code traceability us-
ing slicing and conceptual coupling. In IEEE Interna-
tional Conference on Software Maintenance, ICSM,
pages 63–72. IEEE.
Qusef, A., Bavota, G., Oliveto, R., De Lucia, A., and Bink-
ley, D. (2014). Recovering test-to-code traceability
using slicing and textual analysis. Journal of Systems
and Software, 88:147–168.
Rehurek, R. and Sojka, P. (2010). Software Framework for
Topic Modelling with Large Corpora. Proceedings of
the LREC 2010 Workshop on New Challenges for NLP
Frameworks, pages 45–50.
Exploration and Mining of Source Code Level Traceability Links on Stack Overflow
345