ral language is well known. While our precision and
recall for identifying the role, objective, and benefits
are high, we do not know if the information we wish
to extract is advantageous. Therefore, we believe a
larger-scale evaluation with practitioners may be re-
quired to determine the subjectivity of our algorithm.
Internal Validity. This category of threat in related to
experimental design. We manually retrieved the user
story role, objective, and benefits components from
these datasets. Since we utilised numerous human ex-
tractors, we feel that human error is unlikely to have
led to inaccurate analysis.
6 CONCLUSION
We propose an automated approach for extracting
roles, goals, and benefits from user stories and vi-
sualizing them as knowledge graphs. Our approach
leverages NLP techniques and Neo4j’s querying ca-
pabilities, increasing interactiveness and facilitating
communication between stakeholders. Evaluation of
our approach on 22 user story datasets showed 100%
recall and average precision of > 96% for extracting
parts. A user study with eight participants confirmed
the usefulness of the graph model but suggested a
more user-friendly GUI. Future work includes de-
veloping a more user-friendly interface, adding more
story properties, and conducting further user studies.
ACKNOWLEDGEMENTS.
Ladeinde, Kanij and Grundy are supported by ARC
Laureate Fellowship FL190100035.
REFERENCES
Abrahamsson, P., Fronza, I., Moser, R., Vlasenko, J., and
Pedrycz, W. (2011). Predicting development effort
from user stories. In 2011 International Sympo-
sium on Empirical Software Engineering and Mea-
surement. IEEE.
Abualhaija, S., Arora, C., Sabetzadeh, M., Briand, L. C.,
and Traynor, M. (2020). Automated demarcation
of requirements in textual specifications: a machine
learning-based approach. Empirical Software Engi-
neering, 25(6):5454–5497.
Ali, R., Dalpiaz, F., and Giorgini, P. (2010). A goal-based
framework for contextual requirements modeling and
analysis. Requirements Engineering, 15(4):439–458.
Arora, C., Sabetzadeh, M., Briand, L., and Zimmer, F.
(2016). Extracting domain models from natural-
language requirements: approach and industrial eval-
uation. In Conference on Model Driven Engineering
Languages and Systems.
Arora, C., Sabetzadeh, M., Nejati, S., and Briand, L. (2019).
An active learning approach for improving the accu-
racy of automated domain model extraction. ACM
TOSEM, 28(1):1–34.
Casamayor, A., Godoy, D., and Campo, M. (2012). Min-
ing textual requirements to assist architectural soft-
ware design: a state of the art review. Artificial In-
telligence Review, 38(3):173–191.
Cohn, M. (2004). User stories applied: For agile software
development. Addison-Wesley Professional.
Dalpiaz, F. (2018). Requirements data sets (user stories).
Mendeley Data, v1.
Dalpiaz, F., Van Der Schalk, I., Brinkkemper, S., Aydemir,
F. B., and Lucassen, G. (2019). Detecting terminolog-
ical ambiguity in user stories: Tool and experimenta-
tion. Information and Software Technology, 110:3–16.
Elallaoui, M., Nafil, K., and Touahni, R. (2018). Automatic
transformation of user stories into uml use case dia-
grams using nlp techniques. Procedia computer sci-
ence, 130:42–49.
G
¨
unes¸, T. and Aydemir, F. B. (2020). Automated goal model
extraction from user stories using nlp. In 28th Re-
quirements Engineering Conference (RE). IEEE.
Horkoff, J., Aydemir, F. B., Cardoso, E., Li, T., Mat
´
e, A.,
Paja, E., Salnitri, M., Piras, L., Mylopoulos, J., and
Giorgini, P. (2019). Goal-oriented requirements engi-
neering: an extended systematic mapping study. Re-
quirements engineering, 24(2):133–160.
Lu, H., Hong, Z., and Shi, M. (2017). Analysis of film
data based on neo4j. In 2017 IEEE/ACIS 16th In-
ternational Conference on Computer and Information
Science (ICIS), pages 675–677. IEEE.
Lucassen, G., Dalpiaz, F., van der Werf, J. M. E., and
Brinkkemper, S. (2016). The use and effectiveness
of user stories in practice. In International working
conference on requirements engineering: Foundation
for software quality, pages 205–222. Springer.
Lucassen, G., Robeer, M., Dalpiaz, F., Van Der Werf, J.
M. E., and Brinkkemper, S. (2017). Extracting con-
ceptual models from user stories with visual narrator.
Requirements Engineering, 22(3):339–358.
Mich, L., Mariangela, F., and Pierluigi, N. I. (2004). Mar-
ket research for requirements analysis using linguis-
tic tools. Requirements Engineering Journal (RE J),
9(1):40–56.
Nayak, A., Kesri, V., and Dubey, R. K. (2020). Knowledge
graph based automated generation of test cases in soft-
ware engineering. In 7th ACM IKDD CoDS and 25th
COMAD. ACM IKDD CoDS.
Negro, A. (2021). Graph-powered machine learning. Si-
mon and Schuster.
Neo4j (2012). Neo4j - the world’s leading graph database.
Nguyen, T. H., Grundy, J., and Almorsy, M. (2015). Rule-
based extraction of goal-use case models from text. In
Joint Meeting on Found. of Software Engineering.
Pohl, K. (2010). Requirements Engineering. Springer.
Extracting Queryable Knowledge Graphs from User Stories: An Empirical Evaluation
691