
future research initiatives are dedicated to the refine-
ment of GPT-4’s capabilities, encompassing strategies
such as fine-tuning and few-shot learning. Our ulti-
mate goal is the reduction of time complexity while
upholding the precision of the generated goal models.
REFERENCES
Blei, D. M., Ng, A. Y., and Jordan, M. I. (2003). Latent
dirichlet allocation. Journal of machine Learning re-
search, 3(Jan):993–1022.
Cámara, J., Troya, J., Burgueño, L., and Vallecillo, A.
(2023). On the assessment of generative ai in model-
ing tasks: an experience report with chatgpt and uml.
Software and Systems Modeling, pages 1–13.
Chen, B., Chen, K., Hassani, S., Yang, Y., Amyot, D.,
Lessard, L., Mussbacher, G., Sabetzadeh, M., and
Varró, D. (2023). On the use of GPT-4 for creating
goal models: An exploratory study. In 2023 IEEE
31st International Requirements Engineering Confer-
ence Workshops (REW), pages 262–271. IEEE.
Chen, N., Lin, J., Hoi, S. C., Xiao, X., and Zhang, B.
(2014). AR-miner: mining informative reviews for
developers from mobile app marketplace. In Proc. of
the 36th International Conference on Software Engi-
neering, pages 767–778. ACM.
D ˛abrowski, J., Letier, E., Perini, A., and Susi, A. (2023).
Mining and searching app reviews for requirements
engineering: Evaluation and replication studies. In-
formation Systems, page 102181.
Devine, P., Tizard, J., Wang, H., Koh, Y. S., and Blin-
coe, K. (2022). What’s inside a cluster of software
user feedback: A study of characterisation methods.
In 2022 IEEE 30th International Requirements Engi-
neering Conference (RE), pages 189–200. IEEE.
Ding, S. and Ito, T. (2023). Self-agreement: A frame-
work for fine-tuning language models to find agree-
ment among diverse opinions. arXiv preprint
arXiv:2305.11460.
Dragoni, M., Federici, M., and Rexha, A. (2019). An un-
supervised aspect extraction strategy for monitoring
real-time reviews stream. Information processing &
management, 56(3):1103–1118.
Guzman, E. and Maalej, W. (2014). How do users like this
feature? a fine grained sentiment analysis of app re-
views. In 2014 IEEE 22nd international requirements
engineering conference (RE), pages 153–162.
Hajjem, M. and Latiri, C. (2017). Combining ir and lda
topic modeling for filtering microblogs. Procedia
Computer Science, 112:761–770.
Hofmann, H. F. and Lehner, F. (2001). Requirements engi-
neering as a success factor in software projects. IEEE
software, 18(4):58–66.
Jiang, H., Zhang, J., Li, X., Ren, Z., Lo, D., Wu, X.,
and Luo, Z. (2019). Recommending new features
from mobile app descriptions. ACM Transactions
on Software Engineering and Methodology (TOSEM),
28(4):1–29.
Johann, T., Stanik, C., Maalej, W., et al. (2017). SAFE: A
simple approach for feature extraction from app de-
scriptions and app reviews. In Proc. of the 2017 IEEE
25th international requirements engineering confer-
ence (RE), pages 21–30. IEEE.
Licorish, S. A., Tahir, A., Bosu, M. F., and MacDonell,
S. G. (2015). On satisfying the android os commu-
nity: User feedback still central to developers’ portfo-
lios. In Proc. of the 2015 24th Australasian Software
Engineering Conference, pages 78–87. IEEE.
Ma, S., Wang, S., Lo, D., Deng, R. H., and Sun, C. (2015).
Active semi-supervised approach for checking app be-
havior against its description. In Proc. of the 2015
IEEE 39Th annual computer software and applica-
tions conference, volume 2, pages 179–184. IEEE.
Maalej, W. and Pagano, D. (2011). On the socialness of
software. In Proc. of the 2011 IEEE Ninth Interna-
tional Conference on Dependable, Autonomic and Se-
cure Computing, pages 864–871. IEEE.
Malik, H., Shakshuki, E. M., and Yoo, W.-S. (2020). Com-
paring mobile apps by identifying ‘hot’features. Fu-
ture Generation Computer Systems, 107:659–669.
Nakagawa, H. and Honiden, S. (2023). MAPE-K loop-
based goal model generation using generative ai. In
13th International Workshop on Model-Driven Re-
quirements Engineering (MoDRE). IEEE.
OpenAI (2023). Chatgpt. https://chat.openai.com/
chat.
Oriol, M., Stade, M., Fotrousi, F., Nadal, S., Varga, J., Seyff,
N., Abello, A., Franch, X., Marco, J., and Schmidt, O.
(2018). FAME: supporting continuous requirements
elicitation by combining user feedback and monitor-
ing. In Proc. of the 2018 ieee 26th international re-
quirements engineering conference (re), pages 217–
227. IEEE.
Pagano, D. and Maalej, W. (2013). User feedback in
the appstore: An empirical study. In Proc. of the
2013 21st IEEE international requirements engineer-
ing conference (RE), pages 125–134. IEEE.
Papadimitriou, C. H., Raghavan, P., Tamaki, H., and Vem-
pala, S. (2000). Latent semantic indexing: A prob-
abilistic analysis. Journal of Computer and System
Sciences, 61(2):217–235.
Ren, S., Nakagawa, H., and Tsuchiya, T. (2022). Goal
model structuring based on semantic correlation of
user reviews. Intelligent Decision Technologies,
16(4):737–748.
Seyff, N., Graf, F., and Maiden, N. (2010). Using mobile
re tools to give end-users their own voice. In Proc. of
the 2010 18th IEEE International Requirements Engi-
neering Conference, pages 37–46. IEEE.
Szmrecsanyi, B. (2012). Grammatical variation in British
English dialects: A study in corpus-based dialectom-
etry. Cambridge University Press.
Zhong, X., ShafieiBavani, E., and Jimeno Yepes, A. (2020).
Image-based table recognition: data, model, and eval-
uation. In Proc. of the European Conference on Com-
puter Vision, pages 564–580. Springer.
Zowghi, D. (2018). "Affects" of user involvement in soft-
ware development. In Proc. of the 2018 1st Interna-
tional Workshop on Affective Computing for Require-
ments Engineering (AffectRE), pages 13–13. IEEE.
ICAART 2024 - 16th International Conference on Agents and Artificial Intelligence
392