Exploring Popular Software Repositories: A Study on Sentiment Analysis and Commit Clustering

Bianca Vieira, Rogério Garcia

2024

Abstract

The software repositories store data and metadata about the project development, including commits, which record user modifications to projects and their metadata, such as the user responsible for the commit, date, time, and others. The programmer can register a comment to inform the modification content, its purpose, requester, motivation, and useful data. Focusing on those comments, this paper proposes using comments to group the commits and construct a sentiment analysis regarding the messages. The main purpose is to analyze those messages, both by the groups and the sentiments expressed, to understand them (what sort of sentiment they express). Opinions are central to almost all human activities and are key influences on our behaviors. Beliefs, perceptions of reality, and choices made are conditioned upon sentiments. Therefore, understanding how the developers, especially programmers, feel about a task might be useful in analyzing progress and interaction among people and artifacts (source code). In this paper, we present initial analyses of data and metadata from the twenty most popular software repositories, written in five popular programming languages. We stated five research questions and answered them, pointing out further investigations.

Download


Paper Citation


in Harvard Style

Vieira B. and Garcia R. (2024). Exploring Popular Software Repositories: A Study on Sentiment Analysis and Commit Clustering. In Proceedings of the 26th International Conference on Enterprise Information Systems - Volume 1: ICEIS; ISBN 978-989-758-692-7, SciTePress, pages 297-304. DOI: 10.5220/0012633400003690


in Bibtex Style

@conference{iceis24,
author={Bianca Vieira and Rogério Garcia},
title={Exploring Popular Software Repositories: A Study on Sentiment Analysis and Commit Clustering},
booktitle={Proceedings of the 26th International Conference on Enterprise Information Systems - Volume 1: ICEIS},
year={2024},
pages={297-304},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012633400003690},
isbn={978-989-758-692-7},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 26th International Conference on Enterprise Information Systems - Volume 1: ICEIS
TI - Exploring Popular Software Repositories: A Study on Sentiment Analysis and Commit Clustering
SN - 978-989-758-692-7
AU - Vieira B.
AU - Garcia R.
PY - 2024
SP - 297
EP - 304
DO - 10.5220/0012633400003690
PB - SciTePress