The Lifecycle of Data Clumps: A Longitudinal Case Study in Open-Source Projects

Nils Baumgartner; Elke Pulvermüller

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

The Lifecycle of Data Clumps: A Longitudinal Case Study in Open-Source Projects

Topics: Model Quality Assessment; Model-Based Software Development; Software and Systems Engineering

In Proceedings of the 12th International Conference on Model-Based Software and Systems Engineering MODELSWARD - Volume 1, 15-26, 2024 , Rome, Italy

Authors: Nils Baumgartner and Elke Pulvermüller

Affiliation: Research Group Software Engineering, Institute of Computer Science, Department of Mathematics and Computer Science, University of Osnabrück, Osnabrueck, Germany

Keyword(s): Design Smell, Code Smell Dataset, Class Diagram, Data Clumps, Code Analysis, Reporting Format.

Abstract: This study explores the characteristics of data clumps, a specific type of code smells, in software projects. Code smells are characteristics in source code which indicate a deeper problem. Data clumps are identical groups of variables in different part of the code. The lack of datasets for data clumps can make it difficult to identify and manage these sets in software projects. We developed a tool to parse source code projects into an abstract syntax tree, facilitating detailed analysis of data clumps. Our findings reveal a notable presence of data clumps forming clusters, complicating manual refactoring. In this paper, we propose a unified reporting format for data clump detection and provide a granular dataset for data clumps. Additionally, we outline a detection methodology that can be applied across different programming languages and frameworks. We also provide a first look into the lifecycle and evolution of data clumps, showing that data clumps either remain in projects or ac cumulate over time. This work provides a foundation for further research aimed at enhancing software quality through identifying and refactoring data clumps, offering a starting point for discussions and improvements in this domain. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.27

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Baumgartner, N., Pulvermüller and E. (2024). The Lifecycle of Data Clumps: A Longitudinal Case Study in Open-Source Projects. In Proceedings of the 12th International Conference on Model-Based Software and Systems Engineering - MODELSWARD; ISBN 978-989-758-682-8; ISSN 2184-4348, SciTePress, pages 15-26. DOI: 10.5220/0012313900003645

@conference{modelsward24,
author={Nils Baumgartner and Elke Pulvermüller},
title={The Lifecycle of Data Clumps: A Longitudinal Case Study in Open-Source Projects},
booktitle={Proceedings of the 12th International Conference on Model-Based Software and Systems Engineering - MODELSWARD},
year={2024},
pages={15-26},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012313900003645},
isbn={978-989-758-682-8},
issn={2184-4348},
}

TY - CONF

JO - Proceedings of the 12th International Conference on Model-Based Software and Systems Engineering - MODELSWARD
TI - The Lifecycle of Data Clumps: A Longitudinal Case Study in Open-Source Projects
SN - 978-989-758-682-8
IS - 2184-4348
AU - Baumgartner, N.
AU - Pulvermüller, E.
PY - 2024
SP - 15
EP - 26
DO - 10.5220/0012313900003645
PB - SciTePress