loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Israel Herraiz 1 ; Daniel M. German 2 and Ahmed E. Hassan 3

Affiliations: 1 Technical University of Madrid, Spain ; 2 University of Victoria, Canada ; 3 Queen’s University, Canada

Keyword(s): Mining software repositories, Software size estimation, Open source.

Related Ontology Subjects/Areas/Topics: Business Analytics ; Communication and Software Technologies and Architectures ; Data Engineering ; Data Warehouses and Data Mining ; e-Business ; Enterprise Information Systems ; Enterprise Software Technologies ; Programming Languages ; Software Economics ; Software Engineering

Abstract: Source code size is an estimator of software effort. Size is also often used to calibrate models and equations to estimate the cost of software. The distribution of source code file sizes has been shown in the literature to be a lognormal distribution. In this paper, we measure the size of a large collection of software (the Debian GNU/Linux distribution version 5.0.2), and we find that the statistical distribution of its source code file sizes follows a double Pareto distribution. This means that large files are to be found more often than predicted by the lognormal distribution, therefore the previously proposed models underestimate the cost of software.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.14.142.115

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Herraiz, I.; German, D. and Hassan, A. (2011). ON THE DISTRIBUTION OF SOURCE CODE FILE SIZES. In Proceedings of the 6th International Conference on Software and Database Technologies - Volume 2: ICSOFT; ISBN 978-989-8425-77-5; ISSN 2184-2833, SciTePress, pages 5-14. DOI: 10.5220/0003426200050014

@conference{icsoft11,
author={Israel Herraiz. and Daniel M. German. and Ahmed E. Hassan.},
title={ON THE DISTRIBUTION OF SOURCE CODE FILE SIZES},
booktitle={Proceedings of the 6th International Conference on Software and Database Technologies - Volume 2: ICSOFT},
year={2011},
pages={5-14},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003426200050014},
isbn={978-989-8425-77-5},
issn={2184-2833},
}

TY - CONF

JO - Proceedings of the 6th International Conference on Software and Database Technologies - Volume 2: ICSOFT
TI - ON THE DISTRIBUTION OF SOURCE CODE FILE SIZES
SN - 978-989-8425-77-5
IS - 2184-2833
AU - Herraiz, I.
AU - German, D.
AU - Hassan, A.
PY - 2011
SP - 5
EP - 14
DO - 10.5220/0003426200050014
PB - SciTePress