Internal Quality Evolution of Open-Source Software Systems
Abstract
1. Introduction
2. Related Work
3. Internal Quality of Software Systems
4. Research Methodology
- RQ1: How does open-source software evolution affect the Line of Code (LOC)?
- RQ2: Is there a statistically significant relationship between size and other source code attributes (e.g., complexity, coupling, inheritance, and cohesion)?
Data Sets
- Lucene is an Apache search engine software library. The collected versions are from 5.3.2 to 8.6.0. The archives can be downloaded from [42]
- JabRef is a cross-platform citation and reference management software. The collected versions are from 3.0 to 5.2. The archives can be downloaded from [43]
- PMD is a cross-language static code analyzer. The collected versions are from 5.3.7 to 6.26.0. The archives can be downloaded from [44]
- Struts is a web-based application framework that is used for the designing of Java EE web applications. The collected versions are from 2.3.25 to 2.5.26. The archives can be downloaded from [45]
- Commons Lang is a Java utility package for java.lang API. The collected versions are from 3.3.1 to 3.12. The archives can be downloaded from [46]
5. Results and Analysis
5.1. Correlation Analysis
5.2. Regression Analysis
6. Discussion
7. Conclusions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Madhavji, N.H.; Fernandez-Ramil, J.; Perry, D. Software Evolution and Feedback: Theory and Practice; John Wiley & Sons: Hoboken, NJ, USA, 2006. [Google Scholar]
- Reussner, R.; Goedicke, M.; Hasselbring, W.; Vogel-Heuser, B.; Keim, J.; Märtin, L. Managed Software Evolution; Springer: Berlin/Heidelberg, Germany, 2019. [Google Scholar]
- Sousa, B.L.; Bigonha, M.A.; Ferreira, K.A. Analysis of Coupling Evolution on Open Source Systems. In Proceedings of the XIII Brazilian Symposium on Software Components, Architectures, and Reuse, Salvador, Brazil, 23–27 September 2019; pp. 23–32. [Google Scholar]
- Mens, T.; Demeyer, S. Software Evolution, 1st ed.; Springer: Berlin/Heidelberg, Germany, 2008. [Google Scholar]
- Neamtiu, I.; Xie, G.; Chen, J. Towards a better understanding of software evolution: An empirical study on open-source software. J. Softw. Evol. Process. 2013, 25, 193–218. [Google Scholar] [CrossRef]
- Alenezi, M.; Khellah, F. Architectural stability evolution in open-source systems. In Proceedings of the The International Conference on Engineering & MIS 2015, Istanbul, Turkey, 24–26 September 2015; pp. 1–5. [Google Scholar]
- Alenezi, M.; Almustafa, K. Empirical analysis of the complexity evolution in open-source software systems. Int. J. Hybrid Inf. Technol. 2015, 8, 257–266. [Google Scholar] [CrossRef]
- Gezici, B.; Tarhan, A.; Chouseinoglou, O. Internal and external quality in the evolution of mobile software: An exploratory study in open-source market. Inf. Softw. Technol. 2019, 112, 178–200. [Google Scholar] [CrossRef]
- Tripathy, P.; Naik, K. A Practitioner’s Approach, Software Evolution and Maintenance; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2014. [Google Scholar]
- Chatzigeorgiou, A.; Melas, G. Trends in object-oriented software evolution: Investigating network properties. In Proceedings of the 2012 34th International Conference on Software Engineering (ICSE), Zurich, Switzerland, 2–9 June 2012; pp. 1309–1312. [Google Scholar]
- Yu, L.; Mishra, A. An Empirical Study of Lehman’s Law on Software Quality Evolution. Int. J. Softw. Inform. 2013, 7, 469–481. [Google Scholar]
- Counsell, S.; Hassoun, Y.; Johnson, R.; Mannock, K.; Mendes, E. Trends in Java code changes: The key to identification of refactorings? In Proceedings of the 2nd International Conference on Principles and Practice of Programming in Java, Kilkenny, Ireland, 16–18 June 2003; pp. 45–48. [Google Scholar]
- Bagherzadeh, M.; Kahani, N.; Bezemer, C.P.; Hassan, A.E.; Dingel, J.; Cordy, J.R. Analyzing a decade of Linux system calls. Empir. Softw. Eng. 2018, 23, 1519–1551. [Google Scholar] [CrossRef]
- Ivers, J.; Ozkaya, I.; Nord, R.L.; Seifried, C. Next generation automated software evolution refactoring at scale. In Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, New York, NY, USA, 8–13 November 2020; pp. 1521–1524. [Google Scholar]
- Paulson, J.W.; Succi, G.; Eberlein, A. An empirical study of open-source and closed-source software products. IEEE Trans. Softw. Eng. 2004, 30, 246–256. [Google Scholar] [CrossRef]
- Israeli, A.; Feitelson, D.G. The Linux kernel as a case study in software evolution. J. Syst. Softw. 2010, 83, 485–501. [Google Scholar] [CrossRef]
- Tu, Q.; Godfrey, M.W. Evolution in open source software: A case study. In Proceedings of the 2000 International Conference on Software Maintenance, San Jose, CA, USA, 11–14 October 2000; pp. 131–142. [Google Scholar]
- Capiluppi, A. Models for the evolution of OS projects. In Proceedings of the International Conference on Software Maintenance, Amsterdam, The Netherlands, 22–26 September 2003; pp. 65–74. [Google Scholar]
- Antoniol, G.; Gueheneuc, Y.G.; Merlo, E.; Tonella, P. Mining the lexicon used by programmers during sofware evolution. In Proceedings of the 2007 IEEE International Conference on Software Maintenance, Paris, France, 2–5 October 2007; pp. 14–23. [Google Scholar]
- Gonzalez-Barahona, J.M.; Robles, G.; Michlmayr, M.; Amor, J.J.; German, D.M. Macro-level software evolution: A case study of a large software compilation. Empir. Softw. Eng. 2009, 14, 262–285. [Google Scholar] [CrossRef]
- Herraiz, I. A statistical examination of the evolution and properties of libre software. In Proceedings of the 2009 IEEE International Conference on Software Maintenance, Edmonton, AB, Canada, 20–26 September 2009; pp. 439–442. [Google Scholar]
- Capiluppi, A.; Ramil, J.F. Studying the evolution of open source systems at different levels of granularity: Two case studies. In Proceedings of the 7th International Workshop on Principles of Software Evolution, Kyoto, Japan, 6–7 September 2004; pp. 113–118. [Google Scholar]
- Thomas, L.; Schach, S.R.; Heller, G.Z.; Offutt, J. Impact of release intervals on empirical research into software evolution, with application to the maintainability of Linux. IET Softw. 2009, 3, 58–66. [Google Scholar] [CrossRef]
- Gonzalez-Barahona, J.M.; Robles, G.; Herraiz, I.; Ortega, F. Studying the laws of software evolution in a long-lived FLOSS project. J. Softw. Evol. Process 2014, 26, 589–612. [Google Scholar] [CrossRef] [PubMed]
- Chatzimparmpas, A.; Bibi, S.; Zozas, I.; Kerren, A. Analyzing the Evolution of Javascript Applications. In Proceedings of the ENASE 2019 14th International Conference on Evaluation of Novel Approaches to Software Engineering, Crete, Greece, 4–5 May 2019; pp. 359–366. [Google Scholar]
- Stamelos, I.; Angelis, L.; Oikonomou, A.; Bleris, G.L. Code quality analysis in open source software development. Inf. Syst. J. 2002, 12, 43–60. [Google Scholar] [CrossRef]
- Meirelles, P.; Santos, C., Jr.; Miranda, J.; Kon, F.; Terceiro, A.; Chavez, C. A study of the relationships between source code metrics and attractiveness in free software projects. In Proceedings of the 2010 Brazilian Symposium on Software Engineering, Salvador, Brazil, 27 September–1 October 2010; pp. 11–20. [Google Scholar]
- Goeminne, M.; Mens, T. Towards the analysis of evolution OSS ecosystems. In Proceedings of the 8th BElgian-NEtherlands Software eVOLution Seminar (BENEVOL 2009), Louvain-la-Neuve, Belgium, 17–18 December 2009; pp. 30–35. [Google Scholar]
- Xie, G.; Chen, J.; Neamtiu, I. Towards a better understanding of software evolution: An empirical study on open source software. In Proceedings of the 2009 IEEE International Conference on Software Maintenance, Edmonton, AB, Canada, 20–26 September 2009; pp. 51–60. [Google Scholar]
- Morasca, S. A probability-based approach for measuring external attributes of software artifacts. In Proceedings of the 2009 3rd International Symposium on Empirical Software Engineering and Measurement, Lake Buena Vista, FL, USA, 15–16 October 2009; pp. 44–55. [Google Scholar]
- Fenton, N.; Bieman, J. Software Metrics: A Rigorous and Practical Approach; CRC Press: Boca Raton, FL, USA, 2014. [Google Scholar]
- Lee, Y.; Chang, K.H. Reusability and maintainability metrics for object-oriented software. In Proceedings of the 38th Annual on Southeast Regional Conference, Clemson, SC, USA, 7–8 April 2000; pp. 88–94. [Google Scholar]
- Briand, L.C.; Morasca, S.; Basili, V.R. Property-based software engineering measurement. IEEE Trans. Softw. Eng. 1996, 22, 68–86. [Google Scholar] [CrossRef]
- Abran, A. Software Metrics and Software Metrology; John Wiley & Sons: Hoboken, NJ, USA, 2010. [Google Scholar]
- Fenton, N.E.; Neil, M. Software Metrics: Roadmap. In Proceedings of the Conference on the Future of Software Engineering, Limerick, Ireland, 4–11 June 2000; pp. 357–370. [Google Scholar]
- Halstead, M.H. Elements of Software Science. 1977. Available online: https://dl.acm.org/doi/book/10.5555/540137 (accessed on 1 May 2021).
- McCabe, A. Complexity Measure. IEEE Trans. Softw. Eng. 1976, 4, 308–320. [Google Scholar] [CrossRef]
- Albrecht, A.J. Measuring Application Development Productivity. In Proceedings of the Joint SHARE/GUIDE/IBM Application Development Symposium, Monterey, CA, USA, 14–17 October 1979. [Google Scholar]
- DeLone, W.H.; McLean, E.R. Information systems success: The quest for the dependent variable. Inf. Syst. Res. 1992, 3, 60–95. [Google Scholar] [CrossRef]
- Gorton, I.; Liu, A. Software component quality assessment in practice: Successes and practical impediments. In Proceedings of the 24th International Conference on Software Engineering, Orlando, FL, USA, 23–25 May 2002; pp. 555–558. [Google Scholar]
- Abreu, R.; Premraj, R. How developer communication frequency relates to bug introducing changes. In Proceedings of the Joint International and Annual ERCIM Workshops on Principles of Software Evolution (IWPSE) and Software Evolution (Evol) Workshops, Amsterdam, The Netherlands, 24–25 August 2009; pp. 153–158. [Google Scholar]
- Lucene. Lucene Repository. Available online: https://archive.apache.org/dist/lucene/java/ (accessed on 1 May 2021).
- JabRef. JabRef Repository. Available online: https://github.com/JabRef/jabref/releases (accessed on 1 May 2021).
- PMD. PMD Repository. Available online: https://github.com/pmd/pmd/releases (accessed on 1 May 2021).
- Struts. Struts Repository. Available online: https://github.com/apache/struts/releases (accessed on 1 May 2021).
- Lang, C. Common Lang Repository. Available online: https://github.com/apache/commons-lang/releases (accessed on 1 May 2021).
- Azevedo, M. o3smeasures-tool. Available online: https://github.com/mariazevedo88/o3smeasures-tool/ (accessed on 1 May 2021).
- Nuñez-Varela, A.S.; Pérez-Gonzalez, H.G.; Martínez-Perez, F.E.; Soubervielle-Montalvo, C. Source code metrics: A systematic mapping study. J. Syst. Softw. 2017, 128, 164–197. [Google Scholar] [CrossRef]
- Lanza, M.; Marinescu, R. Object-Oriented Metrics in Practice: Using Software Metrics to Characterize, Evaluate, and Improve the Design of Object-Oriented Systems; Springer: Berlin/Heidelberg, Germany, 2007. [Google Scholar]
- Chidamber, S.R.; Kemerer, C.F. A metrics suite for object oriented design. IEEE Trans. Softw. Eng. 1994, 20, 476–493. [Google Scholar] [CrossRef]
- Succi, G.; Pedrycz, W.; Djokic, S.; Zuliani, P.; Russo, B. An empirical exploration of the distributions of the chidamber and kemerer object-oriented metrics suite. Empir. Softw. Eng. 2005, 10, 81–104. [Google Scholar] [CrossRef]
- Fenton, N.E.; Ohlsson, N. Quantitative analysis of faults and failures in a complex software system. IEEE Trans. Softw. Eng. 2000, 26, 797–814. [Google Scholar] [CrossRef]



| Attribute | Metric | Source | 
|---|---|---|
| Number of Classes | [49] | |
| Size | Line of code (LOC) | [49] | 
| Complexity | Cyclomatic Complexity (CC) demonstrates different number of paths that are present in a method plus one. | [37] | 
| Inheritance | Depth of Inheritance Tree (DIT) denotes the measure of a total number of ancestors present in a class. | [50] | 
| Coupling | Coupling Between Objects (CBO) signifies the number of classes that are coupled to a particular class. | [50] | 
| Cohesion | Lack of Cohesion of Methods (LCOM) can be understood as the total number of pairs, present in the member functions, without any shared instance variables, minus the number of member functions’ pairs with shared instance variables. | [50] | 
| Source | DF | Mean Square | F | Partial Eta Squared | 
|---|---|---|---|---|
| Corrected Model | 4 | 13,923,329,927.650 | 445.203 | 0.975 | 
| Intercept | 1 | 136,464,462,738.000 | 4363.494 | 0.990 | 
| Systems | 4 | 13,923,329,927.650 | 445.203 | 0.975 | 
| # Classes | LOC | Complexity | Inheritance | Coupling | Cohesion | |
|---|---|---|---|---|---|---|
| # Classes | 1 | 0.978 | 0.042 | 0.988 | 0.98 | 0.974 | 
| LOC | 1 | 0.215 | 0.957 | 0.959 | 0.934 | |
| Complexity | 1 | −0.049 | 0.022 | 0.004 | ||
| Inheritance | 1 | 0.977 | 0.968 | |||
| Coupling | 1 | 0.958 | ||||
| Cohesion | 1 | 
| Unstandardized Coefficients | Standardized Coefficients | Collinearity Statistics | |||||
|---|---|---|---|---|---|---|---|
| Model | B | Std. Error | Beta | t | Sig. | Tolerance | VIF | 
| Constant | −9078.904 | 1171.597 | −7.749 | 0.000 | |||
| # Classes | 54.919 | 8.764 | 0.709 | 6.266 | 0.000 | 0.013 | 79.879 | 
| Complexity | 0.814 | 0.061 | 0.210 | 13.370 | 0.000 | 0.652 | 1.533 | 
| Inheritance | 30.131 | 6.433 | 0.491 | 4.684 | 0.000 | 0.015 | 68.618 | 
| Coupling | 1.323 | 4.219 | 0.021 | 0.314 | 0.755 | 0.036 | 27.999 | 
| Cohesion | −6.802 | 1.523 | −0.253 | −4.466 | 0.000 | 0.050 | 20.000 | 
| Unstandardized Coefficients | Standardized Coefficients | ||||
|---|---|---|---|---|---|
| Model | B | Std. Error | Beta | t | Sig. | 
| Constant | −11,560.023 | 1499.943 | −7.707 | 0.000 | |
| Complexity | 1.004 | 0.072 | 0.259 | 14.015 | 0.000 | 
| Inheritance | 58.155 | 6.290 | 0.949 | 9.245 | 0.000 | 
| Coupling | 8.794 | 5.505 | 0.140 | 1.597 | 0.117 | 
| Cohesion | −3.192 | 1.918 | −0.119 | −1.664 | 0.103 | 
| Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. | 
© 2021 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Alenezi, M. Internal Quality Evolution of Open-Source Software Systems. Appl. Sci. 2021, 11, 5690. https://doi.org/10.3390/app11125690
Alenezi M. Internal Quality Evolution of Open-Source Software Systems. Applied Sciences. 2021; 11(12):5690. https://doi.org/10.3390/app11125690
Chicago/Turabian StyleAlenezi, Mamdouh. 2021. "Internal Quality Evolution of Open-Source Software Systems" Applied Sciences 11, no. 12: 5690. https://doi.org/10.3390/app11125690
APA StyleAlenezi, M. (2021). Internal Quality Evolution of Open-Source Software Systems. Applied Sciences, 11(12), 5690. https://doi.org/10.3390/app11125690
 
         
                                                

 
       