Modeling and Assessing Software Reliability in Open-Source Projects

Maria T. Vasileva; Georgi Penchev

doi:10.3390/computation13090214

and

Faculty of Mathematics and Informatics, University of Plovdiv Paisii Hilendarski, 24 Tzar Asen, 4000 Plovdiv, Bulgaria

^*

Author to whom correspondence should be addressed.

Computation2025, 13(9), 214;https://doi.org/10.3390/computation13090214

This article belongs to the Section Computational Engineering

Version Notes

Order Reprints

Abstract

One of the key components of the software quality model is reliability. Its importance has grown with the increasing use and reuse of open-source components in software development. Software reliability growth models are commonly employed to address this aspect by predicting future failure rates and estimating the number of remaining defects throughout the development process. This paper investigates two software reliability growth models derived from the Verhulst model, with a particular focus on a structural property known as Hausdorff saturation. We provide analytical estimates for this characteristic and propose it as an additional criterion for model selection. The models are evaluated using four open-source datasets, where the Hausdorff saturation metric supports the conclusions drawn from standard goodness-of-fit measures. Furthermore, we introduce an interactive software reliability assessment tool that integrates with GitHub, enabling expert users to analyze real-time issue-tracking data from open-source repositories. The tool facilitates model comparison and enhances practical applicability. Overall, the proposed approach contributes to more robust reliability assessment by combining theoretical insights with actionable diagnostics.

Keywords:

software reliability growth model; non-homogeneous Poisson process (NHPP); Hausdorff saturation; open-source projects; reliability assessment tool; GitHub

MSC:

68N30; 90B25; 41A46

1. Introduction

Reliability is one of the key attributes of a quality model according to ISO/IEC 25010:2023 (Systems and software engineering—Systems and software Quality Requirements and Evaluation (SQuaRE)—Product quality model) [1]. Moreover for software it is a critical quality metric that reflects a system’s resistance to failure during operation. Traditional reliability assessment methods often require extensive testing or production deployment data. One basic approach of software reliability prediction is using software reliability growth models (SRGMs). Their main aim is supporting the reliability estimation in the process of software development, especially in the phases of testing and validation.

The development of new software reliability growth models constitutes an important topic in the scientific literature. Each of them is based on distinct assumptions concerning the software fault detection process. For instance, some models assume perfect debugging, where faults are immediately and completely removed without introducing new errors. However, this is often unrealistic, as the difficulty of correcting faults tends to increase as testing progresses. In contrast, other models incorporate imperfect debugging, acknowledging that the fault correction process may be incomplete and may introduce additional ones. Some new models for imperfect debugging can be found in [2,3,4,5,6,7].

Open-source software (OSS) projects are developed collaboratively by geographically dispersed developers without centralized management. Unlike traditional software, the testing phase is not clearly demarcated; it is carried out in parallel with coding. It relies on users to identify defects rather than dedicated test teams, so all bugs cannot be fully detected. Given the dynamic and distributed nature of OSS development, assessing its reliability is essential, particularly as such software is increasingly used in new systems. Studying OSS reliability helps ensure quality despite the absence of formal testing phases. Our objective is to investigate the reliability of open-source projects by applying an SRGM. While this is not a novel technique, it remains a meaningful approach for understanding and predicting the fault-finding process. Other advanced methods and applications of the methodology are extensively explored in various studies, such as [8,9,10,11,12].

Most software reliability growth models assume that software faults follow a nonhomogeneous Poisson process (NHPP). They are characterized by a model-specific mean value function

m (t)

used to estimate the expected number of faults detected over time. Based on

m (t)

, the reliability function for a NHPP model can be expressed as the conditional probability

R (x | t)

, which represents the probability that no software failure will occur in the time interval

[t, t + x]

, where

t \geq 0

and

x > 0

. Formally, this can be written as

R (x | t) = exp (- [m (t + x) - m (t)]) .

Table 1 presents summary of some basic software reliability growth models including the definitions of their mean value functions and their corresponding types (concave or S-shaped).

Table 1. Summary of some basic software reliability models.

In 2018, Pham [20] proposed estimating the mean value function

m (t)

using a logistic growth model, also known as the Verhulst model, defined by

\frac{d m (t)}{d t} = b (t) m (t) (1 - \frac{m (t)}{a (t)}), m (0) \neq 0,

(1)

where

a (t)

is fault content function and

b (t)

is software fault detection rate per unit of time.

Haque and Ahmad [21] proposed the software reliability model that takes into account the improvement of the testing team’s skills over time. Its mean value function is expressed by

m_{H A} (t) = \frac{a}{1 + β exp (- \frac{b t^{α}}{α})} .

(2)

They used (1) with

a (t) = a

,

b (t) = b t^{α - 1}

and

m (0) = \frac{a}{1 + β}

. Function

b (t)

is the same as in the Generalized Goel Model and represents an initial increase followed by a decrease in the failure rate.

Song, Kim and Chang [22] consider a software reliability model that considers the number of finite faults and dependent faults with the mean value function defined by

m_{S K C} (t) = \frac{a}{1 + (\frac{a}{k} - 1) (1 + b t) exp (- b t)} .

(3)

It is obtained from (1) with

a (t) = a

,

b (t) = \frac{b^{2} t}{b t + 1}

and

m (0) = k

. This model is assumed to be used for open-source software.

Some other SRGM models obtained by the logistic growth model can be found in [23,24,25].

In this study, we have two main aims. First, we introduce an intrinsic characteristic known as Hausdorff saturation for software reliability models. It provides a quantitative measure of how close the software is to completing its failure discovery process. In Section 2 we present a detailed analysis for two software reliability models—Haque–Ahmad

m_{H A}

and Song–Kim–Chang

m_{S K C}

. We prove precise estimates of Hausdorff distance to the horizontal asymptote. Additionally, we utilize four datasets from an open-source platforms to perform a comparative analysis between three classical and two proposed software reliability growth models. We apply standard goodness-of-fit tests and Hausdorff saturation to determine the most appropriate approximation model. Our second goal is assessing the reliability of open-source software (OSS) projects by analyzing the patterns of issue creation and resolution over time. By focusing on the trends in opened and closed issues, we aim to determine the maturity and reliability of OSS projects. Section 3 describes the development of a software reliability assessment tool designed for this purpose. The proposed tool implements five software reliability growth models within a dynamic Wolfram Language environment that directly interfaces with the GitHub API. The presented process enables real-time reliability assessment and provides comparative analysis of model fits to determine the most appropriate for a given software project.

2. Approximation Results

2.1. Hausdorff Saturation

The main aim of this subsection is investigation of the intrinsic characteristic named Hausdorff saturation. It presents the distance between the mean value function

m (t)

of software reliability growth model and a

Γ

-shaped curve connecting its endpoints. This metric can be applied to models with a horizontal asymptote, which typically represents the theoretical maximum number of defects that can be detected over time. By evaluating the proximity of the model’s predictions to this asymptote, Hausdorff saturation provides insight into the maturity and completeness of the software testing process. This characteristic can aid in selecting the most suitable software reliability model for approximating a given dataset.

The advantages of the Hausdorff metric for approximating functions and point sets in the plane, compared to the uniform metric, have been highlighted in numerous publications and monographs over the past 70 years. Recently, this classical theory has been applied in the study of dynamic models in various fields, including growth theory such as investigations into epidemic models, particularly those related to the COVID-19 pandemic [26], the synthesis and analysis of antenna patterns and filters [27], the approximation of data related to computer virus propagation [28], debugging and test theory [29,30,31,32,33].

We consider Euclidean space with the maximum norm, namely, for the points

A = (t_{A}, x_{A})

and

B = (t_{B}, x_{B})

in

R^{2}

, we have

∥ A - B ∥ = max (| t_{A} - t_{B} |, | x_{A} - x_{B} |)

. Hence we can define the Hausdorff distance [34,35].

Definition 1.

The Hausdorff distance (the H-distance)

ρ (f, g)

between two interval functions

f, g

on

Ω \subseteq R

is the distance between their completed graphs

F (f)

and

F (g)

considered as closed subsets of

Ω \times R

. More precisely,

ρ (f, g) = max {sup_{A \in F (f)} inf_{B \in F (g)} ∥ A - B ∥, sup_{B \in F (g)} inf_{A \in F (f)} ∥ A - B ∥} .

Definition 2.

The saturation of a software reliability model on

[a, b)

,

- \infty < a < b \leq \infty

is the Hausdorff distance between the completed graph of the mean value function

m (\cdot)

with asymptote h and the Γ-shaped curve consisting of two lines—one vertical between the points

(a, 0)

and

(a, h)

and another horizontal between

(a, h)

and

(b, m (b))

.

Corollary 1.

The saturation of an SRGM model with mean value function

m (t)

to the horizontal asymptote h at the “median level” satisfies the following nonlinear equation:

m (t_{0} + d) = h - d,

(4)

where

m (t_{0}) = \frac{1}{2} h

.

Proof.

The proof follows immediately from Definitions 1 and 2. Equation (4) has a unique root since the function

f (d) = m (t_{0} + d) - h + d

is increasing and continuous. □

According to Definitions 1 and 2, we have that Hausdorff saturation d is the smallest square (box) ball touching the graph of the mean value function of the software reliability model. The Hausdorff metric is an approximation error between the sigmoidal (mean value) function and a step function. Figure 1 presents a graphical representation of the problem.

Figure 1. Graphical representation of Hausdorff saturation d of mean value function

m (t)

.

A small Hausdorff distance indicates that the model’s predicted failure counts are approaching the theoretical upper bound, suggesting that the majority of detectable defects have been identified and resolved. This reflects a mature testing process and a high level of software reliability. Conversely, a large Hausdorff distance implies a significant gap between current failure predictions and the model’s asymptote, indicating that the software remains in the reliability growth phase and is not yet suitable for high-assurance deployment.

The next theorem gives some useful estimates for Hausdorff saturation of the software reliability model.

Theorem 1.

The Hausdorff saturation d of Haque–Ahmad software reliability model

m_{H A}

defined by (2) satisfies

d_{l} = \frac{a}{Δ_{1}} < d < \frac{a ln (Δ_{1})}{Δ_{1}} = d_{r},

where

Δ_{1} = 2.1 * (1 + \frac{1}{4} a α log (β) {(\frac{b}{α log (β)})}^{1 / α}),

(5)

with

Δ_{1} > e^{1.05}

.

Proof of Theorem 1.

First we note that

{lim}_{t \to \infty} m_{H A} (t) = a

and

m_{H A} (t_{0}) = \frac{1}{2} a

for

t_{0} = {(\frac{b}{α log (β)})}^{- 1 / α}

. We apply Corollary 1 as we consider the function

f_{1} (d) = m_{H A} (t_{0} + d) - a + d .

We apply the Taylor series for function

f (d) = m (t_{0}) - a + (1 + m^{'} (t_{0})) d + O (d^{2}) = - \frac{a}{2} + (1 + m^{'} (t_{0})) d + O (d^{2}) .

With simple calculations, we have

m_{H A}^{'} (t) = {(\frac{a}{1 + β exp (- \frac{b t^{α}}{α})})}^{'} = \frac{a b β t^{α - 1} e^{- \frac{b t^{α}}{α}}}{1 + {(β e^{- \frac{b t^{α}}{α}})}^{2}} .

Hence,

m_{H A}^{'} (t_{0}) = \frac{Δ_{1}}{2.1} - 1

. From here we can conclude that

g_{1} (d) - f_{1} (d) = O (d^{2})

, where

g_{1} (d) = - \frac{a}{2} + \frac{Δ_{1}}{2.1} d

with

Δ_{1}

defined by (5). Therefore function

g_{1} (d)

approximates

f_{!} (d)

for

d \to 0

with

O (d^{2})

. Function

g_{1} (d)

is also monotonically increasing since

g_{1}^{'} (d) > 0

. If

Δ_{1} > e^{1.05}

, then

g (d_{l}) < 0

and

g (d_{r}) > 0

, which completes the proof. □

Theorem 2.

The Hausdorff saturation d of the Song–Kim–Chang software reliability model

m_{S K C}

defined by (3) satisfies

d_{l} = \frac{a}{Δ_{2}} < d < \frac{a ln (Δ_{2})}{Δ_{2}} = d_{r},

where

Δ_{2} = 2.1 * (1 + \frac{a b (- a e^{W_{- 1} (\frac{k}{e (k - a)}) + 1} + k e^{W_{- 1} (\frac{k}{e (k - a)}) + 1} + k)}{4 k}),

(6)

with

Δ_{2} > e^{1.05}

, and

W_{k} (z)

is the Lambert-W function.

Proof of Theorem 2.

Regarding analogues, first we compute

{lim}_{t \to \infty} m_{S K C} (t) = a

. Moreover

m_{S K C} (t_{0}) = \frac{1}{2} a

for

t_{0} = - \frac{W_{- 1} (\frac{k}{e (k - a)}) + 1}{b}

. We apply Corollary 1 as we consider the function

f_{2} (d) = m_{S K C} (t_{0} + d) - a + d .

We consider the analogue from the Taylor series, and

m_{S K C}^{'} (t) = \frac{a b^{2} k t (a - k) e^{b t}}{{((a - k) (b t + 1) + k e^{b t})}^{2}},

and we obtain that

m_{S K C}^{'} (t_{0}) = \frac{Δ_{2}}{2.1} - 1

. Hence

g_{2} (d) - f_{2} (d) = O (d^{2})

, where

g_{2} (d) = - \frac{a}{2} + \frac{Δ_{2}}{2.1} d

with

Δ_{2}

defined by (6). Therefore function

g_{2} (d)

approximates

f_{2} (d)

for

d \to 0

with

O (d^{2})

. We have analogue proof of

g_{2} (d_{l}) < 0

and

g_{1} (d_{r}) > 0

, which ends the proof. □

2.2. Numerical Examples

In this subsection, we present two examples involving the fitting of software reliability growth models (SRGMs) to an open-source dataset containing detected software faults. We constructed a nonlinear regression model based on an SRGM using the NonlinearModelFit function in the CAS Wolfram Mathematica [36]. Let us note that this function estimates the parameters of nonlinear models via the least-squares method, under the assumption that the residuals are independent and normally distributed. It returns a FittedModel object, which provides access to a wide range of diagnostic information. A key advantage of this model-fitting framework is the ability to retrieve detailed results and diagnostics without refitting the model. The available properties include parameter estimates, standard errors, confidence intervals, residuals and goodness-of-fit statistics.

Example 1.

The GNOME (GNU’s Not Unix Network Object Model Environment) project is an open-source platform that has a free and complete set of user friendly applications and tools. We consider the dataset (Table 2) from version GNOME 2.2 [37]. The first row is time in weeks, and the second one is for cumulative detected faults.

Table 2. Detected faults in GNOME 2.2 official public releases.

We propose a comparison between software reliability models that have horizontal asymptote

h = a

, namely, the Goel–Okumoto, delayed S-shaped, inflection S-shaped, Haque–Ahmad and Song–Kim–Chang models. Values of estimated parameters with four digits after the decimal point for the considered models are presented in Table 3.

Table 3. Parameter estimation for GNOME 2.2 dataset.

Figure 2 shows a graphical representation of the cumulative number of failures (with black dots) and estimated mean value functions of the proposed software reliability models.

Figure 2. Fitting results for GNOME 2.2 dataset.

We evaluate the goodness-of-fit of the considered software reliability models using standard criteria, available as properties of the FittedModel object. Namely, we use the Akaike information criterion (AIC), Bayesian information criterion (BIC), coefficient of determination (

R^{2}

or R-squared), and Adjusted R-squared. Let us recall that, among a set of approximation models, the preferred model is the one with the minimum AIC and BIC values and the maximum R-squared and adjusted R-squared values. The numerical results are presented in Table 4. We can see that Haque–Ahmad software reliability model

m_{H A}

has the best criteria values.

Table 4. Comparison of criteria values for GNOME 2.2 dataset.

Let us consider the Hausdorff saturation of obtained approximation models with parameters from Table 3. For Haque–Ahmad software reliability model

m_{H A}

defined by (2) for

a = 56.2411

b = 1.2771

,

α = 0.1531

and

β = 50661.1

, the value of Hausdorff distance is

d = 8.72883

with

t_{0} = 5.51209

. The values of upper and lower estimates

d_{l} = 5.12019

and

d_{r} = 12.2703

are computed using Theorem 1. According to Theorem 2 for the Song–Kim–Chang software reliability model, we have

d = 5.01575

for

t_{0} = 5.28609

with corresponding estimates

d_{l} = 3.95241

and

d_{r} = 9.99032

. Figure 3 graphically presents the obtained results. It is obvious that Haque–Ahmad software reliability model

m_{H A}

is better for the GNOME 2.2 dataset.

Figure 3. Hausdorff saturation for (a) Haque–Ahmad software reliability model. (b) Song–Kim–Chang software reliability model.

The residuals from a fitted model are defined as the differences between the observed and predicted values. The fitting residuals data is distributed according to the Normal distribution. The residual vs. predictor plot of fitting residuals for all five models is presented in Figure 4. Also Figure 5 presents the quantile plot of fitting residuals for Haque–Ahmad and Song–Kim–Chang software reliability models.

Figure 4. Residual vs. predictor plot of fitting residuals for (a) Goel–Okumoto, (b) delayed S-shaped, (c) inflection S-shaped, (d) Haque–Ahmad and (e) Song–Kim–Chang software reliability models.

Figure 5. Quantile plot of fitting residuals for (a) Haque–Ahmad software reliability model and (b) Song–Kim–Chang software reliability model.

Also we tested the hypothesis that fitting residuals for all five compared models follow a normal distribution. It is not rejected at the 5 percent significance level according to the Jarque–Bera ALM, Mardia Combined, Mardia Kurtosis, Mardia Skewness or Shapiro–Wilk standard goodness-of-fit tests.

We propose comparing different types of software reliability models for approximating the GNOME 2.2 dataset. According to the standard test of goodness-of-fit and Hausdorff saturation, we can conclude that Haque–Ahmad software reliability model

m_{H A}

is the best for predicting the number of faults in the future. Figure 6 presents actual and predicted number of software faults. The solid black points show the faults that were actually observed during testing, while the open (unfilled) points show the predicted values from our model. This kind of plot helps us assess how stable the fault detection process is and gives us a sense of whether the software is reaching a reliable state before release.

Figure 6. Prediction for the number of faults for GNOME 2.2 dataset.

Example 2.

Ansible is an open-source automation tool that supports provisioning, configuration, application deployment and infrastructure management following the Infrastructure as Code (IaC) paradigm. It uses a declarative language to define system states and operates across Unix-based systems and Windows. As a widely adopted DevOps tool, Ansible plays a crucial role in automating complex tasks and ensuring efficient, repeatable infrastructure operations.

We consider a dataset (Table 5) from Ansible software used in [38]. The time period is from 1 January 2016 to 31 December 2017 as for each month we take the cumulative detected faults.

Table 5. Detected faults in Ansible official public releases.

Table 6 presents values of estimated parameters with four digits after the decimal point for the considered models. Figure 7 shows a graphical representation of the cumulative number of failures (with black dots) and estimated mean value functions of the proposed software reliability models.

Table 6. Parameter estimation for Ansible software dataset.

Figure 7. Fitting results for Ansible dataset.

Table 7 presents the values of standard criteria for evaluation the goodness-of-fit. Although the Haque–Ahmad software reliability model

m_{H A}

has the best values for three of the proposed criteria, both the Haque–Ahmad and Song–Kim–Chang models yield extremely similar results. Therefore, we can conclude that both models are suitable for application to the considered dataset.

Table 7. Comparison of criteria values for Ansible software dataset.

The Hausdorff saturation is calculated for obtained parameters from Table 6. We obtain the value of Hausdorff distance

d = 27.52

with

t_{0} = 18.2982

for Haque–Ahmad software reliability model

m_{H A}

. By Theorem 1 we have estimates

d_{l} = 8.96625

and

d_{r} = 65.1797

. Regarding the analogue for Song–Kim–Chang software reliability model

m_{S K C}

, we have

d = 25.3841

for

t_{0} = 17.7701

with corresponding estimates

d_{l} = 8.36737

and

d_{r} = 9.99032

by Theorem 2. The obtained Hausdorff saturation values are highly similar, particularly when normalized with respect to the horizontal asymptote. This suggests that both models are equally appropriate for application to the given dataset.

Based on the fault detection data for the Ansible software over a two-year period, both the Haque–Ahmad and Song–Kim–Chang models demonstrate comparable performance in terms of goodness-of-fit and Hausdorff saturation. Therefore, both models can be effectively used for fault prediction in this context. As shown in Figure 8, the actual number of faults (solid black points) is followed by predicted values (open, unfilled points), with both models closely capturing the observed trend. This consistency indicates that either model can be reliably applied for fault prediction and reliability assessment of the software.

Figure 8. Prediction number of faults for the Ansible software for (a) Haque–Ahmad software reliability model. (b) Song–Kim–Chang software reliability model.

Example 3.

Electron is an open-source project under the OpenJS Foundation maintained by an active community of contributors. It is designed for developing cross-platform desktop applications with HTML, CSS and JavaScript. Electron is built on Node.js and Chromium, which makes it a suitable choice for web developers.

Electron frequently releases major versions in sync with every other Chromium release. We consider the issue reports from GitHub for version 31.0.0 during its Stable phase. Table 8 presents the cumulative number of detected faults over a period of 29 weeks, from 10 April 2024 to 30 October 2024.

Table 8. Detected faults in Electron official public releases.

Table 9 presents the estimated parameters for the models under consideration, with values rounded to four decimal places. Figure 9 provides a graphical representation of the cumulative number of failures (indicated by black dots) alongside the estimated mean value functions of the proposed software reliability models. A quantitative evaluation of the models’ performance, based on standard goodness-of-fit criteria, is detailed in Table 10.

Table 9. Parameter estimation for Electron 31.0.0 dataset.

Figure 9. Fitting results for Electron 31.0.0.

Table 10. Comparison of criteria values for Electron software dataset.

Using the parameters estimated in Table 9, we calculated the Hausdorff saturation for the Haque–Ahmad software reliability model (

m_{H A}

). The Hausdorff distance is

d = 11.1591

with

t_{0} = 13.1706

. Consistent with Theorem 1, the lower and upper bounds were estimated as

d_{l} = 6.68992

and

d_{r} = 22.3695

, respectively. Similarly, an analogous calculation for the Song–Kim–Chang software reliability model (

m_{S K C}

) yielded a Hausdorff distance of

d = 9.56518

at

t_{0} = 13.1212

. The corresponding estimates, as established by Theorem 2, are

d_{l} = 6.1097

and

d_{r} = 20.8072

.

The analysis of Hausdorff saturation demonstrates the stability of the models under consideration. Based on the combined results of standard goodness-of-fit tests and the Hausdorff saturation, we can conclude that the Haque–Ahmad software reliability model

m_{H A}

is the most suitable for forecasting future fault occurrences. This is illustrated in Figure 10.

Figure 10. Prediction for the number of faults for Electron dataset.

Example 4.

Eclipse Foundation platform is a major open-source software ecosystem. It is primarily Java-based with a plugin architecture supporting multiple programming languages such as C, C++, PHP, Perl, JavaScript, Python and others.

Table 11 contains real defect reports submitted by users and developers during routine software operation. The defect reports were obtained from the public Eclipse Bugzilla repository. It covers the period of 27 months (from 01.2023 to 03.2025), during which ten releases (4.26–4.35) were issued.

Table 11. Detected faults in Eclipse.

The estimated parameters of the analyzed models are given in Table 12. Table 13 summarizes the quantitative performance assessment based on standard goodness-of-fit metrics. A visual comparison is shown in Figure 11, where the cumulative observed failures (black dots) are plotted alongside the estimated mean value functions of the proposed software reliability models.

Table 12. Parameter estimation for Eclipse dataset.

Table 13. Comparison of criteria values for Eclipse software dataset.

Figure 11. Fitting results for Eclipse.

The corresponding values of the Hausdorff saturation for the Haque–Ahmad software reliability model

m_{H A}

are

d = 21.453

with

t_{0} = 7.45499

with estimates

d_{l} = 7.25165

and

d_{r} = 40.9566

, respectively. Analogous for the Song–Kim–Chang software reliability model

m_{S K C}

we have Hausdorff distance

d = 14.5305

at

t_{0} = 7.65121

with corresponding estimates

d_{l} = 6.2984

and

d_{r} = 36.2197

.

Based on the Haque–Ahmad software reliability model, Figure 12 gives the predicted with open (unfilled) points and actual number solid black points of software faults.

Figure 12. Prediction for the number of faults for Eclipse dataset.

3. Software Reliability Assessment Tool

Although software reliability growth models can be implemented manually using specialized mathematical software, this process is often time-consuming and demands a high level of expertise. Most software engineers are reluctant to implement these models due to limited awareness, a lack of understanding of the underlying mathematical concepts or time constraints. The development of software engineering tools has been driven by the objective of bridging the gap between theoretical academic research and its practical application. In [39] Xiao, Okamura and Dohi present a review of over 40 software reliability assessment tools. A review of five open-source tools, SFRAT (https://github.com/LanceFiondella/srt.core, accessed on 27 July 2025), SRATS (https://github.com/SwReliab/SRATS2010, accessed on 27 July 2025), RSRAT (https://github.com/SwReliab/Rsrat, accessed on 27 July 2025), PHSRM (https://github.com/okamumu/phsrm, accessed on 27 July 2025) and SMERFS (https://www.slingcode.com/smerfs/, accessed on 27 July 2025), can be found in [40]. Some examples of other tools are SFRAT [41], C-SFRAT [42] and C SFRAT [43]. The basic difference among the tools lie in how they collect data, which software reliability models are used and the programming languages used in their development.

Several tools support the full process of reliability analysis. For instance, STRAIT [44] is a free, open-source tool that collects issue report data directly from GitHub and supports nine software reliability models. It was developed using Java and R. Another example is RAAF-OSS [45], a reliability automatic assessment framework designed for open-source software and compatible with Jira. It is implemented in Python and supports six software reliability models.

We present a computational tool for assessing software reliability through the analysis of issue tracking patterns in GitHub repositories. The tool supports the full reliability analysis process and is designed to be accessible to users without expert knowledge.

It is a dynamic Wolfram Language environment that directly interfaces with GitHub’s API. To the best of the authors’ knowledge, this is the first tool of its kind.

The tool processes five main steps: data collection, data normalization, model fitting, visualization and evaluation.

Data Collection: In open-source software (OSS) projects, public repositories serve both as development platforms and as sources for issue data collection. Issues are retrieved from the GitHub repository, and their creation and closure timestamps are captured. The dataset used comprises closed issues from GitHub repositories, obtained via the GitHub REST API v3. Data points are organized as

{t, N (t)}

pairs, where t is the time in days since the first reported issue and

N (t)

is the cumulative number of issues up to time t. This transformed dataset emulates software reliability growth testing data, where the time of discovery of software faults (represented by issues) is modeled over calendar time. This modeling is performed in the absence of formal test-phase data, which is often unavailable in open-source projects. Our tool automates the collection and storage of issue reports directly from GitHub using the project URL. Users only need to provide the repository owner and name.

Data Normalization: Timestamps are normalized to analyze the progression of issues over time. The raw timestamps undergo the following processing steps: 1. conversion to absolute time values; 2. normalization relative to the creation date of the first issue; and 3. binning into daily intervals for consistent time series analysis. In the interactive interface, the user has opportunity to choose amount of data to include, choosing either all issue reports or only a portion using a dynamic slider. Another option allows the user to set the maximum number of iterations to be used during the model fitting process.

Accurate parameter estimation is essential for the effective application of software reliability growth models. Since software failures exhibit nonlinear behavior, merely fitting test data to models is insufficient. Instead, optimization techniques are required to identify the most suitable parameters that ensure reliable prediction accuracy. Our tool allows users to select the most appropriate optimization method for parameter estimation, including

“ConjugateGradient”—nonlinear conjugate gradient;
“Gradient” —gradient descent;
“LevenbergMarquardt”—Gauss–Newton method for least squares;
“Newton”—Newton method;
“QuasiNewton” —quasi-Newton BFGS;
“InteriorPoint”—interior point method;
“NMinimize”—optimization that always attempts to find a global minimum;
“Automatic”—automatic default method.

Model Fitting: Various reliability growth models, such as Goel–Okumoto, delayed S-shaped, inflection S-shaped, Haque–Ahmad and Song–Kim–Chang, are applied to the issue data. Each model can be selected by the user. We use CAS Wolfram Mathematica to fit data with high precision. The model fitting function NonlinearModelFit [36] returns a FittedModel object, which provides convenient access to both model predictions and diagnostic information without the need to refit the model. One of the key advantages is its ability to support direct evaluation at specific input values, as well as queries for parameter estimates, residuals and other diagnostic metrics. Once a model is chosen, the parameter estimations are displayed in a table view. The table presents the best-fit parameter information, including values for “Estimate,” “Standard Error,” “t-Statistic” and “p-Value.”

Visualization: Plots are generated to visualize how the selected models fit the actual issue data. These plots allow for analysis of issue trends and provide a clear graphical representation of model performance, helping users choose the most appropriate model.

Evaluation: Criteria values for model comparison present some properties that measure the goodness-of-fit of the considered models in table view. The presented criteria are as follows: “AIC” (Akaike information criterion), “BIC” (Bayesian information criterion), “R-Squared” (coefficient of determination

R^{2}

) and “Adjusted R-Squared” (

R^{2}

adjusted for the number of model parameters). In each row of the table, the most suitable value is highlighted in blue for the selected software reliability models—specifically, the minimum values for AIC and BIC, and the maximum values for R-squared and adjusted R-squared. This facilitates the identification of the model that demonstrates the best fit to the observed data.

The main advantages provided by the interactive interface of our tool are as follows:

Dynamic model fitting with adjustable sample sizes;
Support for multiple optimization algorithms;
Visual comparison of model fits;
Parameter estimates with statistical significance;
Ease of use, even for users without expert knowledge.

We demonstrate capabilities of our tool by using the MPAndroidChart repository as a case study (see Figure 13). MPAndroidChart is a powerful and user-friendly charting library for Android https://github.com/PhilJay/MPAndroidChart (accessed on 27 July 2025). This open-source project belongs to Philipp Jahoda and is licensed under the Apache License, Version 2.0. Its repository issue tracker is dedicated exclusively to bug reports and feature requests.

Figure 13. Software reliability assessment tool.

Analysis of the MPAndroidChart repository revealed a decreasing trend in opened issues over time, suggesting a reduction in the introduction of new defects. Also we observe a high closure rate. A significant proportion of opened issues have been resolved and closed, indicating active maintenance and responsiveness from the development community.

In our software reliability tool, we provide a comparative analysis of all five models. Based on parameter estimations and standard goodness-of-fit tests, we conclude that the Song–Kim–Chang software reliability model offers the most accurate predictions for the number of future faults.

The decreasing number of opened issues, coupled with a high closure rate, suggests that the MPAndroidChart project is maturing and becoming more reliable over time. This aligns with findings from previous studies indicating that OSS projects exhibit reliability growth patterns similar to closed source projects. Moreover, the active engagement of the community in resolving issues contributes to the project’s reliability. As more users and developers participate in identifying and fixing bugs, the software becomes more robust.

By analyzing issue tracking data, we can gain valuable insights into the reliability and maturity of OSS projects. The trends observed in the MPAndroidChart repository demonstrate that a decreasing rate of new issues and a high resolution rate are indicative of a reliable and well-maintained project.

4. Discussion and Conclusions

Open-source projects have become integral to modern software development, offering transparency and collaborative improvement. However, assessing the their reliability remains a challenge due to the decentralized nature of their development. The paper presents an analysis of the so-called Hausdorff saturation characteristic for two software reliability models based on the logistic growth model. The main goal is to propose an additional criterion for model selection. Thus, this distance acts as a geometric indicator of software reliability maturity, helping assess how close we are to a stable, dependable system. This methodology can be successfully applied to other models as well. We considered four numerical examples using fault detection datasets from two different open-source projects. In the presented two examples, the proposed metric supports the results obtained from the previous metrics, including standard goodness-of-fit tests. The question of the usefulness of the proposed Hausdorff saturation criterion remains open and will be the subject of future research, particularly in the context of large datasets. We present a new dynamic tool for evaluating the reliability of open-source software through the analysis of issue tracking dynamics. The tool supports the complete reliability assessment process in real time and is designed for use by individuals without specialized expertise. To demonstrate its applicability, we include a case study based on an open-source project. Furthermore, the tool is easily extensible, allowing for the integration of additional software reliability growth models.

The applicability of SRGMs to real-world projects is heavily dependent on the characteristics of software repositories and the adopted development methodology. For OSS in particular, development typically follows agile practices rather than classical waterfall models. This approach, characterized by short iterations, continuous feedback and the parallel execution of coding and testing, aligns perfectly with the decentralized nature of OSS communities. Consequently, the reliability assessment of OSS must account for the heterogeneity of the reported data, which includes not only genuine defects but also duplicates, enhancement requests and rejected reports. This reality distinguishes OSS reliability analysis from that of traditional systems, where testing is a more formalized and managed process. Some recent studies [46,47,48] emphasize the need to integrate repository mining and empirical software engineering into SRGM-based analysis, while foundational works [49] continue to provide the guiding theoretical framework. Future research may explore several enhancements to improve software reliability assessment. We can consider incorporating additional metrics. While this study focuses on issue opening and closing trends, integrating other indicators such as pull request activity, code churn, commit frequency and test coverage could offer a more comprehensive view of software reliability and overall project health. One promising direction is analyzing the roles and behaviors of contributors to gain insight into how community dynamics impact the evolution of reliability in open-source projects. Another avenue involves developing a real-time monitoring dashboard that continuously tracks issue dynamics and estimates software reliability, thereby supporting maintainers and contributors in evaluating project status on an ongoing basis. Another promising direction involves correlating issue trends with external quality indicators, such as user ratings, download counts or real-world usage in production systems, to strengthen the link between internal repository data and perceived reliability. Lastly, predictive modeling based on historical issue data could enable the forecasting of future reliability trends, issue arrival rates and resolution times, ultimately aiding in resource planning and risk assessment allocation for OSS projects.

Author Contributions

Conceptualization, M.T.V. and G.P.; methodology, M.T.V.; software, G.P.; validation, M.T.V. and G.P.; formal analysis, M.T.V. and G.P.; data curation, G.P.; writing—original draft preparation, M.T.V. and G.P.; writing—review and editing, M.T.V. and G.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by project No. FP25-FMI-010, “Innovative interdisciplinary research in Informatics, Mathematics, and Pedagogy of Education”, of the Scientific Fund of the Paisii Hilendarski University of Plovdiv, Bulgaria.

Data Availability Statement

The original data presented in the study are openly available in Github at https://github.com/PhilJay/MPAndroidChart (accessed on 27 July 2025).

Conflicts of Interest

The authors declare no conflicts of interest.

References

ISO/IEC 25010:2023; Systems and Software Engineering—Systems and Software Quality Requirements and Evaluation (SQuaRE)—Product Quality Model. International Organization for Standardization: Geneva, Switzerland, 2023.
Huang, Y.S.; Chiu, K.C.; Chen, W.M. A software reliability growth model for imperfect debugging. J. Syst. Softw. 2022, 188, 111267. [Google Scholar] [CrossRef]
Tian, Q.; Yeh, C.W.; Fang, C.C. Bayesian decision making of an imperfect debugging software reliability growth model with consideration of debuggers’ learning and negligence factors. Mathematics 2022, 10, 1689. [Google Scholar] [CrossRef]
Luo, H.; Xu, L.; He, L.; Jiang, L.; Long, T. A novel software reliability growth model based on generalized imperfect debugging NHPP framework. IEEE Access 2023, 11, 71573–71593. [Google Scholar] [CrossRef]
Lee, D.; Chang, I.; Pham, H. Study of a new software reliability growth model under uncertain operating environments and dependent failures. Mathematics 2023, 11, 3810. [Google Scholar] [CrossRef]
Nazir, R.; Iqbal, J.; Masoodi, F.S.; Shrivastava, A.K. Developing an innovative imperfect debugging software reliability growth model with enhanced testing coverage strategies. Int. J. Reliab. Qual. Saf. Eng. 2024, 31, 2450017. [Google Scholar] [CrossRef]
Samal, U. Software reliability growth model considering imperfect debugging and fault removal efficiency. Qual. Reliab. Eng. Int. 2025, 41, 1268–1278. [Google Scholar] [CrossRef]
Yamada, S.; Tamura, Y. OSS Reliability Measurement and Assessment; Springer Series in Reliability Engineering; Springer: Cham, Switzerland, 2016; ISBN 978-3-319-31818-9. [Google Scholar]
Miyamoto, S.; Tamura, Y.; Yamada, S. A Method of OSS Reliability Assessment Based on Public Repository Analysis. Int. J. Reliab. Qual. Saf. Eng. 2023, 30, 2350021. [Google Scholar] [CrossRef]
Tamura, Y.; Yamada, S. Applied OSS Reliability Assessment Modeling, AI and Tools; Springer Series in Reliability Engineering; Springer: Cham, Switzerland, 2024; ISBN 978-3-031-64803-8. [Google Scholar]
Tamura, Y.; Miyamoto, S.; Zhou, L.; Anand, A.; Kapur, P.K.; Yamada, S. OSS reliability assessment method based on deep learning and independent Wiener data preprocessing. Int. J. Syst. Assur. Eng. Manag. 2024, 15, 2668–2676. [Google Scholar] [CrossRef]
Miyamoto, S.; Tamura, Y.; Yamada, S. A Method of Reliability Assessment Based on Trend Analysis for Open Source Software. Int. J. Reliab. Qual. Saf. Eng. 2024, 31, 2450004. [Google Scholar] [CrossRef]
Goel, A.L.; Okumoto, K. Time-dependent error-detection rate model for software reliability and other performance measures. IEEE Trans. Reliab. 1979, 28, 206–211. [Google Scholar] [CrossRef]
Yamada, S.; Ohba, M.; Osaki, S. S-shaped reliability growth modeling for software error detection. IEEE Trans. Reliab. 1983, 32, 475–484. [Google Scholar] [CrossRef]
Ohba, M. Inflexion S-shaped software reliability growth models. In Stochastic Models in Reliability Theory; Osaki, S., Hatoyama, Y., Eds.; Springer: Berlin, Germany, 1984; pp. 144–162. [Google Scholar]
Yamada, S.; Tokuno, K.; Osaki, S. Imperfect debugging models with fault introduction rate for software reliability assessment. Int. J. Syst. Sci. 1992, 23, 2241–2252. [Google Scholar] [CrossRef]
Pham, H.; Nordmann, L.; Zhang, X. A general imperfect software debugging model with S-shaped fault detection rate. IEEE Trans. Reliab. 1999, 48, 169–175. [Google Scholar] [CrossRef]
Pham, H.; Zhang, X. An NHPP Software Reliability Model and Its Comparison. Int. J. Reliab. Qual. Saf. Eng. 1997, 4, 269–282. [Google Scholar] [CrossRef]
Pham, H. An imperfect-debugging fault-detection dependent-parameter software. Int. J. Automat. Comput. 2007, 4, 325–328. [Google Scholar] [CrossRef]
Pham, H. A Logistic fault-dependent detection software reliability model. J. Univers. Comput. Sci. 2018, 24, 1717–1730. [Google Scholar]
Haque, M.A.; Ahmad, N. An effective software reliability growth model. Saf. Reliab. 2021, 40, 209–220. [Google Scholar] [CrossRef]
Song, K.Y.; Kim, Y.S.; Chang, I.H. Software reliability model for open-source software that considers the number of finite faults and dependent faults. Math. Biosci. Eng. 2023, 20, 11785–11804. [Google Scholar] [CrossRef]
Haque, M.A.; Ahmad, N. A logistic growth model for software reliability estimation considering uncertain factors. Int. J. Reliab. Qual. Saf. Eng. 2021, 28, 2150032. [Google Scholar] [CrossRef]
Haque, M.A.; Ahmad, N. A software reliability model using fault removal efficiency. J. Reliab. Stat. Stud. 2022, 15, 459–472. [Google Scholar] [CrossRef]
Haque, M.A.; Ahmad, N. A logistic software reliability model with Loglog fault detection rate. Iran J. Comput. Sci. 2025, 8, 1–7. [Google Scholar] [CrossRef]
Kyurkchiev, N. Selected Topics in Mathematical Modeling: Some New Trends. Dedicated to Academician Blagovest Sendov (1932–2020); LAP LAMBERT Academic Publishing: Saarbrucken, Germany, 2020; ISBN 978-613-9-45608-6. [Google Scholar]
Kyurkchiev, N. Some intrinsic properties of Tadmor–Tanner functions: Related problems and possible applications. Mathematics 2020, 8, 1963. [Google Scholar] [CrossRef]
Kyurkchiev, N.; Kyurkchiev, V.; Iliev, A.; Rahnev, A. Some nonstandard differential models with applications to the population dynamics and computer viruses propagation. Dyn. Syst. Appl. 2019, 28, 757–788. [Google Scholar]
Pavlov, N.; Iliev, A.; Rahnev, A.; Kyurkchiev, N. Some Software Reliability Models: Approximation and Modeling Aspects; LAP LAMBERT Academic Publishing: Saarbrücken, Germany, 2018; ISBN 978-613-9-82805-0. [Google Scholar]
Pavlov, N.; Iliev, A.; Rahnev, A.; Kyurkchiev, N. Nontrivial Models in Debugging Theory (Part 2); LAP LAMBERT Academic Publishing: Saarbrücken, Germany, 2018; ISBN 978-613-9-87794-2. [Google Scholar]
Pavlov, N.; Iliev, A.; Rahnev, A.; Kyurkchiev, N. Analysis of the Chen’s and Pham’s software reliability models. Cybern. Inf. Technol. 2018, 18, 37–47. [Google Scholar] [CrossRef]
Rahneva, O.; Golev, A.; Spasov, G. Investigations on Some New Models in Debugging and Growth Theory (Part 3); LAP LAMBERT Academic Publishing: Saarbrücken, Germany, 2020; ISBN 978-620-66655-8. [Google Scholar]
Kyurkchiev, N.; Iliev, A.; Golev, A.; Rahnev, A. Some Non-Standard Models in “Debugging and Test Theory” (Part 4); Plovdiv University Press: Plovdiv, Bulgaria, 2020; ISBN 978-619-2-02584-7. [Google Scholar]
Hausdorff, F. Set Theory, 2nd ed.; Chelsea Publication: New York, NY, USA, 1962. [Google Scholar]
Sendov, B.L. Hausdorff approximations. In Mathematics and Its Applications; Springer Science & Business Media: Berlin/Heidelberg, Germany, 1990; Volume 50, pp. 1–367. [Google Scholar]
Wolfram Research, NonlinearModelFit (2008), Wolfram Language Function, (Updated 2025). Available online: https://reference.wolfram.com/language/ref/NonlinearModelFit.html (accessed on 27 July 2025).
Ramadan, I.S.; Harb, H.M.; Mousa, H.M.; Malhat, M.G. Reliability assessment for open-source software using deterministic and probabilistic models. Int. J. Inf. Technol. Comput. Sci. 2022, 14, 1–15. [Google Scholar] [CrossRef]
Micko, R.; Chren, S.; Rossi, B. Applicability of software reliability growth models to open source software. In Proceedings of the 2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA), Gran Canaria, Spain, 31 August–2 September 2022; pp. 255–262. [Google Scholar]
Xiao, X.; Okamura, H.; Dohi, T. Survey of software reliability assessment tools. In Advances in Reliability Science, Developments in Reliability Engineering; Ram, M., Ed.; Elsevier Inc: Amsterdam, The Netherlands, 2024; pp. 575–601. [Google Scholar]
Lu, S.; Li, H.; Jiang, Z. Comparative study of open source software reliability assessment tools. In Proceedings of the 2020 IEEE International Conference on Artificial Intelligence and Information Systems (ICAIIS), Dalian, China, 20–22 March 2020; pp. 49–55. [Google Scholar]
Nagaraju, V.; Shekar, V.; Steakelum, J.; Luperon, M.; Shi, Y.; Fiondella, L. Practical software reliability engineering with the Software Failure and Reliability Assessment Tool (SFRAT). SoftwareX 2019, 10, 100357. [Google Scholar] [CrossRef]
Aubertine, J.; Nagaraju, V.; Fiondella, L. The Covariate Software Failure and Reliability Assessment Tool (C-SFRAT). In Proceedings of the International Conference on Reliability and Quality in Design, Online, 19–22 April 2021. [Google Scholar]
Aubertine, J.; Chen, K.; Nagaraju, V.; Fiondella, L. A covariate software tool to guide test activity allocation. SoftwareX 2022, 17, 100909. [Google Scholar] [CrossRef]
Chren, S.; Micko, R.; Buhnova, B.; Rossi, B. STRAIT: A tool for automated software reliability growth analysis. In Proceedings of the 2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR), Montreal, QC, Canada, 25–31 May 2019; pp. 105–110. [Google Scholar]
Wang, Q.; Zhang, W.; Jiang, J.; Li, L. A reliability automatic assessment framework of open source software based on JIRA. In Proceedings of the 2020 9th International Conference on Software and Computer Applications, Langkawi, Malaysia, 18–21 February 2020; pp. 188–193. [Google Scholar]
Pena, G.; Barraza, N.R.; Anand, A. Increasing failure rate software reliability models for agile projects: A comparative study. In System Performance Modeling; De Gruyter Series on the Applications of Mathematics in Engineering and Information Sciences; De Gruyter: Berlin, Germany; Boston, MA, USA,, 2020; pp. 32–58. ISBN 3110604507. [Google Scholar]
Lin, J.S.; Huang, C.Y.; Fang, C.C. Analysis and assessment of software reliability modeling with preemptive priority queueing policy. J. Syst. Softw. 2022, 187, 111249. [Google Scholar] [CrossRef]
Li, Q.; Wang, Y.; Tian, D.; Yuan, C.; Hu, C. Component-based modeling of cascading failure propagation in directed dual-weight software networks. Comput. Netw. 2024, 255, 110861. [Google Scholar] [CrossRef]
Lyu, M.R. Handbook of Software Reliability Engineering; IEEE Computer Society Press: Los Alamitos, CA, USA, 1996. [Google Scholar]

Figure 1. Graphical representation of Hausdorff saturation d of mean value function

m (t)

.

Figure 2. Fitting results for GNOME 2.2 dataset.

Figure 3. Hausdorff saturation for (a) Haque–Ahmad software reliability model. (b) Song–Kim–Chang software reliability model.

Figure 4. Residual vs. predictor plot of fitting residuals for (a) Goel–Okumoto, (b) delayed S-shaped, (c) inflection S-shaped, (d) Haque–Ahmad and (e) Song–Kim–Chang software reliability models.

Figure 5. Quantile plot of fitting residuals for (a) Haque–Ahmad software reliability model and (b) Song–Kim–Chang software reliability model.

Figure 6. Prediction for the number of faults for GNOME 2.2 dataset.

Figure 7. Fitting results for Ansible dataset.

Figure 8. Prediction number of faults for the Ansible software for (a) Haque–Ahmad software reliability model. (b) Song–Kim–Chang software reliability model.

Figure 9. Fitting results for Electron 31.0.0.

Figure 10. Prediction for the number of faults for Electron dataset.

Figure 11. Fitting results for Eclipse.

Figure 12. Prediction for the number of faults for Eclipse dataset.

Figure 13. Software reliability assessment tool.

Table 1. Summary of some basic software reliability models.

Model Name	Mean Value Function	Model Type
Goel–Okumoto [13]	$m (t) = a (1 - exp (- b t))$	Concave
Delayed S-shaped [14]	$m (t) = a (1 - (1 + b t) exp (- b t))$	S-shaped
Inflection S-shaped [15]	$m (t) = \frac{a (1 - (1 + b t) exp (- b t))}{1 + β exp (- b t)}$	S-shaped
Yamada linear imperfect debugging model [16]	$m (t) = a (1 - exp (- b t)) (1 - \frac{α}{b}) + α a t$	Concave
Yamada exponential imperfect debugging model [16]	$m (t) = \frac{a b}{α + b} (exp (α t) - exp (- b t))$	Concave
Pham–Nordmann–Zhang [17]	$m (t) = \frac{a}{1 + β exp (- b t)} ((1 - exp (- b t)) (1 - \frac{α}{b}) + α t)$	S-shaped and Concave
Pham–Zhang [18]	$m (t) = \frac{((c + a) (1 - exp (- b t)) - \frac{a b}{b - α} (exp (- α t) - exp (- b t)))}{1 + β exp (- b t)}$	S-shaped and Concave
Dependent-parameter model [19]	$m (t) = α (1 + β t) (β t + exp (- β t) - 1)$	Concave

Table 2. Detected faults in GNOME 2.2 official public releases.

week	1	2	3	4	5	6	7	8	9	10	11	13	15	16	17	18	22	24
Cumulative detected Failures	5	9	14	19	28	33	35	36	38	41	43	44	48	49	50	51	52	54

Table 3. Parameter estimation for GNOME 2.2 dataset.

Model	a	b	$α$	$β$	k
Goel–Okumoto	$57.0362$	$0.1224$	−	−	−
Delayed S-shaped	$50.7748$	$0.3325$	−	−	−
Inflection S-shaped	$53.0419$	$0.2078$	−	$0.9808$	−
Haque–Ahmad	$56.2411$	$1.2771$	$0.1531$	$50661.1$	−
Song–Kim–Chang	$49.5002$	$0.5413$	−	−	$8.9540$

Table 4. Comparison of criteria values for GNOME 2.2 dataset.

Model	Goel–Okumoto	Delayed S-Shaped	Inflection S-Shaped	Haque–Ahmad	Song–Kim–Chang
AIC	78.8995	81.7299	76.1154	71.5129	97.8165
BIC	81.5706	84.4010	79.6769	75.9648	101.378
R-Squared	0.9978	0.9974	0.9983	0.9988	0.9944
Adjusted R-Squared	0.9975	0.9971	0.9980	0.9985	0.9933

Table 5. Detected faults in Ansible official public releases.

months	1	2	3	4	5	6	7	8	9	10	11	12
Cumulative detected Failures	240	404	540	792	932	1118	1285	1504	1706	1982	2005	2454
months	13	14	15	16	17	18	19	20	21	22	23	24
Cumulative detected Failures	3152	3766	4441	5147	5629	6201	6827	7275	8375	8991	9525	9851

Table 6. Parameter estimation for Ansible software dataset.

Model	a	b	$α$	$β$	k
Goel–Okumoto	$4.3910 \times 10^{6}$	$7.8016 \times 10^{- 5}$	−	−	−
Delayed S-shaped	158,948	$0.0170$	−	−	−
Inflection S-shaped	$15,564.2$	$0.1649$	−	$27.0258$	−
Haque–Ahmad	$12,873.3$	$0.1601$	$1.0968$	$34.4353$	−
Song–Kim–Chang	$12,207.3$	$0.2740$	−	−	$526.564$

Table 7. Comparison of criteria values for Ansible software dataset.

Model	Goel–Okumoto	Delayed S-Shaped	Inflection S-Shaped	Haque–Ahmad	Song–Kim–Chang
AIC	408.901	341.398	329.312	319.08	320.008
BIC	412.435	344.932	334.024	324.97	324.72
R-Squared	0.954636	0.997276	0.998493	0.999101	0.998977
Adjusted R-Squared	0.950513	0.997028	0.998277	0.998921	0.998831

Table 8. Detected faults in Electron official public releases.

weeks	1	2	3	4	5	6	7	8	9	10	11	12	13	14	15
Cumulative Failures	1	4	9	15	17	23	29	37	42	62	74	82	89	97	113
weeks	16	17	18	19	20	21	22	23	24	25	26	27	28	29
Cumulative Failures	127	139	152	160	163	167	168	172	178	180	181	182	185	186

Table 9. Parameter estimation for Electron 31.0.0 dataset.

Model	a	b	$α$	$β$	k
Goel–Okumoto	$9417.43$	$0.0007$	−	−	−
Delayed S-shaped	$250.494$	$0.1015$	−	−	−
Inflection S-shaped	$181.97$	$0.2871$	−	$37.8345$	−
Haque–Ahmad	$189.496$	$0.4169$	$0.8222$	$68.2193$	−
Song–Kim–Chang	$184.097$	$0.3527$	−	−	$9.6001$

Table 10. Comparison of criteria values for Electron software dataset.

Model	Goel–Okumoto	Delayed S-Shaped	Inflection S-Shaped	Haque–Ahmad	Song–Kim–Chang
AIC	243.408	211.389	171.699	158.868	171.148
BIC	247.51	215.491	177.168	165.705	176.618
R-Squared	0.986373	0.995483	0.998931	0.999362	0.998951
Adjusted R-Squared	0.985364	0.995148	0.998807	0.999260	0.998830

Table 11. Detected faults in Eclipse.

months	1	2	3	4	5	6	7	8	9
Cumulative Failures	177	328	472	594	726	872	975	1070	1213
months	10	11	12	13	14	15	16	17	18
Cumulative Failures	1306	1409	1537	1609	1698	1809	1848	1876	1927
months	19	20	21	22	23	24	25	26	27
Cumulative Failures	1946	1955	1968	1978	1981	1990	1991	2000	2004

Table 12. Parameter estimation for Eclipse dataset.

Model	a	b	$α$	$β$	k
Goel–Okumoto	$2355.22$	$0.0838$	−	−	−
Delayed S-shaped	$2041.94$	$0.2340$	−	−	−
Inflection S-shaped	$2074.6$	$0.1886$	−	$1.9766$	−
Haque–Ahmad	$2057.26$	$0.4349$	$0.7453$	$13.5707$	−
Song–Kim–Chang	$1980.13$	$0.3988$	−	−	$318.35$

Table 13. Comparison of criteria values for Eclipse software dataset.

Model	Goel–Okumoto	Delayed S-Shaped	Inflection S-Shaped	Haque–Ahmad	Song–Kim–Chang
AIC	309.725	307.907	274.875	267.41	300.433
BIC	313.612	311.795	280.059	273.889	305.616
R-Squared	0.998173	0.998292	0.999535	0.999674	0.998802
Adjusted R-Squared	0.998027	0.998155	0.999477	0.999618	0.998652

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Modeling and Assessing Software Reliability in Open-Source Projects

Abstract

1. Introduction

2. Approximation Results

2.1. Hausdorff Saturation

2.2. Numerical Examples

3. Software Reliability Assessment Tool

4. Discussion and Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics