Next Article in Journal
A Bitsadze–Samarskii-Type Problem for a Second-Kind Mixed-Type Equation in a Domain with a Horizontal Half-Strip as Its Elliptic Part
Previous Article in Journal
Ellipsoid-Structured Localized Generalized Eigenvalue Proximal Support Vector Machines
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Closed-Form Cubic–Logistic Approximation to the Normal Cumulative Distribution Function

by
Michael Arnold Frölich
Department of Anesthesiology and Perioperative Medicine, University of Alabama at Birmingham, Birmingham, AL 35233, USA
Mathematics 2026, 14(3), 486; https://doi.org/10.3390/math14030486
Submission received: 23 December 2025 / Revised: 23 January 2026 / Accepted: 25 January 2026 / Published: 30 January 2026

Abstract

Accurate evaluation of the standard normal cumulative distribution function is fundamental in many areas of mathematics, statistics, and applied computation, yet no closed-form expression in elementary functions exists. We present a simple analytic approximation based on a logistic function with a cubic argument, designed to preserve symmetry, monotonicity, and analytic invertibility. The parameters of the approximation are obtained through numerical optimization over a wide domain, targeting both maximum absolute error and root-mean-square error. The resulting function achieves uniformly low approximation error and significantly reduces error relative to the classical logistic approximation, while remaining competitive with commonly used high-accuracy numerical methods. Unlike rational or high-degree polynomial approximations, the proposed form admits an explicit inverse, making it convenient for applications requiring analytic quantile evaluation or inverse transform sampling. Numerical error analysis and illustrative examples demonstrate that the approximation provides a practical balance between accuracy, simplicity, and analytic tractability.

1. Introduction

The standard normal cumulative distribution function (CDF), denoted by Φ ( x ) , is a cornerstone of probability theory and mathematical statistics. It gives the probability that a standard normally distributed random variable assumes a value less than or equal to x:
Φ ( x ) = 1 2 π x e t 2 / 2 d t .
The function plays a central role in statistical inference, including hypothesis testing, confidence interval construction, and maximum likelihood estimation, and its ubiquity across disciplines is largely due to the Central Limit Theorem [1]. In applied work, inference is often summarized via confidence intervals and power considerations; see, e.g., [2,3,4].
Despite its theoretical importance, Φ ( x ) does not admit a closed-form expression in terms of elementary functions, a fact that follows from classical results in differential algebra [5]. As a consequence, practical evaluation of Φ ( x ) relies on numerical approximations, polynomial expansions, or algorithmic approximants such as rational functions and error-function-based representations [6,7,8,9]. While these approaches can achieve high accuracy, they typically require numerical routines or auxiliary approximations for evaluation and inversion.
A commonly used representation expresses Φ ( x ) in terms of the error function,
erf ( x ) = 2 π 0 x e t 2 d t , Φ ( x ) = 1 2 1 + erf x 2 ,
which is elegant but still necessitates numerical approximation in most computational environments. As a result, the normal CDF remains a computational bottleneck in settings where repeated evaluation or analytic inversion is required, including symbolic computation and inverse transform sampling [10].
Numerous approximation strategies for Φ ( x ) have been proposed, differing in accuracy, complexity, and analytic properties. Rational approximations, such as those originating with Hastings [7], can achieve high numerical fidelity but are not analytically invertible. Polynomial approximations, including those of Marsaglia [8], are compact and efficient but likewise lack symbolic reversibility. Recent work has produced increasingly accurate closed-form approximations [11,12], underscoring ongoing efforts to balance numerical precision with analytic simplicity.
In contrast, logistic sigmoid-based approximations provide closed-form, strictly monotone, and analytically invertible surrogates for Φ ( x ) . The classical logistic approximation
Φ ( x ) 1 1 + e a x , a 1.702 ,
is attractive for its simplicity and symbolic tractability [13], but exhibits poor tail accuracy due to its limited curvature flexibility. This limitation motivates the introduction of nonlinear extensions that preserve invertibility while improving approximation quality.
While logistic approximations of the standard normal cumulative distribution function (CDF) are well established in the literature, the contribution of the present work is not the logistic form itself, but the systematic augmentation of the logistic argument with a cubic term. This extension yields a closed-form, invertible approximation with substantially improved accuracy while preserving analytic simplicity and computational efficiency. Related approximation approaches based on series expansions have also been proposed [14]. While such methods can achieve high precision by increasing expansion order or using additional correction terms, the present work emphasizes a single closed-form cubic–logistic expression with low parametric complexity, global smoothness, and analytic invertibility.
In this work, we introduce a closed-form logistic–cubic approximation to the standard normal CDF, defined by
Φ ( x ) 1 1 + exp ( ( a x + b x 3 ) ) ,
where the coefficients a and b are determined via hybrid numerical optimization combining Differential Evolution with Nelder–Mead refinement. The resulting approximation preserves symmetry, global monotonicity, and analytic invertibility, while achieving uniformly low absolute and root-mean-square error over a wide domain.
The proposed approximation is evaluated from three complementary perspectives:
1.
Mathematical structure and constraints: analysis of symmetry, monotonicity, and admissible parameter ranges.
2.
Numerical accuracy: comparison with established approximations using maximum absolute error and RMSE.
3.
Illustrative applications: demonstration of performance on representative empirical datasets.
All numerical comparisons are benchmarked against the reference implementation scipy.stats.norm.cdf [15].

2. Materials and Methods

This section defines the proposed cubic–logistic approximation, states its structural constraints (symmetry, monotonicity, and invertibility), and describes the numerical procedure used to fit its parameters.

2.1. Mathematical Framework and Structural Constraints

Let σ : R ( 0 , 1 ) denote the logistic sigmoid
σ ( x ) = 1 1 + e x .
We consider approximations to the standard normal CDF of the form
Φ ^ ( x ) = σ ( f ( x ) ) = 1 1 + exp ( f ( x ) ) ,
where f : R R is a smooth scalar function.
The target Φ ( x ) satisfies (i) symmetry  Φ ( x ) = 1 Φ ( x ) , (ii) strict monotonicity, and (iii) limits Φ ( x ) 0 as x and Φ ( x ) 1 as x + . The approximation (6) inherits the correct limits whenever f ( x ) ± as x ± . Moreover, Φ ^ is strictly increasing whenever f is strictly increasing, since σ ( x ) > 0 for all x.
A particularly convenient constraint is analytic invertibility. Since σ is invertible with σ 1 ( p ) = log p 1 p , the approximation admits an explicit inverse whenever f is invertible:
Φ ^ 1 ( p ) = f 1 log p 1 p , p ( 0 , 1 ) .
We therefore seek a simple choice of f that (a) preserves symmetry, (b) is strictly increasing on R , and (c) remains analytically tractable.

2.2. Cubic–Logistic Functional Form

Symmetry of the normal distribution implies that the log-odds transformation of Φ ( x ) is an odd function. Motivated by this structure, we take f to be an odd polynomial. The minimal nonlinear odd polynomial is cubic,
f ( x ) = a x + b x 3 ,
with real parameters a and b.
Optimizing the parameters with respect to the mean squared error over R yields the coefficients
a = 1.59708 , b = 0.07095 .
Accordingly, the proposed Logistic–Cubic approximation is given explicitly by
Φ ^ ( x ) = 1 1 + exp 1.59708 x + 0.07095 x 3 .
For comparison, the classical logistic approximation corresponds to Φ ^ log ( x ) = σ ( α x ) with α 1.702 . Table 1 summarizes the parameterizations of the logistic-based normal CDF approximations considered in this work.
The cubic term introduces additional curvature relative to the classical linear–logistic approximation f ( x ) = a x , improving tail behavior while maintaining closed-form simplicity. Higher odd degrees (e.g., quintic) can marginally reduce error but substantially increase symbolic complexity and raise the risk of producing unnecessary oscillation. For the cubic form, the derivative is
f ( x ) = a + 3 b x 2 .
Lemma 1 (Monotonicity).
If a > 0 and b 0 , then f ( x ) > 0 for all x R , and hence Φ ^ ( x ) is strictly increasing on R .
Proof. 
If a > 0 and b 0 , then for all x R we have f ( x ) = a + 3 b x 2 a > 0 . Thus, f is strictly increasing, and since σ ( x ) > 0 for all x, Φ ^ ( x ) = σ ( f ( x ) ) is strictly increasing.    □
This sufficient condition is used to ensure that the approximation defines a valid CDF and remains invertible.
Table 2 summarizes key symbolic properties of the proposed cubic–logistic approximation relative to common alternatives.

2.3. Optimization Criteria and Error Metrics

The parameters ( a , b ) are chosen to match Φ ( x ) over a prescribed domain using two error criteria: the root-mean-square error (RMSE) and the maximum absolute error. Given a uniform grid { x i } i = 1 N on [ 8 , 8 ] , define
RMSE ( a , b ) = 1 N i = 1 N Φ ( x i ) Φ ^ ( x i ) 2 .
Remark on evaluation intervals. The interval [ 8 , 8 ] is used for global RMSE evaluation to include extreme tail behavior of the normal distribution, whereas the interval [ 6 , 6 ] reflects the effective numerical support of Φ ( x ) in most practical applications, where values outside this range are already numerically indistinguishable from 0 or 1. The qualitative conclusions of the error analysis are not sensitive to this choice of interval.
And
ε max ( a , b ) = max 1 i N Φ ( x i ) Φ ^ ( x i ) .
Since these criteria emphasize complementary aspects of accuracy, we adopt a lexicographic objective: We first minimize ε max ( a , b ) , and among solutions with near-minimal ε max , we select the one with minimal RMSE. (In practice, “near-minimal” is defined by a small tolerance relative to the best observed ε max on the grid). This choice yields uniform control of the worst-case deviation while retaining good average fidelity.

2.4. Numerical Optimization Algorithm

The resulting optimization problem is non-convex over ( a , b ) , motivating a hybrid global–local search. We first apply Differential Evolution [16] over bounded intervals a [ 1.0 , 2.5 ] and b [ 0.01 , 0.15 ] , followed by Nelder–Mead refinement [17] initialized at the best global candidate. The bounds enforce a > 0 and b > 0 , which (by Lemma 1) guarantee monotonicity of Φ ^ on R .
Table 3 summarizes the evaluation domain, grid resolution, objective criteria, and optimization procedures used in fitting the cubic–logistic approximation.
Figure 1 illustrates the representative convergence behavior of the global and local optimization phases.
The fitted parameter values obtained from this procedure are reported in Section 3.

3. Results

This section reports numerical results for the proposed logistic–cubic normal CDF approximation. Accuracy is evaluated using the error metrics defined in Section 2, including the maximum absolute error and the root-mean-square error (RMSE). All results are obtained by direct comparison with a high-precision reference implementation of the standard normal CDF.

3.1. Evaluation Setup and Baseline Methods

All approximations are evaluated on a uniform grid of 1000 points over the interval
x [ 6 , 6 ] ,
which contains more than 99.9999998 % of the probability mass of the standard normal distribution [14]. This domain captures both the central region and the extreme tails while remaining representative of typical computational use.
The proposed logistic–cubic approximation is compared against three commonly used baseline methods: (i) the classical logistic approximation Φ ( x ) ( 1 + e 1.702 x ) 1  [13], (ii) Hastings’ rational approximation [7], and (iii) Marsaglia’s polynomial approximation [8]. These methods span a range of analytic complexity and accuracy characteristics and provide representative benchmarks.
In addition to classical approximations, we also compare the proposed model with representative closed-form normal CDF approximations, including recent explicitly invertible and high-accuracy forms reported in Refs. [12,18]. These models are selected due to their analytic tractability and their frequent use as benchmarks in the literature.
All approximations are evaluated relative to the reference implementation scipy.stats.norm.cdf from the SciPy library [15,19], which computes Φ ( x ) to machine precision. For each method, pointwise absolute errors are computed on the evaluation grid, and global summary metrics are derived from these values. The resulting numerical comparisons are presented in the following subsections.

3.2. Quantitative Accuracy Metrics

We report numerical accuracy results for the logistic–cubic normal CDF approximation using the maximum absolute error and the root-mean-square error (RMSE) defined in Section 2. These metrics respectively quantify the worst-case deviation and the average squared deviation over the evaluation domain.
Remark on tail behavior. For sufficiently large | x | , the normal cumulative distribution function rapidly saturates, with Φ ( x ) 1 for large positive x and Φ ( x ) 0 for large negative x. In many numerical settings, hard truncation outside a finite interval (e.g., | x | > 6 ) may therefore be sufficient. Nevertheless, a smooth closed-form approximation defined on R remains valuable for analytical manipulation, automatic differentiation, and inverse-CDF-based methods, where continuity and global invertibility are desirable.
Table 4 summarizes the numerical performance of the proposed approximation relative to the baseline methods described in Section 3.1. Over the interval x [ 6 , 6 ] , the logistic–cubic approximation attains a maximum absolute error of 1.7 × 10 4 and an RMSE of 4.3 × 10 5 . These values represent a substantial reduction in error relative to the classical logistic approximation and are comparable to those obtained by Hastings’ rational approximation and Marsaglia’s polynomial fit.
Table 5 compares the proposed approximation with representative closed-form normal CDF approximations using standard goodness-of-fit and error metrics.
For Refs. [12,18], we report closed-form and invertibility properties here; numerical error summaries for those specific approximations are given in the original articles and are discussed in the accompanying text.
Across both error criteria, the cubic–logistic form markedly improves upon the classical logistic approximation and remains competitive with higher-accuracy rational and polynomial methods. Additional insight into the spatial distribution of approximation error is provided by the pointwise error analysis presented in the following subsection.

3.3. Higher-Order Odd Polynomial Extensions

To assess whether additional accuracy can be achieved through increased model order, we also considered higher-order odd polynomial extensions of the logistic argument. In particular, a quintic model of the form
σ a x + b x 3 + c x 5
was optimized using the same numerical criteria applied to the cubic model.
For the quintic extension σ ( a x + b x 3 + c x 5 ) , optimization over the same evaluation grid ( x [ 6 , 6 ] , n = 1000 ) yields
a = 1.59543562 , b = 0.07359658 , c = 0.00063163 .
Table 6 summarizes performance comparisons using the Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), and Kolmogorov–Smirnov (KS) statistic.
For the quintic extension σ ( a x + b x 3 + c x 5 ) , optimization over the same grid yields
a = 1.59543562 , b = 0.07359658 , c = 0.00063163 .
Although the quintic extension yields marginal numerical improvements, these gains are small relative to the increase in model complexity. The cubic formulation therefore represents an effective balance between accuracy, parsimony, and analytic tractability.

3.4. Visual Error Structure and Tail Behavior

To complement the global accuracy metrics reported above, we examine the pointwise absolute error of each approximation across the evaluation domain. This analysis highlights how approximation error varies with x and provides insight into tail behavior that may not be fully captured by aggregate error measures.
Figure 2 shows the absolute error | Φ ( x ) Φ ^ ( x ) | on a logarithmic scale for all methods over x [ 6 , 6 ] . The logarithmic scale facilitates comparison of error magnitudes spanning several orders and emphasizes behavior in the tails of the distribution.
The classical logistic approximation exhibits increasing error magnitude as | x | grows, reflecting limitations of a purely linear exponent in capturing tail curvature. Hastings’ rational approximation and Marsaglia’s polynomial fit achieve low error near the center of the distribution but display progressively larger deviations toward the boundaries of the evaluation domain.
The cubic–logistic approximation maintains uniformly low error across the full interval, with symmetric behavior in both tails. In particular, the maximum absolute error remains below 2 × 10 4 throughout the domain, consistent with the quantitative results reported in Table 4.

3.5. Symbolic Tractability and Invertibility

Beyond numerical accuracy, analytic properties such as symbolic tractability and explicit invertibility are important considerations in the selection of normal CDF approximations. These properties are particularly relevant in contexts involving symbolic manipulation, inverse transform sampling, and analytic differentiation.
Table 7 summarizes key symbolic characteristics of the cubic–logistic approximation in comparison with several commonly used alternatives. While all methods considered are smooth and admit closed-form expressions, only the classical logistic approximation and the proposed cubic–logistic form are analytically invertible.
The absence of analytic invertibility in rational and polynomial approximations generally necessitates numerical root-finding procedures or conditional logic for inversion, which can increase computational complexity and limit symbolic use. In contrast, the cubic–logistic approximation admits an explicit inverse through solution of a depressed cubic equation following application of the logit transformation. This property enables closed-form evaluation of quantiles and facilitates analytic manipulation.
In addition to invertibility, the simplicity of the cubic–logistic functional form supports straightforward symbolic differentiation and integration. Unlike piecewise or iterative approximations, the approximation remains globally smooth and algebraically compact, which can simplify analytic derivations and symbolic computation workflows.
These analytic considerations complement the numerical results presented in Section 3. Together, they illustrate how the cubic–logistic approximation balances accuracy with analytic simplicity, positioning it as a useful alternative in settings where both numerical fidelity and symbolic accessibility are desired.

4. Applications

The purpose of the examples in this section is to illustrate the numerical behavior and practical performance of the proposed cubic–logistic approximation in representative applied settings. These examples are not intended to redefine the primary objective of the paper, which remains the accurate approximation of the normal cumulative distribution function. To illustrate the numerical behavior of the cubic–logistic normal CDF approximation beyond synthetic test functions, we present several empirical case studies. These examples are intended to demonstrate how the approximation behaves when fitted to real-world data exhibiting skewness and heavy-tailed structure, rather than to provide domain-specific modeling conclusions. These examples are intended to illustrate numerical behavior under non-Gaussian empirical distributions rather than to assert domain-specific generative models.

4.1. Overview and Relevance of Empirical Domains

The empirical examples considered here are chosen to reflect common distributional features encountered in practice, including skewness, excess kurtosis, and tail asymmetry. Such features often lead to systematic deviations from the standard normal distribution and provide a useful testbed for evaluating approximation behavior.
All datasets are standardized to zero mean and unit variance prior to fitting. This normalization isolates shape characteristics and allows direct comparison between the standard normal CDF, the classical logistic approximation, and the proposed cubic–logistic approximation. Model comparisons are based on goodness-of-fit measures including the Kolmogorov–Smirnov statistic and information criteria (AIC and BIC).

4.2. Environmental Data—PM2.5 Air Quality Index

As a first empirical example, we consider daily PM2.5 measurements obtained from the U.S. Environmental Protection Agency monitoring station at Huntsville Old Airport [20]. The association between long-term PM2.5 exposure and adverse cardiopulmonary outcomes is well established [21]. PM2.5 data are well known to exhibit right skewness and heavy tails, making them a representative example of non-Gaussian environmental measurements [20,22].
After standardization, three models are fitted to the empirical distribution: the standard normal CDF, the classical logistic approximation, and the cubic–logistic approximation. Figure 3 shows the empirical CDF alongside the fitted models.
Goodness-of-fit statistics are summarized in Table 8. The cubic–logistic approximation yields the smallest Kolmogorov–Smirnov statistic among the three models, indicating improved agreement in the tails, while incurring only a modest increase in model complexity as reflected by AIC and BIC.

4.3. Financial Data—S&P 500 Daily Returns

As a second empirical example, we consider daily log returns of the S&P 500 index, which are known to exhibit deviations from Gaussian behavior, including excess kurtosis and heavy tails [23,24]. We analyze ten years of daily returns (2013–2023) obtained from the Stooq database. The data are standardized to zero mean and unit variance prior to fitting.
Following standardization, we compare three models fitted to the empirical distribution: the standard normal CDF, the classical logistic approximation, and the proposed cubic–logistic approximation. Figure 4 shows the empirical CDF together with the fitted curves.
Goodness-of-fit statistics are reported in Table 9. Over this dataset, the cubic–logistic approximation achieves the smallest KS statistic among the three models and yields improved information criteria relative to the normal and logistic fits, indicating closer agreement with the empirical distribution under these summary measures.

4.4. Biomedical Data—Glucose and Triglycerides

As a third empirical example, we consider two continuous biomarker distributions from the National Health and Nutrition Examination Survey (NHANES): fasting glucose (LBXGLU) and triglycerides (LBXTR). These variables commonly exhibit right-skewness and heavy-tail structure [22,25,26].
We fit the same three models as above—standard normal, classical logistic, and cubic–logistic—to the empirical distributions. Figure 5 shows the empirical CDFs together with fitted curves for both biomarkers.
Table 10 reports AIC, BIC, and KS statistics for the three models. For these data, the cubic–logistic approximation attains the lowest AIC and BIC and substantially reduces the KS statistic relative to the normal and logistic fits, consistent with improved agreement with the empirical distribution across the full support.

4.5. Small-Sample Behavior of the Cubic–Logistic Approximation

To assess parameter stability under limited sample sizes, we perform a Monte Carlo resampling study based on the standardized PM2.5 dataset used above. We generate 500 subsets of size n = 20 drawn without replacement from the full dataset. For each subset, the cubic–logistic parameters ( a , b ) are re-estimated using the Nelder–Mead algorithm, initialized at the full-sample estimates.
Figure 6 shows the empirical distributions of the fitted parameters across the 500 trials. The distribution of a ^ is approximately symmetric and concentrated around the full-sample value, while b ^ exhibits slightly greater dispersion with mild right-skewness. Both parameters remain well-behaved, with no evidence of multimodality or extreme outliers.
Overall, these results indicate that the cubic–logistic approximation exhibits stable parameter behavior even under small-sample conditions, with variability consistent with expected sampling uncertainty. This stability supports the practical use of the approximation in settings where data availability is limited.

5. Discussion

Having established the numerical performance of the logistic–cubic normal CDF approximation, we now discuss the broader implications of these results, focusing on practical applications and computational benefits.
The logistic–cubic normal CDF approximation developed in this work occupies a unique and practical middle ground between classical logistic sigmoid approximations and highly accurate numerical approximations such as rational or polynomial approximations. Unlike simpler logistic forms, which are analytically convenient but numerically limited, or highly accurate rational and polynomial methods, which lack invertibility, the logistic–cubic normal CDF approximation offers a balanced combination of symbolic simplicity, computational efficiency, and numerical precision. Its closed-form analytic structure is particularly valuable in modern computational and probabilistic frameworks, such as those employed in probabilistic machine learning, neural network modeling, and symbolic computation, where interpretability and analytic manipulation of functions are crucial [27,28].
The proposed logistic–cubic model should be viewed as an optimal low-order extension of the classical logistic approximation rather than a replacement. By incorporating a cubic term, the model achieves a marked improvement in accuracy while preserving closed-form invertibility, a property not shared by many higher-accuracy approximations.
Crucially, the practical advantages of the logistic–cubic normal CDF approximation extend beyond numerical accuracy alone. Its carefully chosen functional form—a logistic function with an embedded cubic polynomial—results in a smooth, strictly monotonic, and symmetric approximation that faithfully replicates key statistical properties of the standard normal cumulative distribution function. These structural properties are essential in applications where higher-order statistical moments and symmetry properties have analytical significance, such as higher-order statistical modeling, inference techniques sensitive to distribution shape, and analytical derivations of cumulative probability models.
Moreover, the logistic–cubic normal CDF approximation maintains full differentiability and invertibility across its entire domain. This differentiability is particularly beneficial in computational settings requiring automatic differentiation, such as gradient-based optimization algorithms, Bayesian inference frameworks, and neural network training procedures. Analytic invertibility significantly simplifies inverse computations, such as inverse transform sampling, cumulative quantile modeling, and probabilistic inference. It also ensures straightforward integration into symbolic algebra and computational statistics software, thereby enhancing its practical applicability and flexibility. The unique combination of symbolic simplicity, analytic invertibility, and numerical accuracy makes the logistic–cubic normal CDF approximation especially suited for diverse real-world scenarios. These advantages are most relevant in settings where analytic invertibility, smoothness, and symbolic tractability are required alongside reasonable numerical accuracy.
Thus, while more complex rational or polynomial approximations might occasionally deliver marginal improvements in numerical precision within narrowly defined contexts, the logistic–cubic normal CDF approximation provides an elegant and widely generalizable solution that successfully balances numerical performance, analytic convenience, and symbolic flexibility. Its potential use cases span a broad range of disciplines and computational scenarios, from symbolic artificial intelligence and statistical simulation to embedded analytics and analytical modeling frameworks, demonstrating that a closed-form and analytically invertible structure is not merely advantageous but often essential for practical and efficient real-world applications.

6. Conclusions

This paper introduced a closed-form cubic–logistic approximation to the standard normal cumulative distribution function. The proposed approximation augments the classical logistic sigmoid with a cubic polynomial term, yielding a simple two-parameter form that preserves symmetry, strict monotonicity, and analytic invertibility.
From a mathematical perspective, the approximation was constructed to satisfy key structural properties of a valid CDF while remaining analytically tractable. Numerical optimization was used to determine optimal parameters under combined uniform and average error criteria. The resulting approximation achieves substantially improved accuracy relative to the classical logistic model and performance comparable to established rational and polynomial approximations, while retaining a closed-form inverse.
Numerical experiments and illustrative empirical case studies demonstrated that the cubic–logistic approximation exhibits stable behavior across a range of distributional shapes, including skewed and heavy-tailed data. These results indicate that the approximation provides a useful balance between numerical fidelity and analytic simplicity, particularly in settings where invertibility or symbolic manipulation is required.
The proposed method is not intended to replace highly specialized numerical approximations optimized solely for minimizing approximation error over restricted domains. Rather, its contribution lies in showing that a low-degree, closed-form approximation can achieve competitive accuracy while offering analytic advantages unavailable to more complex alternatives.
Several directions for future research remain. These include extensions to multivariate settings, formal analysis of approximation error under transformations, and adaptation of the cubic–logistic framework to asymmetric or heavy-tailed target distributions. Such extensions may further expand the applicability of analytically invertible CDF approximations in computational and theoretical contexts.

Funding

This research received no external funding.

Data Availability Statement

All datasets used in this study are publicly available and cited in the manuscript. These include the EPA Air Quality System (PM2.5), NHANES biomarkers dataset, and S&P 500 return data from Stooq.

Acknowledgments

The author gratefully acknowledges the support of the University of Alabama at Birmingham and the United States Navy Reserve. The views expressed are those of the author and do not reflect the official policy or position of the Department of the Navy, Department of Defense, or the United States Government.

Conflicts of Interest

The author declares no conflicts of interest.

References

  1. Feller, W. An Introduction to Probability Theory and Its Applications; Wiley: Hoboken, NJ, USA, 1968; Volume 1. [Google Scholar]
  2. Bland, M. An Introduction to Medical Statistics; Oxford University Press: Oxford, UK, 2015. [Google Scholar]
  3. Muller, K.E.; Lavange, L.M.; Ramey, S.L.; Ramey, C.T. Power calculations for general linear multivariate models including repeated measures applications. J. Am. Stat. Assoc. 1992, 87, 1209–1226. [Google Scholar] [CrossRef] [PubMed]
  4. Button, K.S.; Ioannidis, J.P.A.; Mokrysz, C.; Nosek, B.A.; Flint, J.; Robinson, E.S.J.; Munafò, M.R. Power failure: Why small sample size undermines the reliability of neuroscience. Nat. Rev. Neurosci. 2013, 14, 365–376. [Google Scholar] [CrossRef] [PubMed]
  5. Borwein, J.; Corless, R.M. Emerging tools for experimental mathematics. Am. Math. Mon. 1999, 106, 889–909. [Google Scholar] [CrossRef]
  6. Abramowitz, M.; Stegun, I.A. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables; National Bureau of Standards: Gaithersburg, MD, USA, 1964.
  7. Hastings, C. Approximations for Digital Computers; Princeton University Press: Princeton, NJ, USA, 1955. [Google Scholar]
  8. Marsaglia, G. Evaluating the normal distribution. J. Stat. Softw. 2004, 11, 1–11. [Google Scholar] [CrossRef]
  9. Winitzki, S. A handy approximation for the error function and its inverse. arXiv 2008, arXiv:0805.1598. [Google Scholar]
  10. Eidous, O.; Abu-Hawwas, J. An accurate approximation for the standard normal distribution function. J. Inf. Optim. Sci. 2021, 42, 17–27. [Google Scholar] [CrossRef]
  11. Eidous, O.M.; Al-Rawwash, M.Y. Approximations for standard normal distribution function and its invertible. J. Algorithms Comput. Technol. 2025, 19, 17483026251322100. [Google Scholar] [CrossRef]
  12. Lipoth, J.; Tereda, Y.; Papalexiou, S.M.; Spiteri, R.J. A new very simply explicitly invertible approximation for the standard normal cumulative distribution function. AIMS Math. 2022, 7, 11635–11646. [Google Scholar] [CrossRef]
  13. Bowling, S.R.; Khasawneh, M.T.; Kaewkuekool, S.; Cho, B.R. A logistic approximation to the cumulative normal distribution. J. Ind. Syst. Eng. 2009, 2, 256–264. [Google Scholar] [CrossRef]
  14. Blinnikov, S.; Moessner, R. Expansions for nearly Gaussian distributions. Astron. Astrophys. Suppl. Ser. 1998, 130, 193–205. [Google Scholar] [CrossRef]
  15. Virtanen, P.; Gommers, R.; Oliphant, T.E.; Haberland, M.; Reddy, T.; Cournapeau, D.; Burovski, E.; Peterson, P.; Weckesser, W.; Bright, J.; et al. SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nat. Methods 2020, 17, 261–272. [Google Scholar] [CrossRef] [PubMed]
  16. Storn, R.; Price, K. Differential Evolution—A Simple and Efficient Heuristic for Global Optimization over Continuous Spaces. J. Glob. Optim. 1997, 11, 341–359. [Google Scholar] [CrossRef]
  17. Nelder, J.A.; Mead, R. A Simplex Method for Function Minimization. Comput. J. 1965, 7, 308–313. [Google Scholar] [CrossRef]
  18. Vázquez-Leal, H.; Castañeda-Sheissa, R.; Filobello-Niño, U.; Sarmiento-Reyes, A.; Sánchez-Orea, J. High accurate simple approximation of normal distribution integral. Math. Probl. Eng. 2012, 2012, 124029. [Google Scholar] [CrossRef]
  19. van der Walt, S.; Colbert, S.C.; Varoquaux, G. The NumPy array: A structure for efficient numerical computation. Comput. Sci. Eng. 2011, 13, 22–30. [Google Scholar] [CrossRef]
  20. U.S. Environmental Protection Agency. Air Quality System (AQS) Data Mart; U.S. EPA: Washington, DC, USA, 2026. Available online: https://www.epa.gov/aqs (accessed on 10 January 2026).
  21. Pope, C.A.; Dockery, D.W. Health effects of fine particulate air pollution: Lines that connect. J. Air Waste Manag. Assoc. 2006, 56, 709–742. [Google Scholar] [CrossRef] [PubMed]
  22. Berrocal, V.J.; Guan, Y.; Muyskens, A.; Wang, H.; Reich, B.J.; Mulholland, J.A.; Chang, H.H. A comparison of statistical and machine learning methods for creating national daily maps of ambient PM2.5 concentration. Atmos. Environ. 2020, 222, 117130. [Google Scholar] [CrossRef] [PubMed]
  23. Fama, E.F. The behavior of stock market prices. J. Bus. 1965, 38, 34–105. [Google Scholar] [CrossRef]
  24. Cont, R. Empirical properties of asset returns: Stylized facts and statistical issues. Quant. Financ. 2001, 1, 223–236. [Google Scholar] [CrossRef]
  25. Davies, M.J.; Aroda, V.R.; Collins, B.S.; Gabbay, R.A.; Green, J.; Maruthur, N.M.; Rosas, S.E.; Del Prato, S.; Mathieu, C.; Mingrone, G.; et al. Management of Hyperglycemia in Type 2 Diabetes, 2022. A Consensus Report by the American Diabetes Association (ADA) and the European Association for the Study of Diabetes (EASD). Diabetes Care 2022, 45, 2753–2786. [Google Scholar] [CrossRef] [PubMed]
  26. Feingold, K.R. Dyslipidemia in Patients with Diabetes. In Endotext [Internet]; Feingold, K.R., Adler, R.A., Ahmed, S.F., Anawalt, B., Blackman, M.R., Chrousos, G., Corpas, E., de Herder, W.W., Dhatariya, K., Dungan, K., et al., Eds.; MDText.com, Inc.: South Dartmouth, MA, USA, 2023. Available online: https://www.ncbi.nlm.nih.gov/books/NBK305900/ (accessed on 21 January 2026).
  27. Murphy, K.P. Machine Learning: A Probabilistic Perspective; MIT Press: Cambridge, MA, USA, 2012. [Google Scholar]
  28. Rasmussen, C.E.; Williams, C.K.I. Gaussian Processes for Machine Learning; MIT Press: Cambridge, MA, USA, 2006. [Google Scholar]
Figure 1. Convergence behavior of the optimization procedure.
Figure 1. Convergence behavior of the optimization procedure.
Mathematics 14 00486 g001
Figure 2. Log-scale pointwise absolute error for the logistic–cubic normal CDF approximation and selected baseline methods.
Figure 2. Log-scale pointwise absolute error for the logistic–cubic normal CDF approximation and selected baseline methods.
Mathematics 14 00486 g002
Figure 3. Empirical and fitted CDFs for normalized PM2.5 data.
Figure 3. Empirical and fitted CDFs for normalized PM2.5 data.
Mathematics 14 00486 g003
Figure 4. Empirical and fitted CDFs for normalized S&P 500 daily log returns.
Figure 4. Empirical and fitted CDFs for normalized S&P 500 daily log returns.
Mathematics 14 00486 g004
Figure 5. Empirical and fitted CDFs for normalized glucose and triglyceride levels.
Figure 5. Empirical and fitted CDFs for normalized glucose and triglyceride levels.
Mathematics 14 00486 g005
Figure 6. Empirical distributions of the fitted cubic–logistic parameters (a, b) from 500 resampled datasets ( n = 20 ).
Figure 6. Empirical distributions of the fitted cubic–logistic parameters (a, b) from 500 resampled datasets ( n = 20 ).
Mathematics 14 00486 g006
Table 1. Summary of logistic-based normal CDF approximations.
Table 1. Summary of logistic-based normal CDF approximations.
ModelFunctional FormParameters
Logistic σ ( α x ) α = 1.702
Logistic–Cubic σ ( a x + b x 3 ) a = 1.59708 , b = 0.07095
Table 2. Functional and symbolic properties of selected approximations to the standard normal CDF.
Table 2. Functional and symbolic properties of selected approximations to the standard normal CDF.
PropertyClassic LogisticRational (Hastings)Cubic–Logistic Approximation (This Work)
SmoothYesYesYes
Closed-formYesYesYes
InvertibleYesNoYes
SymmetricYesYesYes
Tail FidelityNoYesYes
Table 3. Optimization configuration and parameter constraints used to fit the cubic–logistic approximation approximation.
Table 3. Optimization configuration and parameter constraints used to fit the cubic–logistic approximation approximation.
ParameterValue
Evaluation Domain x [ 8 ,   8 ]
Grid Resolution10,000 points
Objective MetricsRMSE, Max Absolute Error
Global OptimizerDifferential Evolution
Local OptimizerNelder–Mead Simplex
Parameter Bounds a [ 1.0 ,   2.5 ] , b [ 0.01 ,   0.15 ]
Monotonicity Constraint f ( x ) = a + 3 b x 2 > 0
Table 4. Error comparison between normal CDF approximations over x [ 6 , 6 ] .
Table 4. Error comparison between normal CDF approximations over x [ 6 , 6 ] .
ModelMax Abs ErrorRMSE
Classic Logistic ( a = 1.702 ) 8.3 × 10 3 3.4 × 10 3
Hastings Rational 2.6 × 10 4 5.8 × 10 5
Marsaglia Polynomial 1.9 × 10 4 4.6 × 10 5
Logistic–Cubic Approximation (This Work) 1.7 × 10 4 4.3 × 10 5
Logistic–Quintic Extension 1.85 × 10 5 1.04 × 10 5
Table 5. Comparison with representative closed-form normal CDF approximations (evaluated on 1000 points over x [ 6 , 6 ] ).
Table 5. Comparison with representative closed-form normal CDF approximations (evaluated on 1000 points over x [ 6 , 6 ] ).
ApproximationClosed-FormInvertibleAICKS
HastingsYesNo−19,510.14 2.6 × 10 4
MarsagliaYesNo−19,973.74 1.9 × 10 4
Vázquez-Leal et al. [18]YesNoN/AN/A
Lipoth et al. [12]YesYesN/AN/A
Logistic–Cubic (proposed)YesYes−20,104.62 1.7 × 10 4
Table 6. Performance comparison of odd polynomial logistic models (evaluated on 1000 points over x [ 6 , 6 ] ).
Table 6. Performance comparison of odd polynomial logistic models (evaluated on 1000 points over x [ 6 , 6 ] ).
ModelAICBICKS
Logistic−11,365.96−11,361.05 8.3 × 10 3
Logistic–Cubic−20,104.62−20,094.81 1.7 × 10 4
Logistic–Quintic−22,932.36−22,917.64 1.85 × 10 5
Table 7. Symbolic properties of selected normal CDF approximations.
Table 7. Symbolic properties of selected normal CDF approximations.
ModelClosed-FormInvertibleSmooth
Classic LogisticYesYesYes
Hastings RationalYesNoYes
Marsaglia PolynomialYesNoYes
Cubic–Logistic Approximation (This Work)YesYesYes
Table 8. Goodness-of-fit metrics for normalized PM2.5 data.
Table 8. Goodness-of-fit metrics for normalized PM2.5 data.
ModelAICBICKS Statistic
Normal1052.731063.820.0761
Logistic1048.141059.200.0815
Cubic–Logistic1050.661061.720.0634
Table 9. Goodness-of-fit metrics for normalized S&P 500 daily return data.
Table 9. Goodness-of-fit metrics for normalized S&P 500 daily return data.
ModelAICBICKS Statistic
Normal1598.421605.780.0583
Logistic1584.311591.670.0527
Cubic–Logistic1582.061593.730.0432
Table 10. Goodness-of-fit metrics for glucose and triglyceride distributions.
Table 10. Goodness-of-fit metrics for glucose and triglyceride distributions.
ModelAICBICKS Statistic
Normal2123.402135.100.0872
Logistic2110.202121.900.0685
Cubic–Logistic2098.702110.300.0441
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Frölich, M.A. A Closed-Form Cubic–Logistic Approximation to the Normal Cumulative Distribution Function. Mathematics 2026, 14, 486. https://doi.org/10.3390/math14030486

AMA Style

Frölich MA. A Closed-Form Cubic–Logistic Approximation to the Normal Cumulative Distribution Function. Mathematics. 2026; 14(3):486. https://doi.org/10.3390/math14030486

Chicago/Turabian Style

Frölich, Michael Arnold. 2026. "A Closed-Form Cubic–Logistic Approximation to the Normal Cumulative Distribution Function" Mathematics 14, no. 3: 486. https://doi.org/10.3390/math14030486

APA Style

Frölich, M. A. (2026). A Closed-Form Cubic–Logistic Approximation to the Normal Cumulative Distribution Function. Mathematics, 14(3), 486. https://doi.org/10.3390/math14030486

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop