A Closed-Form Cubic–Logistic Approximation to the Normal Cumulative Distribution Function

Frölich, Michael Arnold

doi:10.3390/math14030486

Open AccessFeature PaperArticle

A Closed-Form Cubic–Logistic Approximation to the Normal Cumulative Distribution Function

by

Michael Arnold Frölich

Department of Anesthesiology and Perioperative Medicine, University of Alabama at Birmingham, Birmingham, AL 35233, USA

Mathematics 2026, 14(3), 486; https://doi.org/10.3390/math14030486

Submission received: 23 December 2025 / Revised: 23 January 2026 / Accepted: 25 January 2026 / Published: 30 January 2026

Download

Browse Figures

Versions Notes

Abstract

Accurate evaluation of the standard normal cumulative distribution function is fundamental in many areas of mathematics, statistics, and applied computation, yet no closed-form expression in elementary functions exists. We present a simple analytic approximation based on a logistic function with a cubic argument, designed to preserve symmetry, monotonicity, and analytic invertibility. The parameters of the approximation are obtained through numerical optimization over a wide domain, targeting both maximum absolute error and root-mean-square error. The resulting function achieves uniformly low approximation error and significantly reduces error relative to the classical logistic approximation, while remaining competitive with commonly used high-accuracy numerical methods. Unlike rational or high-degree polynomial approximations, the proposed form admits an explicit inverse, making it convenient for applications requiring analytic quantile evaluation or inverse transform sampling. Numerical error analysis and illustrative examples demonstrate that the approximation provides a practical balance between accuracy, simplicity, and analytic tractability.

Keywords:

normal distribution; CDF approximation; logistic approximation; numerical optimization; symbolic computation

MSC:

62E17; 60E05; 65D20

1. Introduction

The standard normal cumulative distribution function (CDF), denoted by

Φ (x)

, is a cornerstone of probability theory and mathematical statistics. It gives the probability that a standard normally distributed random variable assumes a value less than or equal to x:

Φ (x) = \frac{1}{\sqrt{2 π}} \int_{- \infty}^{x} e^{- t^{2} / 2} d t .

(1)

The function plays a central role in statistical inference, including hypothesis testing, confidence interval construction, and maximum likelihood estimation, and its ubiquity across disciplines is largely due to the Central Limit Theorem [1]. In applied work, inference is often summarized via confidence intervals and power considerations; see, e.g., [2,3,4].

Despite its theoretical importance,

Φ (x)

does not admit a closed-form expression in terms of elementary functions, a fact that follows from classical results in differential algebra [5]. As a consequence, practical evaluation of

Φ (x)

relies on numerical approximations, polynomial expansions, or algorithmic approximants such as rational functions and error-function-based representations [6,7,8,9]. While these approaches can achieve high accuracy, they typically require numerical routines or auxiliary approximations for evaluation and inversion.

A commonly used representation expresses

Φ (x)

in terms of the error function,

erf (x) = \frac{2}{\sqrt{π}} \int_{0}^{x} e^{- t^{2}} d t, Φ (x) = \frac{1}{2} [1 + erf (\frac{x}{\sqrt{2}})],

(2)

which is elegant but still necessitates numerical approximation in most computational environments. As a result, the normal CDF remains a computational bottleneck in settings where repeated evaluation or analytic inversion is required, including symbolic computation and inverse transform sampling [10].

Numerous approximation strategies for

Φ (x)

have been proposed, differing in accuracy, complexity, and analytic properties. Rational approximations, such as those originating with Hastings [7], can achieve high numerical fidelity but are not analytically invertible. Polynomial approximations, including those of Marsaglia [8], are compact and efficient but likewise lack symbolic reversibility. Recent work has produced increasingly accurate closed-form approximations [11,12], underscoring ongoing efforts to balance numerical precision with analytic simplicity.

In contrast, logistic sigmoid-based approximations provide closed-form, strictly monotone, and analytically invertible surrogates for

Φ (x)

. The classical logistic approximation

Φ (x) \approx \frac{1}{1 + e^{- a x}}, a \approx 1.702,

(3)

is attractive for its simplicity and symbolic tractability [13], but exhibits poor tail accuracy due to its limited curvature flexibility. This limitation motivates the introduction of nonlinear extensions that preserve invertibility while improving approximation quality.

While logistic approximations of the standard normal cumulative distribution function (CDF) are well established in the literature, the contribution of the present work is not the logistic form itself, but the systematic augmentation of the logistic argument with a cubic term. This extension yields a closed-form, invertible approximation with substantially improved accuracy while preserving analytic simplicity and computational efficiency. Related approximation approaches based on series expansions have also been proposed [14]. While such methods can achieve high precision by increasing expansion order or using additional correction terms, the present work emphasizes a single closed-form cubic–logistic expression with low parametric complexity, global smoothness, and analytic invertibility.

In this work, we introduce a closed-form logistic–cubic approximation to the standard normal CDF, defined by

Φ (x) \approx \frac{1}{1 + exp (- (a x + b x^{3}))},

(4)

where the coefficients a and b are determined via hybrid numerical optimization combining Differential Evolution with Nelder–Mead refinement. The resulting approximation preserves symmetry, global monotonicity, and analytic invertibility, while achieving uniformly low absolute and root-mean-square error over a wide domain.

The proposed approximation is evaluated from three complementary perspectives:

1.: Mathematical structure and constraints: analysis of symmetry, monotonicity, and admissible parameter ranges.
2.: Numerical accuracy: comparison with established approximations using maximum absolute error and RMSE.
3.: Illustrative applications: demonstration of performance on representative empirical datasets.

All numerical comparisons are benchmarked against the reference implementation scipy.stats.norm.cdf [15].

2. Materials and Methods

This section defines the proposed cubic–logistic approximation, states its structural constraints (symmetry, monotonicity, and invertibility), and describes the numerical procedure used to fit its parameters.

2.1. Mathematical Framework and Structural Constraints

Let

σ : R \to (0, 1)

denote the logistic sigmoid

σ (x) = \frac{1}{1 + e^{- x}} .

(5)

We consider approximations to the standard normal CDF of the form

\hat{Φ} (x) = σ (f (x)) = \frac{1}{1 + exp (- f (x))},

(6)

where

f : R \to R

is a smooth scalar function.

The target

Φ (x)

satisfies (i) symmetry

Φ (- x) = 1 - Φ (x)

, (ii) strict monotonicity, and (iii) limits

Φ (x) \to 0

as

x \to - \infty

and

Φ (x) \to 1

as

x \to + \infty

. The approximation (6) inherits the correct limits whenever

f (x) \to \pm \infty

as

x \to \pm \infty

. Moreover,

\hat{Φ}

is strictly increasing whenever f is strictly increasing, since

σ^{'} (x) > 0

for all x.

A particularly convenient constraint is analytic invertibility. Since

σ

is invertible with

σ^{- 1} (p) = log (\frac{p}{1 - p})

, the approximation admits an explicit inverse whenever f is invertible:

{\hat{Φ}}^{- 1} (p) = f^{- 1} (log \frac{p}{1 - p}), p \in (0, 1) .

(7)

We therefore seek a simple choice of f that (a) preserves symmetry, (b) is strictly increasing on

R

, and (c) remains analytically tractable.

2.2. Cubic–Logistic Functional Form

Symmetry of the normal distribution implies that the log-odds transformation of

Φ (x)

is an odd function. Motivated by this structure, we take f to be an odd polynomial. The minimal nonlinear odd polynomial is cubic,

f (x) = a x + b x^{3},

(8)

with real parameters a and b.

Optimizing the parameters with respect to the mean squared error over

R

yields the coefficients

a = 1.59708, b = 0.07095 .

(9)

Accordingly, the proposed Logistic–Cubic approximation is given explicitly by

\hat{Φ} (x) = \frac{1}{1 + exp (- (1.59708 x + 0.07095 x^{3}))} .

(10)

For comparison, the classical logistic approximation corresponds to

{\hat{Φ}}_{\log} (x) = σ (α x)

with

α \approx 1.702

. Table 1 summarizes the parameterizations of the logistic-based normal CDF approximations considered in this work.

The cubic term introduces additional curvature relative to the classical linear–logistic approximation

f (x) = a x

, improving tail behavior while maintaining closed-form simplicity. Higher odd degrees (e.g., quintic) can marginally reduce error but substantially increase symbolic complexity and raise the risk of producing unnecessary oscillation. For the cubic form, the derivative is

f^{'} (x) = a + 3 b x^{2} .

(11)

Lemma 1 (Monotonicity).

If

a > 0

and

b \geq 0

, then

f^{'} (x) > 0

for all

x \in R

, and hence

\hat{Φ} (x)

is strictly increasing on

R

.

Proof.

If

a > 0

and

b \geq 0

, then for all

x \in R

we have

f^{'} (x) = a + 3 b x^{2} \geq a > 0

. Thus, f is strictly increasing, and since

σ^{'} (x) > 0

for all x,

\hat{Φ} (x) = σ (f (x))

is strictly increasing. □

This sufficient condition is used to ensure that the approximation defines a valid CDF and remains invertible.

Table 2 summarizes key symbolic properties of the proposed cubic–logistic approximation relative to common alternatives.

2.3. Optimization Criteria and Error Metrics

The parameters

(a, b)

are chosen to match

Φ (x)

over a prescribed domain using two error criteria: the root-mean-square error (RMSE) and the maximum absolute error. Given a uniform grid

{x_{i}}_{i = 1}^{N}

on

[- 8, 8]

, define

RMSE (a, b) = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {[Φ (x_{i}) - \hat{Φ} (x_{i})]}^{2}} .

(12)

Remark on evaluation intervals. The interval

[- 8, 8]

is used for global RMSE evaluation to include extreme tail behavior of the normal distribution, whereas the interval

[- 6, 6]

reflects the effective numerical support of

Φ (x)

in most practical applications, where values outside this range are already numerically indistinguishable from 0 or 1. The qualitative conclusions of the error analysis are not sensitive to this choice of interval.

And

ε_{max} (a, b) = max_{1 \leq i \leq N} |Φ (x_{i}) - \hat{Φ} (x_{i})| .

(13)

Since these criteria emphasize complementary aspects of accuracy, we adopt a lexicographic objective: We first minimize

ε_{max} (a, b)

, and among solutions with near-minimal

ε_{max}

, we select the one with minimal RMSE. (In practice, “near-minimal” is defined by a small tolerance relative to the best observed

ε_{max}

on the grid). This choice yields uniform control of the worst-case deviation while retaining good average fidelity.

2.4. Numerical Optimization Algorithm

The resulting optimization problem is non-convex over

(a, b)

, motivating a hybrid global–local search. We first apply Differential Evolution [16] over bounded intervals

a \in [1.0, 2.5]

and

b \in [0.01, 0.15]

, followed by Nelder–Mead refinement [17] initialized at the best global candidate. The bounds enforce

a > 0

and

b > 0

, which (by Lemma 1) guarantee monotonicity of

\hat{Φ}

on

R

.

Table 3 summarizes the evaluation domain, grid resolution, objective criteria, and optimization procedures used in fitting the cubic–logistic approximation.

Figure 1 illustrates the representative convergence behavior of the global and local optimization phases.

The fitted parameter values obtained from this procedure are reported in Section 3.

3. Results

This section reports numerical results for the proposed logistic–cubic normal CDF approximation. Accuracy is evaluated using the error metrics defined in Section 2, including the maximum absolute error and the root-mean-square error (RMSE). All results are obtained by direct comparison with a high-precision reference implementation of the standard normal CDF.

3.1. Evaluation Setup and Baseline Methods

All approximations are evaluated on a uniform grid of 1000 points over the interval

x \in [- 6, 6],

which contains more than

99.9999998 %

of the probability mass of the standard normal distribution [14]. This domain captures both the central region and the extreme tails while remaining representative of typical computational use.

The proposed logistic–cubic approximation is compared against three commonly used baseline methods: (i) the classical logistic approximation

Φ (x) \approx {(1 + e^{- 1.702 x})}^{- 1}

[13], (ii) Hastings’ rational approximation [7], and (iii) Marsaglia’s polynomial approximation [8]. These methods span a range of analytic complexity and accuracy characteristics and provide representative benchmarks.

In addition to classical approximations, we also compare the proposed model with representative closed-form normal CDF approximations, including recent explicitly invertible and high-accuracy forms reported in Refs. [12,18]. These models are selected due to their analytic tractability and their frequent use as benchmarks in the literature.

All approximations are evaluated relative to the reference implementation scipy.stats.norm.cdf from the SciPy library [15,19], which computes

Φ (x)

to machine precision. For each method, pointwise absolute errors are computed on the evaluation grid, and global summary metrics are derived from these values. The resulting numerical comparisons are presented in the following subsections.

3.2. Quantitative Accuracy Metrics

We report numerical accuracy results for the logistic–cubic normal CDF approximation using the maximum absolute error and the root-mean-square error (RMSE) defined in Section 2. These metrics respectively quantify the worst-case deviation and the average squared deviation over the evaluation domain.

Remark on tail behavior. For sufficiently large

| x |

, the normal cumulative distribution function rapidly saturates, with

Φ (x) \approx 1

for large positive x and

Φ (x) \approx 0

for large negative x. In many numerical settings, hard truncation outside a finite interval (e.g.,

| x | > 6

) may therefore be sufficient. Nevertheless, a smooth closed-form approximation defined on

R

remains valuable for analytical manipulation, automatic differentiation, and inverse-CDF-based methods, where continuity and global invertibility are desirable.

Table 4 summarizes the numerical performance of the proposed approximation relative to the baseline methods described in Section 3.1. Over the interval

x \in [- 6, 6]

, the logistic–cubic approximation attains a maximum absolute error of

1.7 \times 10^{- 4}

and an RMSE of

4.3 \times 10^{- 5}

. These values represent a substantial reduction in error relative to the classical logistic approximation and are comparable to those obtained by Hastings’ rational approximation and Marsaglia’s polynomial fit.

Table 5 compares the proposed approximation with representative closed-form normal CDF approximations using standard goodness-of-fit and error metrics.

For Refs. [12,18], we report closed-form and invertibility properties here; numerical error summaries for those specific approximations are given in the original articles and are discussed in the accompanying text.

Across both error criteria, the cubic–logistic form markedly improves upon the classical logistic approximation and remains competitive with higher-accuracy rational and polynomial methods. Additional insight into the spatial distribution of approximation error is provided by the pointwise error analysis presented in the following subsection.

3.3. Higher-Order Odd Polynomial Extensions

To assess whether additional accuracy can be achieved through increased model order, we also considered higher-order odd polynomial extensions of the logistic argument. In particular, a quintic model of the form

σ (a x + b x^{3} + c x^{5})

(14)

was optimized using the same numerical criteria applied to the cubic model.

For the quintic extension

σ (a x + b x^{3} + c x^{5})

, optimization over the same evaluation grid (

x \in [- 6, 6]

,

n = 1000

) yields

a = 1.59543562, b = 0.07359658, c = - 0.00063163 .

Table 6 summarizes performance comparisons using the Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), and Kolmogorov–Smirnov (KS) statistic.

For the quintic extension

σ (a x + b x^{3} + c x^{5})

, optimization over the same grid yields

a = 1.59543562, b = 0.07359658, c = - 0.00063163 .

Although the quintic extension yields marginal numerical improvements, these gains are small relative to the increase in model complexity. The cubic formulation therefore represents an effective balance between accuracy, parsimony, and analytic tractability.

3.4. Visual Error Structure and Tail Behavior

To complement the global accuracy metrics reported above, we examine the pointwise absolute error of each approximation across the evaluation domain. This analysis highlights how approximation error varies with x and provides insight into tail behavior that may not be fully captured by aggregate error measures.

Figure 2 shows the absolute error

| Φ (x) - \hat{Φ} (x) |

on a logarithmic scale for all methods over

x \in [- 6, 6]

. The logarithmic scale facilitates comparison of error magnitudes spanning several orders and emphasizes behavior in the tails of the distribution.

The classical logistic approximation exhibits increasing error magnitude as

| x |

grows, reflecting limitations of a purely linear exponent in capturing tail curvature. Hastings’ rational approximation and Marsaglia’s polynomial fit achieve low error near the center of the distribution but display progressively larger deviations toward the boundaries of the evaluation domain.

The cubic–logistic approximation maintains uniformly low error across the full interval, with symmetric behavior in both tails. In particular, the maximum absolute error remains below

2 \times 10^{- 4}

throughout the domain, consistent with the quantitative results reported in Table 4.

3.5. Symbolic Tractability and Invertibility

Beyond numerical accuracy, analytic properties such as symbolic tractability and explicit invertibility are important considerations in the selection of normal CDF approximations. These properties are particularly relevant in contexts involving symbolic manipulation, inverse transform sampling, and analytic differentiation.

Table 7 summarizes key symbolic characteristics of the cubic–logistic approximation in comparison with several commonly used alternatives. While all methods considered are smooth and admit closed-form expressions, only the classical logistic approximation and the proposed cubic–logistic form are analytically invertible.

The absence of analytic invertibility in rational and polynomial approximations generally necessitates numerical root-finding procedures or conditional logic for inversion, which can increase computational complexity and limit symbolic use. In contrast, the cubic–logistic approximation admits an explicit inverse through solution of a depressed cubic equation following application of the logit transformation. This property enables closed-form evaluation of quantiles and facilitates analytic manipulation.

In addition to invertibility, the simplicity of the cubic–logistic functional form supports straightforward symbolic differentiation and integration. Unlike piecewise or iterative approximations, the approximation remains globally smooth and algebraically compact, which can simplify analytic derivations and symbolic computation workflows.

These analytic considerations complement the numerical results presented in Section 3. Together, they illustrate how the cubic–logistic approximation balances accuracy with analytic simplicity, positioning it as a useful alternative in settings where both numerical fidelity and symbolic accessibility are desired.

4. Applications

The purpose of the examples in this section is to illustrate the numerical behavior and practical performance of the proposed cubic–logistic approximation in representative applied settings. These examples are not intended to redefine the primary objective of the paper, which remains the accurate approximation of the normal cumulative distribution function. To illustrate the numerical behavior of the cubic–logistic normal CDF approximation beyond synthetic test functions, we present several empirical case studies. These examples are intended to demonstrate how the approximation behaves when fitted to real-world data exhibiting skewness and heavy-tailed structure, rather than to provide domain-specific modeling conclusions. These examples are intended to illustrate numerical behavior under non-Gaussian empirical distributions rather than to assert domain-specific generative models.

4.1. Overview and Relevance of Empirical Domains

The empirical examples considered here are chosen to reflect common distributional features encountered in practice, including skewness, excess kurtosis, and tail asymmetry. Such features often lead to systematic deviations from the standard normal distribution and provide a useful testbed for evaluating approximation behavior.

All datasets are standardized to zero mean and unit variance prior to fitting. This normalization isolates shape characteristics and allows direct comparison between the standard normal CDF, the classical logistic approximation, and the proposed cubic–logistic approximation. Model comparisons are based on goodness-of-fit measures including the Kolmogorov–Smirnov statistic and information criteria (AIC and BIC).

4.2. Environmental Data—PM2.5 Air Quality Index

As a first empirical example, we consider daily PM2.5 measurements obtained from the U.S. Environmental Protection Agency monitoring station at Huntsville Old Airport [20]. The association between long-term PM2.5 exposure and adverse cardiopulmonary outcomes is well established [21]. PM2.5 data are well known to exhibit right skewness and heavy tails, making them a representative example of non-Gaussian environmental measurements [20,22].

After standardization, three models are fitted to the empirical distribution: the standard normal CDF, the classical logistic approximation, and the cubic–logistic approximation. Figure 3 shows the empirical CDF alongside the fitted models.

Goodness-of-fit statistics are summarized in Table 8. The cubic–logistic approximation yields the smallest Kolmogorov–Smirnov statistic among the three models, indicating improved agreement in the tails, while incurring only a modest increase in model complexity as reflected by AIC and BIC.

4.3. Financial Data—S&P 500 Daily Returns

As a second empirical example, we consider daily log returns of the S&P 500 index, which are known to exhibit deviations from Gaussian behavior, including excess kurtosis and heavy tails [23,24]. We analyze ten years of daily returns (2013–2023) obtained from the Stooq database. The data are standardized to zero mean and unit variance prior to fitting.

Following standardization, we compare three models fitted to the empirical distribution: the standard normal CDF, the classical logistic approximation, and the proposed cubic–logistic approximation. Figure 4 shows the empirical CDF together with the fitted curves.

Goodness-of-fit statistics are reported in Table 9. Over this dataset, the cubic–logistic approximation achieves the smallest KS statistic among the three models and yields improved information criteria relative to the normal and logistic fits, indicating closer agreement with the empirical distribution under these summary measures.

4.4. Biomedical Data—Glucose and Triglycerides

As a third empirical example, we consider two continuous biomarker distributions from the National Health and Nutrition Examination Survey (NHANES): fasting glucose (LBXGLU) and triglycerides (LBXTR). These variables commonly exhibit right-skewness and heavy-tail structure [22,25,26].

We fit the same three models as above—standard normal, classical logistic, and cubic–logistic—to the empirical distributions. Figure 5 shows the empirical CDFs together with fitted curves for both biomarkers.

Table 10 reports AIC, BIC, and KS statistics for the three models. For these data, the cubic–logistic approximation attains the lowest AIC and BIC and substantially reduces the KS statistic relative to the normal and logistic fits, consistent with improved agreement with the empirical distribution across the full support.

4.5. Small-Sample Behavior of the Cubic–Logistic Approximation

To assess parameter stability under limited sample sizes, we perform a Monte Carlo resampling study based on the standardized PM2.5 dataset used above. We generate 500 subsets of size

n = 20

drawn without replacement from the full dataset. For each subset, the cubic–logistic parameters

(a, b)

are re-estimated using the Nelder–Mead algorithm, initialized at the full-sample estimates.

Figure 6 shows the empirical distributions of the fitted parameters across the 500 trials. The distribution of

\hat{a}

is approximately symmetric and concentrated around the full-sample value, while

\hat{b}

exhibits slightly greater dispersion with mild right-skewness. Both parameters remain well-behaved, with no evidence of multimodality or extreme outliers.

Overall, these results indicate that the cubic–logistic approximation exhibits stable parameter behavior even under small-sample conditions, with variability consistent with expected sampling uncertainty. This stability supports the practical use of the approximation in settings where data availability is limited.

5. Discussion

Having established the numerical performance of the logistic–cubic normal CDF approximation, we now discuss the broader implications of these results, focusing on practical applications and computational benefits.

The logistic–cubic normal CDF approximation developed in this work occupies a unique and practical middle ground between classical logistic sigmoid approximations and highly accurate numerical approximations such as rational or polynomial approximations. Unlike simpler logistic forms, which are analytically convenient but numerically limited, or highly accurate rational and polynomial methods, which lack invertibility, the logistic–cubic normal CDF approximation offers a balanced combination of symbolic simplicity, computational efficiency, and numerical precision. Its closed-form analytic structure is particularly valuable in modern computational and probabilistic frameworks, such as those employed in probabilistic machine learning, neural network modeling, and symbolic computation, where interpretability and analytic manipulation of functions are crucial [27,28].

The proposed logistic–cubic model should be viewed as an optimal low-order extension of the classical logistic approximation rather than a replacement. By incorporating a cubic term, the model achieves a marked improvement in accuracy while preserving closed-form invertibility, a property not shared by many higher-accuracy approximations.

Crucially, the practical advantages of the logistic–cubic normal CDF approximation extend beyond numerical accuracy alone. Its carefully chosen functional form—a logistic function with an embedded cubic polynomial—results in a smooth, strictly monotonic, and symmetric approximation that faithfully replicates key statistical properties of the standard normal cumulative distribution function. These structural properties are essential in applications where higher-order statistical moments and symmetry properties have analytical significance, such as higher-order statistical modeling, inference techniques sensitive to distribution shape, and analytical derivations of cumulative probability models.

Moreover, the logistic–cubic normal CDF approximation maintains full differentiability and invertibility across its entire domain. This differentiability is particularly beneficial in computational settings requiring automatic differentiation, such as gradient-based optimization algorithms, Bayesian inference frameworks, and neural network training procedures. Analytic invertibility significantly simplifies inverse computations, such as inverse transform sampling, cumulative quantile modeling, and probabilistic inference. It also ensures straightforward integration into symbolic algebra and computational statistics software, thereby enhancing its practical applicability and flexibility. The unique combination of symbolic simplicity, analytic invertibility, and numerical accuracy makes the logistic–cubic normal CDF approximation especially suited for diverse real-world scenarios. These advantages are most relevant in settings where analytic invertibility, smoothness, and symbolic tractability are required alongside reasonable numerical accuracy.

Thus, while more complex rational or polynomial approximations might occasionally deliver marginal improvements in numerical precision within narrowly defined contexts, the logistic–cubic normal CDF approximation provides an elegant and widely generalizable solution that successfully balances numerical performance, analytic convenience, and symbolic flexibility. Its potential use cases span a broad range of disciplines and computational scenarios, from symbolic artificial intelligence and statistical simulation to embedded analytics and analytical modeling frameworks, demonstrating that a closed-form and analytically invertible structure is not merely advantageous but often essential for practical and efficient real-world applications.

6. Conclusions

This paper introduced a closed-form cubic–logistic approximation to the standard normal cumulative distribution function. The proposed approximation augments the classical logistic sigmoid with a cubic polynomial term, yielding a simple two-parameter form that preserves symmetry, strict monotonicity, and analytic invertibility.

From a mathematical perspective, the approximation was constructed to satisfy key structural properties of a valid CDF while remaining analytically tractable. Numerical optimization was used to determine optimal parameters under combined uniform and average error criteria. The resulting approximation achieves substantially improved accuracy relative to the classical logistic model and performance comparable to established rational and polynomial approximations, while retaining a closed-form inverse.

Numerical experiments and illustrative empirical case studies demonstrated that the cubic–logistic approximation exhibits stable behavior across a range of distributional shapes, including skewed and heavy-tailed data. These results indicate that the approximation provides a useful balance between numerical fidelity and analytic simplicity, particularly in settings where invertibility or symbolic manipulation is required.

The proposed method is not intended to replace highly specialized numerical approximations optimized solely for minimizing approximation error over restricted domains. Rather, its contribution lies in showing that a low-degree, closed-form approximation can achieve competitive accuracy while offering analytic advantages unavailable to more complex alternatives.

Several directions for future research remain. These include extensions to multivariate settings, formal analysis of approximation error under transformations, and adaptation of the cubic–logistic framework to asymmetric or heavy-tailed target distributions. Such extensions may further expand the applicability of analytically invertible CDF approximations in computational and theoretical contexts.

Funding

This research received no external funding.

Data Availability Statement

All datasets used in this study are publicly available and cited in the manuscript. These include the EPA Air Quality System (PM2.5), NHANES biomarkers dataset, and S&P 500 return data from Stooq.

Acknowledgments

The author gratefully acknowledges the support of the University of Alabama at Birmingham and the United States Navy Reserve. The views expressed are those of the author and do not reflect the official policy or position of the Department of the Navy, Department of Defense, or the United States Government.

Conflicts of Interest

The author declares no conflicts of interest.

References

Feller, W. An Introduction to Probability Theory and Its Applications; Wiley: Hoboken, NJ, USA, 1968; Volume 1. [Google Scholar]
Bland, M. An Introduction to Medical Statistics; Oxford University Press: Oxford, UK, 2015. [Google Scholar]
Muller, K.E.; Lavange, L.M.; Ramey, S.L.; Ramey, C.T. Power calculations for general linear multivariate models including repeated measures applications. J. Am. Stat. Assoc. 1992, 87, 1209–1226. [Google Scholar] [CrossRef] [PubMed]
Button, K.S.; Ioannidis, J.P.A.; Mokrysz, C.; Nosek, B.A.; Flint, J.; Robinson, E.S.J.; Munafò, M.R. Power failure: Why small sample size undermines the reliability of neuroscience. Nat. Rev. Neurosci. 2013, 14, 365–376. [Google Scholar] [CrossRef] [PubMed]
Borwein, J.; Corless, R.M. Emerging tools for experimental mathematics. Am. Math. Mon. 1999, 106, 889–909. [Google Scholar] [CrossRef]
Abramowitz, M.; Stegun, I.A. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables; National Bureau of Standards: Gaithersburg, MD, USA, 1964.
Hastings, C. Approximations for Digital Computers; Princeton University Press: Princeton, NJ, USA, 1955. [Google Scholar]
Marsaglia, G. Evaluating the normal distribution. J. Stat. Softw. 2004, 11, 1–11. [Google Scholar] [CrossRef]
Winitzki, S. A handy approximation for the error function and its inverse. arXiv 2008, arXiv:0805.1598. [Google Scholar]
Eidous, O.; Abu-Hawwas, J. An accurate approximation for the standard normal distribution function. J. Inf. Optim. Sci. 2021, 42, 17–27. [Google Scholar] [CrossRef]
Eidous, O.M.; Al-Rawwash, M.Y. Approximations for standard normal distribution function and its invertible. J. Algorithms Comput. Technol. 2025, 19, 17483026251322100. [Google Scholar] [CrossRef]
Lipoth, J.; Tereda, Y.; Papalexiou, S.M.; Spiteri, R.J. A new very simply explicitly invertible approximation for the standard normal cumulative distribution function. AIMS Math. 2022, 7, 11635–11646. [Google Scholar] [CrossRef]
Bowling, S.R.; Khasawneh, M.T.; Kaewkuekool, S.; Cho, B.R. A logistic approximation to the cumulative normal distribution. J. Ind. Syst. Eng. 2009, 2, 256–264. [Google Scholar] [CrossRef]
Blinnikov, S.; Moessner, R. Expansions for nearly Gaussian distributions. Astron. Astrophys. Suppl. Ser. 1998, 130, 193–205. [Google Scholar] [CrossRef]
Virtanen, P.; Gommers, R.; Oliphant, T.E.; Haberland, M.; Reddy, T.; Cournapeau, D.; Burovski, E.; Peterson, P.; Weckesser, W.; Bright, J.; et al. SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nat. Methods 2020, 17, 261–272. [Google Scholar] [CrossRef] [PubMed]
Storn, R.; Price, K. Differential Evolution—A Simple and Efficient Heuristic for Global Optimization over Continuous Spaces. J. Glob. Optim. 1997, 11, 341–359. [Google Scholar] [CrossRef]
Nelder, J.A.; Mead, R. A Simplex Method for Function Minimization. Comput. J. 1965, 7, 308–313. [Google Scholar] [CrossRef]
Vázquez-Leal, H.; Castañeda-Sheissa, R.; Filobello-Niño, U.; Sarmiento-Reyes, A.; Sánchez-Orea, J. High accurate simple approximation of normal distribution integral. Math. Probl. Eng. 2012, 2012, 124029. [Google Scholar] [CrossRef]
van der Walt, S.; Colbert, S.C.; Varoquaux, G. The NumPy array: A structure for efficient numerical computation. Comput. Sci. Eng. 2011, 13, 22–30. [Google Scholar] [CrossRef]
U.S. Environmental Protection Agency. Air Quality System (AQS) Data Mart; U.S. EPA: Washington, DC, USA, 2026. Available online: https://www.epa.gov/aqs (accessed on 10 January 2026).
Pope, C.A.; Dockery, D.W. Health effects of fine particulate air pollution: Lines that connect. J. Air Waste Manag. Assoc. 2006, 56, 709–742. [Google Scholar] [CrossRef] [PubMed]
Berrocal, V.J.; Guan, Y.; Muyskens, A.; Wang, H.; Reich, B.J.; Mulholland, J.A.; Chang, H.H. A comparison of statistical and machine learning methods for creating national daily maps of ambient PM_2.5 concentration. Atmos. Environ. 2020, 222, 117130. [Google Scholar] [CrossRef] [PubMed]
Fama, E.F. The behavior of stock market prices. J. Bus. 1965, 38, 34–105. [Google Scholar] [CrossRef]
Cont, R. Empirical properties of asset returns: Stylized facts and statistical issues. Quant. Financ. 2001, 1, 223–236. [Google Scholar] [CrossRef]
Davies, M.J.; Aroda, V.R.; Collins, B.S.; Gabbay, R.A.; Green, J.; Maruthur, N.M.; Rosas, S.E.; Del Prato, S.; Mathieu, C.; Mingrone, G.; et al. Management of Hyperglycemia in Type 2 Diabetes, 2022. A Consensus Report by the American Diabetes Association (ADA) and the European Association for the Study of Diabetes (EASD). Diabetes Care 2022, 45, 2753–2786. [Google Scholar] [CrossRef] [PubMed]
Feingold, K.R. Dyslipidemia in Patients with Diabetes. In Endotext [Internet]; Feingold, K.R., Adler, R.A., Ahmed, S.F., Anawalt, B., Blackman, M.R., Chrousos, G., Corpas, E., de Herder, W.W., Dhatariya, K., Dungan, K., et al., Eds.; MDText.com, Inc.: South Dartmouth, MA, USA, 2023. Available online: https://www.ncbi.nlm.nih.gov/books/NBK305900/ (accessed on 21 January 2026).
Murphy, K.P. Machine Learning: A Probabilistic Perspective; MIT Press: Cambridge, MA, USA, 2012. [Google Scholar]
Rasmussen, C.E.; Williams, C.K.I. Gaussian Processes for Machine Learning; MIT Press: Cambridge, MA, USA, 2006. [Google Scholar]

Figure 1. Convergence behavior of the optimization procedure.

Figure 2. Log-scale pointwise absolute error for the logistic–cubic normal CDF approximation and selected baseline methods.

Figure 3. Empirical and fitted CDFs for normalized PM2.5 data.

Figure 4. Empirical and fitted CDFs for normalized S&P 500 daily log returns.

Figure 5. Empirical and fitted CDFs for normalized glucose and triglyceride levels.

Figure 6. Empirical distributions of the fitted cubic–logistic parameters (a, b) from 500 resampled datasets (

n = 20

).

Figure 6. Empirical distributions of the fitted cubic–logistic parameters (a, b) from 500 resampled datasets (

n = 20

).

Table 1. Summary of logistic-based normal CDF approximations.

Model	Functional Form	Parameters
Logistic	$σ (α x)$	$α = 1.702$
Logistic–Cubic	$σ (a x + b x^{3})$	$a = 1.59708, b = 0.07095$

Table 2. Functional and symbolic properties of selected approximations to the standard normal CDF.

Property	Classic Logistic	Rational (Hastings)	Cubic–Logistic Approximation (This Work)
Smooth	Yes	Yes	Yes
Closed-form	Yes	Yes	Yes
Invertible	Yes	No	Yes
Symmetric	Yes	Yes	Yes
Tail Fidelity	No	Yes	Yes

Table 3. Optimization configuration and parameter constraints used to fit the cubic–logistic approximation approximation.

Parameter	Value
Evaluation Domain	$x \in [- 8, 8]$
Grid Resolution	10,000 points
Objective Metrics	RMSE, Max Absolute Error
Global Optimizer	Differential Evolution
Local Optimizer	Nelder–Mead Simplex
Parameter Bounds	$a \in [1.0, 2.5], b \in [0.01, 0.15]$
Monotonicity Constraint	$f^{'} (x) = a + 3 b x^{2} > 0$

Table 4. Error comparison between normal CDF approximations over

x \in [- 6, 6]

.

Table 4. Error comparison between normal CDF approximations over

x \in [- 6, 6]

.

Model	Max Abs Error	RMSE
Classic Logistic ( $a = 1.702$ )	$8.3 \times 10^{- 3}$	$3.4 \times 10^{- 3}$
Hastings Rational	$2.6 \times 10^{- 4}$	$5.8 \times 10^{- 5}$
Marsaglia Polynomial	$1.9 \times 10^{- 4}$	$4.6 \times 10^{- 5}$
Logistic–Cubic Approximation (This Work)	$1.7 \times 10^{- 4}$	$4.3 \times 10^{- 5}$
Logistic–Quintic Extension	$1.85 \times 10^{- 5}$	$1.04 \times 10^{- 5}$

Table 5. Comparison with representative closed-form normal CDF approximations (evaluated on 1000 points over

x \in [- 6, 6]

).

Table 5. Comparison with representative closed-form normal CDF approximations (evaluated on 1000 points over

x \in [- 6, 6]

).

Approximation	Closed-Form	Invertible	AIC	KS
Hastings	Yes	No	−19,510.14	$2.6 \times 10^{- 4}$
Marsaglia	Yes	No	−19,973.74	$1.9 \times 10^{- 4}$
Vázquez-Leal et al. [18]	Yes	No	N/A	N/A
Lipoth et al. [12]	Yes	Yes	N/A	N/A
Logistic–Cubic (proposed)	Yes	Yes	−20,104.62	$1.7 \times 10^{- 4}$

Table 6. Performance comparison of odd polynomial logistic models (evaluated on 1000 points over

x \in [- 6, 6]

).

Table 6. Performance comparison of odd polynomial logistic models (evaluated on 1000 points over

x \in [- 6, 6]

).

Model	AIC	BIC	KS
Logistic	−11,365.96	−11,361.05	$8.3 \times 10^{- 3}$
Logistic–Cubic	−20,104.62	−20,094.81	$1.7 \times 10^{- 4}$
Logistic–Quintic	−22,932.36	−22,917.64	$1.85 \times 10^{- 5}$

Table 7. Symbolic properties of selected normal CDF approximations.

Model	Closed-Form	Invertible	Smooth
Classic Logistic	Yes	Yes	Yes
Hastings Rational	Yes	No	Yes
Marsaglia Polynomial	Yes	No	Yes
Cubic–Logistic Approximation (This Work)	Yes	Yes	Yes

Table 8. Goodness-of-fit metrics for normalized PM2.5 data.

Model	AIC	BIC	KS Statistic
Normal	1052.73	1063.82	0.0761
Logistic	1048.14	1059.20	0.0815
Cubic–Logistic	1050.66	1061.72	0.0634

Table 9. Goodness-of-fit metrics for normalized S&P 500 daily return data.

Model	AIC	BIC	KS Statistic
Normal	1598.42	1605.78	0.0583
Logistic	1584.31	1591.67	0.0527
Cubic–Logistic	1582.06	1593.73	0.0432

Table 10. Goodness-of-fit metrics for glucose and triglyceride distributions.

Model	AIC	BIC	KS Statistic
Normal	2123.40	2135.10	0.0872
Logistic	2110.20	2121.90	0.0685
Cubic–Logistic	2098.70	2110.30	0.0441

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Frölich, M.A. A Closed-Form Cubic–Logistic Approximation to the Normal Cumulative Distribution Function. Mathematics 2026, 14, 486. https://doi.org/10.3390/math14030486

AMA Style

Frölich MA. A Closed-Form Cubic–Logistic Approximation to the Normal Cumulative Distribution Function. Mathematics. 2026; 14(3):486. https://doi.org/10.3390/math14030486

Chicago/Turabian Style

Frölich, Michael Arnold. 2026. "A Closed-Form Cubic–Logistic Approximation to the Normal Cumulative Distribution Function" Mathematics 14, no. 3: 486. https://doi.org/10.3390/math14030486

APA Style

Frölich, M. A. (2026). A Closed-Form Cubic–Logistic Approximation to the Normal Cumulative Distribution Function. Mathematics, 14(3), 486. https://doi.org/10.3390/math14030486

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Closed-Form Cubic–Logistic Approximation to the Normal Cumulative Distribution Function

Abstract

1. Introduction

2. Materials and Methods

2.1. Mathematical Framework and Structural Constraints

2.2. Cubic–Logistic Functional Form

2.3. Optimization Criteria and Error Metrics

2.4. Numerical Optimization Algorithm

3. Results

3.1. Evaluation Setup and Baseline Methods

3.2. Quantitative Accuracy Metrics

3.3. Higher-Order Odd Polynomial Extensions

3.4. Visual Error Structure and Tail Behavior

3.5. Symbolic Tractability and Invertibility

4. Applications

4.1. Overview and Relevance of Empirical Domains

4.2. Environmental Data—PM2.5 Air Quality Index

4.3. Financial Data—S&P 500 Daily Returns

4.4. Biomedical Data—Glucose and Triglycerides

4.5. Small-Sample Behavior of the Cubic–Logistic Approximation

5. Discussion

6. Conclusions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI