1. Introduction
Many discrete datasets encountered in practice take values on the non-negative integers that are routinely modeled using standard families, such as the Poisson or geometric distributions. In contrast, there are important situations where the natural support is the whole set of integers , most notably when observations are signed differences of counts or other zero-centered measurements. Canonical examples include score differences in sports, day-to-day changes in transaction counts or sales, inter-rater differences in clinical tallies, and discretized (symmetric) return increments in finance. For such problems, modeling directly on with distributions that respect symmetry around zero is both natural and desirable.
Several integer-valued distributions on
have been proposed. Prominent instances include the Skellam distribution [
1], obtained as the difference of two independent Poisson variables; the discrete Laplace distribution [
2], along with related skew/asymmetric variants [
3,
4]; the discrete normal distribution [
5]; and, recently, perturbed Laplace–type models [
6]. Applications of signed count differences appear in medical and reliability studies [
7,
8] and in sports analytics, such as goal differences [
9]. For general background on count modeling and discrete distributions, see refs. [
10,
11]. In addition, substantial probability mass at zero frequently arises in practice, creating links with the zero-inflated literature [
12].
Beyond these classical constructions, there has been a notable increase in recent work on flexible integer-valued distributions supported on
, including several models explicitly designed to capture symmetry and tunable dispersion. For example, ref. [
13] introduced the discrete skew logistic distribution, which can accommodate symmetric and asymmetric count data and provides a useful reference for tail-shape control. Two recent contributions by refs. [
14,
15] developed new symmetric and perturbation-based distributions on the integers, with applications to stock exchange and hydrological data. In parallel, ref. [
6] proposed a general perturbation of the discrete Laplace distribution, demonstrating improvements in financial and health datasets. More broadly, ref. [
16] reviewed Skellam-type models and related integer-supported families, while ref. [
17] provided an up-to-date survey on models for integer-valued data, highlighting the importance of distributions supported on
in modern applications. These recent works underscore the need for simple, interpretable, and analytically tractable symmetric models on
, a gap that the present Sy-
family aims to fill. These recent developments further motivate the need for a symmetric decomposition-based model with explicit identifiability and analytical tractability, such as the Sy-
and Sy-
formulations proposed in this work.
This paper introduces a unified and tractable framework for symmetric integer-valued data on , named the Sy- family. The construction separates a three-point symmetric sign component from a non-negative magnitude: a data-generating sign takes values in with a tunable mass at zero and is multiplied by an independent, non-negative integer-valued variable. This sign–magnitude representation yields zero-centered, exactly symmetric models with interpretable control of the atom at zero while allowing the analyst to inherit tail behavior, dispersion, and computational convenience from the chosen baseline magnitude distribution.
We develop a coherent set of distributional results for the family: closed-form probability mass functions and cumulative distribution functions, bilateral probability generating functions, and moment identities. Symmetry implies vanishing odd moments, whereas even moments factor through the baseline magnitude. We also establish a characterization by symmetry and independence: an integer-valued distribution belongs to the proposed family if and only if it is symmetric and its sign is independent of the magnitude with a three-point symmetric distribution. Beyond these foundations, we study general consequences of the product structure, including tail transfer from the magnitude, conditions for unimodality or bimodality, and simple obstructions to infinite divisibility.
A distinctive feature of this framework is a strong symmetry property for operations on independent variables, where for two independent members of the family, the sum and the difference have the same distribution. This identity is a direct consequence of the bilateral generating function symmetry and does not generally hold for standard two-sided competitors, such as Skellam [
1], discrete Laplace [
2], perturbed Laplace [
6], or extended Poisson models [
18].
As a central example, we particularize the family with a Poisson magnitude, thereby obtaining the symmetric Poisson model. We derive distributional formulas (including entropy), discuss the induced zero mass in relation to zero-inflated counts [
12], and develop estimation via the method of moments and maximum likelihood. Simulation studies assess finite-sample behavior, and applications to datasets from finance and education illustrate a competitive or improved fit relative to established alternatives supported on
.
The remainder of the paper is organized as follows.
Section 2 introduces the Sy-
family, detailing the construction from a symmetric modified Bernoulli sign and an independent non-negative magnitude.
Section 3 develops core distributional results for the family, including closed-form probability mass function (PMF) and cumulative distribution function (CDF) identities, bilateral generating functions, characterization by symmetry and sign–magnitude independence, tail transfer from the magnitude, modality conditions, criteria precluding infinite divisibility, the quantile function, and the median, as well as a discussion of first-order stochastic dominance for
.
Section 4 specializes in the Sy–Poisson model, deriving the moment generating function (MGF) and probability generating function (PGF), closed-form even moments (via Touchard polynomials), skewness, kurtosis, and Shannon entropy.
Section 5 presents inference for Sy–Poisson: method-of-moments estimation (with asymptotic variance via the delta method), likelihood-based estimation, and both observed and expected Fisher information.
Section 6 reports a simulation study evaluating the finite-sample bias and mean squared error of the maximum likelihood estimators.
Section 7 provides two empirical applications (finance and education) comparing Sy–Poisson with established competitors on
. A concluding section summarizes implications and outlines directions for future research.
4. Special Case: The Sy-Poisson Distribution
In this subsection, we particularize the Sy- family by taking the mixing variable Y to be Poisson. Recall the set-up in Definition 2 (and Proposition 8): with , takes values in , and .
Definition 3. Let with and , , be independent. The random variable is said to follow a Sy-Poisson distribution, denoted . Its PMF, inherited from the Sy- construction, isConsequently, the CDF of Z is obtained from Proposition 8 by replacing with the Poisson CDF. It is useful to contrast the proposed Sy–Poisson specification with classical symmetric models on . The Skellam distribution, obtained as the difference of two independent Poisson variables, is symmetric but couples the zero mass and tail decay through a single intensity parameter. Similarly, symmetric negative binomial variants provide heavier tails but still link dispersion and central mass through a common shape parameter. Zero-inflated symmetric count models allow for additional mass at zero but do not preserve exact symmetry unless extra constraints are imposed. In contrast, the Sy–Poisson model separates the sign and magnitude mechanisms, offering exact bilateral symmetry, independent control of the zero mass via , and inherited Poisson-type tail behavior through . This decomposition makes the model both flexible and analytically tractable, and it ensures identifiability under mild parametric assumptions, providing advantages over the alternatives above.
Proposition 12. Let with , , , and be independent. Then the CDF of iswhere for , , and denotes the incomplete gamma function defined by . Proof. Apply Proposition 8 with equal to the CDF: for , and for . □
Remark 6. The distribution is symmetric about 0
and zero–inflated, with . For small λ, most of the mass is concentrated at 0
, so the PMF is sharply unimodal at the origin. As λ increases, the Poisson component spreads out, and two symmetric shoulders emerge on the positive and negative sides; for moderate and large λ, the central mode at 0
is flanked by two lighter side peaks, yielding an overall three–bump shape. In the limit , the point mass at 0
tends to , and the side peaks become more pronounced, whereas when , the concentration at 0
dominates for any fixed λ. These behaviors are illustrated by the PMF in Figure 1 and Figure 2, and the corresponding CDF in Figure 3 and Figure 4. 4.1. Generating Functions
Proposition 13. Let with and , constructed as where and are independent. Then:Moreover, for all , reflecting the exact symmetry of Z. Proof. Condition on X. Since , , and . Use , , and the independence of X and Y. □
Corollary 8. All odd raw moments vanish, for . For ,where is the r-th Touchard polynomial [10] (raw moment of a Poisson). In particular,Hence . Corollary 9. Skewness is 0 due to symmetry, and the (non–excess) kurtosis isand the excess kurtosis equals . Remark 7. (i) The identities (
14)
and (15)
yield derivatives at (or ) that recover the even moments without resorting to series expansions. (ii) The symmetry implies that the distributions of and coincide for independent Sy–Poisson variables; see Section 3.9. 4.2. The Total Time on Test Transform
The total time on test (TTT) transform is a standard tool in reliability analysis and quality-control methodology for assessing distributional shape and aging properties. For a non-negative random variable
X with distribution function
F and survival function
, the TTT transform is defined by
where
denotes the (generalized) quantile function of
F. The function
is increasing in
u, and its curvature reveals information about the underlying failure rate behavior: concave curves indicate a decreasing failure rate (DFR), convex curves indicate an increasing failure rate (IFR), and curves close to the diagonal
correspond to approximately exponential or memoryless behavior.
In practice, the TTT transform is implemented through its discrete empirical version. For ordered non-negative observations
, the empirical TTT values are computed as
and the TTT plot is obtained by graphing
against
. The diagonal line
serves as a natural reference: empirical curves lying above the diagonal suggest DFR behavior, while those below the diagonal suggest IFR behavior.
In our setting, we apply the TTT transform to the non-negative magnitudes
(equivalently, to the
Y component in the Sy-
representation), and we compare the empirical TTT plot with the TTT curve implied by the fitted
model as a diagnostic tool; see
Section 6.
4.3. Shannon Entropy
Proposition 14. Under the assumptions of Proposition 13, the Shannon entropy (natural logarithm) admits an exact representationwhere and . In base-2 units, replace with . Proof. By symmetry, write
and, for
,
. Then
Using
and factoring out
,
This yields (
16). □
Corollary 10. Using Stirling’s expansion [19] and taking expectations for ,so that 6. Simulation Study
To evaluate the finite-sample properties of the proposed estimators for
, we performed a comprehensive Monte Carlo experiment. This simulation mechanism follows directly from Proposition 1, which characterizes any Sy-
random variable as the product of an independent symmetric sign and a non-negative magnitude. Specifically, the representation
The constructive form is associated with the probability mass function given in Equation (
12):
determines whether
or
applies,
selects the sign when
, and
Y supplies the magnitude. Thus, the generative steps above reproduce exactly the
distribution implied by the theoretical results. For each parameter configuration, 10,000 independent samples of size
were drawn under the true parameters
. This parameter configuration is representative of many practical scenarios:
yields a moderate zero mass in the
model, while
produces a magnitude distribution with moderate dispersion. Together, these values generate datasets whose symmetry, central concentration, and tail behavior closely resemble those encountered in empirical applications, making them well suited for assessing estimation accuracy in realistic settings. The log-likelihood
was maximized numerically to obtain MLEs, and MoM estimators were computed from the first two empirical moments. The Algorithm 2 below provides a concise summary of the exact data-generation procedure used in all Monte Carlo experiments. This step-by-step formulation clarifies how independent draws from the
model are produced and ensures full reproducibility of the simulation design.
| Algorithm 2 Generation of an i.i.d. sample from |
- 1:
for
do - 2:
Draw . If , set and continue. - 3:
Draw and set . - 4:
Draw and set . - 5:
end for
|
6.1. Comparison of MLE and MoM Estimators
For each
n, bias and mean-squared error (MSE) were calculated as
with
.
Figure 5 plots Monte Carlo estimates of
against
n, whereas
Figure 6 plots Monte Carlo estimates of
against
n.
Both MLEs and MoM estimators show rapidly vanishing bias and MSE as the sample size increases, consistent with standard asymptotic theory. The MoM estimator displays a slightly higher MSE for small n compared to the MLE, but its performance converges to that of the MLE as .
Across the grid of
n,
shows a mild positive bias, while
tends to underestimate the true value. Moreover,
remains larger than
, reflecting the higher sampling variability of the rate parameter. These bias patterns are consistent with the structure of the log-likelihood and the Fisher information described in
Section 5.8. The information for
is primarily driven by the zero and near zero observations, where the curvature of the likelihood is pronounced; this yields relatively strong identifiability for
and explains the small positive finite-sample bias. In contrast, the information for
depends on the dispersion of the magnitude component: when many observations fall at or near zero, the effective information about
is reduced, leading to a mild tendency toward underestimation. As the sample size increases, both information components scale linearly with
n, causing the biases in
and
to diminish, in agreement with the asymptotic theory.
6.2. Standard-Error Accuracy and Confidence-Interval Coverage
To assess standard-error accuracy and validate asymptotic normality, we formed observed Wald confidence intervals using the inverse Hessian at the MLE, as given by
. For each replication and parameter, we constructed two-sided Wald intervals at the 90%, 95%, and 99% nominal levels and recorded empirical coverage along with the average interval length.
Figure 7 reports the CI coverage, whereas
Figure 8 shows the average CI length as functions of
n.
The observed Wald intervals show coverage approaching nominal levels as n increases, with noticeable gains between small and moderate sample sizes. In cases of small , the intervals can be slightly conservative for very small n, but accuracy improves quickly with n, and the average lengths decrease at the expected rate, consistent with the asymptotic theory for the MLE.
A further point of reassurance concerns the use of Wald confidence intervals in a discrete setting. Although discrete models sometimes induce irregular likelihood shapes and poor Wald performance, the
model benefits from a smooth and strictly concave log-likelihood in both parameters. The independence between the symmetric sign and the Poisson magnitude yields well-behaved score functions and a Fisher information matrix that remains finite and positive for all admissible
. These properties ensure that the MLEs lie well within the interior of the parameter space and satisfy standard differentiability conditions, so Wald intervals inherit the usual asymptotic validity even in finite samples. This explains why the simulation results show accurate coverage levels despite the inherent discreteness of the data-generating process. The observed coverage behavior can be directly linked to the analytical form of the Fisher information derived in
Section 5.8. When the zero mass
is large, the term
in the information matrix (where
) dominates, yielding high curvature of the log-likelihood with respect to
and, therefore, tighter confidence intervals. Conversely, the information component associated with
depends on both
and
through
, which can be relatively flat for small
. This explains why the empirical coverage for
tends to be slightly conservative in small samples, whereas the coverage for
rapidly approaches nominal levels. As the sample size increases, both components of the Fisher information scale linearly with
n, leading to asymptotic normality and the near-exact coverage observed for
.
8. Concluding Remarks
In this paper, we introduce the Sy- family, a unified and tractable framework for symmetric integer-valued modeling based on a simple sign-magnitude decomposition. Writing with and as independent variables yields models that are exactly symmetric around zero, allowing for interpretable control of the atom at zero and inheriting tail behavior and dispersion from the chosen magnitude distribution. Within this framework, we derived closed-form expressions for the PMF and CDF, bilateral generating functions, and even-order moments; established a characterization by symmetry and sign-magnitude independence; and studied tail transfer, modality, and the equality in law between sums and differences of independent Sy- variables.
Specializing in a Poisson magnitude leads to the Sy-Poisson model, for which we obtained explicit generating functions, moment identities, and entropy, and developed both method-of-moments and likelihood-based inference. Monte Carlo simulations showed that the maximum likelihood estimators exhibit small finite-sample bias and accurate Wald confidence-interval coverage, while the TTT plots and empirical applications in finance and education confirmed that Sy-Poisson can match or improve upon classical two-sided competitors on . Beyond these case studies, the sign-magnitude structure suggests further applications in quality-control contexts, where signed deviations from target defect levels or specification limits arise naturally. Promising directions for future work include regression extensions, dependence modeling and time-series formulations, multivariate constructions, and the development of Sy- based monitoring tools for quality-control problems.