Next Article in Journal
Directional Quaternion Step Differentiation and a Bicomplex Double-Step Calculus for Cancellation-Free First and Second Derivatives
Previous Article in Journal
FDC-LGL: Fast Discrete Clustering with Local Graph Learning for Large-Scale Datasets
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Hypothesis Tests for Comparing Point Processes

1
Department of Biostatistics, St. Jude Children’s Research Hospital, Memphis, TN 38105, USA
2
Department of Statistics, Florida State University, Tallahassee, FL 32306, USA
*
Author to whom correspondence should be addressed.
Mathematics 2026, 14(4), 727; https://doi.org/10.3390/math14040727
Submission received: 16 January 2026 / Revised: 12 February 2026 / Accepted: 18 February 2026 / Published: 19 February 2026
(This article belongs to the Section D1: Probability and Statistics)

Abstract

This paper presents a comprehensive study of statistical tests for comparing temporal point processes in general, with a particular focus on Poisson processes. We explore three key approaches: (1) an intensity-based test specifically for Poisson processes, (2) general parametric tests using the notion of maximum likelihood estimation, and (3) a general nonparametric test using the Isometric Log-Ratio (ILR) transformation. The first approach adopts a three-step procedure for comparing inhomogeneous Poisson processes by testing total and normalized intensities separately and then combining the corresponding p-values using Fisher’s method. The second method proposes a likelihood-based parametric test to examine the conditional intensity functions in point processes, emphasizing the application to Hawkes processes. Lastly, the third approach introduces a nonparametric test for general point processes, by transforming inter-event times into a Euclidean space via the ILR transformation, followed by conventional depth-based methods on multivariate data. We then conduct thorough studies on simulations as well as real-world data to illustrate these testing procedures and demonstrate their effectiveness.

1. Introduction

Point processes are mathematical models which represent random events occurring in time or space. These processes are commonly used in various fields such as telecommunications, ecology, seismology, and finance, where events are randomly distributed and their temporal or spatial patterns are of particular interest. The most extensively studied class of point processes is the temporal Poisson process, known for its simplicity and utility in modeling events that occur independently over time. The homogeneous Poisson process assumes a constant event rate (intensity), while the inhomogeneous Poisson process accounts for a time-varying intensity function. By contrast, both methodological developments and inferential tools for more general temporal point processes are relatively scarce, and formal hypothesis testing procedures for comparing point processes are particularly limited.
Poisson processes have a rich history in statistical modeling, and traditional methods focus heavily on mathematical frameworks, likelihood representations, model estimation, and simulations [1]. One key feature of Poisson processes is their independence from history, which means they can be fully characterized by a deterministic intensity function λ ( t ) 0 . This intensity function dictates the rate of event occurrence over time. If λ ( t ) is constant, the process is homogeneous; otherwise, it is inhomogeneous. While the study of homogeneous Poisson processes is well established, comparing the intensities of inhomogeneous Poisson processes remains an area with many unexplored challenges.
Most classical studies focused on comparing homogeneous Poisson processes [2,3]. These approaches are equivalent to comparing the means of two Poisson distributions. More recent advancements, including the E-test [4] and parametric bootstrap tests [5], have extended these methods. However, much of the literature still concentrates on the homogeneous case, with fewer studies addressing the more complex inhomogeneous Poisson processes. Some notable exceptions include investigating the ratio of intensities [6] and normalized intensities [7]. In this paper, we propose a three-step procedure to compare two inhomogeneous Poisson processes. Our procedure addresses both the magnitude and the distribution of event times. The results from both tests are combined using Fisher’s combined probability test [8], providing an overall p-value to assess whether the two intensity functions are equal.
Beyond Poisson processes, more complex temporal point processes, such as the Hawkes process, introduce additional challenges. The Hawkes process [9] is a “self-exciting” process where each event increases the probability of future events, making it highly suitable for modeling clustered or cascading events, such as earthquakes, trade orders, or bank default. The intensity function of a Hawkes process is not only time-dependent but is also influenced by past events, allowing the process to capture dependencies between events. This flexibility makes the Hawkes process a valuable tool in modeling phenomena where events are interdependent.
Parametric tests are suitable for processes with a specific form of the conditional intensity functions, such as the Poisson and Hawkes processes, since the parameters can be estimated using conventional log-likelihood. We introduce parametric tests based on maximum likelihood estimation for such processes, taking advantage of their well-defined likelihood structures to conduct parameter estimation and hypothesis testing.
In cases where conditional intensity assumptions are inappropriate or unknown, nonparametric methods provide an alternative. A prominent nonparametric approach is the Isometric Log-Ratio (ILR) transformation [10], which maps compositional data, such as inter-event times in point processes, from a constrained simplex space to an unconstrained Euclidean space. This transformation enables the application of multivariate statistical techniques to compare the transformed data, overcoming the limitations imposed by the compositional nature of the original data.
To assess the similarity of point processes based on their transformed data, we employ depth-based hypothesis testing. Several depth-based tests have been proposed for multivariate distributions, including depth versus depth plots [11] and multivariate spacings [12]. In this paper, we implement a quality index Q based on data depth [13] to compare the distributions of two groups of point processes. The quality index is defined as the probability that a random sample from one distribution has a smaller depth with respect to the other distribution. A Q-value of 0.5 indicates that the two distributions are identical, while values deviating from 0.5 suggest differences in location or scale.
The structure of this paper is as follows. Section 2 introduces the hypothesis tests for comparing Poisson processes, including the proposed three-step procedure for inhomogeneous processes. Section 3 discusses parametric tests based on maximum likelihood estimation, with a focus on point processes such as the Hawkes process. Section 4 presents nonparametric tests using the ILR transformation and depth-based methods for comparing general point processes. Simulation studies are provided throughout to demonstrate the effectiveness of each approach. In Section 5, we provide simulation studies to demonstrate the effectiveness of the proposed approaches under various scenarios. We also apply the methods to real-world data, showcasing their practical relevance. Finally, Section 6 concludes the paper with a summary of the key findings and outlines potential directions for future research. Mathematical details are given in Appendices.

2. A Three-Step Procedure for Comparing Poisson Processes

Poisson processes are classical stochastic processes, and their intensity functions play a critical role in determining the behavior of the process. In this section, we propose a three-step hypothesis test to compare the intensity functions of two inhomogeneous Poisson processes. This method extends the conventional comparison framework by considering both the total intensities and the normalized intensities of the processes. The combined test allows for a more detailed comparison of Poisson processes in various practical scenarios.
In practice, a process is often given in a finite-time domain, and we limit our study in a finite interval [ 0 , T ] . Assume the intensities of two Poisson processes are λ 1 ( t ) and λ 2 ( t ) , respectively. We are interested in testing
H 0 : λ 1 ( t ) = λ 2 ( t ) vs . H 1 : λ 1 ( t ) λ 2 ( t ) for some t .
The first step of the test examines whether the total intensities, denoted by Λ 1 = 0 T λ 1 ( t ) d t and Λ 2 = 0 T λ 2 ( t ) d t , of the two processes are equal. A z-test is used to compare the two intensities based on large sample properties. If the total intensities are found to be equal, the second step compares the normalized intensities, or densities, λ 1 ( t ) Λ 1 and λ 2 ( t ) Λ 2 , using the Kolmogorov–Smirnov test to assess whether the distributions of the event times are the same. Finally, the results from both tests are combined using Fisher’s combined probability test [8] to form an overall p-value, which evaluates the hypothesis that the two Poisson processes have the same intensity function.
Suppose { N ( t ) , t [ 0 , T ] } is an inhomogeneous Poisson process with an intensity function λ ( t ) . Given that N ( T ) = K , the unordered K arrival times t 1 , , t K are independent and share a common density function
f ( t ) = λ ( t ) Λ , 0 t T ,
where Λ = 0 T λ ( t ) d t is the total intensity.
Consider two inhomogeneous Poisson processes, { N 1 ( t ) , t [ 0 , T ] } and { N 2 ( t ) , t [ 0 , T ] } , with intensity functions λ 1 ( t ) and λ 2 ( t ) , respectively. The number of events in the interval [ 0 , T ] for each process, denoted as N i ( T ) , follows a Poisson distribution with mean Λ i 0 T λ i ( t ) d t , i = 1 , 2 . Specifically, for any integer k 0 , the probability mass function is
P { N i ( T ) = k } = exp ( Λ i ) Λ i k k ! , i = 1 , 2 .
Let S ˜ 1 = { s 11 , , s 1 M 1 } and S ˜ 2 = { s 21 , , s 2 M 2 } be two sets of independent realizations from the first and second processes, respectively, where M 1 and M 2 denote the number of realizations. The numbers of events in these realizations are denoted as N 11 , N 12 , , N 1 M 1 and N 21 , N 22 , , N 2 M 2 . Each N 1 k Poisson ( Λ 1 ) and N 2 l Poisson ( Λ 2 ) for k = 1 , , M 1 and l = 1 , , M 2 . Furthermore, each realization consists of event times in strictly increasing order, represented as:
s i j = { t i j ( 1 ) , t i j ( 2 ) , , t i j ( N i j ) } ,
where t i j ( k ) denotes the k-th event time in the j-th realization of the i-th process, with k = 1 , , N i j , j = 1 , , M i , and i = 1 , 2 .
To test whether the intensity functions are equal, i.e.,
H 0 : λ 1 ( t ) = λ 2 ( t ) , t [ 0 , T ] ,
we conduct two independent subtests. The first subtest is to compare the total intensities of the two processes
H 0 : Λ 1 = Λ 2 vs . H 1 : Λ 1 Λ 2 .
We apply a z-test with the following test statistic
Z = Λ ^ 2 Λ ^ 1 Λ ^ 1 M 1 + Λ ^ 2 M 2 ,
where Λ ^ i = 1 M i j = 1 M i N i j for i = 1 , 2 . Under the null hypothesis, Z asymptotically follows a standard normal distribution N ( 0 , 1 ) . The p-value for this subtest is given by
p 1 = 2 ( 1 Φ ( | Z | ) ) ,
where Φ ( · ) is the cumulative distribution function (c.d.f.) of N ( 0 , 1 ) .
The second subtest compares the distributions of the arrival times by testing the normalized intensities:
H 0 : λ 1 ( t ) Λ 1 = λ 2 ( t ) Λ 2 vs . H 1 : λ 1 ( t ) Λ 1 λ 2 ( t ) Λ 2 .
Based on the conditional property described in Equation (1), for any realization j { 1 , , M i } , the unsorted event times t i j ( 1 ) , t i j ( 2 ) , , t i j ( N i j ) are independently distributed with density function λ i ( t ) / Λ i , i = 1 , 2 . Let
s 1 = j = 1 M 1 s 1 j , s 2 = j = 1 M 2 s 2 j ,
where each set s i contains K i = j = 1 M i N i j independent event times with density function λ i ( t ) / Λ i . As the sample sizes M 1 and M 2 increase, K i almost surely. We use the two-sample Kolmogorov–Smirnov (K-S) test to compare the empirical distribution functions of s 1 and s 2 , i.e., to compare λ 1 ( t ) / Λ 1 and λ 2 ( t ) / Λ 2 .
The test statistic for the K-S test is
D K 1 , K 2 = K 1 K 2 K 1 + K 2 1 2 sup x | F K 1 ( x ) G K 2 ( x ) | ,
where F K 1 ( x ) and G K 2 ( x ) are the empirical c.d.f.s of s 1 and s 2 , respectively. The p-values for the KS test can be derived from tables. If K 1 and K 2 are sufficiently large, D K 1 , K 2 approximately follows the Kolmogorov–Smirnov distribution.
After obtaining the p-values from both subtests, we calculate an overall p-value to compare the two intensity functions. Note that both subtests partially address the same hypothesis—whether λ 1 ( t ) = λ 2 ( t ) . Indeed, the first subtest compares if the numbers of events in the two processes follow the same distribution, with null hypothesis H 0 : Λ 1 = Λ 2 . The second subtest compares if the distributions of events in the two processes are the same, with null hypothesis H 0 : λ 1 ( t ) / Λ 1 = λ 2 ( t ) / Λ 2 . If both subtests confirm the null hypothesis, we conclude that the two intensity functions are equal.
Since the p-values p 1 and p 2 are asymptotically independent when M 1 and M 2 are large, we apply Fisher’s method to combine them:
X = 2 ( log ( p 1 ) + log ( p 2 ) ) ,
where X has a chi-squared distribution with four degrees of freedom. The combined p-value, which is computed using the test statistic X, is the overall p-value for the test H 0 : λ 1 ( t ) = λ 2 ( t ) , t [ 0 , T ] . The following algorithm outlines the test procedure for comparing two inhomogeneous Poisson processes.
The overall test procedure for the comparison of the two inhomogeneous Poisson processes is given in the following Algorithm 1.
Algorithm 1. (Given two groups of Poisson realizations s 11 , s 12 , , s 1 M 1 and s 21 , s 22 , , s 2 M 2 )
1.
Count the number of events of every realization: N 11 , , N 1 M 1 and N 21 , , N 2 M 2 ;
2.
Perform a z-test on the total intensities and compute the p-value p 1 ;
3.
Merge all samples in each group: s 1 = j = 1 M 1 s 1 j , s 2 = j = 1 M 2 s 2 j ;
4.
Apply two-sample Kolmogorov–Smirnov test to compare distribution functions of s 1 and s 2 and compute the p-value p 2 ;
5.
Use Fisher’s method to combine p 1 and p 2 , compute the overall p-value using the test statistic X = 2 ( log ( p 1 ) + log ( p 2 ) ) , where X χ 4 2 .
Remark 1. 
The proposed three-step procedure can be naturally extended to compare multiple groups of inhomogeneous Poisson processes. Suppose we have K groups of independent Poisson realizations, each with an intensity function λ k ( t ) , k = 1 , , K . The first subtest compares the total intensities of the K processes:
H 0 : Λ 1 = = Λ K vs . H 1 : Λ i Λ j   f o r   s o m e   i , j { 1 , , K } .
For this step, we can use statistical tests such as Pearson’s χ 2 test or the likelihood ratio test [14].
The second subtest compares the distributions of event times among the K Poisson processes:
H 0 : λ 1 ( t ) Λ 1 = = λ K ( t ) Λ K vs . H 1 : λ i ( t ) Λ i λ j ( t ) Λ j for   some   i , j { 1 , , K } .
The K-sample Anderson–Darling test [15] is appropriate for this step.
Finally, we still apply Fisher’s method to combine p-values from previous two steps to generate an overall p-value for the hypothesis:
H 0 : λ 1 ( t ) = = λ K ( t ) vs . H 1 : λ i ( t ) λ j ( t ) for   some   i , j { 1 , , K } .
Building upon this method for Poisson processes, we can also explore parametric approaches for more general point processes. The following section discusses the use of maximum likelihood estimation as a parametric method for comparing point processes.

3. Parametric Tests Based on the Maximum Likelihood Estimation

In this section, we explore hypothesis testing for realizations from general point processes. The maximum likelihood estimation (MLE) approach offers a robust parametric framework for comparing point processes. By estimating the unknown parameters of intensity functions, we can construct test statistics to evaluate whether two processes share the same underlying intensity.
Suppose two point processes have conditional intensity functions λ ( t | H t , θ 1 ) and λ ( t | H t , θ 2 ) in closed form, respectively, where H t denotes history, and θ 1 and θ 2 are unknown parameters. The null hypothesis H 0 : θ 1 = θ 2 can be tested by first estimating the parameters θ 1 and θ 2 using MLE. Once the parameters are estimated, the Wald test is applied to test whether the two estimated intensities are equal. The MLE approach enables the use of optimization methods such as the Newton–Raphson method to maximize the log-likelihood functions and obtain parameter estimates.
We also present two examples: the Hawkes process [9] and the Poisson process software reliability model [16]. While these examples demonstrate the feasibility of our method, the approach is applicable to any point process as long as MLE can be performed.

3.1. Tests Using Maximum Likelihood Estimation for Point Processes

Given a point process realization with conditional intensity λ * ( t | θ ) λ ( t | H t , θ ) , where events t 1 , , t n are on an finite interval [ 0 , T ] , the likelihood L of this point process is [1]
L = i = 1 n λ * ( t i | θ ) exp 0 T λ * ( u | θ ) d u .
Consider two independent point processes with conditional intensity functions λ * ( t | θ 1 ) and λ * ( t | θ 2 ) on [ 0 , T ] , where θ 1 and θ 2 are p-dimensional parameter vectors. Assume that the realizations S ˜ 1 = { s 11 , , s 1 M 1 } have intensity λ * ( t | θ 1 ) , and S ˜ 2 = { s 21 , , s 2 M 2 } have intensity λ * ( t | θ 2 ) . We test the null hypothesis:
H 0 : θ 1 = θ 2 .
For set S ˜ 1 , suppose the likelihood of each independent realization s 1 j is L 1 j ( θ 1 ) , j = 1 , M 1 . Then, the likelihood of the entire set S ˜ 1 is L 1 ( θ 1 ) = j = 1 M 1 L 1 j ( θ 1 ) . Similarly, the likelihood of the set S ˜ 2 is denoted as L 2 ( θ 2 ) . To maximize the log-likelihood, we calculate the gradients and Hessian matrices of L 1 ( θ 1 ) and L 2 ( θ 2 ) . If the Hessian is well-defined and has a closed form, it is negative semi-definite, allowing us to apply the Newton–Raphson method or other appropriate optimization techniques to obtain the MLE.
Assuming MLEs θ ^ 1 and θ ^ 2 exist, their asymptotic distributions are
M 1 ( θ ^ 1 θ 1 ) d N 0 , I ( θ 1 ) 1 and M 2 ( θ ^ 2 θ 2 ) d N 0 , I ( θ 2 ) 1 ,
as M 1 , M 2 . The proof of asymptotic properties follows standard techniques and is provided in Appendix A.1.
The Wald test statistic for H 0 : θ 1 = θ 2 is
W = M 1 M 2 ( θ ^ 1 θ ^ 2 ) ( M 1 I 1 ( θ ^ 1 ) + M 2 I 1 ( θ ^ 2 ) ) ( θ ^ 1 θ ^ 2 ) ,
where I ( θ ^ 1 ) is the Fisher information matrix estimated by the negative Hessian
I ( θ ^ 1 ) = 2 log L X ( θ ^ 1 ) θ ^ 1 2 .
Under the null hypothesis, W follows an asymptotic χ r 2 distribution, where r is the dimension of θ , that is, the number of parameters being tested.
In the remainder of this section, we present two examples of point processes for which MLE can be feasibly applied.

3.2. Example: Hawkes Process

The Hawkes process models events where each occurrence increases the likelihood of subsequent events, such as earthquakes or trade orders. The conditional intensity function of a Hawkes process is [1]:
λ * ( t ) = μ + 0 t ν ( t s ) d N ( s ) ,
where μ > 0 is the background intensity, and ν : ( 0 , ) [ 0 , ) is the excitation function.
A widely used choice for the excitation function is an exponentially decaying function. In this case, ν ( t ) = α e β t for α , β > 0 , and the intensity function is given by
λ * ( t ) = μ + t α e β ( t s ) d N ( s ) .
Given a realization of such a Hawkes process, { t 1 , t 2 , , t N } , on the interval [ 0 , T ] , the intensity function becomes
λ * ( t ) = μ + α t i < t e β ( t t i ) ,
The log-likelihood is given as [17]
log L = i = 1 N log ( λ * ( t i α , β ) ) 0 T λ * ( u α , β ) d u = i = 1 N log { μ + α A ( i ) } + i = 1 N α β e β ( T t i ) 1 μ T .
We compute the partial derivatives and the Hessian matrix of the log-likelihood function to obtain the MLE of α , β and μ . The gradients and Hessians are provided in Appendix A.2. Optimization techniques, such as the Newton–Raphson method and the Berndt–Hall–Hall–Hausman (BHHH) method, are employed to maximize the log-likelihood.

3.3. Example: Poisson Process Software Reliability Model

The maximum likelihood estimators for a general nonhomogeneous Poisson process do not always exist. Therefore, in our second example we focus on a software reliability model, a nonhomogeneous Poisson process on [ 0 , T ] with intensity function λ ( t ) = α ( t / β ) k β k ! exp ( t / β ) , where the unknown parameters are α and β . The maximum likelihood estimations of α and β exist as T with probability 1 [16]. For k = 1 , the log-likelihood becomes
log L = n log α 2 n log β i = 1 n t i β + i = 1 n log t i α 1 1 + T β exp T β .
Gradients and Hessians are computed to obtain the maximum likelihood estimator using numerical optimization methods. The gradients and Hessians are given as follows:
log L α = n α + 1 + T β exp T β 1 ,
log L β = 2 n β + i = 1 n t i β + α T 2 β 3 exp T β .
The Hessian matrix is provided by
2 log L α 2 = n α 2 ,
2 log L β 2 = 2 n β 2 i = 1 n 2 t i β 3 + α T 3 β 5 3 T 2 β 4 exp T β ,
2 log L α β = T 2 β 3 exp T β .

4. Nonparametric Tests for General Point Processes

In many cases, parametric models require a known functional form for the intensity function. However, identifying an appropriate parametric form in practice can be challenging, especially when there is limited prior knowledge about the process. Even when a reasonable parametric form is available, the estimation process may encounter difficulties as an MLE for point processes may not always exist. For example, consider a Poisson process with intensity λ ( t ) = ( 1 + α sin ( t ) ) on [ 0 , 2 π ] , where α is the parameter. The likelihood of a realization { t 1 , , t n } is L = i = 1 n ( 1 + α sin ( t i ) ) exp ( 2 π ) , the likelihood L as α . Hence, alternative testing methods are needed in such cases.
In general, nonparametric tests for point processes are highly desirable in cases where no appropriate assumptions can be made about the intensity functions. To address this issue, we propose to apply the well-known Isometric Log-Ratio (ILR) transformation [10] to inter-event times of the processes, mapping them into a Euclidean space. This transformation allows us to use multivariate statistical tools to compare the distributions of these transformed points. The ILR-based method is flexible and can be applied to a wide range of point processes, making it an ideal choice when parametric assumptions are not suitable. Specifically, the test compares whether the multivariate distributions of transformed inter-event times from two processes are statistically different. This method is advantageous when the intensity functions of the processes are unknown or too complicated to estimate parametrically.
We now consider tests for point processes without making assumptions about the intensity function. For a fixed time interval [ 0 , T ] , two groups of point processes are observed, denoted as S ˜ 1 = { s 11 , , s 1 M 1 } and S ˜ 2 = { s 21 , , s 2 M 2 } . Suppose S ˜ i has a conditional intensity function λ i * ( t ) λ i ( t | H t ) for i = 1 , 2 . We aim to test the null hypothesis:
H 0 : S ˜ 1 and S ˜ 2 are from the same point process ,
or equivalently
H 0 : λ 1 * = λ 2 * .
However, directly conducting nonparametric tests to compare conditional intensity functions is challenging. To address this, we introduce the ILR transformation on inter-event times. This transformation maps data from a simplex space to Euclidean space, allowing us to transform point processes into points in Euclidean space and then compare the multivariate distributions of these points.
Given a cardinality K N + in the time domain [ 0 , T ] , a point process can fully be characterized by its joint density function. If the point processes in S ˜ i have a joint density function f i ( t 1 , , t K ) for i = 1 , 2 , let S ˜ i K denote the set of all point processes having cardinality K in S ˜ i . Then, S ˜ i = K = 0 S ˜ i K for i = 1 , 2 . Let | s 1 | and | s 2 | represent the cardinalities of the point processes in S ˜ 1 and S ˜ 2 , respectively. The hypothesis in Equation (5) can be reformulated as:
1.
Condition 1. The cardinalities of point processes in both groups follow the same distribution: P ( | s 1 | = K ) = P ( | s 2 | = K ) for all K.
2.
Condition 2. For any fixed cardinality K, the point processes in S ˜ 1 K and S ˜ 2 K have the same joint density function: f 1 ( t 1 , , t K | | s 1 | = K ) = f 2 ( t 1 , , t K | | s 2 | = K ) .
Based on these conditions, we propose a series of subtests to compare S ˜ 1 and S ˜ 2 :
H 00 : P ( | s 1 | = K ) = P ( | s 2 | = K ) for all K ; H 01 : f 1 ( t 1 , t 2 , , t K 1 | | s 1 | = K 1 ) = f 2 ( t 1 , t 2 , , t K 1 | | s 2 | = K 1 ) ; H 02 : f 1 ( t 1 , t 2 , , t K 2 | | s 1 | = K 2 ) = f 2 ( t 1 , t 2 , , t K 2 | | s 2 | = K 2 ) ; H 0 m : f 1 ( t 1 , t 2 , , t K m | | s 1 | = K m ) = f 2 ( t 1 , t 2 , , t K m | | s 2 | = K m ) .
We choose K 1 , , K m such that all corresponding subsets S ˜ 1 K i and S ˜ 2 K i have sufficiently large sample sizes.
For the first subtest H 00 , we compare the distributions of cardinalities using Pearson’s chi-square test for homogeneity. Let the cardinalities of the realizations in the two groups be
C 1 = { C 11 , C 12 , , C 1 M 1 } , C 2 = { C 21 , C 22 , , C 2 M 2 } .
The detailed computation of this test is presented in [18].
For the subsequent subtests H 01 H 0 m , we use a quality index based on data depth as proposed by [13]. However, since the point processes with a fixed cardinality K are in irregular spaces, we need to map them into regular Euclidean spaces to apply depth-based methods effectively. Let S denote the set of all point processes on [ 0 , T ] and S K denote the set of all point processes with cardinality K. Specifically, S K = { ( t 1 , t 2 , , t K ) R K | 0 t 1 t 2 t K T } for some non-negative integer K and hence S = K = 0 S K . Since depth functions are not well-defined in irregular spaces like S K , we apply the ILR transformation to map the point processes into regular Euclidean spaces, enabling the use of depth-based methods for hypothesis testing.

4.1. Isometric Log-Ratio Transformation

We use the notion of inter-event time (IET) representation for the ILR transformation [19]. Suppose s = ( t 1 , t 2 , , t K ) S k is a point process with given cardinality K over the time domain [ 0 , T ] . This process can be equivalently represented using inter-event times. Let
u = ( u 1 , u 2 , , u K + 1 ) ( t 1 , t 2 t 1 , , t K t K 1 , T t K ) .
Since i = 1 K + 1 u i = T and u i > 0 almost surely for all i = 1 , , K + 1 , any point process with cardinality K can be viewed as a vector in the simplex
S K + 1 = u = ( u 1 , u 2 , , u K + 1 ) R K + 1 | u i > 0 , i = 1 , , K + 1 ; i = 1 K + 1 u i = T .
To perform statistical analysis, we use the ILR transformation to map the IET-represented point processes from the simplex S K + 1 to unconstrained Euclidean space R K , allowing us to apply depth-based methods.
The ILR transformation is an isometry between S K + 1 and R K [10]. For a vector u S K + 1 , the ILR transformation is defined as [20]
u * = ilr ( u ) Φ · log u 1 g ( u ) , log u 2 g ( u ) , , l o g u K + 1 g ( u ) ,
where g ( u ) = i = 1 K + 1 u i 1 / ( K + 1 ) is the component-wise geometric mean of u , and Φ is a matrix in R K × ( K + 1 ) satisfying Φ Φ = I K and Φ Φ = I K + 1 1 K + 1 1 K + 1 1 K + 1 , where I K and I K + 1 are identity matrices, and 1 K + 1 is a ( K + 1 )-dimensional vector of ones. The inverse of the ILR transformation, given as u * R K , is
u = ilr 1 ( u * ) T exp ( u * Φ ) exp ( u * Φ ) · 1 K + 1 .
The matrix Φ is not unique. A common form, obtained via the Gram–Schmidt process, is given by [20]
Φ i j = 1 ( K + 1 i ) ( K + 2 i ) , j K + 1 i ( K + 1 i ) ( K + 2 i ) , j = K + 2 i 0 otherwise
Using the IET representation and ILR transformation, we establish bijective relationships between point process space S K , the simplex S K + 1 and Euclidean space R K . We can now map two groups of point process realizations to vectors in R K , denoted as
X 1 = { X s 11 , , X s 1 M 1 } , X 2 = { X s 21 , , X s 2 M 2 } ,
Our goal is to compare the multivariate distributions of X 1 and X 2 . Given a cardinality K, assume that the random vectors in X 1 have density function f 1 K , and those in X 2 have f 2 K . Then, the hypothesis in Equation (6) can be reformulated as
H 00 : P ( | s 1 | = K ) = P ( | s 2 | = K ) for all K ; H 01 : f 1 K 1 x | | s 1 | = K 1 = f 2 K 1 x | | s 2 | = K 1 ; H 02 : f 1 K 2 x | | s 1 | = K 2 = f 2 K 1 x | | s 2 | = K 2 ; H 0 m : f 1 K m x | | s 1 | = K m = f 2 K m x | | s 2 | = K m ,
where s 1 S ˜ 1 and s 2 S ˜ 2 . Next, we compare the multivariate distributions in R K applying depth-based methods.

4.2. Multivariate Distribution Test Based on Data Depth

Several depth-based tests have been proposed for multivariate distribution, including depth versus depth plots [11] and multivariate spacings [12]. In our approach, we implement a quality index Q, defined in [13], as follows:
Q ( F , G ) = P D ( F ; X ) D ( F ; Y ) | X F , Y G ,
where F and G are two distribution functions on R p for some p 1 , and D ( F ; · ) is a data depth function with respect to distribution F. The value of Q lies in the interval [ 0 , 1 ] , with Q = 1 2 indicating that F = G , and Q < 1 2 suggesting a location shift or scale difference. We use this index to test the hypothesis
H 0 : Q = 1 2
to determine whether F and G are statistically identical.
Given two samples X = { X 1 , , X m } from F and Y = { Y 1 , , Y n } from G, the empirical estimate of Q is
Q ^ Q ^ ( F m , G n ) Q ^ ( X , Y ) = 1 n i = 1 n R ( F m ; Y i ) ,
where R ( F m ; Y i ) is the proportion of X j s that satisfy D ( F m ; X j ) D ( F m ; Y i ) , and D ( F m ; · ) is the empirical depth function with respect to the empirical distribution F m . When Q ^ is calculated using the Mahalanobis depth, and under certain conditions, the asymptotic behavior of Q ^ is given by [13]
1 12 1 m + 1 n 1 2 Q ^ 1 2 N ( 0 , 1 )
as m , n .
Now, consider hypothesis in Equation (9). Given two point process samples S ˜ 1 and S ˜ 2 and a cardinality K, we extract two subsets S ˜ 1 K and S ˜ 2 K . Applying the ILR transformation to these subsets yields two groups of vectors in R K , denoted as X 1 K and X 2 K . Rather than directly comparing the point processes in S ˜ 1 K and S ˜ 2 K , we compare the K-dim random vectors in X 1 K and X 2 K by calculating Q ^ ( X 1 K , X 2 K ) .
Using the asymptotic distribution of Q ^ from Equation (10), we can perform a large-sample z-test to determine whether Q ^ = 1 2 . In practice, we recommend selecting K m 8 , as larger cardinalities may result in inaccurate estimates of Mahalanobis depth due to covariance matrix instability. Additionally, testing in high-dimensional settings requires a large number of samples to ensure reliable statistical inference.

4.3. Test Procedure

After obtaining the p-values of all ( m + 1 ) subtests in Equation (9), the final step is to combine these multiple comparisons into a single p-value. Fisher’s method [21] is a well-known approach for combining p-values, under the assumption that the individual tests are independent. However, in our context, the subtests H 01 , , H 0 m are not independent due to shared data structures across the transformed point processes.
To address this dependency, we consider Bonferroni correction [22], which is effective under arbitrary dependence structures. However, Bonferroni correction can be overly conservative, particularly in cases where the number of comparisons is large. To overcome this, we use Holm’s modification [23], which provides a stepwise procedure to control the family-wise error rate while being less conservative than the traditional Bonferroni correction.
Assume p 1 , p 2 , , p m are p-values for m hypotheses H 01 , H 02 , , H 0 m , and α is the significance level. Holm’s method involves sorting the p-values in ascending order, denoted by p ( 1 ) p ( 2 ) p ( m ) , with corresponding hypotheses H 0 ( 1 ) , H 0 ( 2 ) , , H 0 ( m ) . The stepwise procedure is detailed in Algorithm 2.
Algorithm 2. Holm’s Method for Combining Hypotheses
1.
If p ( 1 ) α m , reject H 0 ( 1 ) and proceed to Step 2; otherwise, accept all H 0 ( 1 ) , H 0 ( 2 ) , , H 0 ( m ) and stop;
2.
If p ( 2 ) α m 1 , reject H 0 ( 2 ) and proceed to Step 3; otherwise, accept H 0 ( 2 ) , H 0 ( 3 ) , , H 0 ( m ) and stop;
3.
If p ( 3 ) α m 2 , reject H 0 ( 3 ) and proceed to Step 4; otherwise, accept H 0 ( 3 ) , H 0 ( 4 ) , , H 0 ( m ) and stop;
4.
Continue this process, stop if a hypothesis is accepted;
5.
If p ( m ) α , reject H 0 ( m ) ; otherwise, accept it and stop.
Holm’s method adjusts the original p-values to corrected values:
m p ( 1 ) , ( m 1 ) p ( 2 ) , ( m 2 ) p ( 3 ) , p ( m ) .
Since the global null hypothesis H 0 : λ 1 * = λ 2 * holds if all sub-hypotheses H 00 , H 01 , , H 0 m in Equation (9) are true, the overall p-value is
min { m p ( 1 ) , ( m 1 ) p ( 2 ) , ( m 2 ) p ( 3 ) , , p ( m ) } .
Our test procedure is summarized in Algorithm 3.
Algorithm 3. Input: two groups of realizations S ˜ 1 = { s 11 , , s 1 M 1 } and S ˜ 2 = { s 21 , , s 2 M 2 }
1.
Count the cardinality of each realization, denoted as C 1 = { C 11 , C 12 , , C 1 M 1 } and C 2 = { C 21 , C 22 , , C 2 M 2 } ;
2.
Use Chi-square test to compare the distributions of C 1 and C 2 , and obtain the p-value p 0 ;
3.
Let S ˜ i K denote the set of all point processes with cardinality K in S ˜ i , i = 1 , 2 , choose cardinalities K 1 , , K m such that the corresponding subsets S ˜ 1 K j and S ˜ 2 K j have sufficiently large sample sizes;
4.
For each j { 1 , , m } , apply the IET representation and ILR transformation to map the realizations in S ˜ 1 K j and S ˜ 2 K j into Euclidean space R K j , yielding two groups of K i -dimensional vectors, denoted as X 1 K j and X 2 K j ;
5.
For each j { 1 , , m } , compute the quality index Q ^ ( X 1 K j , X 2 K j ) , test the hypothesis Q ^ = 1 2 , and obtain the p-value p j for each comparison;
6.
Collect all p-values p 0 , p 1 , , p m , and arrange them in ascending order as p ( 1 ) p ( 2 ) p ( m + 1 ) . Using Holm’s correction, compute the overall p-value as
Overall p - value = min { ( m + 1 ) p ( 1 ) , m p ( 2 ) , ( m 1 ) p ( 3 ) , , p ( m + 1 ) } .

5. Numerical Studies

5.1. Simulations

In this subsection, we conduct extensive simulations to evaluate the performance of the proposed hypothesis tests. We present results for three different approaches: the three-step test for Poisson processes, the parametric test using maximum likelihood estimation, and the nonparametric test based on the ILR transformation.

5.1.1. Simulation for Three-Step Test for Poisson Processes

To assess the effectiveness of the three-step procedure for comparing inhomogeneous Poisson processes, we generate two groups of Poisson processes with intensity functions:
λ 1 ( t ) = 1 + sin ( t ) , λ 2 ( t ) = d ( 1 + sin ( t + c π ) ) , t [ 0 , 2 π ] ,
where M 1 and M 2 denote the respective sample sizes. This process is repeated 1000 times to obtain stable estimates of the test statistics. We analyze how the test behaves under varying values of c and d, which control the phase shift and amplitude scaling, respectively.
For the first simulation scenario, we fix d = 1 and vary c, and the results are summarized in Table 1. Since the total intensities are equal ( Λ 1 = Λ 2 ), the p-values for the z-test remain approximately 0.5. However, as c increases, the p-values for the K-S test decrease, indicating increasing differences in the distribution of event times.
In the second scenario, we fix c = 0 and vary d, and the results are presented in Table 1. Here, the normalized intensities remain identical, but the total intensities differ. Consequently, the p-values for the K-S test remain close to 0.5, while the z-test p-values decrease as d increases. Overall, these results demonstrate that our method effectively detects differences in both the total intensity and the distribution of events in Poisson processes.
The simulations demonstrate that as the differences between the two intensity functions increase, the overall p-value decreases, indicating that the proposed three-step test effectively detects deviations in point process characteristics. Moreover, the empirical power increases steadily with the effect size and improves substantially as the sample size grows, demonstrating the reliability of the proposed procedure under both types of alternatives.

5.1.2. Simulation for MLE Parametric Tests

Next, we evaluate the performance of the MLE-based parametric test using two types of point processes: the Hawkes process and a Poisson process with an intensity function based on a software reliability model.
Hawkes Process 
We generate two groups of Hawkes processes with intensity functions:
λ i * ( t ) = μ i + t α i e β i ( t u ) d N ( u ) , i = 1 , 2 , t [ 0 , 20 ] .
Each group consists of 50 realizations. We fix α 1 = 4 , β 1 = 5 , and μ 1 = 0.5 and systematically alter α 2 , β 2 , and μ 2 . We repeat this simulation 1000 times. The realizations of one of these simulations are shown in Figure 1, and the results are shown in Table 2.
Poisson Process 
We generate two groups of Poisson processes with intensity functions:
λ i ( t ) = α i ( t / β i ) β i exp ( t / β i ) , i = 1 , 2 , t [ 0 , 10 ] .
Each group consists of 100 realizations. We fix α 1 = 20 , β 1 = 1 and vary α 2 and β 2 . The log-likelihood functions are maximized using numerical optimization, and hypothesis testing is performed based on the estimated parameters. This experiment is repeated 1000 times. The realizations of one of these simulations are shown in Figure 2, and the p-values (p) of each parameter setting are shown in Table 3.
Both simulations demonstrate that as the differences between the two intensity functions increase, the overall p-value decreases, indicating that the MLE-based test effectively detects deviations in point process characteristics. Additionally, the empirical power increases with the effect size and with larger sample sizes, confirming the stability of the proposed procedure under both types of alternatives.

5.1.3. Simulation for ILR-Based Nonparametric Tests

We now evaluate the ILR transformation-based test for general point processes. We still conduct simulations on two different types of point processes: the Hawkes process and the Poisson process.
Hawkes Process 
We generate two groups of Hawkes processes with intensity functions:
λ i * ( t ) = μ i + t α i e β i ( t u ) d N ( u ) , i = 1 , 2 , t [ 0 , 10 ] .
Each group contains 5000 realizations, and we fix α 1 = 4 , β 1 = 5 and μ 1 = 0.5 and alter one of the values of α 2 , β 2 and μ 2 at a time.
To conduct the ILR transformation-based test, we apply the ILR mapping on inter-event times and use multivariate depth-based hypothesis testing. K 1 , K 2 , K m in Equation (9) are set to 2 , 3 , , 8 . This process is repeated 1000 times, and we record the mean and standard deviation of the p-values (p). The results are presented in Table 4.
Poisson Process 
Next, we simulate two groups of Poisson processes, each characterized by the intensity function:
λ i ( t ) = α i t β i 2 exp ( t / β i ) , i = 1 , 2 , t [ 0 , 1 ] .
Each group contains 1000 realizations, and we fix α 1 = 5 , β 1 = 0.2 , and change the values of α 2 and β 2 . This process is repeated 1000 times. The p-values (p) of each parameter setting are shown in Table 5.
Both simulation studies confirm that as the differences between the intensity functions increase, the overall p-value decreases, indicating that the ILR transformation-based test effectively detects differences in point process distributions. The empirical power increases steadily as the parameter deviation becomes larger in both the Poisson and Hawkes settings. However, due to the multi-step construction and the combination of multiple conditional subtests, the procedure can be relatively sensitive in finite samples and may exhibit elevated rejection rates under the global null.

5.2. Real Data Analysis: Dopamine Spiking in Mouse VTA

We illustrate the proposed ILR-based test using a neural spiking dataset from the ventral tegmental area (VTA) of mice [24]. The data consist of spike times from optogenetically identified dopaminergic neurons recorded during classical conditioning tasks, where 1-second odor presentations served as conditioned stimuli and water deliveries as rewards (the dataset is publicly available from CRCNS.org as “vta-1”; see Starkweather et al. [25] for a detailed description). Across sessions, animals received either Salvinorin B (SalB), an agonist for Kappa Opioid Receptor DREADD expressed in the medial prefrontal cortex, or Saline injections, leading to different dopaminergic response patterns under uncertainty in reward prediction.
In this dataset, trials are organized into four experimental conditions: SalB1, SalB2, Saline1, and Saline2. SalB1 and SalB2 correspond to the two classical conditioning tasks performed under SalB injections, whereas Saline1 and Saline2 represent the same tasks performed under saline injections. The two tasks differ in their cue–reward contingencies and in the level of state uncertainty [24]. Within each condition, we observe spike trains from optogenetically identified dopaminergic neurons across many repetitions of the task, so that each group consists of a collection of long spike trains recorded on a common time window for multiple neuron–trial pairs.
To obtain many comparable point patterns, we segmented each long spike train into short point processes aligned to odor onset times. Specifically, we started from the first odor onset time t 0 and collected all spikes in the time interval [ t 0 , t 0 + 1000 ] ms. We then set the next t 0 to be the first odor onset strictly after t 0 + 1000 and repeated this procedure, thereby constructing a sequence of non-overlapping 1000 ms windows. Each segment was then shifted so that its starting time is 0, yielding short point processes defined on the common domain [ 0 , 1000 ] ms in each experimental group. This procedure yields more than 8000 short point processes in each of the four groups, all defined on the common domain [ 0 , 1000 ] . The time interval length of 1000 ms was chosen so that most segments contain at most 10 spikes while still retaining enough events for the ILR-based test.
We first compared the four groups at the between-condition level. For a given Monte Carlo sample size S { 1000 , 2000 , 3500 } , we repeatedly (with N iter = 1000 iterations) drew S spike trains without replacement from each of the two groups under comparison and applied the ILR-based two-sample test on the resulting collections of point processes. Within each iteration, we evaluated the test over cardinalities 2 , , 8 , using the conditional framework described in Section 4, and recorded the resulting p-values. Table 6 reports the mean and standard deviation of the p-values over the 1000 repetitions for each pair of groups.
Across all choices of S , the between-group comparisons yield small p-values, and the null hypothesis of equal point process distributions is consistently rejected. This indicates systematic differences in the spike timing patterns between SalB and Saline conditions, as well as between the two task-related groups. As expected for a powerful test, the ILR-based procedure is sensitive to such distributional changes and tends to reject the null whenever there are systematic differences in cardinality or temporal structure. Moreover, for each pairwise comparison, the mean p-values decrease as S increases, reflecting the gain in power with larger sample sizes.
To assess the behavior of the test under the null, we also performed within-group comparisons. Using the same resampling scheme (with S { 1000 , 2000 , 3500 } and N iter = 1000 ), we randomly split each group into two subsamples of equal size and applied the ILR test to these artificially constructed pairs. Table 7 summarizes the mean and standard deviation of the resulting p-values.
The within-group p-values are moderate and do not provide evidence against the null hypothesis for any of the four groups. Although the ILR-based test is powerful and tends to reject the null when there are genuine differences between point process distributions, these internal comparisons suggest that its type I error is under reasonable control in this setting. Taken together, the results in Table 6 and Table 7 demonstrate that the proposed ILR-based framework can detect meaningful changes in spike timing patterns across experimental conditions while remaining stable under repeated subsampling from the same underlying process. Moreover, for each within-group comparison, the mean p-values increase as S grows, which is consistent with more stable estimation under larger sample sizes when the null hypothesis holds.

6. Summary and Future Work

In this paper, we proposed and examined three statistical hypothesis testing approaches for comparing point processes, particularly focusing on Poisson processes. The three methods include: (1) a three-step test designed specifically for inhomogeneous Poisson processes, (2) a parametric test based on maximum likelihood estimation for structured point processes such as Hawkes processes, and (3) a nonparametric approach utilizing the Isometric Log-Ratio transformation for general point processes.
The three-step test for inhomogeneous Poisson processes evaluates differences in total intensity and normalized intensity separately, combining results using Fisher’s method to obtain an overall p-value. Through extensive simulations, we demonstrated that this method effectively identifies both magnitude and distributional differences in event occurrences.
The MLE-based parametric test was applied to both Hawkes processes and a Poisson process model used in software reliability. By estimating intensity function parameters through numerical optimization, the test successfully detected differences between two processes in simulated experiments. The results confirmed that as the differences between intensity functions grow, the test reliably identifies deviations.
For more general point processes where a parametric form may not be suitable, we introduced a nonparametric ILR transformation-based test. By mapping inter-event times into a Euclidean space, we leveraged depth-based statistical methods to compare point process distributions. Simulation studies on both Hawkes and Poisson processes demonstrated that the ILR-based test effectively captures differences in the underlying event structure.
Additionally, we applied the ILR-based test to real spike-train recordings. We analyzed spike-time distributions across multiple experimental conditions, focusing on groups with similar or distinct pharmacological manipulations. The results showed that comparisons between conditions with similar neural activity patterns exhibited no significant differences in their spike timing, whereas conditions involving altered excitability displayed clear and statistically significant deviations.
While the proposed methods provide effective tools for comparing point processes, there are several directions for future research:
  • Extending the three-step procedure to multi-group comparisons: Our three-step test was primarily designed for comparing two Poisson processes. Extending this framework to compare multiple inhomogeneous Poisson processes simultaneously would be an interesting avenue for further study.
  • Robustness of MLE-based methods in more complex point process models: The MLE-based test performed well in Poisson and Hawkes processes, but further research could explore its effectiveness in more complex models, such as renewal processes or self-correcting point processes.
  • Optimization of ILR-based methods for small sample sizes: The ILR transformation provides a flexible framework for analyzing point processes, but its performance can be limited when sample sizes are small. Future research can explore ways to refine the transformation or incorporate additional statistical techniques to improve sensitivity and power in low-sample scenarios.
  • Application to other real-world datasets: While we applied our methods to spiking data, similar techniques could be used to analyze other event-based data, such as financial transactions, earthquake occurrences, or neuronal spike trains.
Overall, our study provides a basic framework for statistical tests on point processes, and the proposed methods have demonstrated their effectiveness in both simulated and real-world settings. Future work will aim to refine and extend these methodologies to further improve their applicability and robustness in diverse domains.

Author Contributions

Conceptualization, Y.M. and W.W.; methodology, Y.M. and W.W.; formal analysis, Y.M. and W.W.; data curation, Y.M.; writing—original draft preparation, Y.M.; writing—review and editing, Y.M. and W.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used in this study are publicly available from online sources cited in the manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Technical Details for Maximum Likelihood Estimation of Point Processes

Appendix A.1. Asymptotic Normality of Maximum Likelihood Estimators for Point Processes

Let { s 1 , , s M } be a set of independent realizations of a point process with intensity λ ( t | θ ) , where θ Θ are unknown parameters. Let θ ^ M be the maximum likelihood estimation of θ . Let l M be the normalized log-likelihood function:
l M ( θ ) = 1 M log i = 1 M L s i ( θ ) ,
where L s i is the likelihood of s i . By the definition of maximum likelihood estimation, θ ^ M = arg max θ Θ log i = 1 M L s i ( θ ) , so l M ( θ ^ M ) = 0 . Then by mean value theorem, l M ( θ ^ M ) = l M ( θ ) + l M ( θ ˜ ) ( θ ^ M θ 0 ) for some θ ˜ between θ and θ ^ M . It follows that M ( θ ^ M θ ) = M l M ( θ ) l M ( θ ˜ ) . By the central limit theorem, M l M ( θ ) d M ( 0 , I ( θ ) ) . If we assume θ ^ M p θ , then by the weak law of large numbers, l M ( θ ˜ ) p I ( θ ) . Finally, Slutsky’s theorem implies the asymptotic normality
θ ^ M d M ( θ , I ( θ ) 1 )
as M .

Appendix A.2. Gradients and Hessians for Hawkes Process

The gradients are
log L α = i = 1 n 1 β e β ( T t i ) 1 + i = 1 n A ( i ) μ + α A ( i ) ,
log L β = α i = 1 n 1 β ( T t i ) e β ( T t i ) + 1 β 2 e β ( T t i ) 1 i = 1 n α B ( i ) μ + α A ( i ) ,
log L μ = T + i = 1 n 1 μ + α A ( i ) ,
where B ( i ) = t j < t i ( t i t j ) e β ( t i t j ) for i 2 and B ( 1 ) = 0 .
The Hessian matrix is
2 log L α 2 = i = 1 n A ( i ) μ + α A ( i ) 2 ,
2 log L β α = i = 1 n 1 β ( T t i ) e β ( T t i ) + 1 β 2 e β ( T t i ) 1 + i = 1 n B ( i ) μ + α A ( i ) + α A ( i ) B ( i ) ( μ + α A ( i ) ) 2 .
2 log L β 2 = α i = 1 n 1 β ( T t i ) 2 e β ( T t i ) + 2 β 2 ( T t i ) e β ( T t i ) + 2 β 3 e β ( T t i ) 1 + i = 1 n α C ( i ) μ + α A ( i ) α B ( i ) μ + α A ( i ) 2 .
where C ( i ) = t j < t i ( t i t j ) 2 e β ( t i t j ) for i 2 and C ( 1 ) = 0 ,
2 log L μ 2 = i = 1 n 1 ( μ + α A ( i ) ) 2 ,
2 log L α μ = i = 1 n A ( i ) ( μ + α A ( i ) ) 2 ,
2 log L β μ = i = 1 n α B ( i ) ( μ + α A ( i ) ) 2 .

References

  1. Daley, D.J.; Vere-Jones, D. An Introduction to the Theory of Point Processes: Volume I: Elementary Theory and Methods; Springer: New York, NY, USA, 2003. [Google Scholar]
  2. Cox, D. Some simple approximate tests for Poisson variates. Biometrika 1953, 40, 354–360. [Google Scholar] [CrossRef]
  3. Gail, M. Power computations for designing comparative Poisson trials. Biometrics 1974, 30, 231–237. [Google Scholar] [CrossRef]
  4. Krishnamoorthy, K.; Thomson, J. A more powerful test for comparing two Poisson means. J. Stat. Plan. Inference 2004, 119, 23–35. [Google Scholar] [CrossRef]
  5. Chiu, S.N. Parametric bootstrap and approximate tests for two Poisson variates. J. Stat. Comput. Simul. 2010, 80, 263–271. [Google Scholar] [CrossRef]
  6. Deshpande, J.; Mukhopadhyay, M.; Naik-Nimbalkar, U. Testing of two sample proportional intensity assumption for non-homogeneous Poisson processes. J. Stat. Plan. Inference 1999, 81, 237–251. [Google Scholar] [CrossRef]
  7. Fromont, M.; Laurent, B.; Reynaud-Bouret, P. The two-sample problem for poisson processes: Adaptive tests with a nonasymptotic wild bootstrap approach. Ann. Stat. 2013, 41, 1431–1461. [Google Scholar] [CrossRef]
  8. Fisher, R.A. Statistical Methods for Research Workers; Springer: Berlin/Heidelberg, Germany, 1992; pp. 66–70. [Google Scholar]
  9. Hawkes, A.G. Point spectra of some mutually exciting point processes. J. R. Stat. Soc. Ser. B Stat. Methodol. 1971, 33, 438–443. [Google Scholar] [CrossRef]
  10. Egozcue, J.J.; Pawlowsky-Glahn, V.; Mateu-Figueras, G.; Barcelo-Vidal, C. Isometric logratio transformations for compositional data analysis. Math. Geol. 2003, 35, 279–300. [Google Scholar] [CrossRef]
  11. Li, J.; Liu, R.Y. New nonparametric tests of multivariate locations and scales using data depth. Stat. Sci. 2004, 19, 686–696. [Google Scholar] [CrossRef]
  12. Zhang, C.; Xiang, Y.; Shen, X. Some multivariate goodness-of-fit tests based on data depth. J. Appl. Stat. 2012, 39, 385–397. [Google Scholar] [CrossRef]
  13. Liu, R.Y.; Singh, K. A quality index based on data depth and multivariate rank tests. J. Am. Stat. Assoc. 1993, 88, 252–260. [Google Scholar] [CrossRef]
  14. Chiu, S.N.; Wang, L. Homogeneity tests for several Poisson populations. Comput. Stat. Data Anal. 2009, 53, 4266–4278. [Google Scholar] [CrossRef]
  15. Scholz, F.W.; Stephens, M.A. K-Sample Anderson–Darling Tests of Fit for Continuous and Discrete Cases; Technical Report Technical Report 81; Department of Statistics, University of Washington: Seattle, WA, USA, 1986. [Google Scholar]
  16. Zhao, M.; Xie, M. On maximum likelihood estimation for a general non-homogeneous Poisson process. Scand. J. Stat. 1996, 23, 597–607. [Google Scholar]
  17. Ozaki, T. Maximum likelihood estimation of Hawkes’ self-exciting point processes. Ann. Inst. Stat. Math. 1979, 31, 145–155. [Google Scholar] [CrossRef]
  18. Franke, T.M.; Ho, T.; Christie, C.A. The chi-square test: Often used and more often misinterpreted. Am. J. Eval. 2012, 33, 448–458. [Google Scholar] [CrossRef]
  19. Zhou, X.; Ma, Y.; Wu, W. Statistical Depth for Point Process via the Isometric Log-Ratio Transformation. arXiv 2022, arXiv:2203.04454. [Google Scholar] [CrossRef]
  20. Pawlowsky-Glahn, V.; Egozcue, J.J.; Tolosana-Delgado, R. Lecture Notes on Compositional Data Analysis; Departament d’Informàtica i Matemàtica Aplicada, Universitat de Girona: Girona, Spain, 2007; Technical report. [Google Scholar]
  21. Edwards, A.W. RA Fischer, statistical methods for research workers, (1925). In Landmark Writings in Western Mathematics 1640–1940; Elsevier: Amsterdam, The Netherlands, 2005; pp. 856–870. [Google Scholar]
  22. Bonferroni, C. Teoria statistica delle classi e calcolo delle probabilita. Pubbl. Del R Ist. Super. Di Sci. Econ. E Commericiali Di Firenze 1936, 8, 3–62. [Google Scholar]
  23. Holm, S. A simple sequentially rejective multiple test procedure. Scand. J. Stat. 1979, 6, 65–70. [Google Scholar]
  24. Starkweather, C.K.; Gershman, S.J.; Uchida, N. The Medial Prefrontal Cortex Shapes Dopamine Reward Prediction Errors under State Uncertainty. Neuron 2018, 98, 616–629.e6. [Google Scholar] [CrossRef] [PubMed]
  25. Starkweather, C.K.; Gershman, S.J.; Uchida, N. Spiking activity of dopaminergic neurons in mouse ventral tegmental area (VTA) during chemogenetic suppression of the mPFC, during two classical conditioning tasks. CRCNS. org 2018, 10, K01J97X6. [Google Scholar] [CrossRef]
Figure 1. Simulated realizations of Hawkes processes from one representative experiment. Each point represents an event time. The x-axis denotes the observation time, and the y-axis indexes different realizations. (Left): Group 1. (Right): Group 2. The two groups are generated under different model parameters, with α 2 = 3.5 , β 2 = 5 , μ 2 = 0.5 . The corresponding p-value is p = 1.9868 × 10 5 .
Figure 1. Simulated realizations of Hawkes processes from one representative experiment. Each point represents an event time. The x-axis denotes the observation time, and the y-axis indexes different realizations. (Left): Group 1. (Right): Group 2. The two groups are generated under different model parameters, with α 2 = 3.5 , β 2 = 5 , μ 2 = 0.5 . The corresponding p-value is p = 1.9868 × 10 5 .
Mathematics 14 00727 g001
Figure 2. Simulated realizations of Poisson processes from one representative experiment. Each point represents an event time. The x-axis denotes the observation time, and the y-axis indexes different realizations. (Left): Group 1. (Right): Group 2. The two groups are generated under different intensity levels, with α 2 = 18 and β 2 = 1 . The corresponding p-value is p = 0.0256 .
Figure 2. Simulated realizations of Poisson processes from one representative experiment. Each point represents an event time. The x-axis denotes the observation time, and the y-axis indexes different realizations. (Left): Group 1. (Right): Group 2. The two groups are generated under different intensity levels, with α 2 = 18 and β 2 = 1 . The corresponding p-value is p = 0.0256 .
Mathematics 14 00727 g002
Table 1. Mean (±S.D.) of p-values and empirical power ( α = 0.05 ) of the three-step tests in the simulation data.
Table 1. Mean (±S.D.) of p-values and empirical power ( α = 0.05 ) of the three-step tests in the simulation data.
M 1 M 2 cd p 1 p 2 pPower
30300.11.0 0.5206 ( ± 0.2902 ) 0.3624 ( ± 0.2688 ) 0.4162 ( ± 0.2858 ) 0.0790
50500.11.0 0.4877 ( ± 0.2868 ) 0.2683 ( ± 0.2379 ) 0.3202 ( ± 0.2577 ) 0.1680
30300.21.0 0.4855 ( ± 0.2819 ) 0.0781 ( ± 0.1019 ) 0.1308 ( ± 0.1511 ) 0.3990
50500.21.0 0.5083 ( ± 0.2928 ) 0.0176 ( ± 0.0316 ) 0.0417 ( ± 0.0699 ) 0.7530
30300.31.0 0.5079 ( ± 0.2853 ) 0.0033 ( ± 0.0100 ) 0.0094 ( ± 0.0254 ) 0.9560
50500.31.0 0.5102 ( ± 0.2827 ) 0.0001 ( ± 0.0006 ) 0.0004 ( ± 0.0026 ) 0.9990
30300.01.1 0.3798 ( ± 0.3044 ) 0.5168 ( ± 0.2909 ) 0.4100 ( ± 0.3002 ) 0.1140
50500.01.1 0.3175 ( ± 0.2979 ) 0.5309 ( ± 0.2919 ) 0.3637 ( ± 0.2952 ) 0.1660
30300.01.2 0.1664 ( ± 0.2297 ) 0.5249 ( ± 0.2943 ) 0.2145 ( ± 0.2438 ) 0.3430
50500.01.2 0.1005 ( ± 0.1761 ) 0.5132 ( ± 0.2925 ) 0.1398 ( ± 0.2013 ) 0.5170
30300.01.3 0.0476 ( ± 0.1074 ) 0.5194 ( ± 0.2923 ) 0.0806 ( ± 0.1468 ) 0.6590
50500.01.3 0.0132 ( ± 0.0464 ) 0.5085 ( ± 0.2866 ) 0.0267 ( ± 0.0720 ) 0.8610
Table 2. Mean (±S.D.) of p-values and empirical power ( α = 0.05 ) in the simulated Hawkes process for the MLE-based test.
Table 2. Mean (±S.D.) of p-values and empirical power ( α = 0.05 ) in the simulated Hawkes process for the MLE-based test.
α 2 β 2 μ 2 pPower
460.5 0.0027 ( ± 0.0163 ) 0.9880
45.50.5 0.0924 ( ± 0.1584 ) 0.6160
3.550.5 0.0413 ( ± 0.0981 ) 0.8100
3.750.5 0.1997 ( ± 0.2445 ) 0.3900
450.35 0.0078 ( ± 0.0329 ) 0.9650
450.4 0.0967 ( ± 0.1654 ) 0.6240
Table 3. Mean (±S.D.) of p-values and empirical power ( α = 0.05 ) in the simulated Poisson process for the MLE-based test.
Table 3. Mean (±S.D.) of p-values and empirical power ( α = 0.05 ) in the simulated Poisson process for the MLE-based test.
α 2 β 2 pPower
200.90 0.0023 ( ± 0.0149 ) 0.9890
200.95 0.1390 ( ± 0.2043 ) 0.5040
171 0.0016 ( ± 0.0116 ) 0.9920
181 0.0325 ( ± 0.0926 ) 0.8460
191 0.2629 ( ± 0.2704 ) 0.2750
Table 4. Mean (±S.D.) of p 0 , overall p-values, and empirical power ( α = 0.05 ) in the simulated Hawkes process for the ILR-based test. When ( α 2 , β 2 , μ 2 ) = ( α 1 , β 1 , μ 1 ) , the reported power corresponds to the empirical size.
Table 4. Mean (±S.D.) of p 0 , overall p-values, and empirical power ( α = 0.05 ) in the simulated Hawkes process for the ILR-based test. When ( α 2 , β 2 , μ 2 ) = ( α 1 , β 1 , μ 1 ) , the reported power corresponds to the empirical size.
α 2 β 2 μ 2 p 0 pPower
4.05.00.50 0.4916 ( ± 0.2835 ) 0.1073 ( ± 0.1310 ) 0.4490
3.55.00.50 0.4482 ( ± 0.2919 ) 0.1107 ( ± 0.1408 ) 0.4680
3.05.00.50 0.4093 ( ± 0.2843 ) 0.0406 ( ± 0.0678 ) 0.7530
2.85.00.50 0.3941 ( ± 0.2981 ) 0.0235 ( ± 0.0440 ) 0.8660
4.04.50.50 0.4750 ( ± 0.2890 ) 0.0779 ( ± 0.1039 ) 0.5640
4.04.00.50 0.3703 ( ± 0.2926 ) 0.0316 ( ± 0.0548 ) 0.8180
4.03.80.50 0.2898 ( ± 0.2675 ) 0.0214 ( ± 0.0474 ) 0.8860
4.05.00.45 0.3466 ( ± 0.2876 ) 0.0962 ( ± 0.1296 ) 0.5150
4.05.00.40 0.0841 ( ± 0.1394 ) 0.0387 ( ± 0.0677 ) 0.7770
4.05.00.60 0.1944 ( ± 0.2331 ) 0.0482 ( ± 0.0771 ) 0.7100
4.05.00.62 0.1474 ( ± 0.1975 ) 0.0354 ( ± 0.0665 ) 0.8040
Table 5. Mean (±S.D.) of p 0 , overall p-values, and empirical power ( α = 0.05 ) in the simulated Poisson process for the ILR-based test. When ( α 2 , β 2 ) = ( α 1 , β 1 ) , the reported power corresponds to the empirical size.
Table 5. Mean (±S.D.) of p 0 , overall p-values, and empirical power ( α = 0.05 ) in the simulated Poisson process for the ILR-based test. When ( α 2 , β 2 ) = ( α 1 , β 1 ) , the reported power corresponds to the empirical size.
α 2 β 2 p 0 pPower
50.2 0.5045 ( ± 0.2885 ) 0.1241 ( ± 0.1521 ) 0.446
50.21 0.4937 ( ± 0.2871 ) 0.1102 ( ± 0.1404 ) 0.460
50.22 0.4506 ( ± 0.2842 ) 0.0965 ( ± 0.1276 ) 0.504
50.25 0.2618 ( ± 0.2582 ) 0.0326 ( ± 0.0527 ) 0.806
50.27 0.1051 ( ± 0.1650 ) 0.0119 ( ± 0.0210 ) 0.947
50.18 0.4783 ( ± 0.2947 ) 0.0848 ( ± 0.1144 ) 0.539
50.17 0.4536 ( ± 0.2906 ) 0.0558 ( ± 0.0767 ) 0.664
50.15 0.4077 ( ± 0.2903 ) 0.0061 ( ± 0.0133 ) 0.983
5.10.2 0.4787 ( ± 0.2924 ) 0.1214 ( ± 0.1544 ) 0.443
5.30.2 0.2237 ( ± 0.2377 ) 0.1104 ( ± 0.1418 ) 0.458
5.50.2 0.0589 ( ± 0.1241 ) 0.0622 ( ± 0.0960 ) 0.668
5.70.2 0.0076 ( ± 0.0298 ) 0.0193 ( ± 0.0538 ) 0.901
Table 6. ILR-based two-sample tests between experimental groups in the VTA dataset. Reported values are mean (±S.D.) of p-values over N iter = 1000 repetitions.
Table 6. ILR-based two-sample tests between experimental groups in the VTA dataset. Reported values are mean (±S.D.) of p-values over N iter = 1000 repetitions.
Comparison S = 1000 S = 2000 S = 3500
SalB1 vs. SalB20.0159 (±0.0401)0.0042 (±0.0128)0.0004 (±0.0021)
Saline1 vs. Saline20.0012 (±0.0052)0.00003 (±0.00011)0.0000 (±0.0000)
SalB1 vs. Saline10.0473 (±0.0741)0.0281 (±0.0503)0.0071 (±0.0137)
SalB2 vs. Saline20.0192 (±0.0346)0.0054 (±0.0128)0.0006 (±0.0019)
Table 7. ILR-based two-sample tests within each experimental group in the VTA dataset. Reported values are mean (±S.D.) of p-values over N iter = 1000 repetitions.
Table 7. ILR-based two-sample tests within each experimental group in the VTA dataset. Reported values are mean (±S.D.) of p-values over N iter = 1000 repetitions.
Group S = 1000 S = 2000 S = 3500
SalB10.1445 (±0.1623)0.2170 (±0.2237)0.2652 (±0.2564)
SalB20.1066 (±0.1263)0.1944 (±0.2106)0.2428 (±0.2345)
Saline10.1370 (±0.1511)0.2066 (±0.2094)0.2713 (±0.2530)
Saline20.1130 (±0.1330)0.1837 (±0.1934)0.2372 (±0.2293)
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Mu, Y.; Wu, W. Hypothesis Tests for Comparing Point Processes. Mathematics 2026, 14, 727. https://doi.org/10.3390/math14040727

AMA Style

Mu Y, Wu W. Hypothesis Tests for Comparing Point Processes. Mathematics. 2026; 14(4):727. https://doi.org/10.3390/math14040727

Chicago/Turabian Style

Mu, Yue, and Wei Wu. 2026. "Hypothesis Tests for Comparing Point Processes" Mathematics 14, no. 4: 727. https://doi.org/10.3390/math14040727

APA Style

Mu, Y., & Wu, W. (2026). Hypothesis Tests for Comparing Point Processes. Mathematics, 14(4), 727. https://doi.org/10.3390/math14040727

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop