Next Article in Journal
COVID-19 Spread Forecasting, Mathematical Methods vs. Machine Learning, Moscow Case
Previous Article in Journal
Zero-Free Intervals of Chromatic Polynomials of Mixed Hypergraphs
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On Asymptotics of Optimal Stopping Times

by
Hugh N. Entwistle
,
Christopher J. Lustri
and
Georgy Yu. Sofronov
*
Department of Mathematics and Statistics, Macquarie University, Sydney, NSW 2109, Australia
*
Author to whom correspondence should be addressed.
Mathematics 2022, 10(2), 194; https://doi.org/10.3390/math10020194
Submission received: 12 December 2021 / Revised: 7 January 2022 / Accepted: 7 January 2022 / Published: 9 January 2022
(This article belongs to the Section Probability and Statistics)

Abstract

:
We consider optimal stopping problems, in which a sequence of independent random variables is drawn from a known continuous density. The objective of such problems is to find a procedure which maximizes the expected reward. In this analysis, we obtained asymptotic expressions for the expectation and variance of the optimal stopping time as the number of drawn variables became large. In the case of distributions with infinite upper bound, the asymptotic behaviour of these statistics depends solely on the algebraic power of the probability distribution decay rate in the upper limit. In the case of densities with finite upper bound, the asymptotic behaviour of these statistics depends on the algebraic form of the distribution near the finite upper bound. Explicit calculations are provided for several common probability density functions.

1. Introduction

Optimal stopping problems pose the challenge of deciding when to stop some stochastic process in order to maximise some objective, or utility. This problem has arisen in a number of contexts, such as deciding how many candidates for a job should be interviewed before deciding upon one particular candidate, known in literature as the secretary problem [1,2,3]. Optimal stopping has been used to determine when assets should be bought and sold in order to maximize profits; this has been described as the house-selling problem and is discussed in [4,5,6]. Optimal stopping has been incorporated into the theory of online auctions, as in [7]. In this study, the authors linked the analysis of the secretary problem to the design of a mechanism for online auctions. Related analyses may be found in [8,9]. Optimal stopping also features in financial and economic applications, such as in the pricing of American options [10], games [11], operational risk insurance [12], dynamic pricing [13] and more complicated buying–selling problems [14,15,16].
Optimal stopping problems are formulated in terms of observing random variables and determining the stopping point in order to maximize a particular reward function. The problem considered here involved observing a sequence of random variables y 1 , y 2 , , y N and making the decision to stop after a particular number of observations, denoted by m, where 1 m N , based on the variables that had been previously observed at that stage. After stopping, we receive a reward which is a function of the already observed values y 1 , , y m . This problem is one representative of a class of optimal stopping problems that consists of finding a sequential procedure that maximizes the expected reward. For a more extensive discussion of this class of problems, see Section 13.4 [17].
While there exists extensive literature on the theory of optimal stopping problems [17,18], less attention has been paid to the asymptotic properties of stopping times. Most existing asymptotic results focus on “no-information” problems in which the distribution of the observations is unknown.
In no-information problems, an observer determines the relative rank of each observation and the reward function is a function of these ranks. An example of a no-information problem is the secretary problem [1,19]. In this problem, the reward function is the indicator variable of the best object, which means that the observer aims to maximise the probability that the best object is selected. As a consequence, the secretary problem is sometimes referred to as the no-information best-choice problem.
There are many other variations of the no-information problem and significant work exists on the asymptotic properties of the stopping time (see, for example, [19,20,21]). It was shown in [22] that, in the secretary problem with a sequence of N observations, the asymptotic expectation and variance for the stopping time is 2 N / e and 2 / e 5 / e 2 N 2 , respectively, where e is taken to be Euler’s constant. Asymptotic descriptions of statistical properties for other no-information problem variants can be found in [20,23], where the techniques used are dependent on the structure and variation of each problem. The asymptotics where the number of observations is random has also been studied in [24].
There is substantially less literature describing the asymptotic behaviour of statistical properties for “full-information” problems, when the distribution of random variables is known beforehand. Gilbert and Mosteller [23] studied the optimal stopping strategy for the full-information problem in which the objective is identical to that of the secretary problem, known as the full-information best-choice problem [25]. In this setting, the optimal stopping strategy was shown to consist of stopping and choosing the m-th observation y m if it is the highest ranked out of all observations made at this point and has a value exceeding a threshold, which depends on m. Asymptotically, it was shown that v n , the value of a sequence with n steps remaining, v n 1 c / n , where c 0.804352 . An intermediate classification of the “partial information” problem occurs when the observations are taken from a known distribution but containing one, or potentially more, unknown parameters. For example, DeGroot (see Section 13.6; [17]) analysed the stopping problem involving sampling from a normal distribution with unknown mean. Such problems are dealt with using sequential estimation methodologies and Bayesian methods. As we focus on full-information problems in this paper, these methods are not needed and are deemed out of scope. In best-choice settings, variables that do not follow the uniform distribution can be re-scaled by applying the cumulative density function of the particular distribution to achieve a uniform distribution that is monotonically equivalent, as far as the best-choice is concerned. The asymptotic probability of finding the best object (see Section 3 of [23,26]) is independent of the distribution of the variables, tending to approximately 0.58 as the number of observations in the sequence becomes large.
The best-choice problems that are described above are interested in finding the very best object. For the full-information case, this condition can be relaxed by instead setting our reward at stopping time m to y m and seeking to maximise the expected reward. A special case of this problem, called the uniform game (Section 5a of [23]), is related to the well-known Cayley’s problem (see, for example, [1,27]). In [23], the authors showed that the expected reward of a sequence of n independent and identically distributed (iid) random variables having the standard uniform distribution can be approximated in the following way (see also [27]):
v n 1 2 n + log ( n + 1 ) + 1.767 as n .
In [22], the authors found the asymptotics of the expected value and the variance of the stopping time as N / 3 and N 2 / 18 , respectively, for the uniform game. The asymptotic techniques used to compute this behaviour are highly dependent on the distribution and cannot be easily generalized to more general classes of distribution. In [28,29], using the extreme value theory, the authors proved limit theorems for threshold-stopped random variables and derived the asymptotic distribution of the reward sequence of the optimal stopping (iid) random variables.
In the full-information problem, where the reward at time m is y m , such as that considered in the present study, the distribution of the observations plays a significant role in the asymptotic behaviour of the outcome statistics. This study offers a general asymptotic technique for calculating the asymptotic behaviour of E ( τ N ) and Var ( τ N ) as N for general classes of probability distributions in the full-information problem where we wish to maximise the expected reward y m . The value y m could be interpreted as an offer for an asset (e.g., a house), at some time m, which the decision maker would like to sell. Having simple asymptotic results for E ( τ N ) and Var ( τ N ) can be useful in analysing properties of the asymptotic distribution. For example, we may substitute for the expected value and variance in inequalities such as the Chebyshev’s and Markov’s to obtain probability bounds and avoid the need for simulation, which can save time and resources. It is worth noting that the asymptotic behaviour of the optimal stopping problem can be useful in understanding resource allocation in situations where there may be a fixed cost per observation, i.e., the decision maker would be able to have a conservative understanding of how much cost they would incur before their stopping strategy is completed, hence being able to allocate their resources accordingly. In addition to recovering existing results from [22] for the uniform distribution, we extended the analysis to consider the effect of drawing elements of the sequence from a wide range of common distributions.

2. Formulation

Let y 1 , y 2 , , y N be a sequence of independent, identically distributed (iid) random variables from a known distribution. We are allowed to observe the y n sequentially, stopping at any variable we choose. If we stop at time m after observations ( y 1 , , y m ) , then we receive reward y m . This decision to stop depends solely on the values of the random sequence already observed and not on any future values. We are interested in the problem where we find an optimal stopping rule that maximises our expected gain. More formally, we assume we have (i) a probability space ( Ω , F , { F n } n = 1 , , N , P ) , (ii) an increasing sequence { F n } n = 1 , , N of sub- σ -algebras of F and (iii) that the sequence y 1 , , y N of random variables is such that y n is measurable with respect to F n for each n = 1 , , N . Note that, here, we take F n = B ( y 1 , , y n ) , where B ( A ) is the σ -algebra generated by A . The optimal stopping problem can be formulated as finding the value process
V n = ess sup τ S n E { y τ | F n } ,
where S n is the class of all F -measurable stopping times with values in { n , , N } and E { · | F n } is the conditional expectation with respect to F n . We note that the (iid) assumption allows us to drop this conditional.
A well-known method to solve such optimal stopping problems is to use a recursive scheme called the Snell envelope. This is the main tool to derive optimal solutions and to characterise them; see, for example, [30]. In the present study, the problem of finding the optimal stopping rule can be solved by backward induction using the Snell envelope (see, for example, [17,18]).
v n = E ( max { y N n + 1 , v n 1 } ) , n = 1 , , N , v 0 = ,
where v n is the value of a sequence with n steps and v N is the expected reward. Here, max { y N n + 1 , v n 1 } represents the maximum gain that is possible to obtain having n steps left. If n = 1 , we have to stop and our gain would be y N . If 1 < n N , we can either stop or continue. If we stop, our gain is y N n + 1 and if we continue, our expected gain is v n 1 .
If y 1 , y 2 , , y N are iid continuous random variables with common probability density function (pdf) f ( y ) , then
v n = max { y , v n 1 } f ( y ) d y , n = 1 , , N , v 0 = .
The optimal stopping rule τ N is
τ N = min { m : 1 m N , y m v N m } .
This optimal stopping rule formulation is for single stopping. It can be extended for multiple stopping for sums of random variables; see [31,32].

3. Computing v n Behaviour

3.1. Recurrence Relation for v n

Let us assume that y n is drawn from a continuous pdf f ( y ) , which has unbounded support in the positive direction. This function has a cumulative distribution function (cdf) F ( y ) and a survival function h ( y ) = 1 F ( y ) .
Theorem 1.
Let Y be an integrable random variable whose expectation exists and which is drawn from a continuous distribution f ( y ) .
The value of a sequence with n + 1 steps is given by
v n + 1 = v n + v n h ( y ) d y .
Proof. 
We recall that, for any integrable random variable X, we have that
E ( X ) = 0 ( 1 F ( x ) ) d x 0 F ( x ) d x .
Thus,
v n + 1 = 0 P ( max ( Y , v n ) x ) d x + 0 v n P ( max ( Y , v n ) > x ) d x + v n P ( max ( Y , v n ) > x ) d x = 0 + 0 v n 1 d x + v n P ( Y > x ) d x = v n + v n h ( y ) d y .
We note that, if f ( y ) has bounded support in the positive direction such that f ( y ) = 0 for y > y max , it follows, from above, that
v n + 1 = v n + v n y max h ( y ) d y .

3.2. Asymptotics of v n as n

Using the asymptotic controlling factor method (found in Section 5.3 of [33]), we know that, if a function v n grows no more rapidly than e a n b with b < 1 as n , then the leading-order asymptotic solution for v n satisfies
v n + 1 v n v n as n ,
where asymptotic equivalence is defined in the usual fashion and denotes differentiation with respect to n. It is straightforward to obtain a finite upper bound for the integral in (5), which implies that the growth of v n can be no faster than linear in n. Hence, this theorem holds.
Therefore, we can use (5) to write the asymptotic relation
v n v n h ( y ) d y as n ,
with the upper bound replaced by b in the finite upper support case, corresponding to (7).
Remark 1.
The expression in (9) can be manipulated to obtain a convenient asymptotic representation for v n , subject to some additional assumptions. Let us suppose that v n exists, is integrable for sufficiently large values of n and has an asymptotic series representation. These conditions are sufficient to allow both sides of (9) to be differentiated (see, for example, the discussion in Section 3.8 of [33]) to give
v n h ( v n ) v n as n .
Let us assume that there exists some k such that v n 0 for n > k . Therefore, Equation (10) gives
h ( v n ) v n v n as n .
The assumptions required for (11) to be valid hold for all distributions considered in the present study. Therefore, this relationship is used to simplify a later result for a collection of common distributions.

3.3. Example Calculations

We illustrate the application of these ideas to some common distributions, such as the exponential distribution, which is studied in Example 1. In Example 2, we demonstrate that the same method can be applied to heavy-tailed distributions by considering the Pareto distribution and, in Example 3, we show that this method can be easily applied to distributions with finite support by considering the uniform distribution.
Finally, in Example 4, we consider a broad class of distributions with exponential upper tails and show that the asymptotic behaviour of v n is fully determined for this class of distribution, subject to the assumption that v n increases without bound.
Example 1.
The exponential distribution is given by
f ( y ) = 1 β e y / β ,
with y [ 0 , ) and β > 0 .
The survival function h ( y ) is given by h ( y ) = e y / β . Hence, (9) becomes
d v n d n β e v n / β as n .
Solving this formal equation gives the large-n asymptotic behaviour as
v n β log ( n ) as n .
Note that the constant term obtained by solving the ordinary differential equation does not contribute to the leading-order expression in the large-n limit.
Example 2.
The Pareto distribution is given by
f ( y ) = α β α y α + 1 ,
on y [ β , ) , where β > 0 . We assume that α > 1 .
The survival function is given by h ( y ) = ( β / y ) α . Therefore, the differential equation for the large-n asymptotic behaviour is given by
d v n d n β α α 1 v n 1 α .
Solving this formal equation gives the asymptotic behaviour in the large-n limit as
v n β α n α 1 1 / α as n .
Example 3.
The uniform distribution is given by
f ( y ) = 1 b a ,
on y [ a , b ] , where b > a .
The survival function is given by h ( y ) = b y b a . Therefore, the differential equation for the large-n asymptotic behaviour is given from (7) by
d v n d n ( b v ) 2 2 ( b a ) as n .
Solving this formal equation gives the asymptotic behaviour in the large-n limit as
v n b 2 ( b a ) n as n .
Example 4.
A distribution f ( y ) is given with a survival function that, for sufficiently large values of y, satisfies
h ( y ) γ e ( y / β ) α < e ( y / β ) α y Δ ,
for positive Δ and where α, β and γ are positive constants. Let us assume that v n increases without bound (i.e., for any N, there exists an n such that v n > N ).
The condition in (21) gives the asymptotic behaviour of h ( y ) for large values of y. Assuming that v n increases without bound as n increases, there must be some sufficiently large n such that this inequality holds. Therefore, we may replace the full expression for h ( y ) in (9) with its asymptotic value in the large-n limit, giving the ordinary differential equation
d v n d n v n h ( y ) d y = γ β α Γ 1 α , v n α β α + g ( n ) ,
where Γ represents the upper incomplete gamma function and
| g ( n ) | < v n e ( y / β ) α y Δ d y < 1 v n Δ v n e ( y / β ) α d y .
As v n is assumed to increase without bound, this integral is asymptotically subdominant compared to the right-hand side of (22) as n . Therefore, the solution to the differential Equation (22) may be approximated using standard asymptotic methods to give the formal asymptotic relation
v n β log ( n ) 1 / α as n .
It is straightforward to show, using (11), that large-n asymptotics of h ( v n ) are independent of α and β to leading order, giving h ( v n ) 1 / n in the limit n .

3.4. Tabulation of Further Examples

Using methods similar to the previous examples, the asymptotic behaviour of v n may be computed for a wide range of common distributions. Table 1 contains the large-n asymptotics for a number of common distributions.
These densities contain a number of parameters, some of which are required to satisfy particular conditions. The normal distribution permits arbitrary μ , but requires σ > 0 . The gamma distribution requires α > 0 and β > 0 and the function γ represents the lower incomplete gamma function, found in [34]. We also assume that α 1 for the purposes of the calculation shown here. The α = 1 case requires a different analysis and corresponds to the exponential distribution. The Pareto distribution requires β > 0 and α > 1 .
The uniform distribution requires a < b , while the triangular distribution requires a < c < b . The Wigner distribution requires R > 0 and the beta distribution requires α > 0 and β > 0 . In the expression for the beta distribution, the function B ( α , β ) represents the standard beta function, while B ( y ; α , β ) represents the incomplete beta function (see [34]).
The function W denotes the Lambert-W function. The asymptotic expressions for v n associated with the normal and gamma distributions can be further simplified by observing that the argument of the Lambert-W function becomes large in the asymptotic limit and
W ( x ) = log ( x ) log ( log ( x ) ) + log ( log ( x ) ) log ( x ) + o 1 log ( x ) as x .
Using the first two terms of this expansion produces an asymptotic expression for v n with error that is o ( 1 ) as n for both the normal and gamma distributions.
We note that the results obtained using this asymptotic formulation for the uniform distribution are consistent with previous analyses from [20,22].
Importantly, we see that v n approaches a maximum value for each distribution with compact support, corresponding to the maximum possible value of y in the domain, as assumed and subsequently confirmed in the previous analysis. In contrast, for each pdf on a domain with unbounded upper support, v n increases monotonically without bound. The asymptotic behaviour of v n is subsequently used to determine the expectation and variance of τ N for each example.

4. Calculating Optimal Stopping Statistics

4.1. Calculating the Expectation

Let w i = P ( y < v i ) = 1 h ( v i ) . The expectation is now given by
E ( τ N ) = n = 1 N n P ( y 1 < v N 1 , , y n 1 < v N n + 1 , y n v N n )
= ( 1 w N 1 ) + 2 w N 1 ( 1 w N 2 ) + + N w N 1 w 1
= 1 + n = 1 N 1 i = n N 1 w i .
We select a value k such that 0 k N ; a representative choice is k = N . This choice does not affect the asymptotic values computed here, but would impact the form of higher-order corrections. We write the sum of products as
E ( τ N ) = 1 + n = 1 k 1 i = n N 1 w i + n = k N 1 i = n N 1 w i .
We now consider the first summation term in (29). Recalling that w n is non-negative and bounded above by one, we see that
0 < n = 1 k 1 i = n N 1 w i < n = 1 k 1 i = n N 1 1 = k 1 .
The purpose of (29) is to split the sum such that an asymptotic expression can be used for w i in order to approximate the second summation term. As k is large in the limit N , we may use the asymptotic approximation for v n in the large n limit for each term in the product. Consequently, if v n satisfies the requirements for (11) to hold, we find that
n = k N 1 i = n N 1 w i n = k N 1 i = n N 1 1 + v i v i as N .
In fact, for many distributions, this may be further simplified, using the large-n asymptotics for v n and its derivatives. For the distributions in Examples 1–4, as well as each distribution from Table 1, it is straightforward to show, by direct calculation using (11), that
h ( v n ) λ n as n ,
for some positive constant λ . In fact, from Example 4, we see that any distribution with an exponential upper tail has λ = 1 . For distributions where v n satisfies (32),
n = k N 1 i = n N 1 w i n = k N 1 i = n N 1 1 λ i = n = k N 1 ( n λ ) N n ( n ) N n = Γ ( N λ ) Γ ( N ) n = k N 1 Γ ( n ) Γ ( n λ ) ,
in the limit N , where ( a ) n is the Pochhammer symbol [34]. The asymptotic behaviour of this expression is given by
Γ ( N λ ) Γ ( N ) n = k N 1 Γ ( n ) Γ ( n λ ) N λ + 1 as N .
By comparing (34) with (30), we see that the second summation term dominates the expression for large N. Hence,
E ( τ N ) N λ + 1 as N .
We note that, while this leading-order approximation is correct in the limit N , for practical purposes, it is often necessary to retain the first correction term in the approximation for w n in the limit n . While this does not give an expression as simple as (35), it does significantly increase the accuracy of the approximation for moderately large values of N.
For example, in Section 5, we compute the expectation (and variance) of τ N for the normal distribution. Computing only the leading-order expression for the expectation and variance of τ N causes this computation to converge to the asymptotic value extremely slowly, due to the log ( log ( n ) ) terms present in the first correction to w n as n . By including these terms in the asymptotic expansion for w n , we are able to show that the simulations agree with the asymptotic predictions within the simulated range of N.

4.2. Calculating the Variance

A similar process may be used to determine the square expectation, hence the variance. The square expectation is given by
E ( τ N 2 ) = n = 1 N n 2 P ( y 1 < v N 1 , , y n 1 < v N n + 1 , y n v N n )
= ( 1 w N 1 ) + 2 2 w N 1 ( 1 w N 2 ) + + N 2 w N 1 w 1
= 1 + n = 1 N 1 ( 2 N + 1 2 n ) i = n N 1 w i .
We again define k such that 0 k N and split this series to obtain
E ( τ N 2 ) = 1 + n = 1 k 1 ( 2 N + 1 2 n ) i = n N 1 w i + n = k N 1 ( 2 N + 1 2 n ) i = n N 1 w i .
We now consider the first summation term in (39). As before, we note that w i is positive and less than one, giving
0 < n = 1 k 1 ( 2 N + 1 2 n ) i = n N 1 w i < n = 1 k 1 ( 2 N + 1 2 n ) i = n N 1 1 = 2 k N 2 N k 2 + 2 k 1 .
We have again split the sum such that an asymptotic expression can be used for w i in order to approximate the second summation term using the asymptotic approximation for v n in the large n limit for each term in the product. If v n satisfies the requirements for (11) to hold, we find that
n = k N 1 ( 2 N + 1 2 n ) i = n N 1 w i n = k N 1 ( 2 N + 1 2 n ) i = n N 1 1 + v i v i as N .
We again observe that, for many distributions, including those in Examples 1–4 and each distribution from Table 1, the asymptotic behaviour of v n and its derivatives may be used to simplify h ( v n ) , giving the expression in (32). Noting this result, the sum can be evaluated to give
n = k N 1 ( 2 N + 1 2 n ) i = n N 1 w i n = k N 1 ( 2 N + 1 2 n ) i = n N 1 1 λ i
= n = k N 1 ( 2 N + 1 2 n ) ( n λ ) N n ( n ) N n
= Γ ( N λ ) Γ ( N ) n = k N 1 ( 2 N + 1 2 n ) Γ ( n ) Γ ( n λ ) .
The asymptotic behaviour of this expression can be evaluated to give
Γ ( N λ ) Γ ( N ) n = k N 1 ( 2 N + 1 2 n ) Γ ( n ) Γ ( n λ ) 2 N 2 ( λ + 1 ) ( λ + 2 )
in the limit N . This expression dominates the sum in (40). Hence, the evaluation of the asymptotic behaviour of the variance as N gives
Var ( τ N ) = E ( τ N 2 ) E ( τ N ) 2 2 N 2 ( λ + 1 ) ( λ + 2 ) N λ + 1 2 .
This expression may be simplified to give an asymptotic approximation for the variance.
Var ( τ N ) λ N 2 ( λ + 1 ) 2 ( λ + 2 ) as N .
As before, it is sometimes of practical value to retain higher corrections in the approximation for w n as n , such as in the computations performed in Section [5] (Figure 1) on the normal distribution. This allows the asymptotic prediction to be compared with the simulations for the moderately large values of N considered here.

4.3. Higher Moments

We note that it is possible to calculate the statistics comprising of higher moments, since the above calculations for the first and second moment are easily generalised through the following expression for the j th raw moment:
E ( τ N j ) = 1 + n = 1 N 1 ( N n + 1 ) j ( N n ) j i = n N 1 w i .
Finding asymptotic expressions for moments of interest can be performed by using a very similar approach as above.

4.4. Example Calculations

In Table 2, we compute the expectation and variance for the common pdfs computed in Examples 1–3 and those shown in Table 2. This table illustrates the asymptotics of h ( y ) in the limit y for pdfs with infinite upper support and in the limit y y max for domains with finite upper support. For each pdf, the expectation and variance of τ N are obtained using the expectation formula from (35) and the variance formula from (47).
In each of the densities with exponentially decaying tails (exponential and gamma), the expectation and variance are identical, corresponding to setting λ = 1 in the expectation and variance formulae. This is consistent with the result of Example 4, which showed that the large-n asymptotics are identical for any pdf with an exponentially decaying upper tail, corresponding to λ = 1 .
However, in the remaining pdfs, the expectation and variance are not identical. For distributions with heavy tails or a finite upper bound, the optimal stopping statistics are determined by the rate at which the distribution function decays as the upper bound is approached, characterised by the asymptotic behaviour of h ( y ) in this limit. Consequently, the expectation and variance associated with each of these distributions take different values.
We can see that setting β = 1 in the beta distribution gives identical expectation and variance to the uniform distribution, while setting β = 2 or β = 3 / 2 give identical expectation and variance to the triangular and Wigner distributions, respectively. This is caused by the fact that the asymptotic decay of h ( y ) in the limit y y max occurs at the same algebraic power in each case and these statistics depend entirely on the algebraic power of the decay rate.

5. Numerical Comparisons

For each of the distributions in Table 2, we validated the asymptotic predictions for the expectation and variance of τ N for large N by comparing against numerical simulations. These results were simulated using the R programming language [35], with graphs plotted using Tikz [36]. For each value of N from 10 to 1000, in increments of 10, we simulated the optimal stopping problem 10,000 times, using the optimal stopping rule from (4) to determine the stopping point. The expected value and variance were then estimated for each of these N. The results of these computations are illustrated in Figure 1 and Figure 2 for a range of unbounded and bounded distributions, respectively.
We note that the calculated expectation and variance were scaled by a factor of N and N 2 , respectively, so that they tended to constant asymptotic values. The variance of the simulations for these calculated values appeared constant under this scaling, which indicates that the simulated expectation and variance both varied more significantly about the asymptotic predicted values as N increased.
In each computation, the numerical results support the corresponding asymptotic predictions. As N increased, the expectation and variance of each set of trials approached the value predicted by the asymptotic calculations. For the Pareto and beta distributions, the numerical comparisons were performed for three different sets of distribution parameters. We saw that changing the parameters of the Pareto and beta distributions had the effect of changing the asymptotic expectation and variance, as predicted by the calculated values in Table 2.
In contrast, Figure 3 contains the simulated results for the exponential function with β = 0.1 , β = 1 and β = 10 . In each of these figures, the asymptotic behaviour of the expectation and variance of τ N tended to identical values for the expectation and variance. These figures support the predictions from Section 4, in which it is determined that the leading-order large-N approximation of the expectation and variance do not depend on the distribution parameters for distributions with exponential upper tails.
In our asymptotic prediction for the mean and variance of the normal distribution, we retained the first correction term for h ( v n ) as n , which corresponded to taking the first correction in the expansion of the Lambert-W function (25). This gave
h ( v n ) 1 n + 1 n log ( n 2 / 2 π ) as n ,
which allowed us to more accurate estimate E ( τ N ) using (34) and Var ( τ N ) using (31). This first correction was small compared to the leading-order behaviour as N , but decayed so slowly that it had to be included in order to obtain accurate predictions for the expectation and variance of τ N for even moderately large values of N. By comparing the behaviour of the simulated and asymptotic results, we saw that the computations agreed with this more accurate asymptotic prediction.

6. Applications

Historically, asymptotics are used alongside numerical computation in order to understand the long-term behaviour of such problems that are difficult to analyse with exact results. While the asymptotic approximation of optimal stopping moments in this study does not assist with computation, these results can provide insight into the long-term structure of how these optimal decisions might be made. Typically, decision makers are risk-averse and tend to make sub-optimal decisions that are made too early [37]. Therefore, understanding the basic long-term behaviour of what might happen would benefit how decision makers perceive risk and their expected profits alongside with any losses they would incur before making profit.
Example 5.
In some processes, such as in the selling of the asset, the decision maker may receive a sequence of offers that follow some distribution. Additionally, there may be some fixed cost $ c N of observing each offer in the sequence; therefore, we must factor in the cumulative increase in these costs into the decision making. This may be especially true in the situation where there are limited resources that would prevent the ability to wait until an optimal decision can be made. The problem can be remodelled by creating the cumulative sequence { c n } n = 1 N , where c n corresponds to the amount paid in costs when there are n steps remaining. Here, we would have that c n = ( N n ) × c N and the recurrent equations for v n would now be a function of c n , which requires a new consideration of the problem. However, the asymptotics for the stopping time in the basic problem (without cost) can provide a conservative insight into how much money might be spent, or lost, before the stopping rule has been actioned. For example, if we observe a sequence of uniformly distributed variables on the interval [ a , b ] , under the basic optimal stopping strategy, we may expect to lose roughly $ c 3 before arriving at a decision. Under this strategy, we would expect a profit of
v N c 3 b 2 ( b a ) N c 3 .
These would be, in fact, underestimates of the strategy, since the optimal strategy would change. Nevertheless, very simple (conservative) estimates of the expected profit and cost can be obtained from this theory.
Example 6.
In this study, we calculated asymptotics for the expected value and variance of τ N . These can now be substituted into well-known inequalities, such as the Chebyshev’s and Markov’s inequalities, to obtain bounds for probabilities concerning the duration of the optimal stopping rule. This is useful when the underlying probability distribution is unknown and not roughly normal, which is the case for the stopping duration. We recall that, if X is an integrable non-negative random variable with finite expectation and finite non-zero variance and a > 0 is a real number, then the probability that X is at least a is, at most, the expectation of X divided by a. We note that τ N satisfies these conditions; thus, we apply Markov’s inequality to obtain
P τ N N δ δ E ( τ N ) N δ λ + 1 .
We may apply Chebyshev’s inequality to provide a bound for probability that the stopping time exceeds a certain distance from its mean.
P τ N E ( τ N ) N δ δ 2 Var ( τ N ) N 2 .
For large values of N, we obtain
P τ N N λ + 1 N δ λ δ 2 ( λ + 1 ) 2 ( λ + 2 )
By the corresponding substitution for λ , these calculations can be performed for any of the distributions highlighted in this study. We note that these bounds are not incredibly precise; for example, in the case for optimal stopping on the uniform distribution, we have that λ = 2 and so
P τ N N 3 N δ δ 2 18 ,
which would only be useful on 3 2 < δ < 3 2 . We simulated the uniform stopping problem for N = 5000 (conducting 10,000 simulations) to obtain the theoretical and simulated bounds (see Table 3).

7. Conclusions

In this paper, we derived asymptotics of optimal stopping times for sequences of independent identically distributed continuous random variables. In particular, we found the asymptotics of the expected value and the variance of the stopping time for large classes of density functions whose domains have either infinite or finite upper bounds.
Asymptotic calculations were performed on a number of probability distributions, on both bounded and unbounded domains. Numerical simulations were subsequently performed to calculate the expectation and variance of the optimal stopping rule for a range of values of N, ranging from 10 to 1000. In each case, the simulated results tended towards the asymptotic prediction in the large-N limit, validating the asymptotic approach.
One particularly interesting observation is that, if a density function f ( y ) has no upper bound and the upper tail decays at certain exponential rate as y , the expectation and variance of the optimal stopping rule are given by E ( τ N ) = N / 2 and Var ( τ N ) = N 2 / 12 . These asymptotic values do not depend on any other features of the distribution and hold for any distribution with exponentially decaying upper tail behaviour. This parameter independence was tested for the exponential distribution, for which the simulated values converged to the large-N asymptotic prediction given a range of different parameter choices.
The independence of the expectation and variance of the optimal stopping rule from distribution parameters does not hold for distributions with algebraically decaying upper tails, such as the Pareto distribution. We found, in our analyses, that the expectation and variance of the optimal stopping rule, in this case, depends on the algebraic power of the tail decay rate, which was supported by numerical computation for a range of different parameter values.
In most cases, the leading-order asymptotic behaviour was sufficient to explain the simulated results; however, this is not necessarily true. Comparing the leading-order asymptotic approximation with the simulated results for the normal distribution showed a significant error between the predicted and simulated values. This error was resolved by including an extra correction term in the asymptotic calculations, leading to agreement between the asymptotic and simulated results. It is important to note that, for some distributions, higher correction terms to the value of a sequence can be required in order for the asymptotics to be useful in predicting statistical properties for moderately large values of the asymptotic parameter.
The key contribution from this study is outlining a general strategy for obtaining the asymptotic behaviour of the expectation and variance of the optimal stopping rule that can be applied to a wide range of distributions in a consistent fashion. This method recovers existing results from [22], while also making predictions for many other distributions that were subsequently validated through comparison with simulated results.

Author Contributions

Conceptualisation, C.J.L. and G.Y.S.; methodology, C.J.L. and G.Y.S.; simulation, H.N.E.; writing—original draft preparation, H.N.E. and C.J.L.; writing—review and editing, C.J.L. and G.Y.S.; visualisation, H.N.E. and C.J.L.; supervision, C.J.L. and G.Y.S.; project administration, G.Y.S. All authors have read and agreed to the published version of the manuscript.

Funding

C.J.L. acknowledges the support of the Australian Research Council Discovery Project 190101190.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data sets generated during the current study are available from the corresponding author on reasonable request.

Conflicts of Interest

The author declare no conflict of interest.

References

  1. Ferguson, T.S. Who solved the secretary problem? Stat. Sci. 1989, 4, 282–296. [Google Scholar] [CrossRef]
  2. Seale, D.A.; Rapoport, A. Sequential Decision Making with Relative Ranks: An Experimental Investigation of the “Secretary Problem”. Organ. Behav. Hum. Decis. Process. 1997, 69, 221–236. [Google Scholar] [CrossRef]
  3. Stein, W.E.; Seale, D.A.; Rapoport, A. Analysis of heuristic solutions to the best choice problem. Eur. J. Oper. Res. 2003, 151, 140–152. [Google Scholar] [CrossRef]
  4. David, I. Explicit results for a class of asset-selling problems. Eur. J. Oper. Res. 1998, 110, 576–584. [Google Scholar] [CrossRef]
  5. Karlin, S. Stochastic models and optimal policy for selling an asset. In Studies in Applied Probability and Management Science; K. J. Arrow, S.K., Scarf, H., Eds.; Stanford University Press: Stanford, CA, USA, 1962. [Google Scholar]
  6. Sofronov, G.Y. An Optimal Decision Rule for a Multiple Selling Problem with a Variable Rate of Offers. Mathematics 2020, 8, 690. [Google Scholar] [CrossRef]
  7. Hajiaghayi, M.T.; Kleinberg, R.; Parkes, D.C. Adaptive limited-supply online auctions. In Proceedings of the 5th ACM Conference on Electronic Commerce, New York, NY, USA, 17–20 May 2004; pp. 71–80. [Google Scholar]
  8. Harrell, G.; Harrison, J.; Mao, G.; Wang, J. Online auction and secretary problem. In Proceedings of the International Conference on Scientific Computing (CSC), Las Vegas, NV, USA, 27–30 July 2015; pp. 241–244. [Google Scholar]
  9. Kleinberg, R.D. A multiple-choice secretary algorithm with applications to online auctions. In Proceedings of the SODA’05: 16th Annual ACM-SIAM Sympos, Discrete Algorithms, Vancouver, BC, Canada, 23–25 January 2005; Volume 5, pp. 630–631. [Google Scholar]
  10. Egloff, D. Monte Carlo algorithms for optimal stopping and statistical learning. Ann. Appl. Probab. 2005, 15, 1396–1432. [Google Scholar] [CrossRef]
  11. Ivashko, A.; Mazalov, V.; Mazurov, A. A Game-Theoretic Approach to Team Formation in The Voice Show. In Mathematical Optimization Theory and Operations Research; Kochetov, Y., Bykadorov, I., Gruzdeva, T., Eds.; Springer International Publishing: Cham, Switzerland, 2020; pp. 216–230. [Google Scholar]
  12. Targino, R.S.; Peters, G.W.; Sofronov, G.Y.; Shevchenko, P.V. Optimal exercise strategies for Operational Risk insurance via multiple stopping times. Methodol. Comput. Appl. Probab. 2017, 19, 487–518. [Google Scholar] [CrossRef] [Green Version]
  13. Karpowicz, A.; Szajowski, K. Double optimal stopping times and dynamic pricing problem: Description of the mathematical model. Math. Methods Oper. Res. 2007, 66, 235–253. [Google Scholar] [CrossRef]
  14. Sofronov, G.Y.; Keith, J.M.; Kroese, D.P. An optimal sequential procedure for a buying-selling problem with independent observations. J. Appl. Probab. 2006, 43, 454–462. [Google Scholar] [CrossRef]
  15. Sofronov, G.Y. A multiple optimal stopping rule for a buying-selling problem with a deterministic trend. Stat. Pap. 2016, 57, 1107–1119. [Google Scholar] [CrossRef]
  16. Sofronov, G.Y. An optimal double stopping rule for a buying-selling problem. Methodol. Comput. Appl. Probab. 2020, 22, 1–12. [Google Scholar] [CrossRef]
  17. DeGroot, M.H. Optimal Statistical Decisions; John Wiley & Sons: Hoboken, NJ, USA, 2005. [Google Scholar]
  18. Chow, Y.S.; Robbins, H.; Siegmund, D. Great Expectations: The Theory of Optimal Stopping; Houghton Mifflin: Boston, MA, USA, 1971. [Google Scholar]
  19. Freeman, P.R. The Secretary Problem and Its Extensions: A Review. Int. Stat. Rev. 1983, 51, 189–206. [Google Scholar] [CrossRef]
  20. Demers, S. The duration of optimal stopping problems. arXiv 2019, arXiv:1810.11557. [Google Scholar]
  21. Goldenshluger, A.; Malinovsky, Y.; Zeevi, A. A unified approach for solving sequential selection problems. Probab. Surv. 2020, 17, 214–256. [Google Scholar] [CrossRef]
  22. Mazalov, V.V.; Peshkov, N.V. On asymptotic properties of optimal stopping time. Theory Probab. Its Appl. 2004, 48, 549–555. [Google Scholar] [CrossRef]
  23. Gilbert, J.P.; Mosteller, F. Recognizing the Maximum of a Sequence. J. Am. Stat. Assoc. 1966, 61, 35–73. [Google Scholar] [CrossRef]
  24. Masami, Y. Asymptotic Results for the Best-Choice Problem with a Random Number of Objects. J. Appl. Probab. 1984, 21, 521–536. [Google Scholar]
  25. Gnedin, A.V. On the Full Information Best-Choice Problem. J. Appl. Probab. 1996, 33, 678–687. [Google Scholar] [CrossRef]
  26. Samuels, S.M. Secretary Problems as a Source of Benchmark Bounds. Lect.-Notes-Monogr. Ser. 1992, 22, 371–387. [Google Scholar]
  27. Moser, L. On a problem of Cayley. Scr. Math. 1956, 22, 289–292. [Google Scholar]
  28. Kennedy, D.P.; Kertz, R.P. Limit Theorems for Threshold-Stopped Random Variables with Applications to Optimal Stopping. Adv. Appl. Probab. 1990, 22, 396–411. [Google Scholar] [CrossRef]
  29. Kennedy, D.P.; Kertz, R.P. The asymptotic behavior of the reward sequence in the optimal stopping of iid random variables. Ann. Probab. 1991, 19, 329–341. [Google Scholar] [CrossRef]
  30. Peskir, G.; Shiryaev, A. Optimal Stopping and Free-Boundary Problems; Birkhäuser Verlag: Basel, Switzerland, 2006. [Google Scholar]
  31. Nikolaev, M.L.; Sofronov, G.Y. A multiple optimal stopping rule for sums of independent random variables. Discret. Appl. Math. 2007, 17, 463–473. [Google Scholar] [CrossRef]
  32. Sofronov, G.Y. An optimal sequential procedure for a multiple selling problem with independent observations. Eur. J. Oper. Res. 2013, 225, 332–336. [Google Scholar] [CrossRef]
  33. Bender, C.M.; Orszag, S.A. Advanced Mathematical Methods for Scientists and Engineers I: Asymptotic Methods and Perturbation Theory; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
  34. Olver, F.W.J.; Daalhuis, A.B.O.; Lozier, D.W.; Schneider, B.I.; Boisvert, R.F.; Clark, C.W.; Miller, B.R.; Saunders, B.V.; Cohl, H.S.; McClain, M.A. (Eds.) NIST Digital Library of Mathematical Functions . Release 1.0.26 of 2020-03-15; 2020. Available online: http://dlmf.nist.gov/ (accessed on 11 December 2021).
  35. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2021. [Google Scholar]
  36. Tantau, T. The TikZ and PGF Packages, Manual for version 3.0.0; Institut für Theoretische Informatik Universität zu Lübeck: Lübeck, Germany, 2013. [Google Scholar]
  37. Müller, A. Expected utility maximization of optimal stopping problems. Eur. J. Oper. Res. 2000, 122, 101–114. [Google Scholar] [CrossRef]
Figure 1. Comparison of large-N asymptotic predictions for the expectation and variance of the optimal stopping rule for a range of distributions on unbounded domains. The asymptotic prediction is shown in red, while the simulated result is shown in blue. Each point on the simulated curve was obtained by simulating the optimal stopping problem 10,000 times, with the results used to estimate the expectation and variance of the optimal stopping point. We note that, for convenience of presentation, the expectation and variance were scaled by N and N 2 , respectively, so that the curves tended to a constant value. The first correction term was retained in the asymptotic comparison for the normal distribution, as the asymptotic decay of this term was particularly slow in the large-N limit.
Figure 1. Comparison of large-N asymptotic predictions for the expectation and variance of the optimal stopping rule for a range of distributions on unbounded domains. The asymptotic prediction is shown in red, while the simulated result is shown in blue. Each point on the simulated curve was obtained by simulating the optimal stopping problem 10,000 times, with the results used to estimate the expectation and variance of the optimal stopping point. We note that, for convenience of presentation, the expectation and variance were scaled by N and N 2 , respectively, so that the curves tended to a constant value. The first correction term was retained in the asymptotic comparison for the normal distribution, as the asymptotic decay of this term was particularly slow in the large-N limit.
Mathematics 10 00194 g001
Figure 2. Comparison of large-N asymptotic predictions for the expectation and variance of the optimal stopping rule for a range of distributions on bounded domains. The asymptotic prediction is shown in red, while the simulated result is shown in blue. Each point on the simulated curve was obtained by simulating the optimal stopping problem 10,000 times, with the results used to estimate the expectation and variance of the optimal stopping point. We note that, for convenience of presentation, the expectation and variance were scaled by N and N 2 , respectively, so that the curves tended to a constant value.
Figure 2. Comparison of large-N asymptotic predictions for the expectation and variance of the optimal stopping rule for a range of distributions on bounded domains. The asymptotic prediction is shown in red, while the simulated result is shown in blue. Each point on the simulated curve was obtained by simulating the optimal stopping problem 10,000 times, with the results used to estimate the expectation and variance of the optimal stopping point. We note that, for convenience of presentation, the expectation and variance were scaled by N and N 2 , respectively, so that the curves tended to a constant value.
Mathematics 10 00194 g002
Figure 3. Comparison of large-N asymptotic predictions for the expectation and variance of the optimal stopping rule for the exponential distribution with a range of parameter values. In each case, the behaviour tended to the same value in the large-N limit, validating the conclusion that the leading-order asymptotics of statistical properties of optimal stopping do not depend on the distribution parameters for distributions with exponential upper tails. This is in contrast with the Pareto distribution, seen in Figure 1, which has algebraic upper tails; in this case, the asymptotic values changed as the parameters were varied.
Figure 3. Comparison of large-N asymptotic predictions for the expectation and variance of the optimal stopping rule for the exponential distribution with a range of parameter values. In each case, the behaviour tended to the same value in the large-N limit, validating the conclusion that the leading-order asymptotics of statistical properties of optimal stopping do not depend on the distribution parameters for distributions with exponential upper tails. This is in contrast with the Pareto distribution, seen in Figure 1, which has algebraic upper tails; in this case, the asymptotic values changed as the parameters were varied.
Mathematics 10 00194 g003
Table 1. This table details the behaviour of v n in the large-n limit for several common probability density functions. For each example, the table contains the pdf equation f ( y ) and the density domain. The next column contains the survival function h ( y ) . The final column shows the asymptotic behaviour of v n in the limit n . In each case, v n increases without bound.
Table 1. This table details the behaviour of v n in the large-n limit for several common probability density functions. For each example, the table contains the pdf equation f ( y ) and the density domain. The next column contains the survival function h ( y ) . The final column shows the asymptotic behaviour of v n in the limit n . In each case, v n increases without bound.
Distribution h ( y ) Tail Asymptotics as n
Normal, y ( , ) :
f ( y ) = 1 2 π σ 2 e ( y μ ) 2 / 2 σ 2 1 2 1 2 erf y μ σ 2 v n μ + σ W n 2 2 π
Gamma, y ( 0 , ) :
f ( y ) = β α Γ ( α ) y α 1 e y / β 1 γ ( α , y / β ) Γ ( α ) v n β ( α 1 ) W 1 α 1 n Γ ( α ) 1 / ( α 1 )
Triangular, y [ a , b ] :
y c : 2 ( y a ) ( b a ) ( c a ) y > c : 2 ( b y ) ( b a ) ( b c ) y c : ( y a ) 2 ( b a ) ( c a ) y > c : ( b y ) 2 ( b a ) ( b c ) v n b 3 ( b a ) ( b c ) 2 n
Wigner, y [ R , R ] :
f ( y ) = 2 R 2 y 2 π R 2 1 2 y R 2 y 2 π R 2 1 π arcsin y R v n R 1 1 2 5 π 2 n 2 / 3
Beta, y ( 0 , 1 ) :
f ( y ) = y α 1 ( 1 y ) β 1 B ( α , β ) 1 B ( y ; α , β ) B ( α , β ) v n 1 ( β + 1 ) B ( α , β ) n 1 / β
Table 2. This table contains the asymptotic behaviour of optimal stopping statistics for several common probability density functions, as well as the intermediate quantities used to compute these statistics. For each distribution, the table describes the pdf name and domain, the asymptotic behaviour of h ( y ) in the limit y or y y max for domains with infinite and finite upper support, respectively, and the value of λ associated with this asymptotic behaviour. The final two columns contain the expectation and variance of τ N in the limit N .
Table 2. This table contains the asymptotic behaviour of optimal stopping statistics for several common probability density functions, as well as the intermediate quantities used to compute these statistics. For each distribution, the table describes the pdf name and domain, the asymptotic behaviour of h ( y ) in the limit y or y y max for domains with infinite and finite upper support, respectively, and the value of λ associated with this asymptotic behaviour. The final two columns contain the expectation and variance of τ N in the limit N .
DistributionDomain h ( y ) as y λ E ( τ N ) Var( τ N )
Exponential [ 0 , ) e y / β 1 N 2 N 2 12
Gamma [ 0 , ) β 1 α Γ ( α ) y α 1 e y / β 1 N 2 N 2 12
Pareto [ β , ) β y α α 1 α α N 2 α 1 α 2 ( α 1 ) N 2 ( 2 α 1 ) 2 ( 3 α 1 )
DistributionDomain h ( y ) as y y max λ E ( τ N ) Var( τ N )
Uniform [ a , b ] b y b a 2 N 3 N 2 18
Triangular [ a , b ] ( b y ) 2 ( b a ) ( b c ) 3 2 2 N 5 12 N 2 175
Wigner [ R , R ] 4 2 ( R y ) 3 / 2 3 π R 3 / 2 5 3 3 N 8 45 N 2 704
Beta ( 0 , 1 ) Γ ( α + β ) ( 1 y ) β Γ ( α ) Γ ( β + 1 ) β + 1 β β N 2 β + 1 β 2 ( β + 1 ) N 2 ( 2 β + 1 ) 2 ( 3 β + 1 )
Table 3. Values for the standard uniform distribution.
Table 3. Values for the standard uniform distribution.
δ 1.7522.252.52.7533.253.5
Chebyshev Bound0.17010.22220.28130.34720.42010.50000.58680.6806
Simulated Result0.01030.03180.05410.07900.09770.11480.18340.2401
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Entwistle, H.N.; Lustri, C.J.; Sofronov, G.Y. On Asymptotics of Optimal Stopping Times. Mathematics 2022, 10, 194. https://doi.org/10.3390/math10020194

AMA Style

Entwistle HN, Lustri CJ, Sofronov GY. On Asymptotics of Optimal Stopping Times. Mathematics. 2022; 10(2):194. https://doi.org/10.3390/math10020194

Chicago/Turabian Style

Entwistle, Hugh N., Christopher J. Lustri, and Georgy Yu. Sofronov. 2022. "On Asymptotics of Optimal Stopping Times" Mathematics 10, no. 2: 194. https://doi.org/10.3390/math10020194

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop