Next Article in Journal
A Transformational Modified Markov Process for Chord-Based Algorithmic Composition
Previous Article in Journal
ssMousetrack—Analysing Computerized Tracking Data via Bayesian State-Space Models in R
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Estimating Parameters in Mathematical Model for Societal Booms through Bayesian Inference Approach

Department of Management, Okayama University of Science, 1-1 Ridaicyou, Kita Ward, Okayama 700-0005, Japan
*
Author to whom correspondence should be addressed.
Math. Comput. Appl. 2020, 25(3), 42; https://doi.org/10.3390/mca25030042
Submission received: 29 April 2020 / Revised: 29 June 2020 / Accepted: 8 July 2020 / Published: 10 July 2020

Abstract

:
In this study, based on our previous study in which the proposed model is derived based on the SIR model and E. M. Rogers’s Diffusion of Innovation Theory, including the aspects of contact and time delay, we examined the mathematical properties, especially the stability of the equilibrium for our proposed mathematical model. By means of the results of the stability in this study, we also used actual data representing transient and resurgent booms, and conducted parameter estimation for our proposed model using Bayesian inference. In addition, we conducted a model fitting to five actual data. By this study, we reconfirmed that we can express the resurgences or minute oscillations of actual data by means of our proposed model.

1. Introduction

Booms emerge in many fields and are closely tied to our everyday life. For example, a fashion in clothing, makeup, sports, a movie and food (we call “societal booms”). These examples show that “interesting information” about the individual boom passed at a rapid rate to a large number of people in a short period of time. Furthermore, in a sense, we can regard an infectious disease as a boom, like influenza or SARS (we call “epidemiological booms”), which is infected by viruses that are transmitted from person to person. In the above examples, it is the most important things that both “interesting information” and “viruses” are transmitted by some form of contact. Hence, we considered that a spread with an interest in products, movie, food, etc resemble the transmission dynamics of viruses in ways.
The modern studies on epidemiological booms were developed by Kermack and McKendrick, and others in the early twentieth century [1]. This field of study gained attention among researchers owing to the spread of emerging infectious diseases, such as AIDS in the 1980s, that posed risks in developed countries. This research continues to progress [2,3,4]. On the other hand, there are many studies that researched societal booms from a sociological or psychological perspective, but few were conducted from a mathematical point of view. However, in recent years, companies have concentrated their marketing efforts on the development of hit products that emphasize customer taste and trend analysis by using social networking services (SNS) such as Twitter and Facebook.
In this study, we focus on societal booms and we develop research for it. Ishi et al. [5] have derived a mathematical model for the “hit” phenomenon in entertainment within a society, which is presented as a stochastic process of human dynamics interactions. Ishi et al. have performed calculations using their proposed equation for many movies in the Japanese market. Moreover, Abdullah and Wu [6] have built a mathematical model to explain how news actually spreads on Twitter. Specifically, in [6] they have studied how Twitter activity can be described by using well-known deterministic SIR model [4]. On the other hand, Nakagiri and Kurita [7] conducted one study that focused on societal booms. Nakagiri et al. used a system of simultaneous linear differential equations to develop a mathematical model to describe problems in societal booms, and performed a model fitting to actual data. The mathematical model proposed in this study is simple but extremely versatile. Additionally, Ueda and Asahi [8] expanded on the model developed by Nakagiri et al. to conduct an analysis using actual data by constructing and verifying a model of the changing interests among Twitter users. In [9], we proposed the mathematical boom model developed by Nakagiri et al. in consideration of the SIR model [4], which is a leading idea to describe biological booms such as viral infections, and the Diffusion of Innovation theory [10] proposed by sociologist E. M. Rogers. In this study, we examine the stability of the equilibrium of our proposed model in [9]. Moreover, using actual booms data we evaluate the parameters and examine the fit of our proposed model.
This study is divided into six parts. In Section 2, we explain the ideas at the core of our proposed mathematical model for Societal Booms, which was proposed in our previous study. In Section 3, we investigate the stability of the equilibrium point to the reduced model of our proposed model and derive the sufficient condition for parameters. In Section 4, we explain the Bayesian inference approach, which was used to estimate the parameters of our proposed model, and discusses the numerical exploration of the posterior state space by the MCMC method. In Section 5, at first we introduce the coefficient of determination that forms the standard for the fit. Next, we evaluate the parameters of our proposed model and examine fitting our proposed model to actual data, using five actual data for societal booms.

2. Mathematical Model

In this section, we explain a mathematical model for societal booms which was derived in [9].

2.1. Three Key Points to Derive Our Proposed Model

Here, we explain the three key points which were discussed to derive our proposed model.
The first point is the contact. Infectious disease epidemics such as influenza are thought to occur when a virus invades and infects a healthy person’s body from contact with an infected person. In our proposed model, we define “interesting information” to be a “virus”, which is transmitted from people in an on-boom state to those whom the boom has not reached yet (pre-boom). Thus, our proposed model incorporates the perspective of the contact, which was not considered in [7].
The second point is the time delay. The Diffusion of Innovation theory, developed by E. M. Rogers, separates consumers into five categories based on the speed at which people are likely to adopt innovation (innovators, early adopters, early majority, late majority, and laggards). Based on this theory, we think that time lags exist in the adoption of booms by people in a social system, and thus developed a model that considers the effects of a time delay. Hutchinson [11] suggested the following logistic equation with time delay τ . This equation shows that the solution does not fluctuate monotonously but exhibits complex behavior such as oscillatory behavior depending on the magnitude of the time delay:
d x ( t ) d t = α x ( t ) 1 x ( t τ ) K ( α , K > 0 ) .
Furthermore, the biologist R. M. May [12] regarded (1) as a mathematical model that expresses the temporal changes of the herbivorous animal population x ( t ) , and asserted that the biological definition of a time delay was “the time required for the regeneration of plants that is suitable for animals to eat”. Additionally, May received acclaim for fitting results from an experiment on the Australian sheep blowfly (Lucilia cuprina) conducted by Nicholson [13]. Based on these experimental results, we regarded the definition of a time delay for societal booms as “the time required for a boom adopter associated with contact and resurgence to pick up a boom and take action”, and incorporated the concept of the time delay into the derivation of a mathematical model. The time delay for societal booms is similar to Cooke’s [14] mathematical model that explained infections spread by mosquito carriers, where he defined a time delay as the “time required for an uninfected mosquito to become an infected mosquito (so-called incubation period)”. In particular, there have been many attempts to include time delays in mathematical models, and particularly in differential equations, across various fields such as biology, epidemiology, and engineering [15,16,17]. Similar to Cooke, the proposed model views booms to be transmitted (“infected”) through contact as follows: people in a pre-boom state can become infected by coming into contact with information of interest, and they can enter an on-trend state after a given time. Then, when the person enters an established-trend state, he will be able to “infect” (transmit the trend to) others in a pre-boom state.
The last point is the existence of influencer and “Sakura” (“Sakura” are people who were compelled to boom state). Our proposed model expresses their presence by depicting a resurgence caused by opinion leaders and forced changes in the number of people.
As described above, our proposed model is a natural extension of the boom model developed in [7] and is derived from the above three perspectives. We expect that the model will be able to capture various types of boom data.

2.2. Mathematical Model for Societal Booms

In [9], we proposed a new mathematical model to explain societal booms based on the background described above. Here, according to [9] we derive the mathematical model for societal booms.
First, we assume the state of the boom participants at any given time to be one of the following:
  • State1 Pre-boom: Condition where there is a potential to adopt a boom.
  • State2 On-boom: Condition where the boom is captured.
  • State3 Rooted-boom: Condition where the boom is retained.
  • State4 Unrooted -boom: Condition where the boom did not take off.
Furthermore, at a given time t, we assume the number of boom participants in each state to be y 1 ( t ) , y 2 ( t ) , y 3 ( t ) , and y 4 ( t ) respectively, for states 1–4. Then we represent the changes of a customer’s state by using the following equations:
d y 1 ( t ) d t = α y 1 ( t ) y 2 ( t τ 1 ) δ y 1 ( t ) + ε y 2 ( t τ 2 ) + ζ , d y 2 ( t ) d t = α y 1 ( t ) y 2 ( t τ 1 ) ( β + γ ) y 2 ( t ) + δ y 1 ( t ) , d y 3 ( t ) d t = β y 2 ( t ) ε y 2 ( t τ 2 ) ζ , d y 4 ( t ) d t = γ y 2 ( t ) .
Here, the variables α , β , γ , δ , ε , ζ and τ 1 , τ 2 in (2) respectively represent the rate of transmission (“infection”) of the boom among people in a pre-boom state per unit time, rate of retention among people in an on-boom state, rate of people who quit the boom, adoption rate of the boom by people in a pre-boom state, rate of resurgence from rooted boom to pre-rooted state, and the production rate of people in an rooted boom state. τ 1 and τ 2 are parameters that show the time delay, and τ 1 < τ 2 . Here, α > 0 , β + γ > 0 , δ > 0 . In particular, α y 2 ( t τ 1 ) is called as infectivity and is an important indicator that characterizes the boom model.
In this model, the rate at which people go from a pre-boom state to an on-boom state is proportional to the number of people in a pre-boom state and the number of people who changed to an on-boom state before τ 1 (the first term of the first equation in (2)), and we assume that people in a pre-boom state naturally adopt booms at a fixed rate (the second term of the first equation in (2)). Moreover, it expresses the resurgence of the boom by transferring the rate of people who became on-boom prior to τ 2 (the third term of the first equation in (2)). In addition, ζ expresses the ratio of people in an rooted boom state who were compelled to enter that state (in other words, “Sakura”). Furthermore, the given ratio of people in an on-boom state enter an rooted or unrooted state (the second and third term of the second equation in (2)).

3. Stability of the Equilibrium Point for the Reduced Model

Since the first and second equations in system (2) are independent of the third and forth equations, it suffices to consider the first two equations, that is, we will focus on the reduced model in the following discussions.
d y 1 ( t ) d t = α y 1 ( t ) y 2 ( t τ 1 ) δ y 1 ( t ) + ε y 2 ( t τ 2 ) + ζ , d y 2 ( t ) d t = α y 1 ( t ) y 2 ( t τ 1 ) ( β + γ ) y 2 ( t ) + δ y 1 ( t ) .
Then we can find equilibrium points by setting the right hand of system (3) equal to zero:
α y 1 ( t ) y 2 ( t τ 1 ) δ y 1 ( t ) + ε y 2 ( t τ 2 ) + ζ = 0 , α y 1 ( t ) y 2 ( t τ 1 ) ( β + γ ) y 2 ( t ) + δ y 1 ( t ) = 0 .
Obviously, a trivial solution of (4) is E 0 = ( 0 , 0 ) , if ζ = 0 , and non trivial solution:
E 1 = ( y 1 * , y 2 * ) = ( β + γ ) ζ α ζ + δ ( β + γ ε ) , ζ β + γ ε ,
if ζ 0 .
In this study, we analyze the stability of non trivial equilibrium point E 1 . First, upon the following change of functions and substitutions
a ( t ) = y 1 ( t ) y 1 * , b ( t ) = y 2 ( t ) y 2 * ,
then, system (3) becomes
d a d t = α a ( t ) b ( t τ 1 ) α a ( t ) y 2 * α y 1 * b ( t τ 1 ) α y 1 * y 2 * δ a ( t ) δ y 1 * + ε b ( t τ 2 ) + ε y 2 * + ζ , d b d t = α a ( t ) b ( t τ 1 ) + α a ( t ) y 2 * + α b ( t τ 1 ) y 1 * + α y 1 * y 2 * + δ a ( t ) + δ y 1 * ( β + γ ) b ( t ) ( β + γ ) y 2 * .
Since y 1 * , y 2 * are solutions of (4), system (5) becomes the following form:
d a d t = ( α y 2 * + δ ) a ( t ) α y 1 * b ( t τ 1 ) + ε b ( t τ 2 ) , d b d t = ( α y 2 * + δ ) a ( t ) + α y 1 * b ( t τ 1 ) ( β + γ ) b ( t ) .
Regarding this equation as system ODE with the equilibrium point ( 0 , 0 ) , we have
d d t a ( t ) b ( t ) = A 1 a ( t ) b ( t ) + A 2 a ( t τ 1 ) b ( t τ 1 ) + A 3 a ( t τ 2 ) b ( t τ 2 ) ,
where
A 1 = α y 2 * δ 0 α y 2 * + δ ( β + γ ) , A 2 = 0 α y 1 * 0 α y 1 * , A 3 = 0 ε 0 0 .
Therefore, the characteristics equation of the above system ODE takes the following form:
d e t ( λ I A 1 A 2 e τ 1 λ A 3 e τ 2 λ ) = 0 .
That is,
λ + ( α y 2 * + δ ) α y 1 * e τ 1 λ ε e τ 2 λ ( α y 2 * + δ ) λ + ( β + γ ) α y 1 * e τ 1 λ = 0 ,
which implies
λ 2 + ( β + γ + δ α y 1 * e τ 1 λ + α y 2 * ) λ + ( α y 2 * + δ ) ( β + γ ε e τ 2 λ ) = 0 .
Theorem 1
(Stability of the Equilibrium Point for Reduced Model). We consider system (3), where α , β , γ , δ , and τ 1 , τ 2 are non-negative numbers and ε , ζ are real numbers. Furthemore, let us assume that
β + γ ε > 0 ,
α ζ + δ ( β + γ ε ) α ζ ( β + γ ) > τ 1 2 ,
moreover,
α ζ + δ ( β + γ ε ) > 0 ,
1 ε δ ( β + γ ) ( β + γ ε ) 2 ( α ζ + δ ( β + γ ε ) ) 2 + 1 > τ 2 ,
or
α ζ + δ ( β + γ ε ) < 0 ,
1 ε δ ( β + γ ) ( β + γ ε ) 2 ( α ζ + δ ( β + γ ε ) ) 2 + 1 < τ 2 .
Then we shall show that every root of the characteristic equation (7) must have negative real part.
Proof. 
Suppose for contradiction that λ = μ + i ω is a root of (7) with μ 0 , ω 0 . Then,
I m [ { λ 2 + ( β + γ + δ + α ( y 2 * y 1 * e τ 1 λ ) λ + ( α y 2 * + δ ) ( β + γ ε e τ 2 λ ) } / ω ] = I m [ { ( μ + i ω ) 2 + ( β + γ + δ + α y 2 * α y 1 * e τ 1 ( μ + i ω ) ( μ + i ω ) + ( α y 2 * + δ ) ( β + γ + ε e τ 2 ( μ + i ω ) ) } / ω ] = { 2 μ ω + ( β + γ + δ + α y 2 * α y 1 * e τ 1 μ cos τ 1 ω ) ω + α y 1 * e τ 1 μ sin τ 1 ω μ + ( α y 2 * + δ ) ε e τ 2 μ sin τ 2 ω } / ω = 2 μ + β + γ + δ + α y 2 * α y 1 * e τ 1 μ cos τ 1 ω + α y 1 * e τ 1 μ sin τ 1 ω ω τ 1 μ + ( α y 2 * + δ ) ε e τ 2 μ sin τ 2 ω ω τ 2 .
From τ i > 0 , μ > 0 ,
e τ i μ < 1 , 1 sin τ i ω τ i ω 1 ( i = 1 , 2 ) ,
therefore, we have
f ( μ ) > 2 μ + β + γ + δ + α y 2 * α y 1 * α y 1 * τ 1 μ ( α y 2 * + δ ) ε τ 2 = ( 2 α y 1 * τ 1 ) μ + β + γ + δ + α ( y 2 * y 1 * ) ( α y 2 * + δ ) ε τ 2 .
At first, from μ > 0 and (9), we have
2 α y 1 * τ 1 > 2 α ζ ( β + γ ) B 2 B α ζ ( β + γ ) = 0 .
Next,
β + γ + δ + α ( y 2 * y 1 * ) ( α y 2 * + δ ) ε τ 2 = ( β + γ + δ ) A B + α ( ζ B ( β + γ ) ζ A ) ( α ζ + δ A ) ε B τ 2 A B = { ( β + γ + δ ) B α ζ ( β + γ ) δ ε B τ 2 } A + ( α ζ α ζ ε τ 2 ) B A B = { δ B ( 1 ε τ 2 ) + ( β + γ ) δ A } A + α ζ B ( 1 ε τ 2 ) A B = ( β + γ ) δ A 2 + ( 1 ε τ 2 ) B 2 A B .
From (10), (11) or (12), (13), we have
( β + γ ) δ A 2 + ( 1 ε τ 2 ) B 2 A B > 0 .
This is contradiction shows that every λ has negative real part. Hence every solution of (3) tends to zero exponentially as t . □

4. Bayesian Inference Approach for Estimating Parameters

The Bayesian inference approach is widely used with great successes in various real–world problems. In particular, by recently the development of analytical techniques such as Markov chain Monte Carlo methods (MCMC) (for detail, see [18,19]), it has found application in a wide range of activities such as quantitative finance, stochastic epidemic, biometrics, remote sensing, heat conductivity, sesmic inversion, machine learning [20,21,22,23,24,25,26,27,28,29].
In this section, we discuss how the Bayesian inference approach can be used for evaluating the parameters of a differential equation model, such as our proposed model. If a differential equation model can be solved analytically, then the usual non-linear least squares (NLS) can be used to estimate the unknown parameters [30,31]. However, in most of the practical situations, such analytical solutions are not available as evidenced in the proposed model. In the ordinary differential equation model, the Bayesian inference approach was considered in the works of Gelman et al. and Girolami [32,33]. They solved ordinary differential equation numerically and hence constructed the likelihood. A prior density functions were assigned on θ and the MCMC was used to generate samples from the posterior probability density function (PPDF).
The underlying concept of a Bayesian inference approach is Bayes’ theorem, which relates the parameters θ and the observed data Y as follows:
π ( θ | Y ) = π ( Y | θ ) π ( θ ) π ( Y ) .
(14) states that the posterior probability density function(PPDF) π ( θ | Y ) for parameter θ is proportional to the product of the likelihood function π ( Y | θ ) and the prior density function π ( θ ) .
Here, let us define m dimensional vectors Y , F θ and E as follows:
Y = ( Y 1 , Y 2 , , Y m ) = ( Y ( t 1 ) , Y ( t 2 ) , , Y ( t m ) ) , F θ = ( F 1 θ , F 2 θ , , F m θ ) ,   = ( y 2 θ ( t 1 ) + y 3 θ ( t 1 ) , y 2 θ ( t 2 ) + y 3 θ ( t 2 ) , , y 2 θ ( t m ) + y 3 θ ( t m ) ) , E = ( ε 1 , ε 2 , , ε m ) ,
where t j ( j = 1 , , m ) are the measurement points, F j θ = y 2 θ ( t j ) + y 3 θ ( t j ) are the solution of proposed model (2) for the unknown parameters θ . Moreover, ε j is the noise, assumed as white Gaussian noise as follows:
ε j i . i . d . N ( 0 , Σ θ 2 ) ,
where the i . i . d . means every residual is independent and identically distributed. They all have the same distribution, which is defined right afterward.
Then we seek the parameters θ ¯ , which assumedly represent the true value of θ , such that
Y = F θ + E .
Here the likelihood function π ( Y | θ ) is then given as
π ( Y | θ ) = exp ( Y F θ ) T ( Y F θ ) 2 Σ ε 2 .
In this study the prior density function is simply assumed as π ( θ ) = U [ θ 0 , θ 0 ] , where U [ θ 0 , θ 0 ] is a uniform distribution with the parameter θ 0 which is a sufficiently large positive constant. Eventually the PPDF of the parameters θ can be written as follows:
π ( θ | Y ) exp ( Y F θ ) T ( Y F θ ) 2 Σ ε 2 .
Here, the standard derivation Σ ε is known and can be regarded as a regularization parameter.

Markov Chain Monte Carlo Methods (MCMC)

When we wish to simulate directly from a PPDF even though it is impossible for various reasons, the MCMC algorithm provides the solution based on the mechanism that it is much easier to construct an ergodic Markov chain with PPDF as a stationary probability measure than to simulate directly from PPDF. In particular, this can be achieved by Metropolis–Hastings (M–H) algorithm (see Hastings [34], Metropolis et al. [35]) which takes an arbitrary Markov chain, and adjusts it using a simple system such as selecting acceptance or rejection based on suitable conditions.
In this study, we employ the M–H algorithm to evaluate the parameters of our proposed model from actual data.
M–H Algorithm
  • Step1: Generate θ q ( · | θ k ) = N ( θ k , σ θ k 2 ) (the normal distribution) with a given stander derivation σ k for given θ k .
  • Step2: Calculate the choice α ( θ , θ k ) = min 1 , f ( θ | Y ) / f ( θ k | Y ) .
  • Step3: Update θ k as θ k + 1 = θ with probability α ( θ , θ k ) but otherwise set θ k + 1 = θ k .
By running M–H Algorithm, we can sample the distribution f ( θ | Y ) , and usually the mean value θ CM
θ CM 1 K k * k = k * + 1 K θ k ,
of θ j , after a given burn-in time k * .

5. Using Real Data to Evaluate Validity of Proposed Model

The proposed model is derived for the purpose of expressing the resurgences or slightly oscillations, but it is expected that the solution of this model converges stably to the equilibrium point without explosive divergence when we apply this model to real marketing strategy. Therefore, we give five examples and confirm that suitable parameters evaluated by the method of Bayesian inference approach from each actual data satisfy or do not satisfy conditions (8), (9) and (10), (11) or (12), (13) of Theorem 1 in the previous Section 3.

5.1. Parameter Estimation Steps

In first step, we determine the values for time delay parameters τ 1 and τ 2 from the actual data, referencing events that occurred prior to each cultural phenomenon. In particular, for τ 1 we observed the first peak and set the value. Moreover, for τ 2 we set the value to be the duration between this point of change to a large peak, on unusual changes such as peaks created from resurgence.
Next, we set arbitrarily the initial value α 0 ζ 0 for α ζ , but we check that the solution to the proposed model is obtained (for detail, see each table in five examples below). Furthemore, we perform the sampling 100 , 000 iterations to evaluate parameters α ζ according to M–H Algorithm in Section 4.
In the final step, we use the mean value (19) of 40 , 000 iterations after an initial number 60 , 000 iterations were discarded and we perform fitting our proposed model to actual data.

5.2. Coefficient of Determination

In this study, we introduce the coefficient of determination R 2 as an index, which indicates the percentage change between the solution of the proposed model and an actual data, as follows:
R 2 = 1 i = 1 n ( E i E ¯ ) 2 i = 1 n ( Y i Y ¯ ) 2
Then, from the following form (20), the range is from 0 to 1 and the R 2 of 1 means an actual data can be predicted without error from the solution of the proposed model. Here, Y ¯ is a mean of an actual data, and E i = Y i F i θ , E ¯ is a mean of E i .

5.3. Model Fitting to Actual Data

La La Land. La La Land is a buzz-worthy American romantic musical film. La La Land premiered at the 73rd Venice International Film Festival on 31 August 2016, and was released in the United States on 9 December 2016 and has grossed USD 446 million to date. Moreover, it received 14 Academy Award nominations at the 89th Academy Awards and won in six categories, including Best Director and Best Actress. From the above points, we regards the film “La La Land” to be appropriate for reflecting a transitory boom and used the data as the subject for testing.
In this study, we used the number of user reviews left on Yahoo! Japan Movies during the period from August 2016 to July 2018, including March 2017 when La La Land won in six categories at the 89th Academy with respect to the fit proposed model to actual data.
The actual data in Figure 1 show a decline after the first peak on March 2017 and finally decrease after two small peaks. The graph from the proposed model shows a high degree of fit to the first peak. In addition, we observe that it is able to largely replicate the subsequent curves including the leveling-off and decrease rapidly towards the end of the period after two small peaks. In addition, the coefficient of determination, which is the measure of how well a model explains the data, shows a high value at R 2 = 0.9690 ; this reveals that the proposed model has a high degree of fitness. On the other hand, we observe that fitting to two small peaks in the middle of the period is not good. Moreover, in this example the parameters in Table 1 satisfy conditions in Theorem 1.
“Ten-nensui (Pure Water)”. Suntory “Ten-nensui” is best-selling mineral water which is made with water from renowned water resources in Japan, including the Minami-Alps. All Suntory Ten-nensui products are made from “soft water”, clear in color, and beloved for their refreshing taste (https://www.suntory.com/brands/suntorytennensui/).
In this study, we used Twitter data from before and after the product launch on 4/17/2019 to test the effectiveness of the “GREEN TEA CAMPAIGN” which is the new product of “Ten-nensui (Pure water)” with respect to the fit proposed model to actual data.
The actual data in Figure 2 show a decline after the first peak. After that, it has a small second peak once more, but it decreases again. Similar to the other examples, the proposed model shows a high degree of fit to the first peak. In addition, we observe that the graph from the proposed model is able to well reconstruct the processes of declining and even expresses the small decline that occurs from 4/18/2019 to 4/21/2019. The high degree of accuracy in the fitness is evident from the large coefficient of determination, R 2 = 0.9646 . On the other hand, we observe that the graph from the proposed model cannot express well the second peak in the middle of the period. Moreover, in this example the parameters in Table 2 satisfy conditions in Theorem 1.
Honkirin Beer. Honkirin Beer is a Happoshu (low-malt) beer that was introduced on 13 March 2018. In a cost-conscious environment, Honkirin Beer became the biggest hit among new beer releases in FY 2018 owing to its high quality and low price. According to the brewer Kirin, Honkirin Beer underwent a product renewal in mid-January 2019 for an even more refined authentic taste. As a result, the product logged a record sales volume in February (1.12 million cases), second only to its release in March 2018 (1.17 million cases).
Here, too, we used Twitter data from before and after the product launch on 4/22/2019 to test the effectiveness of the Honkirin Beer campaign with respect to the fit of the proposed model to actual data.
The actual data in Figure 3 show a decline after the first peak slowly with one peak and one bottom on 26 and 28 April 2019. Similar to the other examples, the graph from the proposed model shows a high degree of fit to the first peak. In addition, we observe that it is able to fit the subsequent curves to the actual data except for two specific points. The high degree of accuracy in the fitness is evident from the large coefficient of determination, R 2 = 0.9367 . On the other hand, we observe that the graph from the proposed model is not able to express well two specific points. Future challenges include improvements to these points. Moreover, in this example the parameters in Table 3 satisfy conditions in Theorem 1.
Cup-noodle. The first instant noodles were invented by Momofuku Ando, who later founded the well-known food company Nissin Food, in Japan in Osaka in 1958. Currently, the instant noodles accomplish evolution of various form and are mainly sold as Cup-noodle at convenience stores and supermarkets. In Japan, Cup-noodle is the most convenient quick meal and from the busy Japanese businessmen and women to the school children, it seems everyone enjoys them.
Here, we used Twitter data from 4/10/2019 to 5/3/2019 before and after the product presentation on 4/23/2019 to test the effectiveness of the cup-noodle presentation with respect to the fit of the proposed model to actual data.
The actual data in Figure 4 show a decline after the first peak, and several oscillations and a sharp increase again towards the end day of this period. Similar to the other examples, the graph from the proposed model shows a high degree of fit to the first peak. The high degree of accuracy in the fitness is evident from the large coefficient of determination, R 2 = 0.9354 . In particular, the graph of the proposed model expresses the rapid increase at the end day of this period. These points are advantages of our proposed model. Moreover, in this example the parameters in Table 4 do not satisfy conditions in Theorem 1, but the numerical solution made from these parameters converges stably to the equilibrium point.
“Reiwa (Beautiful Harmony)”. In Japan, there are both the Christian era and the Japanese imperial era names are used, and when we have new emperor, new era starts. There was a big event in the Japanese royal family this year. Current Emperor Akihito abdicated the throne on 30 April and his son, Crown Prince Naruhito, enthroned the throne on May 1. New Imperial era “Reiwa (Beautiful Harmony)” is taken from a verse in the oldest anthology of poems called Manyo-shu. Here, we used Twitter data after the name of new Imperial era was announced on April 1 to test public interest of new Imperial era “Reiwa” for the fit of the proposed model to actual data.
The actual data in Figure 5 show a rapid, sharp spike followed by an easing, which is a characteristic pattern of transitory booms. The graph from the proposed model fits the actual data with a very high overall accuracy. The high degree of accuracy in the fitness is evident from the value of the coefficient of determination, R 2 = 0.9907 . In particular, the solution from the proposed model even expresses the small decline that occurs from 4/4/2019 to 4/6/2019. On the other hand, from the graph of the solution to the proposed model, we can see interests of Twitter users of new Imperial era “Reiwa” towards the date when the Era name changes from “Heisei” to “Reiwa”. Thus, we believe that an advantage of the proposed model is its ability to express oscillations by incorporating time delays and “Sakura” data. Moreover, in this example the parameters in Table 5 do not satisfy conditions in Theorem 1, and the numerical solution made from these parameters begins to oscillate halfway and explosively diverges.

6. Conclusions

In this paper, based on our previous study, we examined the mathematical properties, especially the stability of the equilibrium to the reduced model of our proposed model. Moreover, we also used actual data representing transient booms and resurgent booms and conducted parameter estimation for the proposed model using Bayesian inference approach (MCMC-MH methods). In addition, we conducted a model fitting to actual data. By this study, we reconfirmed that we can express the resurgences or slightly oscillations of actual data by means of our proposed model. However, our proposed model was unable to capture secondary peak and detect slight oscillations, as we had expected. The main difficulty is that the mathematical model using the ordinary differential equation will have fewer reproducibility in terms of the actual data involving more peaks or valleys. Although it is difficult to express all peaks or valleys by proposed model, we will be able to reproduce the important peaks or valleys by considering information of the actual data and adjusting τ 1 , τ 2 , which express time delays, and ε , ζ , which express resurgences. Moreover, it is thought that technique of the sampling using MCMC to evaluate parameters has a difficulty. In our future work, we will try to capture peaks or valleys of the actual data by dividing the timeline into pieces and evaluating the parameter for each piece by Bayesian inference approach.
In addition, there are several potential extensions of the present method. The first one is the validity of the proposed model. This study and our previous study only tested a total of 10 cases. In our future work, it will be necessary to take various examples of boom data, test the validity of our proposed model, and address the relationship between the actual data and parameters. Additionally, we would like to conduct an analysis using data not just for domestic booms but also global booms to test the validity of our proposed model.
The expansion of the model is also an important topic. In particular, a mathematical model should be developed that incorporates probabilistic items to handle unexpected events. The “Sakura” parameter is a unique point of the proposed model, but realistically, a model is needed for scenarios other than that with a constant presence of “Sakura”. We also must consider models with functions that have time-dependent parameters and time-dependent “Sakura”. Moreover, we must also advance mathematical analysis of areas such as the asymptotic behavior of mathematical models and their behavior around the points of equilibrium. In this study, we derived a sufficient condition in regards to the stability of the equilibrium for the reduced model. In the parameter fitting to five actual data for societal booms, “Cup-noodle” and “Reiwa” did not satisfy this sufficient condition. In fact, it was shown that the numerical solution of “Reiwa” begins to oscillate halfway and explosively diverges. However, it is interesting that the numerical solution of “Cup-noodle” converges stably to the equilibrium point. This result implies the improvement of the sufficient condition of the stability in our future challenges.
In recent years, the development of technology has made it relatively easy to obtain large data (both quantitatively and qualitatively), but a significant challenge has been how to use this large amount of data. We would like to continue research efforts in anticipation of the practical contribution of the mathematical model and parameter estimation method using a Bayesian inference approach in this study to corporate marketing strategies.

Author Contributions

Conceptualization, Y.O.; methodology, Y.O. and N.M.; software, Y.O.; validation, Y.O. and N.M.; formal analysis, Y.O.; investigation, Y.O.; resources, Y.O. and N.M.; data curation, Y.O.; writing—original draft preparation, Y.O.; writing—review and editing, Y.O. and N.M.; visualization, Y.O. and N.M. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by JSPS KAKENHI Grant Number 18K03439 and 17K04029.

Acknowledgments

The author would like to thank the referees for carefully reading our manuscript and for giving such constructive comments which substantially helped improving the quality of our paper. Moreover, the author wish to thank the Editor for helpful cooperation that have improved the layout of the paper. The first author would like to acknowledge the supports from JSPS Grant-in-Aid for Scientific Research (C) 18K03439 and the second author would like to acknowledge the supports from JSPS Grant-in-Aid for Scientific Research (C) 17K04029.

Conflicts of Interest

The authors declare that they have no conflict of interest regarding the publication of the research article.

References

  1. Anderson, R.M. Discussion: The Kermack–McKendrick epidemic threshold theorem. Bull. Math. Biol. 1991, 53, 1–32. [Google Scholar] [CrossRef]
  2. Busenberg, S.; Cooke, K. Vertically Transmitted Diseases Models and Dynamics; Springer: Berlin, Germany, 1993. [Google Scholar]
  3. Dietz, K. The first epidemic model A historical note on P.D. En’ko. Aust. J. Stat. 1988, 30, 56–65. [Google Scholar] [CrossRef]
  4. Kermack, W.O.; McKendrick, A.G. Contributions to the mathematical theory of epidemics—I. Proc. R. Soc. 1927, 115, 700–721. [Google Scholar]
  5. Ishii, A.; Arakaki, H.; Matsuda, N.; Umemura, S.; Urushidani, T.; Yamagata, N.; Yoshida, N. The ’hit’ phenomenon: A mathematical model of human dynamics interactions as a stochastic process. New J. Phys. 2012, 14, 063018. [Google Scholar] [CrossRef]
  6. Abdullah, S.; Wu, X. An Epidemic Model for News Spreading on Twitter. In Proceedings of the 2011 23rd IEEE International Conference on Tools with Artificial Intelligence, Boca Raton, FL, USA, 7–9 November 2011; Volume 1, pp. 163–169. [Google Scholar]
  7. Nakagiri, Y.; Kurita, O. On a differential equation model of booms. Trans. Oper. Res. Soc. Jpn. 2004, 47, 83–105. (In Japanese) [Google Scholar] [CrossRef] [Green Version]
  8. Ueda, Y.; Asahi, Y. Construction and verification for the interest shift model of Twitter users [Twitter-riyousya no kanshin-ikou model no kouchiku to kenshou]. Oper. Res. Manag. Sci. Res. 2014, 59, 219–228. (In Japanese) [Google Scholar]
  9. Ota, Y.; Mizutani, N. A proposal of mathematical model on booms for human and social life—Using diffusion of innovation theory and delay differential equations. Trans. Oper. Res. Soc. Jpn. 2020. to appear (In Japanese) [Google Scholar]
  10. Rogers, E.M. The Diffusion of Innovations, 5th ed.; The Free Press: New York, NY, USA, 2003. [Google Scholar]
  11. Hutchinson, G.E. Circular causal systems in ecology. Ann. N. Y. Acad. Sci. 1948, 50, 221–246. [Google Scholar] [CrossRef]
  12. May, R.M. Stability and Complexity in Model Ecosystems; Princeton University Press: Princeton, NJ, USA, 1974. [Google Scholar]
  13. Nicholson, A.J. The Self-Adjustment of Populations to Change. Cold Spring Harb Symp. Quant. Biol. 1957, 22, 153–173. [Google Scholar] [CrossRef]
  14. Cooke, K.L. Stability analysis for a vector disease model. Rocky Mount. J. Math. 1979, 9, 31–42. [Google Scholar] [CrossRef]
  15. Mena-Lorcat, J.; Hethcote, H.W. Dynamic models of infectious diseases as regulators of population sizes. J. Math. Biol. 1992, 30, 693–716. [Google Scholar] [CrossRef] [PubMed]
  16. Pyragas, K. Continuous control of chaos by self-controlling feedback. Phys. Lett. A 1992, 170, 421–428. [Google Scholar] [CrossRef]
  17. Smith, H.L. Subharmonic bifurcation in an S-I-R epidemic model. J. Math. Biol. 1983, 17, 163–177. [Google Scholar] [CrossRef] [PubMed]
  18. Kaipio, J.; Somersalo, E. Statistical and Computational Inverse Problems; Springer: New York, NY, USA, 2005. [Google Scholar]
  19. Robert, C.; Casella, G. Monte Carlo Statistical Methods; Springer: New York, NY, USA, 2004. [Google Scholar]
  20. Andersson, H.; Britton, T. Stochastic Epidemic Models and their Statistical Analysis; Springer: New York, NY, USA, 2000. [Google Scholar]
  21. Andrieu, C.; de Freitas, N.; Doucet, A.; Jordan, M.I. An Introduction to MCMC for Machine Learning. Mach. Learn. 2003, 50, 5–43. [Google Scholar] [CrossRef] [Green Version]
  22. Haario, H.; Laine, M.; Lehtinen, M.; Saksman, E.; Tamminen, J. Markov chain Monte Carlo methods for high dimensional inversion in remote sensing. J. R. Stat. Soc. B Stat. Methodol. 2004, 66, 591–608. [Google Scholar] [CrossRef]
  23. Jacquier, E.; Jarrow, R. Bayesian analysis of contingent claim model error. J. Econom. 2000, 94, 145–180. [Google Scholar] [CrossRef]
  24. Jacquier, E.; Polson, N. Bayesian methods in finance. In Oxford Handbook of Bayesian Econometrics; Oxford University Press: Oxford, UK, 2010; pp. 439–512. [Google Scholar]
  25. Martin, J.C.; Wilcox, L.C.; Burstedde, C.; Ghattas, O. A stochastic Newton MCMC method for large-scale statistical inverse problems with application to seismic inversion. SIAM J. Sci. Comput. 2012, 34, A1460–A1487. [Google Scholar] [CrossRef]
  26. Michelot, T.; Blackwell, P.G.; Jammes, S.C.; Matthiopoulos, J. Inference in MCMC step selection models. J. Int. Biom. Soc. 2019, 76, 438–447. [Google Scholar] [CrossRef]
  27. O’Neill, P.D.; Roberts, G.O. Bayesian inference for partially observed stochastic epidemics. J. R. Stat. Soc. A 1999, 162, 121–129. [Google Scholar] [CrossRef]
  28. Tunaru, R.; Zheng, T. Parameter estimation risk in asset pricing and risk management: A Bayesian approach. Int. Rev. Financ. Anal. 2017, 53, 80–93. [Google Scholar] [CrossRef]
  29. Wang, J.; Zabaras, N. A Bayesian inference approach to the inverse heat conduction problem. Int. J. Heat Mass Transf. 2004, 47, 3927–3941. [Google Scholar] [CrossRef]
  30. Levenberg, K. A method for the solution of certain non-linear problems in least squares. Q. Appl. Math. 1944, 2, 164–168. [Google Scholar] [CrossRef] [Green Version]
  31. Marquardt, D. An algorithm for least-squares estimation of nonlinear parameters. SIAM J. Appl. Math. 1963, 11, 431–441. [Google Scholar] [CrossRef]
  32. Gelman, A.; Bois, F.; Jiang, J. Physiological pharmacokinetic analysis using population modeling and informative prior distributions. J. Am. Stat. Assoc. 1996, 91, 1400–1412. [Google Scholar] [CrossRef]
  33. Girolami, M. Bayesian inference for differential equations. Theor. Comput. Sci. 2008, 408, 4–16. [Google Scholar] [CrossRef] [Green Version]
  34. Hastings, W.K. Monte Carlo sampling methods using Markov chains and their application. Biometrika 1970, 57, 97–109. [Google Scholar] [CrossRef]
  35. Metropolis, N.; Rosenbluth, A.W.; Rosenbluth, M.N.; Teller, A.H.; Teller, E. Equations of state calculations by fast computing machines. J. Chem. Phys. 1953, 21, 1087–1092. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Fitting to La La Land data.
Figure 1. Fitting to La La Land data.
Mca 25 00042 g001
Figure 2. Fitting to “Ten-nensui” data.
Figure 2. Fitting to “Ten-nensui” data.
Mca 25 00042 g002
Figure 3. Fitting to “Honkirin” data.
Figure 3. Fitting to “Honkirin” data.
Mca 25 00042 g003
Figure 4. Fitting to Cup-noodle data.
Figure 4. Fitting to Cup-noodle data.
Mca 25 00042 g004
Figure 5. Fitting to “Reiwa” data.
Figure 5. Fitting to “Reiwa” data.
Mca 25 00042 g005
Table 1. Various parameter values.
Table 1. Various parameter values.
Lalaland
α 0 0.0001 α 0.000267 y 1 ( 0 ) 6700
β 0 0.001 β 0.010366 y 2 ( 0 ) 0
γ 0 0.01 γ 0.074493 y 3 ( 0 ) 1792
δ 0 0.001 δ 0.007580 y 4 ( 0 ) 0
ε 0 0.001 ε 0.006209 τ 1 14
ζ 0 0.001 ζ 0.012396 τ 2 405
Table 2. Various parameter values.
Table 2. Various parameter values.
Pure_Water
α 0 0.0001 α 0.000174 y 1 ( 0 ) 19,000
β 0 0.01 β 0.035623 y 2 ( 0 ) 0
γ 0 0.01 γ 0.109534 y 3 ( 0 ) 1207
δ 0 0.01 δ 0.017018 y 4 ( 0 ) 0
ε 0 0.01 ε 0.018134 τ 1 14
ζ 0 0.01 ζ 9.216067 τ 2 72
Table 3. Various parameter values.
Table 3. Various parameter values.
Honkirin Beer
α 0 0.0001 α 0.000147 y 1 ( 0 ) 4700
β 0 0.01 β 0.013049 y 2 ( 0 ) 0
γ 0 0.01 γ 0.027167 y 3 ( 0 ) 63
δ 0 0.01 δ 0.007777 y 4 ( 0 ) 0
ε 0 0.01 ε 0.003496 τ 1 48
ζ 0 0.01 ζ 0.123903 τ 2 158
Table 4. Various parameter values.
Table 4. Various parameter values.
Cupnoodle
α 0 0.0001 α 0.000036 y 1 ( 0 ) 8200
β 0 0.01 β 0.001943 y 2 ( 0 ) 0
γ 0 0.01 γ 0.047971 y 3 ( 0 ) 1633
δ 0 0.01 δ 0.004999 y 4 ( 0 ) 0
ε 0 0.01 ε 0.009125 τ 1 11
ζ 0 0.01 ζ 1.636896 τ 2 444
Table 5. Various parameter values.
Table 5. Various parameter values.
Beautiful_Harmony
α 0 0.00001 α 0.000004 y 1 ( 0 ) 2,700,000
β 0 0.01 β 0.012495 y 2 ( 0 ) 0
γ 0 0.01 γ 0.085684 y 3 ( 0 ) 0
δ 0 0.01 δ 0.001983 y 4 ( 0 ) 0
ε 0 0.01 ε 0.011766 τ 1 10
ζ 0 0.01 ζ 131.696203 τ 2 62

Share and Cite

MDPI and ACS Style

Ota, Y.; Mizutani, N. Estimating Parameters in Mathematical Model for Societal Booms through Bayesian Inference Approach. Math. Comput. Appl. 2020, 25, 42. https://doi.org/10.3390/mca25030042

AMA Style

Ota Y, Mizutani N. Estimating Parameters in Mathematical Model for Societal Booms through Bayesian Inference Approach. Mathematical and Computational Applications. 2020; 25(3):42. https://doi.org/10.3390/mca25030042

Chicago/Turabian Style

Ota, Yasushi, and Naoki Mizutani. 2020. "Estimating Parameters in Mathematical Model for Societal Booms through Bayesian Inference Approach" Mathematical and Computational Applications 25, no. 3: 42. https://doi.org/10.3390/mca25030042

Article Metrics

Back to TopTop