Next Article in Journal
Analysis of Weak Fault in Hydraulic System Based on Multi-scale Permutation Entropy of Fault-Sensitive Intrinsic Mode Function and Deep Belief Network
Next Article in Special Issue
A Data-Weighted Prior Estimator for Forecast Combination
Previous Article in Journal
Optimizing Deep CNN Architectures for Face Liveness Detection
Previous Article in Special Issue
Acknowledging Uncertainty in Economic Forecasting. Some Insight from Confidence and Industrial Trend Surveys
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Soft Randomized Machine Learning Procedure for Modeling Dynamic Interaction of Regional Systems

by
Yuri S. Popkov
1,2,3,4
1
Federal Research Center “Computer Science and Control” of Russian Academy of Sciences, 119333 Moscow, Russia
2
Institute of Control Sciences of Russian Academy of Sciences, 117997 Moscow, Russia
3
Department of Software Engineering, ORT Braude College, 216100 Karmiel, Israel
4
Yugra Research Institute for Information Technologies, 628011 Khanty-Mansiysk, Russia
Entropy 2019, 21(4), 424; https://doi.org/10.3390/e21040424
Submission received: 19 March 2019 / Revised: 10 April 2019 / Accepted: 11 April 2019 / Published: 20 April 2019
(This article belongs to the Special Issue Entropy Application for Forecasting)

Abstract

:
The paper suggests a randomized model for dynamic migratory interaction of regional systems. The locally stationary states of migration flows in the basic and immigration systems are described by corresponding entropy operators. A soft randomization procedure that defines the optimal probability density functions of system parameters and measurement noises is developed. The advantages of soft randomization with approximate empirical data balance conditions are demonstrated, which considerably reduces algorithmic complexity and computational resources demand. An example of migratory interaction modeling and testing is given.

1. Introduction

The mutual influence of migratory processes in regional systems is a problem of growing significance in the modern world. The socioeconomic statuses of different regions demonstrate higher heterogeneity in response to rising political and military tension. All these factors cause an abrupt redistribution of migration flows and regional population variations, thereby increasing the cost of regional population maintenance [1,2,3,4]. Therefore, it is important to develop different tools (mathematical models, algorithms, and software) for forecasting the distribution of migration flows with adaptation to their dynamics considering available resources.
The authors of [5] suggested a dynamic entropy model for the migratory interaction of regional systems. In comparison with biological reproduction, migration mobility is a rather fast process [1,6]. Thus, the short-term dynamics of regional population size are described by the locally stationary state of a migratory process [7]. The latter can be simulated under the hypothesis that all migrants have a random and independent spatial distribution over interacting regional systems with given prior probabilities. The mathematical model of a locally stationary state is given by a corresponding entropy operator that maps the space of admissible resources into the space of migratory processes [8].
Mathematical modeling and analysis of interregional migration is considered in numerous publications. First, it seems appropriate to mention the monographs [9,10] that are dedicated to a wide range of interregional migration problems, including mathematical modeling of migration flows. Note that the problem of migration touches upon many aspects of socioeconomic, psychological and political status of the space of migratory movements. Thus, of crucial role is the structural analysis of inter- and intraregional migration flows [4] and motivations that generate them [2,11]. The results of structural and motivational analysis of migratory processes are used for computer simulation. There exist three directions of research in this field, each relying on some system of hypotheses. One of the directions involves the stochastic hypothesis about the origin of migratory motivations [12], which is simulated using agent technologies [13,14]. This direction is adjoined by investigations based on the thermodynamic model of migration flows [3,8]. Of course, the short list above does not exhaust the whole variety of migration studies, merely outlining some topics of research.
This paper studies a stochastic version of the model in [5], in which random parameters and measurement noises are characterized by probability density functions (PDFs). These functions are estimated using retrospective information on the real dynamics of regional population size with “soft” randomized machine learning [15]. The learned model was implemented in the form of computer simulations, i.e., generation of an ensemble of random trajectories with the entropy-optimal PDFs of the model parameters and measurement noises. The resulting ensemble was used for testing of the model and also for short-term forecasting.
The method developed below is illustrated by an example of the randomized modeling and forecasting of the migratory interaction among three EU countries (Germany, France, and Italy—the system GFI ) and two countries as sources of immigration (Syria and Libya—the system SL ).

2. Randomized Model of Migratory Interaction

Consider the dynamic discrete-time model of migratory interaction with shared resource constraints that is presented in [5]. The first sub-model represents migration flows within the system GFI and is described by the dynamic regression equation
K [ ( s + 1 ) h ] = ( A E ) K [ s h ] + F ( z [ s h ] ) , ( K , F ) R N , s = 0 , K 1 ¯ ,
where
A = h 1 α 2 a 21 α N a N 1 α 1 a 12 1 α N a N 2 α 1 a 1 N α 2 a 2 N 1 ,
E = h diag [ α n , n = 1 , N ¯ ] .
In these equations, K [ s h ] denotes the population distribution in the regional system GFI at a time s h .
At a time s h , the distribution of immigration flows from the regional system SL to the regional system GFI in terms of an entropy operator is modeled by the second sub-model, which can be described by a vector function F ( z [ s h ] ) with the components
f n [ s h ] = h j = 1 M b j n ( z [ s h ] ) c j n , n = 1 , N ¯ , s = 0 , K 1 ¯ ,
The variable z, which is the exponential Lagrange multiplier in the entropy-optimal distribution problem of immigration flows, satisfies the equation
k = 1 M n = 1 N c k n b k n ( z [ s h ] ) c k n = T [ s h ] ,
where T [ s h ] is the amount of a shared resource used by all regions from the system GFI to maintain immigrants.
In this model, the input data are the amounts T [ 0 ] , T [ h ] , , T [ ( K 1 ) h ] ; and the output data are the regional population distributions K [ 0 ] , K [ h ] , , K [ ( K 1 ) h ] .
The dynamic model in Equations (1)–(5) contains the following parameters:
  • α n [ 0 , 1 ] , n = 1 , N ¯ , as the shares of mobile population in system regions;
  • a i n [ 0 , 1 ] , ( i , n ) = 1 , N ¯ , as the prior probabilities of individual migration in the system GFI ;
  • b k n , k = 1 , M ¯ , n = 1 , N ¯ , as the prior probabilities of individual immigration from region k of the system SL to region n of the system GFI ; and
  • c k n , k = 1 , M ¯ , n = 1 , N ¯ , as the normalized 1 specific generalized cost of immigration maintenance.
Normalization means that 0 < c k n < 1 , k = 1 , M ¯ , n = 1 , N ¯ .
The parameters form three groups: mobility, migratory movements within the system GFI , and immigratory movements from the system SL to the system GFI . All these characteristics are specified by the regions of both systems. The dimensionality of the parametric space is reduced using the same approach as in [5]. The whole essence is to assign a relative regional differentiation of all parameters except for the weights b 1 (mobility) and b 2 (internal migration) of these groups, which are considered as model variables.
This approach leads to the parametric transformation
α n = b 1 m n , a i n = b 2 h i n , ( i , n ) = 1 , N ¯ ; k = 1 , M ¯ ,
where m n and h i n are given parameters which characterize the relation of variables.
Then, the dynamic model of migratory interaction in Equations (1)–(5) takes the form
K [ ( s + 1 ) h ] = ( b 1 b 2 A ˜ b 1 E ˜ ) K [ s h ] + F ˜ ( z [ s h ] ) ,
with the matrix
A ˜ = h 1 m 2 h 21 m N h N 1 m 1 h 12 1 m N h N 2 m 1 h 1 N m 2 h 2 N 1
and the diagonal matrix
E ˜ = h diag [ m n , n = 1 , N ¯ ] .
The vector F ˜ ( μ , z ) [ s h ] consists of the components
f ˜ n ( z [ s h ] ) = h k = 1 M q k n ( z [ s h ] ) c k n , n = 1 , N ¯ , s = 0 , K 1 ¯ .
For each time s h , the variable z satisfies the equation
k = 1 M n = 1 N c k n q k n ( z [ s h ] ) c k n = T [ s h ] , s = 0 , K 1 ¯ ,
i.e., there exist K values z = z [ s h ] , s = 0 , K 1 ¯ .
The randomized version of this model is described by Equations (7)–(11) but some parameters (variables) have random character. These are two randomized parameters, b 1 and b 2 , as well as the variable z = b 3 , all of the interval type. More specifically, the parameters b 1 and b 2 belong to the intervals
B 1 = [ b 1 , b 1 + ] , B 2 = [ b 2 , b 2 + ] .
The interval B 3 of the variable b 3 is given by Equation (11).
Theorem 1.
Let the parameters b k n and c k n in Equation (11) be positive and c k n [ 0 , 1 ] . Then, the solution b 3 of this equation belongs to the interval
B 3 = [ b 3 , b 3 + ] ,
where
b 3 = T [ s h ] M N c m a x b m a x 1 / c m a x ; b 3 + = T [ s h ] M N c m i n b m i n 1 / c m i n ; c m i n = min k n c k n , c m a x = max k n c k n ; b m i n = min k n b k n , b m a x = max k n b k n .
The proof is postponed to the Appendix A.
Therefore, the randomized dynamic model in Equations (7)–(11) includes three random parameters b = { b 1 , b 2 , b 3 } of the interval type that are defined over the three-dimensional cube with faces (Equations (12) and (13)), i.e.,
B = j = 1 3 B j .
The probabilistic properties of the randomized parameters are described by a continuously differentiable PDF W ( b ) .
By assumption, real distributions of regional population sizes contain errors that are simulated by a random vector ξ ¯ [ s h ] R N with the interval components
ξ ¯ [ s h ] Ξ s = ξ ¯ [ s h ] , ξ ¯ + [ s h ] .
The probabilistic properties of this vector are described by a continuously differentiable PDF Q ( ξ ¯ ) .
The measured output of the randomized model has an additive noise,
v [ s h ] = K [ s h ] + ξ ¯ [ s h ] .

3. Characterization of Empirical Risk and Measurement Noises

Construct a synthetic functional J [ W ( b ) , Q ( ξ ¯ ) ] that depends on the PDFs of the model parameters and measurement noises for assessing in quantitative terms the empirical risk (the difference between the regional population distribution generated by the model in Equations (7)–(11) and the real counterpart) and the guaranteed power of these noises. The functional must have components characterizing an intrinsic uncertainty of randomized machine learning (RML) procedures, the approximation quality of empirical balances (the empirical risk) and the worst properties of the corresponding random interval-type noises.
1. Uncertainty. In accordance with the general concept of RML, the first component among the listed ones is an entropy functional that describes the level of uncertainty:
H [ b ) , Q ( ξ ¯ ) ] = B W ( b ) ln W ( b ) d b Ξ Q ( ξ ¯ ) ln Q ( ξ ¯ ) d ξ ¯ .
The two other functional components are constructed using Hölder’s vector and matrix norms (The vector norm has the form a = max n | a n | ; the matrix norm, the form A = max i j | a i j | .) [16].
2. Approximate empirical balances. First, consider a characterization of the empirical risk. For the model in Equations (7)–(11), the deviation between the output and real data vectors is given by
ε ¯ [ s h ] = b 1 b 2 A ˜ b 1 E ˜ Y [ s h ] + F ( b 3 [ s h ] ) Y [ s h ] , s = 0 , K 1 ¯ .
Using well-known inequalities for the matrix and vector norms, it is possible to write
ϵ ¯ [ s h ] b 1 b 2 A ˜ b 1 E ˜ Y [ s h ] + F ( b 3 [ s h ] ) + Y [ ( s + 1 ) h ] = = φ ( b 1 , b 2 , b 3 , s ) , s = 0 , K 1 ¯ .
Introducing the average matrix and vector norms over the observation interval,
φ ( b 1 , b 2 , b 3 ) h 1 K s = 0 K 1 max n y n [ s h ] b 1 max n m n + b 1 b 2 max i , j h i j + + 1 K s = 0 K 1 max n y n [ ( s + 1 ) h ] + M N c m a x b m a x ( b 3 ) c m a x .
The parameters b 1 and b 2 take values within the intervals B 1 and B 2 (Equation (12)) while the parameter b 3 within the interval
B 3 = [ T m a x M N c m a x q m a x 1 / c m a x , T m a x M N c m i n q m i n 1 / c m i n ] ,
where
T m a x = max s T [ s h ] .
Denote
U 1 = h 1 K s = 0 K 1 max n y n [ s h ] max n m n ; U 2 = h 1 K s = 0 K 1 max n y n [ s h ] max i , j h i j ; U 3 = M N h c m a x b m a x ; U 4 = 1 K s = 0 K 1 max n y n [ ( s + 1 ) h ] .
Then, the function φ ( b 1 , b 2 , b 3 ) takes the form
φ ( b 1 , b 2 , b 3 ) = b 1 U 1 + b 1 b 2 U 2 + ( b 3 ) c m a x U 3 + U 4 .
Note that the coefficients U 1 , , U 4 are determined by real data on regional population distributions and also by the characteristics of internal migration within the system GFI and immigration flows from the system SL .
The equality in Equation (25) defines a function φ ( b 1 , b 2 , b 3 ) of random variables. Let its expectation be the characteristic of the empirical risk, i.e.,
r [ W ( b ) ] = B W ( b ) φ ( b ) d b ,
where B = B 1 B 2 B 3 and the intervals B 1 and B 2 have given limits. At the same time, the limits of the interval B 3 are specified by the equalities in Equation (22).
Power of noises. The measurement noises are simulated by random vectors ξ ¯ [ s h ] R N , s = 0 , K 1 ¯ . The components of these vectors may have different domains (ranges of values) at different times s = 0 , K 1 ¯ . For each time, introduce the Euclidean norm ξ ¯ [ s h ] N 2 and its expectation
n s [ Q ( ξ ¯ [ s h ] ) ] = Ξ Q ( ξ ¯ [ s h ] ) ξ ¯ [ s h ] N 2 d ξ ¯ [ s h ] .
The average expectation of this norm over the time interval has the form
n ¯ s [ Q ( ξ ¯ [ s h ] ) ] = 1 K s = 0 K 1 n s [ Q ( ξ ¯ [ s h ] ) ] .
If the measurement noises are the same on the observation interval, then the noise power functional can be written as
n ¯ s [ Q ( ξ ¯ [ s h ] ) ] = n [ Q ( ξ ¯ ) ] = Ξ Q ( ξ ¯ ) ξ ¯ N 2 d ξ ¯ .
This formula involves the Euclidean norm for a quantitative characterization of the noise power. However, it is possible to choose other norms depending on problem specifics.

4. Soft Randomized Estimation of Model Parameters

The model characteristics and measurement noises are estimated using a learning data collection: the real cost of immigrants maintenance T [ 0 ] , , T [ ( K 1 ) h ] (input data) and the real distributions of regional population sizes Y [ 0 ] , , Y [ ( K 1 ) h ] (output data).
In accordance with the general procedure of soft randomized machine learning [15], the optimal probability density functions W ( b ) (model parameters) and Q ( ξ ¯ ) (measurement noises) are calculated by the constrained minimization of the synthetic functional J [ W ( b ) , Q ( ξ ¯ ) ] that contains the following functionals:
  • the entropy
    H [ W ( b ) ] = B W ( b ) ln W ( b ) d b Ξ Q ( ξ ¯ ) ln Q ( ξ ¯ ) d ξ ¯ ;
  • the average empirical risk over the observation interval
    r [ W ( b ) ] = B W ( b ) ( b 1 U 1 + b 1 b 2 U 2 + ( b 3 ) c m a x U 3 + U 4 ) d b ;
    and
  • the average error norm
    n [ Q ( ξ ¯ ) ] = Ξ Q ( ξ ¯ ) i = 1 N ξ i 2 d ξ ¯ .
The soft randomized learning algorithm has the form
J [ W ( b ) , Q ( ξ ¯ ) ] = H [ W ( b ) ] r [ W ( b ) ] n [ Q ( ξ ¯ ) ] max , B W ( b ) d b = 1 , Ξ Q ( ξ ¯ ) d ξ ¯ = 1 .
The solution of this problem is the optimal PDFs under maximal uncertainty, for the model parameters of the form
W ( b ) = exp b 1 U 1 b 1 b 2 U 2 ( b 3 ) c m a x U 3 U 4 P ,
where
P = B exp b 1 U 1 b 1 b 2 U 2 ( b 3 ) c m a x U 3 U 4 d b ,
and for the measurement noises of the form
Q ( ξ ¯ ) = exp i = 1 N ξ i 2 Q ,
where
Q = Ξ exp i = 1 N ξ i 2 d ξ ¯ .
In the case of soft randomization, there is no need for solving the empirical balance equations, which have high complexity and computational intensiveness due to the presence of integral components. Here, computational resources are required for the normalization procedure of the resulting PDFs. On the other hand, the morphology of the optimal PDFs depends on a specific choice of the approximate data balancing criterion and a numerical characterization of the measurement noises.

5. Randomized Forecasting of Dynamic Migratory Interaction

Consider randomized forecasting of dynamic migratory interaction using the principle of soft randomization. Let T p r = [ s 0 h , s p r h ] be the forecasting interval and assume the initial state (the regional population distribution at the initial time s 0 h ) coincides with the real distribution, i.e., K [ s 0 h ] = Y [ s 0 h ] . The shared cost of the system GFI to maintain immigrants is distributed in accordance with a given scenario. For each scenario, the value T m a x and also the interval B 3 in Equations (12), (22), and (23) are determined.
The forecasted trajectories are constructed using the randomized model in Equations (7), (10), and (11)
K [ ( s + 1 ) h ] = b 1 b 2 A ˜ b 1 E ˜ K [ s h ] + F [ s h | b 3 ] , F [ s h | b 3 ] = { k = 1 M b k n ( b 3 ) c k n , n = 1 , N ¯ } , s = s 0 , s p r ¯ , K [ s 0 h ] = Y [ s 0 h ] .
The randomized parameters b 1 , b 2 , and b 3 take values within the corresponding intervals with the probability density function W ( b ) (Equation (34)).
An ensemble of the forecasted trajectories for the model’s output is obtained taking into account a random vector ξ ¯ Ξ with the PDF Q ( ξ ¯ ) (Equation (36)):
v [ s h ] = K [ s h ] + ξ ¯ , s = s 0 , s p r ¯ .
For each scenario T [ s 0 h ] , , T [ s p r h ] , an ensemble K of random forecasting trajectories is generated via sampling (the transformation of a PDF into a corresponding sequence of random vectors of length I) of the optimal PDFs of the model parameters and measurement noises for each time s h . The resulting ensemble allows deriving empirical estimates of different numerical characteristics as follows:
  • the average trajectory
    K ¯ [ s h ] = 1 I i = 1 I K ( i ) [ s h ] , s = s 0 , s p r ¯ ;
  • the variance trajectory
    σ ¯ 2 [ s h ] = 1 I 1 i = 1 I K ( i ) [ s h ] K ¯ [ s h ] 2 , s = s 0 , s p r ¯ ;
  • the variance pipe, i.e., the set of random trajectories that almost surely (since an ensemble consists of a finite number of trajectories, the matter concerns not probability but its empirical estimate) belong to the domain
    D = { K [ s h ] : K ¯ [ s h ] σ ¯ 2 [ s h ] K [ s h ] K ¯ [ s h ] + σ ¯ 2 [ s h ] , s = s 0 , s p r ¯ } ;
  • the empirical probability distribution and its dynamics on the forecasting interval
    P K [ s h ] Δ , s = s 0 , s p r ¯ = I Δ I ,
    where I Δ denotes the number of vectors K [ s h ] whose components are smaller than Δ ; and
  • the median trajectory K ^ [ s h ] , s = s 0 , s p r ¯ , which satisfies the equation
    P ( K [ s h ] ) = 0 , 5 ; s = s 0 , s p r ¯ .
The ensemble K can be used to calculate other characteristics, e.g., α -quantiles, confidence probabilities, etc.

6. Example

The appearance of territories with low economic status always causes the growth of immigration. The early 2000s were remarkable for the formation of several such territories in Northern and Central Africa, the Near East, Afghanistan, etc. As a result, tens of millions of migrants moved to the EU as the level of life in these territories dropped below the subsistence minimum. The EU countries have to allocate considerable financial resources for their filtering and accommodation, which are often unacceptable. An example below illustrates the use of soft randomization for estimating and forecasting of immigration flows from Syria (1) and Libya (2) (the system SL ) to Germany (1), France (2), and Italy (3) (the system GFI ).
1. Randomized model, parameters, measurement errors, time intervals, and real data collections. Choose the randomized mathematical model (Equation (25)) with the normalized variables
p n [ s h ] = K n [ s h ] K m a x , n = 1 , 3 ¯ .
This gives
p n [ ( s + 1 ) h ] = ( 1 b 1 m n ) p n [ s h ] + h b 1 b 2 i = 1 , i n 3 m i h i n p i [ s h ] + h f n [ s h ] , f n [ s h ] = i = 1 M b i n b 3 c i n , n = 1 , 3 ¯ , T [ s h ] = n = 1 3 i = 1 2 c i n b i n b 3 c i n .
The state variables of the system GFI and also the immigration flows from the system SL are normalized, i.e.,
0 p n [ s h ] 1 , 0 f n [ s h ] 1 , n = 1 , N ¯ .
The variable z characterizes the entropy operator of the immigration process and satisfies the last equation in Equation (46). The values of the parameters m i , h i n , b i n , and c i n are combined in Table 1, where columns are different values of corresponding parameter. Recall that the two lowest rows of Table 1 indicate the values of the parameters c i n . By assumption, the regions of both systems have the same specific cost.
In accordance with this table, m m a x = 0.5 , h m a x = 0.5 , b m i n = 0.3 , b m a x = 0.4 , and c m a x = c m i n = c = 0.5 . The measurement errors of population sizes ξ ¯ [ s h ] R 3 (in normalized units) belong to the intervals
ξ ¯ [ s h ] Ξ = [ ξ ¯ , ξ ¯ + ] , ξ n ± = 0.01 ,
and by assumption they have the same limits for times s h .
The normalized observation (model output) has the form
v [ s h ] = p [ s h ] + ξ ¯ [ s h ] .
The random parameter model in Equation (46) was employed for estimating parameter characteristics and testing on corresponding time intervals with step h = 1 year:
  • T e s t = 2009 –2013 as the estimation interval; and
  • T t s t = 2014 –2018 as the testing interval.
2. Entropy estimation of PDFs of model parameters and measurement noises (interval T e s t ). This problem was solved using available data on regional population distribution for Germany ( n = 1 ), France ( n = 2 ), and Italy ( n = 3 ) and also on the shared cost of immigrants maintenance on the estimation interval (see Table 2 and UNdata service at https://data.un.org/).
In this model, the random parameters b 1 , b 2 , and b 3 take values within the intervals
b 1 B 1 = [ 1.0 , 2.5 ] ; b 2 B 2 = [ 0.5 , 1.8 ] , b 3 B 3 = [ 0.3 , 1.5 ] .
In accordance with Equation (24),
U 1 = 0.5 ; U 2 = 0.5 ; U 3 = 1.2 ; U 4 = 0.986 .
Then, the soft RML procedure yields the following optimal PDFs of the model parameters and measurement noises:
W ( b ) = exp 0.5 b 1 0.5 b 1 b 2 1.2 b 3 0 . 5 0.986 W , Q ( ξ ¯ ) = exp n = 1 3 ξ n 2 Q ,
where
W = B 1 B 2 B 3 exp 0.5 b 1 0.5 b 1 b 2 1.2 b 3 0.5 0.986 d b 1 d b 2 d b 3 , Q = n = 1 3 0.01 0.01 exp ( ξ 2 ) d ξ .
The two-dimensional sections of the three-dimensional PDFs of the model parameters are shown in Figure 1a–c, while the graphs of the PDFs of the measurementnoises in Figure 2.
3. Model testing. The randomized model in Equation (49) with the optimal PDFs in Equations (52) and (53) was tested using the above data on regional population sizes from the UNdata service (https://data.un.org/) (see Table 3). This table also presents the testing results in terms of the ensemble-average trajectories p ¯ 1 [ s h ] , p ¯ 2 [ s h ] , and p ¯ 3 [ s h ] .
Testing was performed via sampling of the randomized interval parameters with the PDFs in Equations (52) and (53) and construction of the corresponding trajectories by Equation (49). Figure 3a–c shows ensembles of such trajectories v 1 [ s h ] , v 2 [ s h ] , v [ s h ] as well as the ensemble-average trajectories v ¯ 1 [ s h ] , v ¯ 2 [ s h ] , v ¯ 3 [ s h ] (Graph 1); the real trajectories y 1 [ s h ] , y 2 [ s h ] , y 3 [ s h ] of regional population sizes (Graph 2); and the limits of the variance pipes p ¯ 1 [ s h ] ± σ 1 , p ¯ 2 [ s h ] ± σ 2 , p ¯ 3 [ s h ] ± σ 3 (Graph 3).
The testing accuracy was estimated in terms of the relative root-mean-square error
δ n = s = 0 4 p ¯ n [ s h ] y n [ s h ] 2 s = 0 4 ( p ¯ n [ s h ] ) 2 + s = 0 4 ( y n [ s h ] ) 2 .
In the example under study, it constituted 4.6% (Region 1), 3.5% (Region 2), and 2.6% (Region 3).

7. Conclusions

This paper has developed a mathematical model for dynamic migratory interaction of regional systems with locally stationary states described by corresponding entropy operators. The model incorporates random parameters, and their probabilistic characteristics—the probability density functions of system parameters and measurement noises—have been calculated using soft randomized machine learning. An example of migratory interaction modeling and testing has been given.

Funding

This work was supported by Russian Science Foundation (project No. 17-11-01220).

Conflicts of Interest

The author declares no conflict of interest.

Appendix A

Proof of Theorem 1.
Consider the function
φ ( z ) = μ k = 1 M n = 1 N c k n q k n ( z [ s h ] ) c k n ,
which appears on the left-hand side of Equation (19). Taking advantage of the obvious inequalities,
φ ( z ) = M N μ c m i n q m i n ( z ) c m i n < φ ( z ) < M N μ c m a x q m a x ( z ) c m a x = φ + ( z ) .
The variables are 0 < c m i n < 1 , 0 < c m a x < 1 , c m i n < c m a x , and c m i n < c k n < c m a x . Consider the equations
φ ( z ) = T [ s h ] , φ ( z ) = T [ s h ] , φ + ( z ) = T [ s h ] .
The functions φ ( z ) , φ ( z ) , and φ + ( z ) are strictly convex. Therefore, the solutions of these equations has the relationship
z < z < z + ,
which concludes the proof of Theorem 1. □

References

  1. Bilecen, B.; Van Mol, C. Introduction: International academic mobility and inequalities. J. Ethic Migr. Stud. 2017, 43, 1241–1255. [Google Scholar] [CrossRef]
  2. Black, R.; Xiang, B.; Caller, M.; Engberson, G.; Heering, L.; Markova, E. Migration and development: Causes and consequences. In The Dynamics of International Migration and Settlement in Europe: A State of the Art; Penninx, R., Berger, M., Kraal, K., Eds.; Amsterdam University Press: Amsterdam, The Netherlands, 2006; pp. 41–63. [Google Scholar]
  3. Imel’baev, S.S.; Shmul’yan, B.L. Modeling of stochastic communication systems. In Entropy Methods for Complex Systems Modeling; Wilson, A.G., Ed.; Nauka: Moscow, Russian, 1975; pp. 170–234. [Google Scholar]
  4. Van der Knaap, G.A.; Steegers, W.F. Structural analysis of interregional and intraregional migration patterns. In Demographic Research and Spatial Policy; Heide, H., Willekens, F., Eds.; Academic Press: Voorburg, The Netherlands, 1984. [Google Scholar]
  5. Popkov, Y.S. Dynamic entropy model for migratory interaction of regional systems. Tr. Inst. Sist. Analiz. Ross. Akad. Nauk 2018, 2, 3–11. [Google Scholar]
  6. Zelinsky, W. The hypothesis of the mobility transition. Geogr. Rev. 1971, 46, 219–249. [Google Scholar] [CrossRef]
  7. Popkov, Y.S. Mathematical Demoeconomy: Integrating Demographic and Economic Approaches; De Gruyter: Berlin, Germany, 2014. [Google Scholar]
  8. Wilson, A.G. Entropy in Urban and Regional Modelling; Routledge: London, UK, 1970. [Google Scholar]
  9. Rogers, A.; Willekens, F.; Raymer, J. Modelling interregional migration flows: continuity and change. J. Math. Popul. Stud. 2001, 9, 231–263. [Google Scholar] [CrossRef]
  10. Rogers, A.; Little, J.; Raymer, J. The Indirect Estimation of Migration: Methods for Dealing with Irregular, Inadequate, and Missing Data; Springer Science & Business Media: Berlin, Germany, 2010. [Google Scholar]
  11. Volpert, V.; Petrovskii, S.; Zincenko, A. Interaction of human migration and wealth distribution. Nonlinear Anal. 2017, 150, 408–423. [Google Scholar] [CrossRef]
  12. Pan, J.; Nagurney, A. Using Markov chains to model human migration in a network equilibrium framework. Math. Comput. Model. 1994, 19, 31–39. [Google Scholar] [CrossRef]
  13. Klabunde, A.; Willekens, F. Decision-making in agent-based models of migration: State of the art and challenges. Eur. J. Popul. 2016, 32, 73–97. [Google Scholar] [CrossRef] [PubMed]
  14. Klabunde, A.; Zinn, S.; Willekens, F.; Leuchter, M. Multistable modelling extended by behavioural rules. An application to migration. Popul. Stud. 2017, 71, 61–67. [Google Scholar] [CrossRef] [PubMed]
  15. Popkov, Y.S. Soft randomized machine learning. Doklady Math. 2018, 98, 646–647. [Google Scholar] [CrossRef]
  16. Voevodin, V.V.; Kuznetsov, Y.A. Matrices and Calculations; Nauka: Moscow, Russian, 1984. (In Russian) [Google Scholar]
Figure 1. 2-dimensional section of W.
Figure 1. 2-dimensional section of W.
Entropy 21 00424 g001
Figure 2. 2-dimensional section of Q.
Figure 2. 2-dimensional section of Q.
Entropy 21 00424 g002
Figure 3. (a) v ¯ 1 [4], (b) v ¯ 2 [4], (c) v ¯ 3 [4].
Figure 3. (a) v ¯ 1 [4], (b) v ¯ 2 [4], (c) v ¯ 3 [4].
Entropy 21 00424 g003
Table 1. Values of relative parameters.
Table 1. Values of relative parameters.
m n 0.430.500.40
h 1 n 00.30.3
h 2 n 0.300.3
h 3 n 0.50.40
b 1 n 0.40.30.3
b 2 n 0.30.10.4
c 1 n 0.40.40.3
c 2 n 0.40.40.3
Table 2. Input and output data collections.
Table 2. Input and output data collections.
Year20092010201120122013
s01234
Y 1 [ s ] 81.9081.7780.2780.4280.64
y 1 [ s ] 1.000.9980.9800.9820.985
Y 2 [ s ] 62.4762.8063.1163.4163.70
y 2 [ s ] 0.7620.7670.7710.7740.778
Y 3 [ s ] 59.3959.5359.6359.7159.75
y 3 [ s ] 0.7250.7270.7280.7290.726
T [ s ] (billion)0.0930.0940.0950.0960.097
Table 3. Input and output data collections.
Table 3. Input and output data collections.
Year20142015201620172018
s01234
Y 1 [ s ] 81.48981.70782.06382.38682.674
y 1 [ s ] 0.9850.9880.9930.9961.000
p ¯ 1 [ s h ] 0.9860.6150.7430.6390.999
Y 2 [ s ] 64.19064.45764.79165.13465.484
y 2 [ s ] 0.7210.4720.5640.5290.708
p ¯ 2 [ s h ] 0.7220.6950.7070.6910.715
Y 3 [ s ] 59.58559.50459.50459.50959.516
y 3 [ s ] 0.7750.6090.5620.6990.650
p ¯ 3 [ s h ] 0.7760.6170.6070.7050.628
T [ s ] (billion)0.0970.0970.0970.0980.098

Share and Cite

MDPI and ACS Style

Popkov, Y.S. Soft Randomized Machine Learning Procedure for Modeling Dynamic Interaction of Regional Systems. Entropy 2019, 21, 424. https://doi.org/10.3390/e21040424

AMA Style

Popkov YS. Soft Randomized Machine Learning Procedure for Modeling Dynamic Interaction of Regional Systems. Entropy. 2019; 21(4):424. https://doi.org/10.3390/e21040424

Chicago/Turabian Style

Popkov, Yuri S. 2019. "Soft Randomized Machine Learning Procedure for Modeling Dynamic Interaction of Regional Systems" Entropy 21, no. 4: 424. https://doi.org/10.3390/e21040424

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop