Investigating the Impact of Accessibility on Internal Migration Flows in Italy Through the Calibration of Multiple Linear Regression Models

Basile, Antonio; Gallo, Mariano

doi:10.3390/world6020046

Open AccessArticle

Investigating the Impact of Accessibility on Internal Migration Flows in Italy Through the Calibration of Multiple Linear Regression Models

by

Antonio Basile

and

Mariano Gallo

^*

Department of Engineering, University of Sannio, 82100 Benevento, Italy

^*

Author to whom correspondence should be addressed.

World 2025, 6(2), 46; https://doi.org/10.3390/world6020046

Submission received: 24 January 2025 / Revised: 28 March 2025 / Accepted: 3 April 2025 / Published: 7 April 2025

Download

Browse Figures

Versions Notes

Abstract

:

This study estimates the impact of some socio-economic, real estate, and accessibility factors on the demographic change of the Italian provinces. Migration rates were analysed for one hundred and thirteen provincial capitals, or former provincial capitals, and their territories, and the correlation between them and various socio-economic and accessibility variables was studied. The data analysis showed significant heterogeneity between the different territorial areas of the country, highlighting the well-known phenomenon of migration from southern to northern regions. The aim of this paper is to investigate whether and to what extent accessibility variables have a direct influence on internal migration phenomena, in addition to the other socio-economic variables involved. Multiple linear regression models were specified and calibrated to correlate migration rates with various socio-economic and accessibility variables. The results show a non-negligible impact of certain accessibility variables on the migration phenomenon and suggest the need to work on the infrastructure front to rebalance the country’s demographic and socio-economic structure.

Keywords:

accessibility; migration flows; regression models

1. Introduction

The demographic decrease in Europe is a well-known, widely studied, and constantly evolving phenomenon. The European Union has set up a standing committee of 43 MEPs called the Committee on Regional Development (REGI), which is responsible, among other things, for implementing and improving the Union’s regional development and cohesion policy according to the EU Treaties. Based on the report produced by this commission in 2021 [1], the European Parliament considered the combination of migration flows and low birth rate to be decisive in the depopulation of some regions, especially in Eastern and Southern Europe. Furthermore, it identified several critical issues caused by this phenomenon, including the so-called ‘brain drain’, i.e., the impoverishment of highly skilled and qualified personnel, deficits in the provision of health and cultural services, information technology, and physical connectivity, particularly in terms of transport, education, and employment opportunities. Among other things, the transport system was identified as a possible lever to counter this phenomenon, and the inclusion of specific actions for rural and peripheral regions would also be a step towards solving the problem: “The EU should not neglect the rural and remote regions in its mobility strategy: transport networks can halt depopulation by reinforcing rural-urban connectivity” [2].

Regarding Italy, the subject of this work, the ISTAT 2023 report [3] identified a decrease in emigration and an increase in internal mobility and immigration. In particular, the regions of Southern Italy suffered a demographic decline of 525,000 inhabitants between 2012 and 2021.

The reasons for this phenomenon are complex and heterogeneous, and their identification is not easy since there are several factors involved. Although the phenomenon has been studied from different points of view, the socio-economic conditions, the availability of services, and job opportunities as the main factors favouring internal migration from the most depressed areas to the richest ones of a country have not been considered in previous studies. To the best of our knowledge, a specific study on the influence that accessibility may have on this phenomenon in the Italian context is not yet available in the literature. This study aims to fill this research gap.

Therefore, the aim of this study is to investigate the correlation between changes in the total migration rate (to and from other Italian provinces and to and from abroad) and accessibility, trying to find correlated variables and to identify a possible causal relationship. Verifying how and in what way some accessibility variables influence the migration rate and being able to quantify the effect of interventions on transport systems from this point of view can guide transport planning and policy, as well as taking into account these effects, which have a social, as well as economic, impact that is relevant for the organic and inclusive development of the country.

Over 180 linear regression models were calibrated by setting the total migration rate as a function of various accessibility and socio-economic variables. The models were calibrated using demographic data from 2017 to 2022 for 105 Italian provinces.

Since these data are usually available, the proposed methodology is applicable to any territory in the world. It is also applicable to superordinate territorial units, such as regions and nations, and to subordinate territorial units, such as municipalities.

The main limitation of the proposed methodology is that it cannot examine migration flows within a territorial unit; this aspect of the problem can be dealt with in further research developments.

This document is structured as follows. Section 2 examines the background. Section 3 defines the variables and data used. Section 4 describes the specification and calibration of the regression models. Section 5 discusses the results. Finally, Section 6 summarises the conclusions. Appendix A reports the data used.

2. Background

Italy is a southern European nation with a population of about 59 million and a territory of about 302,000 square kilometres; the average population density is about 195 inhabitants/km². The per capita GDP is about EUR 33,000 per inhabitant [4], with substantial disparities between different areas of the country: the regions of Northern Italy have an average per capita GDP of about EUR 40,000, which is reduced to about EUR 35,000 for the regions of Central Italy and reaches about EUR 22,000 for the regions of Southern Italy and the islands. This economic disparity, which is also representative of a different labour market, is one of the causes of internal migration that this study deals with.

Statistical data [4,5] show that in 2023, there were about 1.44 million changes in residence (see Figure 1), in line with the growing trend of the last decade.

Southern Italy records an outflow of inhabitants that is not compensated for by a corresponding inflow: in 2023, internal outward migration from the southern regions was about 407,000, while incoming migration was about 344,000, with a net loss of 63,000 inhabitants. The migration rate in these regions is −3.2 per thousand inhabitants. Northern Italian regions, on the contrary, are more attractive: transfers to a Northern Italian municipality from any other municipality amounted to about 842 thousand, of which 785 thousand came from another northern municipality, with a migration rate of +2.1 per thousand inhabitants.

International migratory flows show about 416 thousand entries and a growth of +1.1% compared to the previous year. On the contrary, outgoing migration from Italy to other countries is strongly decreasing: −5.6% compared to 2022 and −21% compared to 2019.

An additional data analysis [4] highlights an increase in internal migration, while outgoing international migration is still in line with pre-COVID-19 pandemic data. The restrictions of the pandemic have strongly influenced international inbound migration: after an all-time peak of 301,000 entries in 2017, Italy recorded a drop of 192,000 foreign entries in 2020 and 244,000 in 2021. In 2022, there were 336,000 entries; in 2023, they reached 360,000, setting new records for immigration from abroad. Based on these data, it can be said with certainty that COVID-19, while having impacted international inbound migration, did not negatively affect internal migration, which actually increased.

Migration to some territories at the detriment of others is a phenomenon that can cause socio-economic imbalances with an inefficient spatial distribution of resources.

The phenomenon of migration has been widely studied in the literature, with different methodological and theoretical approaches. Many studies have dealt with international migration, for which several theories have been proposed. A review and analysis of the theories of global migration can be found in [6,7,8]. The main theories are (i) the neoclassical theory, (ii) the new economics theory, (iii) the world systems theory, and (iv) the dual labour market theory.

The neoclassical theory [9] hypothesises that migration is determined by the differences between the returns on labour in different markets, particularly salary differences, and the different supply and demand for labour. In addition to a macro approach, i.e., based on aggregate data, it is possible to simulate individual choice at a micro level with the human capital theory of migration [10].

The new economics theory [11] hypothesises that the decision to migrate is not made by a single person but by entire families, taking into account not only income factors but also the difference in the family’s income compared to other families and the risk aversion of sending a family member abroad. This theory is usually considered less solid and less frequently used than others, due to the difficulty of evaluating some variables involved that would be necessary for its implementation.

The world systems theory [12] considers that the migratory phenomenon is influenced by structural changes in world markets, globalisation, the interdependence between economies, and innovative forms of production. Therefore, capital mobility is seen as interconnected to the mobility of people for work reasons. The theory also takes into account global political and economic inequalities.

The dual labour market theory [13] hypothesises that migration is mainly influenced by the demand for labour, distinguishing between capital-intensive economies, where both skilled and unskilled labour are required, and labour-intensive economies, where the primary demand is for unskilled labour. This theory cannot explain the different immigration rates in countries with similar types of economies.

Internal migration within a country has also been studied; the phenomenon differs for developing countries [14] and developed countries [15].

In the first case, the search for a job is one of the main factors that leads to internal migration; the attractiveness of areas depends on the presence of industrial activities, salary differences, and employment levels. Furthermore, the presence of a network, i.e., relatives or friends in a specific area, has an additional attractive effect, also due to the availability and ease of obtaining information on job opportunities. In the second case, migration allows the workforce to be redistributed geographically according to economic and demographic changes and differences. The human capital model is a valuable tool for studying labour economics, but it is not sufficient to explain the phenomenon of internal migration on its own.

One study [16] focused on the Italian case study and used a gravitational model to study internal migration based on human capital, considering per capita GDP, unemployment rate, population, and level of education as variables—the work identified per capita GDP and the unemployment rate as the main variables influencing migratory flows.

Some classic migration theories were tested in [17], highlighting how it would be useful to develop more complex models to interpret the phenomenon, considering that some theories, when verified on real data, fail to reproduce them accurately.

Internal migration is influenced by the same variables that generate international migration. The objective of this work is to verify if and to what extent the accessibility of the territories and the available transportation services, with reference to high-speed rail, influence internal migratory choices. Transport infrastructures and services generate economic and spatial impacts [18]; among the spatial impacts, those on accessibility are particularly relevant, influencing regional development in the European Union [19].

The importance of infrastructure and transportation services for the socio-economic development of a territory or a community has been widely studied in the literature.

The effect that an inefficient distribution of high-speed rail (HRS) services can have on certain territories was recently studied in [20]. It was found that opening new high-speed stations can reduce migration not only in the cities directly involved but also in neighbouring territories. It has been shown [21] that implementing a high-speed rail network can produce economic growth in urban areas with appreciable effects over five years. According to a Chinese study [22], a reduction in the resident population can be aggravated by improper planning of motorway and high-speed rail infrastructure, particularly for cities that are affected by demographic and economic shrinking. Another study in China [23] focused on the impact of HSR services on the development of the industrial structure of some of the country’s non-central cities; the positive effects were mainly shown for cities less than 100 km away from a central city (provincial capital or city controlled by the central government). Kim [24] examined, for a case study in Korea, the impact of the development of the high-speed railway between Seoul and Pusan on changes in the spatial structure in the region, finding a trend towards population concentration in the area around the capital.

Since the effects of high-speed railways on the labour market have been little studied in previous decades, some recent efforts have been made, such as the calibration of a linear regression model limited to the Madrid metropolitan area [25], which shows that while the growth of labour contracts has increased thanks to high-speed rail services, the unemployment rate and house prices became increasingly less significant in the reference period (2004–2015). Furthermore, the greater accessibility enabled by the introduction of high-speed rail increased interregional economic connections, which also included the labour force, and interregional economic models shifted from a point–axis model to a network model [26].

The economic impacts, in terms of interregional disparity and territorial distribution, produced by introducing a large-scale HSR system were estimated by combining industrial input–output relations with changes in passenger accessibility [27].

Road infrastructure can also have an important socio-economic impact in some rural areas. A study in China [28] showed that improving road infrastructure impacts the income of rural residents; this impact was more significant for those who initially had lower incomes.

A study in the United States [29] showed how restrictive housing policies can contribute to the decline in growth by generating a phenomenon of spatial deallocation.

Regarding inter-urban migration phenomena in large cities and other urban planning and transport aspects, urban ‘sprawl’ and the accessibility of transport to urban centres have been identified as causes of migration to large cities in Russia [30]. The impact of international immigration on the unemployment rate in Canada has been studied in [31], showing that in the short term, there is an increase in the number of people unemployed due to the difficulty of integration, but in the long term, this effect is reabsorbed. The link between migration and employment rates in the United Kingdom has been studied in [32]. A study of the impact of migration on Asian countries is reported in [33].

The accessibility and availability of transport infrastructure and services significantly impact property values [34,35,36,37], highlighting the importance of analysing all externalities, positive and negative, of transport investments.

The effects of the correlation between infrastructure development and the property market have been studied in China [38]; in particular, the increase in housing prices went hand in hand with the rise in the demand and the introduction of transit systems, such as trams [39], even if these studies are limited to a restricted territorial area.

Demographic imbalances in Italy are commonly recognised, and the resulting inequalities are considered very impactful and predominantly linked to socio-economic variables [40]. A study of forty years of data on Italian migration has shown that individual characteristics such as sex, age, and skills have an impact on the weight of economic factors [41]. A recent study on Italian demographic vibrancy [42] included accessibility as an index based on travel time and concluded that the introduction of high-speed rail services alone is not able to change demographic trends, even though it influences the dynamics of some demographic indices by attracting weaker demographic classes to more accessible locations.

3. Data and Methods

The data collection and analysis phase encountered difficulties due to the heterogeneity of the different sources, which sometimes referred to different periods. Demographic data and some socio-economic data were taken from ISTAT (National Institute of Statistics), income data from the MEF (Ministry of the Economy and Finance), data on property values from the Italian Fiscal Agency, data on university locations and personnel from the MUR (Ministry of Universities and Research), and, finally, accessibility data were calculated using specifically implemented models.

The database has the province as its basic territorial unit; where necessary, data were derived from the municipalities composing each province. Since the study needed to be based on the variation over the years of the characteristics of the territories, and the migration rate refers to a specific time interval, in some cases, it was necessary to combine or separate the territorial units. Indeed, during the reference period, some provinces were merged or separated; therefore, the data for these provinces were not homogeneous. Where it was possible to correct the data, they were homogenised and maintained in the database (6 provinces), while, in other cases, removing them from the database was necessary. Overall, only 2 provinces of 107 were removed from the database for these reasons; the provinces not considered are highlighted in Figure 2 with a dotted background (N/A) and are ‘Monza e Brianza’ and ‘Sud Sardegna’.

This study analysed the total migration balance between 2017 and 2022; in the following sections, this variable takes on the role of the dependent variable, representing the phenomenon we want to explain according to the available data. The generic territory (in our case, the generic province) is denoted by i, and the total migration balance is denoted by tmb_i. This variable is equal to the sum of the annual migration balance for 2017 to 2022; the yearly migration balance is the difference between the number of registrations and the number of cancellations from the population registers. It is then related to the number of inhabitants of each territorial unit so that it is expressed in percentage terms and, more precisely, as per thousand (⁰/₀₀). The values of this indicator were calculated based on registry data provided by ISTAT; their distribution is shown in Figure 3, while Figure 2 shows the different values on a map. It should be noted that, as the calculation was based on registry data, the migratory balance also includes data on legal immigrants moving from one region of the country to another. Given the study’s primary objective, which is to assess the impact of accessibility, this is not considered to have altered the phenomenon analysed. The period analysed includes the COVID-19 pandemic, during which the migratory phenomenon was limited, if not absent. To limit the impact of this disturbance on the data, an extended period (5 years) was used, and the data were the sum of the annual migratory balances, which led to an estimate of the overall variation, not an average value, of the population in the 5 years. Considering that the objective of the analysis was to evaluate the impact of accessibility on the phenomenon, it is believed that the approach used can be regarded as valid for this specific study.

The explanatory variables, i.e., those able to explain the phenomenon of internal migration flow, are described in the following subsections and classified into:

Socio-economic variables;
Property value variables;
Accessibility variables.

The choice of variables was based on the analysis of the literature and the authors’ experience in the sector.

The socio-economic variables considered refer mainly to three factors: (1) aspects related to the labour market (employment/unemployment rate and number of job holders); (2) aspects related to the economic wealth of the territory (average income); and (3) the presence or absence of a university. Except for the variable under point (3), which was included to verify whether the presence or absence of a university could have a significant influence on the phenomenon of migration, the other variables represent the main factors influencing internal migration in Western countries, as can be seen from the analysis of the literature and documents produced by the European Union. For international migration, other factors must be considered alongside the economic ones, from wars to climate change, which are clearly not considered in this case. The choice for the real estate values was the variable usually used in similar studies [34]. For the accessibility variables, those most frequently used in studies concerning other phenomena in which accessibility plays an important role were chosen [43,44].

The values of all variables tested, both those taken from public databases and those generated by the transport supply model or calculated by us, are reported in Appendix A so as to allow the reproducibility of the results obtained.

3.1. Socio-Economic Variables

The socio-economic variables, calculated by provincial territory, refer to the employment rate, the number of employed persons classified by sector of activity, and the variables for the presence or absence of a university. These variables and their description are given below:

or_i, occupation rate, calculated as the ratio of employed persons to the population aged 15 years or over;
ur_i, unemployment rate, calculated as the ratio of persons seeking jobs to the labour force, where the latter includes employed and unemployed persons;
aipc_i, average income per capita, calculated by dividing the total taxable income by the number of inhabitants;
jh_i, job holders per inhabitant, calculated as the ratio of the number of job holders to the population;
mjh_i, job holders in manufacturing activities per inhabitant;
cjh_i, job holders in commercial activities per inhabitant;
ejh_i, job holders in educational activities per inhabitant;
ajh_i, job holders in all other activities per inhabitant;
ujh_i, job holders in university teaching and research activities per inhabitant;
uni_i, presence of a university, dummy variable (0/1).

3.2. Property Value Variables

Property values can also impact migration phenomena, although the influence is uncertain: high values may indicate a good quality of life, thus attracting people, but can also be an obstacle for lower incomes. In this work, the following variable was considered:

apv_i, an average of residential property values by provincial territory. The median between the minimum and maximum residential property values, in euros per square metre, for municipalities j in each province i was calculated based on the values provided by the Italian Fiscal Agency. Of these values, the average for each province i was calculated:

$a {p v}_{i} = (\sum_{j \in p r o v_i} \frac{{V m i n}_{j} - {V m a x}_{j}}{2}) / n_{i} \forall_{i} [€ / m^{2}]$

where
prov_i is the set of municipalities belonging to province i;
Vmin_j [Vmax_j] is the minimum [maximum] property value per square metre in municipality j;
n_i is the number of municipalities within province i.

3.3. Accessibility Variables

The accessibility variables were calculated using a method similar to that proposed in [43]. The calculation of some of these variables required the construction of a road transport supply model. This model was developed in previous research, and the paper was just cited. Figure 4 shows the graph of the road network, which consists of 767,674 links and represents 202,628 km of road (more details are given in [43]).

The following subsections describe the different accessibility variables proposed.

3.3.1. Availability of High-Speed Rail Services

It measures the presence of high-speed rail services. The variable, expressed as hsr_i, reports the number of runs on high-speed railway lines arriving at/departing from the station of the provincial capital. If the service is absent, the value of this variable is zero.

3.3.2. Distance from Rome Leonardo da Vinci Airport

The Rome Leonardo da Vinci Airport is Italy’s largest and most important hub. Most intercontinental and international connections depart from and arrive at this airport. Therefore, it is considered that greater or lesser proximity may affect migration. The proposed variable, dha_i, is the reciprocal of the distance between the provincial capital and the airport site, measured on the road graph described above:

{d h a}_{i} = 1 / {d i s t_h a}_{i} \forall_{i}

where dist_ha_i is the distance between province capital i and Leonardo Da Vinci airport (hundreds of km).

3.3.3. Distance from the Nearest International Airport

A variable similar to the previous one refers to the nearest international airport. In each region, only the international airport with the most passengers served was considered in case the region hosted more than one airport. The list of international airports considered is shown in Table A4 in Appendix A.

The variable, dia_i, is calculated as the reciprocal of the distance of the provincial capital from the nearest airport:

{d i a}_{i} = 1 / {{m i n}_{k} (d i s t_i a}_{i, k}) \forall_{i}

where

k indicates the generic international airport;

dist_ia_i,k is the distance between province capital i and the international airport k (hundreds of km).

3.3.4. Population-Weighted Road Accessibility

Another aspect that is supposed to be relevant to the migration phenomenon is road accessibility. This type of accessibility can be calculated in several ways; in this work, the calculation was carried out using the gravity-based method proposed by Hansen [45], which was adequately adapted to the purposes of the analysis.

For each province, i, the variable, denoted ra_i, was calculated as

{r a}_{i} = \sum_{m \neq i} {i n h}_{m} / {r t}_{i, m} \forall_{i}

where

m indicates a generic municipality other than i;

inh_m indicates the number of inhabitants in the municipality m;

rt_i_,m is the travel time by car between locations i and m, calculated using the supply model described above.

3.3.5. Road Distance Indicator

Another indicator of accessibility, rd_i, is based on the distance within the road network (in km × 10⁻³) between provincial capital i and any other municipality, m. We calculate this indicator as follows:

{r d}_{i} = 1 / \sum_{m \neq i} d_{i, m} \forall_{i}

where d_i_,m represents the distance between provincial capital i and municipality m.

3.3.6. Road Travel Time Indicator

This indicator, denoted by rtt_i, is similar to the previous one, only it is based on car travel time, calculated using the transport supply model described above:

{r t t}_{i} = 1 / \sum_{m \neq i} {r t}_{i, m} \forall_{i}

where rt_i_,m has already been defined above.

4. Regression Models

The objective of this study was to calibrate a model that is able to estimate the influence that the variables defined above may have on migratory phenomena. Even if the main focus is on accessibility, it is necessary to consider all the variables that are considered useful in explaining the phenomenon so as to be able to weigh the contribution of each variable correctly. To formulate a model that best represents the contribution of each variable, it is necessary to test multiple combinations of them, thus calibrating multiple models and choosing the one that best approximates the reality.

Among the different possible models, we chose to use multiple linear regression models [46], which have strengths and weaknesses. Among the strengths, the main ones refer to the speed with which they can be calibrated (practically, in closed form and many software programs, including Excel, have tools for their calibration), to the many who have used them in different scientific sectors, and to the ease of generating reliable statistical tests to evaluate the goodness of the model. The main weak point is the hypothesis that the studied phenomenon has a linear relationship with each explanatory variable.

More complex models, such as spatial econometric models [47] or machine learning techniques [48], can interpret migratory phenomena by considering other variables, such as cultural ties, language, prevailing religion, and government structures, which can influence some migratory decisions. In this study, we considered it sufficient to use a multiple linear regression model, both because the primary purpose was to evaluate the impact of accessibility variables and because, when dealing with internal migration, the aforementioned variables have a less significant impact than in the case of international migration.

A multiple linear regression model relates a dependent variable, Y, with several independent variables, X; a general formulation can be expressed as

Y = β_{0} + β_{1} X_{1} + β_{2} X_{2} + \dots + β_{n} X_{n}

where the β terms represent the coefficients of the model to be calibrated.

The linear regression model must be ‘specified’ and ‘calibrated’. The specification phase defines the variables to be present in the model, while the calibration phase defines the values assumed by the coefficients.

The specification was carried out using a trial-and-error procedure: (1) a model specification was assumed; (2) the model was calibrated; and (3) statistical tests were used to check whether the calibrated model succeeded in interpreting the phenomenon. Usually, several model specifications are tested to find the best one.

The calibration phase is based on the availability of observed data for the dependent variables, y_i, and the independent variables, x_i. These data can be arranged in a vector, y, and a matrix, x. In our case, the vector y has a number of elements equal to the number of provinces, and the matrix x has a number of rows equal to the number of provinces and a number of columns equal to the number of variables considered in the specification of each model. The values that the independent variables assume for each observation are also called ‘predictors’. The coefficients of the model, β, can be ordered into a vector, β, that has as many elements as there are coefficients, i.e., equal to the number of variables plus one, to account for the term β₀, also known as the intercept of the model. Finally, we introduce the vector of statistical errors, ε, which has the same numerosity as y. We can, therefore, write

y_{i} = β_{0} + \sum_{z} β_{z} x_{i, z} + ε_{i} \forall i

or

y = x β + ε

The calibration of the model searches for the vector of coefficients that minimises statistical errors. The greater the model’s accuracy, the smaller the error in reproducing the observed data.

We can write, for the i-th row of the matrix x:

y_{i} = x^{i} β + ε_{i}

and

ε_{i} = y_{i} - x^{i} β

The calibration of the model can be based on the method of least squares [46]. This method, which has been widely consolidated in the literature, minimises the sum of squares of the statistical errors, formulating the following optimisation model:

β^{o p t} = {A r g}_{β} m i n \sum_{i} (y_{i} {- x^{i} β)}^{2}

The accuracy of the model in reproducing the observed data can be measured through some indicators, including the coefficient of determination, expressed as

R^{2} = 1 - (\sum_{i} {(y_{i} {- x}^{i} β)}^{2} / \sum_{i} {(y_{i} - y^{^})}^{2})

where y^ is the average of the values of y_i. The closer this indicator is to 1, the more accurate the model is. The coefficient of determination increases as the number of independent variables increases, which can lead to an overestimation of accuracy in the case of a large number of independent variables. To overcome this problem, the adjusted coefficient of determination,

R_{a d j}^{2}

, can be used, which penalises the introduction of superfluous variables into the model specification. This indicator is calculated as

R_{a d j}^{2} = 1 - (\frac{n - 1}{n - p - 1}) \cdot (1 - R^{2})

where n represents the number of observations and p is the number of degrees of freedom of the model. As can be seen, this indicator mitigates the effect of increasing the number of independent variables in the model, given by p. For the case under study, having considered 105 provinces as observable data, the value does not differ much from R², but it is still the case that it is evaluated when comparing the different model specifications.

The coefficient of determination measures the model’s ability to reproduce the observed data but cannot assess whether the variables included in it provide a valid contribution to explaining the phenomenon. Indeed, the combined contribution of some variables may increase the coefficient of determination but, at the same time, provide β coefficient values that are not truly representative of the real contribution of each to the explanation of the observed phenomenon. The statistical t-test, on the other hand, can assess whether a variable is significant within the model; the value that the t-test must assume for a variable to be significant depends on the model’s degrees of freedom, which is equal to the number of independent variables. Table 1 shows the minimum values that, in absolute value, must be met for a variable to be considered significant. If this test is not satisfied, the model thus specified cannot explain the phenomenon, and an alternative specification must be proposed. It is the value of the t-student distribution relating to the degrees of freedom of the model with a 95% confidence interval (t₉₅).

Another useful test to check the goodness of the model is the F-test (Fisher’s test); the value taken by this test must be close to 0, or at least below 0.05, for the model to be valid [46].

Finally, checking the signs of the beta coefficients is another operation that must be performed in order to consider the model valid. Indeed, there are certain variables that are known to give a positive or negative contribution to a certain phenomenon (e.g., an increase in the cost of a good always corresponds to a reduction in the sales of that good); for these variables, we can check the sign of the corresponding coefficient to verify whether the specification of the model is valid. In some cases, however, we may not know a priori whether the variable’s contribution is positive or negative, and, for these variables, the sign is not verified.

A procedure based on a trial-and-error methodology was used to specify and calibrate the model. This procedure consists of inserting a variable into the model and checking whether all statistical tests are satisfied. To the model thus formulated, we tried adding another variable until we obtained a model in which the addition of other variables did not comply with the statistical tests and/or the sign of the coefficients. At this point, we could try to replace some of the variables in the obtained model with others not yet included to see if we could obtain a model with better performance (higher coefficient of determination, in addition to compliance with all tests).

To guide this procedure, it is useful to calculate the correlation coefficient that each independent variable has with the dependent variable so that the variables that presumably explain the phenomenon most strongly are included in the model first. Therefore, the correlation coefficient (or Pearson’s coefficient) was calculated for all the variables defined in Section 3. The results, ordered by decreasing coefficient values, are shown in Table 2; this coefficient assumes values between −1 and 1, and the higher the correlation between two variables, the higher the absolute value of the coefficient. The correlation coefficient, ρ_xy, is calculated as

ρ_{x y} = σ_{x y} / σ_{x} σ_{y}

where σ_xy is the covariance of the two variables and σ_x and σ_y are the standard deviations of x and y, respectively.

Variables with a correlation coefficient of less than 0.1 in absolute value were not used in the model specification and calibration phase. It should be noted that there was no sufficient correlation between the presence of a university and the total migration balance, so this variable was not considered in the calibration of the models. This result can be interpreted by hypothesising, first of all, that families do not consider this a fundamental element for migrating, and considering that students who leave home, at least in Italy, seldom transfer their residence to the city where they go to study, and, therefore, are not included in the migration data.

To limit the number of models to be tested, we also calculated the correlation matrix between the independent variables. This matrix makes it possible to identify independent variables that are closely correlated with each other, which, therefore, is not useful to include in the same model. This matrix is shown in Table 3. As can be seen, there was a significant correlation between the variable aipc_i and the occupational variables; therefore, these variables were not considered jointly in the specification of the models. Furthermore, rd_i and rtt_i were closely correlated; thus, these two variables were not jointly included in the tested models.

Using the trial-and-error procedure, 181 multiple linear regression models relating the migration rate to the independent variables were calibrated. The correlation analyses shown in Table 2 and Table 3 were used to guide the procedure, avoiding the testing of models that, with a high probability, could not be valid either due to a too-low correlation between one of the independent variables and the migration rate (see Table 2) or due to a high correlation between two independent variables (see Table 3), which, if included in the model at the same time, would have created problems in the phase of calibration and in the significance of the variables.

Of all the models tested, Table 4, Table 5 and Table 6 summarise the results for 25 of them. In addition to the best models from the point of view of statistical tests, the table also shows those in which accessibility variables had a significant weight, given the purpose of our study, which concerns the impact of accessibility on the migration phenomenon. In Table 4, Table 5 and Table 6, for each model tested, the following information is given: independent variables, coefficients of the model, values of the coefficients of determination (normal and adjusted), value of Fisher’s test, values of the t-tests for each variable, and an indication of the significance or otherwise of the model derived from the analysis of the signs of the coefficients and statistical indicators. The valid values have been underlined in the tables. All calibrations were made using Excel’s multiple linear regression model calibration tool, which also provides the calculation of all indicators.

Of these models, twelve were found to be statistically acceptable, and of these, No. 11 is the best as it had the highest R² and R_adj² values, 0.764 and 0.755, respectively. These values, considering the complexity of the phenomenon studied, can be considered valid for identifying the variables that influence the migration rate and their weight. The main features of this model are summarised in Table 7.

This model has four variables: aipc_i, apv_i, rtt_i, and hsr_i. The first represents the average per capita income; the second is the value of residential property; the third is the overall road accessibility (calculated with travel time); and the last represents the high-speed rail services. If we focus on the last variable, we can observe that the coefficient is negative: the more significant the availability of high-speed rail services, the lower the migration rate to that territory. This result may seem counterintuitive, but it can be interpreted. Indeed, if a city is well served by high-speed rail, I can work there without needing to move my residence because the service allows me to commute between where I live and where I work. For this reason, the better-served territories may have a lower incoming migration rate, all other conditions being equal.

The model is therefore formulated as follows:

{t m b}_{i} = - 71.538 + 0.003767 {\cdot a i p c}_{i} + 33.042 {\cdot r t t}_{i} + 0.007256 {\cdot a p v}_{i} - 0.187 {\cdot h s r}_{i}

Figure 5 shows a scatter diagram in which the x-axis shows the actual values of the migration rate based on ISTAT data, and the y-axis shows the values calculated by applying the calibrated model. This graph confirms the goodness of the model since the points are included in a range not far from the bisector, the place of the points that would represent the perfect reproducibility of the phenomenon.

5. Discussion

The calibrated model allows us to estimate the impact of certain factors on the total migration balance between 2017 and 2022. After testing several models, combining different socio-economic and accessibility variables, we found a dependence of the migration rate on four variables: average income per capita, average property values in the area, overall accessibility by car (measured by travel time), and high-speed rail services.

The dependence on the average income per capita is in line with all the literature on international and national migratory phenomena. This variable was significant in many of the calibrated models, as well as the one with the highest correlation coefficient with the dependent variable, highlighting how the economic conditions of the place of destination are the main factor that induces a person to migrate from one area to another in the country. This factor is linked to the area’s employment conditions, so much so that the employment-related variables do not appear explicitly in the model, being somehow included in the average per capita income variable.

Property values also had a non-negligible influence on the phenomenon. Although the opposite effect could also be expected (higher property values and lower propensity to move due to the higher economic burden of renting or buying a house), areas with higher average real estate values are more attractive. This factor is also related to the general economic conditions in the area, but according to the results of significance tests, it also gives an independent contribution to the choice to migrate. It can, therefore, be assumed that the quality of the real estate fabric influences the studied phenomenon. This aspect of the problem would deserve a more in-depth study, in line with some research on the subject [49,50], but we will have to consider it in future research.

The variable of overall road accessibility, based on travel time, was another particularly relevant factor for interpreting the migration phenomenon. The possibility and ease of reaching where one decides to live and travelling from this to other places assumes significant relevance. After all, it is well known that the depopulation phenomena of certain areas are also significant due to their poor accessibility.

The fourth variable found to be significant in the model concerns high-speed rail services. In this case, as mentioned earlier, the negative coefficient of the variable seems to produce a counterintuitive result. An analysis of the findings in Table 4 shows that this variable was significant in some calibrated models (# 8, 9, 10, 11), always assuming a negative sign, which are also those with the highest values of the coefficient of determination. The interpretation is that a city better served by high-speed rail is also a city that can be reached more easily and more quickly and, therefore, allows those who have to work there to choose another place to establish their residence and be a commuter.

The results show how the accessibility of territories and the availability of transport services can influence internal migratory phenomena. Two of the four variables that emerged as significant refer to aspects linked to transport and accessibility. Most of the literature in this field, as reported in Section 2, studies migratory phenomena almost exclusively with regard to economic conditions and the labour market; this approach is largely justified for international migratory phenomena, in which the accessibility of a part of the territory, such as a province, is negligible compared to the other variables involved. When we move on to internal migration, economic and labour market factors certainly play the most important role, an aspect also confirmed by the calibrated model, but accessibility variables begin to assume greater importance. From the calibrated model, two contrasting trends were found. Road accessibility, which is a more general measure, had an attractive effect on internal migration: all other factors being equal, the most accessible areas were preferred to the less accessible ones. On the other hand, high-speed rail services had the opposite effect because they make it possible to live in a different area from the one where people work, as the fast connection makes it possible to organise a daily commute.

This result is in line with what was found in [41], where it was shown that cities well connected by high-speed railways attract skilled workers who prefer to travel rather than change their residence.

In Section 4, 25 models were summarised; of these, 12 were valid, that is, they satisfied the statistical tests and the tests on the sign of the coefficient. The model chosen, as mentioned previously, is the one that, among the valid models, had the highest R² value. This model relates the total migratory balance to four variables. Analysing all the calibrated models that are valid allowed us to identify which other variables were significant for the phenomenon, even if they could not be included in the best model because the statistical tests would not be respected. There were only two of these variables, which led to the calibration of a valid model but with a lower R² and, therefore, were less able to explain the phenomenon: rd_i and or_i. The first is overall accessibility based on distance within the road network; clearly, this could not be included in the calibrated model because it is closely correlated with rtt_i, which is the same measurement based on time but gave better results in the calibration of the model. The second variable is the employment rate; this is certainly important but closely correlated with the average per capita income, which performed better in the construction of the model. The other variables, on the other hand, were never significant in the models in which they were considered. The other socio-economic variables, probably because they are too specific, and the accessibility variables, probably because the variable considered, rtt_i, which represents overall accessibility, also partly includes accessibility to airports and accessibility weighted on the population.

The proposed study concerns Italy, and the calibrated model is valid only for this case study, which is based on specific input data. This study can contribute to the development of similar models for other European countries that present similar internal migration phenomena, which are present in many contexts where the socio-economic conditions are substantially different. The methodology proposed in this paper can be replicated in other contexts, as the data used are usually available and easily accessible in all developed countries. The construction of the road network graph, which is necessary to obtain most of the accessibility variables, must be implemented for specific case studies, and the procedure described can help in its construction. Another contribution that may help others replicate this study in other countries regards the variables that were tested in the model and some of the results obtained, such as, for example, the usefulness of considering the average value of the properties or the impact that the presence of a high-speed service has on the migration rate.

6. Conclusions

Like other European countries, Italy has a substantial disparity in socio-economic conditions between the different parts of its territory. In particular, a rich and industrialised north is contrasted by a poorer and less industrialised south. These strong differences, which have never been compensated for despite being known and indisputable since the post-war period, have generated internal migratory flows from the southern to the northern regions, as is clearly visible in the statistical data (see, for example, Figure 1 and Figure 3). This migration phenomenon has intensified in recent years; in particular, young people tend to emigrate for work reasons, as they do not find opportunities commensurate with their skills and education in their place of origin.

In this work, an attempt was made to model the phenomenon using multiple linear regression models, considering the influence that the accessibility of territories can have on their attractiveness. In addition to various accessibility variables, other socio-economic factors were considered, such as average income per capita, number of employees, level of employment, and so on. The specification and calibration of numerous models, which combined the different variables considered in various ways, led to the identification of a multiple linear regression model capable of representing the phenomenon with a good level of accuracy. The dependent variable was the total migration balance, while there were four independent variables, two socio-economic ones and two related to accessibility.

Road network accessibility, which can be calculated with a model of the Italian primary road network, was shown to have a positive effect on the migration phenomenon: the greater the accessibility of an area, the greater its attractiveness. The other accessibility variable, related to the availability of high-speed rail services, instead presented a negative effect; this can be interpreted as the variable being linked to ease of commuting and, therefore, avoiding moving to a city well served by fast rail transport.

The main contribution of this work to the literature is the study of how some accessibility parameters influence internal migration phenomena. The literature has almost always studied migration phenomena only in relation to socio-economic and labour market variables, which are certainly the most important, both for internal migration and, even more so, for international migration. The accessibility variables between small- and medium-sized territories, such as provinces, have not been considered previously, to the best of our knowledge.

Based on the results obtained, it is possible to make some recommendations for policymakers and transport planners. Firstly, overall accessibility is an essential factor in the development of a territory and also impacts migration. Making some areas of the country more accessible could mitigate some migratory trends, even if socio-economic aspects, which were also included in our model, are prevalent. Another important indication is that, to avoid a high concentration of population in some cities and the depopulation of other areas of the country, it is necessary to improve high-speed rail services, which allow commuting for work or study, limiting the depopulation of more economically and socially depressed areas.

Future research can be directed in different directions. Firstly, non-linear models or experiments on other European countries with similar migratory phenomena could be proposed; secondly, similar models could be proposed to verify how accessibility can influence international migration; and, finally, more complex models could be studied and applied, such as spatial econometric models, temporal dynamic models, a longitudinal panel data approach, or machine learning techniques, which can interpret even more complex variables that could influence migratory phenomena.

Author Contributions

Conceptualisation, A.B. and M.G.; methodology. A.B. and M.G.; model, A.B. and M.G.; numerical results, A.B. and M.G.; resources, M.G.; data curation, A.B.; writing—original draft preparation, A.B. and M.G.; writing—review and editing, A.B. and M.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partially funded by the University of Sannio (FRA 2024).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The main data are summarised in Appendix A.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

This appendix summarises the data used for calibrating the models.

Table A1. Data—Part 1.

Province i	tmb_i (‰)	or_i	ur_i	aipc_i	jh_i	mjh_i
Agrigento	−30.4	30.633	26.482	8105.726	0.148	0.013
Alessandria	12.5	46.504	9.695	15,217.504	0.340	0.082
Ancona	10.8	45.468	9.921	15,117.216	0.400	0.102
Barletta–Andria–Trani	−19.2	33.879	20.703	8547.393	0.222	0.043
Valle d’Aosta/Vallée d’Aoste	4.0	49.584	7.609	16,698.839	0.380	0.036
Arezzo	9.0	49.473	8.521	14,708.922	0.377	0.108
Ascoli Piceno	−2.5	42.159	13.802	12,895.026	0.318	0.062
Asti	12.9	46.608	8.731	14,601.626	0.305	0.071
Avellino	−24.4	38.319	16.756	9822.479	0.225	0.045
Bari	−0.6	35.490	18.516	11,129.248	0.278	0.040
Belluno	13.2	50.944	5.768	16,500.126	0.417	0.123
Benevento	−25.7	37.865	16.439	9647.707	0.206	0.033
Bergamo	16.4	51.184	5.520	15,811.489	0.433	0.122
Biella	6.0	47.238	8.689	16,088.507	0.364	0.097
Bologna	32.3	52.486	5.997	19,013.735	0.461	0.099
Bolzano/Bozen	14.2	55.053	4.371	18,710.271	0.454	0.065
Brescia	19.4	49.499	6.785	15,128.927	0.424	0.118
Brindisi	−11.0	32.470	22.315	10,142.003	0.197	0.028
Cagliari	11.5	39.628	16.954	16,143.789	0.330	0.023
Caltanissetta	−41.5	34.276	20.020	8444.495	0.175	0.021
Campobasso	−17.2	36.144	16.018	10,690.066	0.231	0.036
Caserta	−7.5	34.383	15.964	8623.568	0.188	0.027
Catania	−2.9	32.150	19.911	9172.715	0.204	0.021
Catanzaro	−23.1	32.538	21.157	9528.218	0.199	0.017
Forlì–Cesena	24.1	52.871	5.466	15,661.406	0.418	0.094
Chieti	0.9	40.520	13.204	11,816.753	0.333	0.085
Como	20.6	51.502	6.332	15,755.662	0.355	0.092
Cosenza	−21.1	32.936	22.491	8784.594	0.174	0.015
Cremona	18.4	50.555	6.485	16,112.273	0.336	0.092
Crotone	−51.0	26.960	25.911	7488.733	0.167	0.017
Cuneo	16.5	51.706	6.397	15,272.695	0.396	0.103
Enna	−36.5	34.092	18.811	8556.434	0.152	0.016
Fermo	1.0	44.278	10.880	12,784.453	0.380	0.139
Ferrara	27.5	49.467	7.947	15,628.447	0.301	0.062
Firenze	11.3	48.532	8.034	17,025.033	0.421	0.089
Foggia	−22.6	32.644	18.112	9135.671	0.184	0.020
Frosinone	−9.1	43.268	12.070	10,936.108	0.255	0.048
Genova	21.9	44.524	9.508	18,007.969	0.356	0.054
Gorizia	37.0	46.294	7.042	16,130.133	0.319	0.077
Grosseto	17.9	49.507	7.792	13,881.890	0.262	0.026
Imperia	33.0	44.365	10.256	13,307.659	0.235	0.018
Isernia	−24.1	36.275	14.642	10,829.293	0.235	0.037
La Spezia	26.9	44.330	10.329	15,629.200	0.285	0.044
L’Aquila	−10.4	42.856	10.219	12,607.340	0.252	0.033
Latina	18.1	45.155	12.333	11,370.519	0.250	0.042
Lecce	−4.6	34.177	20.258	9828.447	0.223	0.031
Lecco	14.4	51.724	5.730	17,853.426	0.393	0.129
Livorno	12.4	46.974	8.836	15,099.017	0.300	0.037
Lodi	25.0	52.416	6.085	16,171.173	0.285	0.061
Lucca	26.4	46.181	9.901	14,815.287	0.363	0.076
Macerata	−10.0	45.517	10.199	13,441.405	0.374	0.105
Mantova	22.3	51.756	5.908	15,277.793	0.386	0.119
Massa–Carrara	10.6	45.062	12.905	13,909.359	0.289	0.044
Matera	−10.9	38.086	12.167	10,390.104	0.215	0.032
Messina	−17.7	33.327	23.777	10,365.033	0.198	0.020
Milano	19.6	51.390	6.602	20,263.901	0.571	0.079
Modena	24.4	52.621	6.066	17,217.241	0.470	0.140
Napoli	−25.3	31.739	24.023	9127.180	0.217	0.028
Novara	16.2	48.782	10.080	16,393.869	0.356	0.097
Nuoro	−21.9	40.618	13.628	10,113.251	0.189	0.022
Oristano	−8.8	40.022	15.571	10,218.299	0.190	0.019
Padova	16.9	50.319	6.840	16,080.757	0.437	0.109
Palermo	−21.6	30.949	23.945	9533.516	0.191	0.013
Parma	32.6	52.015	5.920	17,888.034	0.450	0.115
Pavia	29.3	51.734	5.889	16,133.822	0.287	0.056
Perugia	8.4	48.003	10.457	13,824.157	0.346	0.074
Pesaro e Urbino	10.7	45.080	9.934	13,996.821	0.399	0.111
Pescara	4.5	41.752	11.778	12,293.565	0.287	0.040
Piacenza	30.7	49.983	6.264	16,901.887	0.383	0.084
Pisa	20.9	48.340	7.898	15,274.636	0.353	0.077
Pistoia	29.5	47.964	8.749	13,417.413	0.313	0.068
Pordenone	23.6	50.154	6.299	15,992.809	0.406	0.133
Potenza	−25.5	38.409	13.526	10,633.470	0.248	0.047
Prato	37.2	50.341	8.398	15,169.672	0.497	0.161
Ragusa	10.7	36.255	17.758	8947.946	0.210	0.025
Ravenna	26.9	50.283	8.146	16,332.475	0.365	0.072
Reggio di Calabria	−30.4	30.974	21.444	9205.039	0.153	0.013
Reggio nell’Emilia	12.8	53.031	5.243	16,624.772	0.465	0.136
Rieti	7.5	45.896	10.195	12,267.318	0.196	0.024
Rimini	30.5	47.509	8.144	14,324.882	0.394	0.059
Roma	11.3	48.814	9.898	16,363.368	0.530	0.020
Rovigo	9.2	50.549	6.169	14,248.537	0.335	0.078
Salerno	−14.8	38.038	19.078	9559.298	0.227	0.032
Sassari	−0.6	39.197	20.401	11,357.691	0.201	0.043
Savona	20.0	43.604	8.734	15,662.312	0.287	0.039
Siena	11.9	49.650	7.330	15,781.504	0.399	0.065
Siracusa	−11.5	33.071	23.699	9496.523	0.186	0.025
Sondrio	14.1	50.016	6.389	14,517.668	0.360	0.063
Taranto	−10.1	33.356	16.958	10,546.641	0.203	0.035
Teramo	−1.0	43.677	9.414	11,462.891	0.330	0.085
Terni	0.4	42.180	11.160	13,599.182	0.297	0.052
Torino	7.2	46.104	10.269	16,564.342	0.386	0.088
Trapani	−10.8	32.416	21.049	9072.155	0.180	0.021
Trento	16.6	52.676	4.736	16,233.803	0.396	0.060
Treviso	9.8	50.553	5.894	15,630.068	0.436	0.132
Trieste	36.0	45.400	6.584	18,012.088	0.360	0.056
Udine	10.1	46.938	6.527	16,019.002	0.372	0.084
Varese	17.8	51.581	5.902	16,121.014	0.371	0.108
Venezia	10.7	47.364	6.832	15,816.744	0.369	0.059
Verbano–Cusio–Ossola	14.9	46.820	7.239	14,075.422	0.278	0.052
Vercelli	10.7	47.093	9.881	15,413.035	0.328	0.084
Verona	23.3	51.054	5.879	15,859.832	0.424	0.090
Vibo Valentia	−32.4	31.204	21.902	8744.138	0.162	0.019
Vicenza	7.2	51.677	5.860	15,768.392	0.475	0.163
Viterbo	18.5	45.032	12.662	12,283.924	0.216	0.029

Table A2. Data—Part 2.

Province i	cjh_i	ejh_i	ajh_i	ujh_i	uni_i	apv_i
Agrigento	0.040	0.002	0.093	0.000	0.000	684.655
Alessandria	0.056	0.001	0.201	0.000	0.000	851.430
Ancona	0.058	0.003	0.236	0.002	1.000	1119.907
Barletta–Andria–Trani	0.055	0.001	0.122	0.000	0.000	1259.949
Valle d’Aosta/Vallée d’Aoste	0.051	0.009	0.284	0.000	1.000	1704.875
Arezzo	0.058	0.003	0.208	0.000	0.000	1273.339
Ascoli Piceno	0.063	0.002	0.191	0.000	0.000	1089.177
Asti	0.054	0.001	0.179	0.000	0.000	664.207
Avellino	0.041	0.001	0.138	0.000	0.000	701.045
Bari	0.056	0.003	0.178	0.002	1.000	1259.949
Belluno	0.056	0.005	0.233	0.000	0.000	1139.577
Benevento	0.042	0.003	0.128	0.001	1.000	1119.462
Bergamo	0.060	0.004	0.247	0.000	1.000	1151.500
Biella	0.055	0.001	0.211	0.000	0.000	519.749
Bologna	0.071	0.008	0.283	0.004	1.000	1648.643
Bolzano/Bozen	0.079	0.005	0.305	0.001	1.000	2453.173
Brescia	0.058	0.005	0.243	0.001	1.000	1442.099
Brindisi	0.045	0.002	0.123	0.000	0.000	969.160
Cagliari	0.064	0.007	0.236	0.003	1.000	1108.149
Caltanissetta	0.039	0.002	0.114	0.000	0.000	573.756
Campobasso	0.041	0.003	0.151	0.001	1.000	613.851
Caserta	0.044	0.005	0.113	0.000	0.000	763.588
Catania	0.051	0.004	0.128	0.001	1.000	862.875
Catanzaro	0.047	0.001	0.133	0.001	1.000	699.635
Forlì–Cesena	0.076	0.003	0.244	0.000	0.000	1467.718
Chieti	0.050	0.003	0.195	0.001	1.000	793.109
Como	0.053	0.003	0.207	0.000	1.000	1438.463
Cosenza	0.044	0.003	0.113	0.001	1.000	615.615
Cremona	0.047	0.003	0.194	0.000	0.000	754.007
Crotone	0.038	0.001	0.111	0.000	0.000	711.274
Cuneo	0.066	0.002	0.226	0.000	1.000	1016.056
Enna	0.038	0.001	0.097	0.001	1.000	761.765
Fermo	0.056	0.001	0.184	0.000	0.000	1089.177
Ferrara	0.051	0.004	0.184	0.002	1.000	985.875
Firenze	0.070	0.006	0.256	0.002	1.000	2171.626
Foggia	0.043	0.002	0.118	0.001	1.000	949.381
Frosinone	0.046	0.002	0.159	0.001	1.000	970.909
Genova	0.058	0.005	0.239	0.002	1.000	2121.818
Gorizia	0.042	0.002	0.198	0.000	0.000	1013.240
Grosseto	0.056	0.001	0.179	0.000	0.000	2390.868
Imperia	0.060	0.002	0.155	0.000	0.000	1994.141
Isernia	0.042	0.001	0.155	0.000	0.000	605.518
La Spezia	0.051	0.003	0.188	0.000	0.000	1918.612
L’Aquila	0.043	0.004	0.172	0.002	1.000	759.928
Latina	0.054	0.002	0.152	0.000	0.000	1518.241
Lecce	0.050	0.002	0.139	0.001	1.000	711.133
Lecco	0.051	0.004	0.209	0.000	0.000	1240.890
Livorno	0.063	0.002	0.198	0.000	0.000	2251.028
Lodi	0.043	0.003	0.178	0.000	0.000	1093.036
Lucca	0.063	0.002	0.222	0.000	1.000	1782.557
Macerata	0.061	0.004	0.203	0.002	1.000	1022.751
Mantova	0.053	0.002	0.212	0.000	0.000	797.018
Massa–Carrara	0.058	0.002	0.186	0.000	0.000	1489.734
Matera	0.045	0.001	0.138	0.000	0.000	682.647
Messina	0.045	0.005	0.128	0.002	1.000	1001.770
Milano	0.103	0.007	0.382	0.003	1.000	1815.224
Modena	0.065	0.004	0.261	0.001	1.000	1179.969
Napoli	0.048	0.004	0.137	0.001	1.000	1685.833
Novara	0.057	0.001	0.201	0.000	0.000	968.743
Nuoro	0.042	0.001	0.123	0.000	0.000	1036.360
Oristano	0.047	0.002	0.123	0.000	0.000	572.071
Padova	0.077	0.006	0.245	0.003	1.000	1212.944
Palermo	0.041	0.005	0.132	0.001	1.000	745.984
Parma	0.058	0.005	0.272	0.002	1.000	1091.804
Pavia	0.049	0.004	0.178	0.002	1.000	1010.330
Perugia	0.062	0.005	0.206	0.002	1.000	976.512
Pesaro e Urbino	0.057	0.005	0.226	0.000	0.000	1305.428
Pescara	0.056	0.002	0.189	0.001	1.000	901.984
Piacenza	0.061	0.003	0.236	0.000	0.000	1061.140
Pisa	0.055	0.007	0.214	0.005	1.000	1503.038
Pistoia	0.059	0.002	0.184	0.000	0.000	1565.556
Pordenone	0.052	0.003	0.217	0.000	0.000	800.112
Potenza	0.041	0.003	0.158	0.001	1.000	632.682
Prato	0.075	0.003	0.259	0.000	0.000	1965.972
Ragusa	0.055	0.002	0.128	0.000	0.000	780.630
Ravenna	0.060	0.002	0.231	0.000	0.000	1397.542
Reggio di Calabria	0.044	0.002	0.093	0.001	1.000	613.429
Reggio nell’Emilia	0.062	0.004	0.264	0.001	1.000	948.859
Rieti	0.034	0.001	0.136	0.000	0.000	1117.607
Rimini	0.072	0.003	0.259	0.000	0.000	2010.219
Roma	0.052	0.006	0.452	0.002	1.000	2147.743
Rovigo	0.056	0.002	0.199	0.000	0.000	1013.272
Salerno	0.049	0.003	0.143	0.001	1.000	1038.758
Sassari	0.035	0.002	0.122	0.001	1.000	1528.476
Savona	0.056	0.003	0.188	0.000	0.000	2472.678
Siena	0.055	0.007	0.272	0.003	1.000	1505.065
Siracusa	0.038	0.002	0.121	0.000	0.000	735.518
Sondrio	0.061	0.004	0.231	0.000	0.000	1194.566
Taranto	0.042	0.001	0.125	0.000	0.000	845.842
Teramo	0.050	0.002	0.193	0.001	1.000	852.634
Terni	0.054	0.002	0.188	0.000	0.000	1076.746
Torino	0.053	0.003	0.241	0.002	1.000	1172.762
Trapani	0.044	0.002	0.113	0.000	0.000	987.059
Trento	0.058	0.010	0.269	0.002	1.000	1891.078
Treviso	0.060	0.003	0.240	0.000	0.000	1166.532
Trieste	0.044	0.008	0.252	0.004	1.000	1905.556
Udine	0.055	0.004	0.229	0.001	1.000	818.160
Varese	0.053	0.004	0.206	0.001	1.000	1160.785
Venezia	0.068	0.004	0.238	0.001	1.000	1686.245
Verbano–Cusio–Ossola	0.047	0.001	0.177	0.000	0.000	1144.223
Vercelli	0.047	0.003	0.194	0.003	1.000	980.565
Verona	0.078	0.005	0.251	0.001	1.000	1263.086
Vibo Valentia	0.038	0.002	0.103	0.000	0.000	598.190
Vicenza	0.063	0.003	0.246	0.000	0.000	1341.906
Viterbo	0.050	0.002	0.135	0.001	1.000	1173.961

Table A3. Data—Part 3.

Province i	hsr_i	dha_i	dia_i	ra_i	rd_i	rtt_i
Agrigento	0	0.143	0.647	6.708	0.031	0.024
Alessandria	1	0.189	1.427	18.272	0.071	0.066
Ancona	14	0.358	5.358	14.340	0.073	0.066
Barletta–Andria–Trani	0	0.249	1.723	11.692	0.047	0.045
Valle d’Aosta/Vallée d’Aoste	0	0.143	0.883	11.150	0.055	0.052
Arezzo	3	0.453	1.280	16.152	0.077	0.072
Ascoli Piceno	3	0.486	1.202	13.213	0.066	0.061
Asti	1	0.176	1.593	16.798	0.067	0.062
Avellino	0	0.370	1.801	14.703	0.051	0.049
Bari	15	0.223	20.000	13.844	0.045	0.044
Belluno	0	0.171	0.956	13.211	0.067	0.060
Benevento	5	0.399	1.469	13.550	0.053	0.050
Bergamo	2	0.172	1.173	22.416	0.074	0.068
Biella	0	0.158	1.403	14.612	0.063	0.058
Bologna	85	0.267	20.000	22.396	0.086	0.079
Bolzano/Bozen	5	0.157	20.000	13.157	0.065	0.060
Brescia	27	0.188	0.776	20.299	0.079	0.072
Brindisi	9	0.178	0.834	10.062	0.039	0.038
Cagliari	0	0.192	13.451	9.764	0.036	0.039
Caltanissetta	0	0.142	0.926	7.349	0.031	0.025
Campobasso	10	0.400	0.826	11.816	0.054	0.051
Caserta	5	0.452	3.081	16.666	0.053	0.052
Catania	0	0.128	20.000	10.496	0.030	0.027
Catanzaro	4	0.166	2.787	8.220	0.035	0.034
Forlì–Cesena	0	0.305	1.140	16.872	0.082	0.073
Chieti	0	0.444	7.642	14.318	0.063	0.059
Como	0	0.163	2.354	20.684	0.069	0.064
Cosenza	5	0.185	1.451	8.913	0.038	0.037
Cremona	0	0.198	0.786	19.034	0.080	0.073
Crotone	0	0.164	0.971	7.685	0.035	0.033
Cuneo	0	0.168	1.030	12.930	0.060	0.056
Enna	0	0.140	1.170	7.175	0.031	0.025
Fermo	0	0.385	1.308	13.518	0.069	0.063
Ferrara	2	0.236	1.984	18.313	0.083	0.074
Firenze	58	0.356	1.186	21.471	0.082	0.076
Foggia	14	0.294	0.788	12.493	0.052	0.049
Frosinone	0	0.919	0.919	16.021	0.060	0.058
Genova	8	0.216	20.000	18.874	0.071	0.066
Gorizia	0	0.162	5.181	11.902	0.060	0.055
Grosseto	6	0.642	0.650	13.271	0.072	0.065
Imperia	0	0.172	0.850	11.239	0.058	0.054
Isernia	0	0.493	0.972	12.639	0.056	0.052
La Spezia	7	0.266	1.257	15.732	0.076	0.069
L’Aquila	0	0.650	1.047	14.021	0.062	0.059
Latina	0	1.223	1.223	15.507	0.059	0.055
Lecce	9	0.168	0.644	9.263	0.037	0.036
Lecco	0	0.163	1.416	19.155	0.070	0.064
Livorno	6	0.351	3.998	15.790	0.076	0.067
Lodi	0	0.184	1.290	21.598	0.077	0.071
Lucca	0	0.309	3.296	16.914	0.078	0.070
Macerata	1	0.415	1.851	13.550	0.071	0.063
Mantova	1	0.214	0.979	19.896	0.083	0.076
Massa–Carrara	6	0.286	1.836	15.874	0.077	0.069
Matera	1	0.225	1.653	11.165	0.044	0.042
Messina	0	0.144	0.962	9.051	0.032	0.030
Milano	94	0.174	2.301	30.949	0.074	0.069
Modena	11	0.248	2.573	21.135	0.085	0.078
Napoli	48	0.421	20.000	21.502	0.052	0.051
Novara	0	0.169	3.475	19.735	0.069	0.064
Nuoro	0	0.267	0.608	6.287	0.040	0.031
Oristano	0	0.228	1.113	7.106	0.038	0.034
Padova	45	0.204	2.421	20.855	0.079	0.072
Palermo	0	0.174	0.480	10.517	0.035	0.025
Parma	10	0.220	1.093	19.569	0.083	0.075
Pavia	2	0.178	1.471	21.159	0.074	0.069
Perugia	1	0.534	20.000	14.443	0.074	0.065
Pesaro e Urbino	16	0.339	1.746	15.317	0.078	0.070
Pescara	12	0.423	20.000	14.412	0.063	0.059
Piacenza	10	0.197	0.891	20.347	0.079	0.074
Pisa	7	0.327	20.000	16.567	0.077	0.069
Pistoia	0	0.314	1.367	18.365	0.080	0.074
Pordenone	3	0.178	1.360	14.892	0.068	0.061
Potenza	1	0.260	0.810	11.389	0.047	0.045
Prato	0	0.339	1.237	20.876	0.081	0.076
Ragusa	0	0.124	1.105	7.069	0.028	0.025
Ravenna	3	0.280	1.289	17.386	0.082	0.073
Reggio di Calabria	4	0.143	0.756	8.144	0.032	0.031
Reggio nell’Emilia	30	0.234	1.566	20.227	0.084	0.077
Rieti	0	0.946	0.946	14.772	0.067	0.061
Rimini	17	0.311	1.091	16.367	0.080	0.072
Roma	103	3.731	20.000	30.293	0.064	0.060
Rovigo	2	0.220	1.257	18.089	0.082	0.073
Salerno	15	0.345	1.593	14.849	0.049	0.048
Sassari	0	0.250	0.461	6.511	0.039	0.030
Savona	0	0.195	1.948	14.117	0.065	0.060
Siena	0	0.464	0.936	15.673	0.078	0.071
Siracusa	0	0.120	1.783	7.822	0.028	0.026
Sondrio	0	0.148	0.691	11.671	0.063	0.056
Taranto	2	0.194	1.078	11.470	0.041	0.040
Teramo	1	0.492	1.651	13.636	0.064	0.060
Terni	3	0.885	1.084	15.232	0.069	0.063
Torino	31	0.161	6.587	21.782	0.062	0.058
Trapani	0	0.247	2.140	12.459	0.047	0.045
Trento	5	0.172	1.766	15.373	0.071	0.066
Treviso	3	0.194	2.996	18.666	0.075	0.067
Trieste	4	0.156	3.025	11.761	0.057	0.053
Udine	3	0.165	2.691	12.953	0.062	0.056
Varese	0	0.160	3.488	18.817	0.068	0.063
Venezia	46	0.198	20.000	18.758	0.075	0.068
Verbano–Cusio–Ossola	0	0.152	1.650	19.053	0.062	0.058
Vercelli	0	0.170	1.923	17.808	0.068	0.063
Verona	32	0.202	0.851	20.637	0.081	0.074
Vibo Valentia	2	0.166	2.493	8.097	0.035	0.034
Vicenza	24	0.197	1.417	19.590	0.079	0.071
Viterbo	0	1.231	1.231	14.566	0.069	0.063

Table A4. International airports.

Region	Airport	Municipality
Piemonte	Turin–Caselle	Caselle Torinese
Aosta Valley	N/A	N/A
Lombardy	Milan–Malpensa	Ferno
Trentino–Alto Adige	Bolzano	Bolzano
Veneto	Venice–Tessera	Venice
Friuli–Venezia Giulia	Trieste–Ronchi dei Legionari	Ronchi dei Legionari
Liguria	Genoa–Sestri	Genoa
Emilia-Romagna	Bologna–Borgo Panigale	Bologna
Tuscany	Pisa–San Giusto	Pisa
Umbria	Perugia	Perugia
Marche	Ancona–Falconara	Falconara Marittima
Lazio	Rome–Fiumicino	Fiumicino
Abruzzo	Pescara	Pescara
Molise	N/A	N/A
Campania	Naples–Capodichino	Naples
Apulia	Bari–Palese	Bari
Basilicata	N/A	N/A
Calabria	Lamezia Terme	Lamezia Terme
Sicily	Catania–Fontanarossa	Catania
Piemonte	Turin–Caselle	Caselle Torinese
Sardinia	Cagliari–Elmas	Elmas

References

European Parliament. REPORT on Reversing Demographic Trends in EU Regions Using Cohesion Policy Instruments. 2021. Available online: https://www.europarl.europa.eu/doceo/document/A-9-2021-0061_EN.html (accessed on 3 April 2025).
European Parliament. How to Tackle Population Decline in Europe’s Regions? 2021. Available online: https://www.europarl.europa.eu/topics/en/article/20210414STO02006/what-solutions-to-population-decline-in-europe-s-regions (accessed on 3 April 2025).
ISTAT. REPORT: Migrazioni Interne e Internazionali Della Popolazione Residente—Anno 2021; ISTAT: Rome, Italy, 2023; Available online: https://www.istat.it/it/files/2023/02/REPORT_MIGRAZIONI_2021.pdf (accessed on 3 April 2025).
ISTAT. Conti Economici Territoriali—Anni 2021–2023; Ufficio Stampa ISTAT: Rome, Italy, 2025; Available online: https://www.istat.it/wp-content/uploads/2025/01/REPORT-CONTI-TERRITORIALI_Anni-2021-2023.pdf (accessed on 3 April 2025).
ISTAT. Indicatori Demografici—Anno 2023; ISTAT: Rome, Italy, 2024; Available online: https://www.istat.it/wp-content/uploads/2024/03/Indicatori_demografici.pdf (accessed on 3 April 2025).
Massey, D.S.; Arango, J.; Hugo, G.; Kouaouci, A.; Pellegrino, A.; Taylor, J.E. Theories of international migration: A review and appraisal. Popul. Dev. Rev. 1993, 19, 431–466. [Google Scholar] [CrossRef]
Faist, T. The Volumes and Dynamics of International Migration and Transnational Social Spaces; Oxford University Press: Oxford, UK, 2000. [Google Scholar]
Arango, J. Explaining migration: A critical view. Int. Soc. Sci. J. 2000, 52, 283–296. [Google Scholar] [CrossRef]
Hicks, J.R. The Theory of Wages; Macmillan: London, UK, 1932. [Google Scholar]
Todaro, M.P. A model of labor migration and urban unemployment in less developed countries. Am. Econ. Rev. 1969, 59, 138–148. [Google Scholar]
Stark, O. The Migration of Labor; Basil Blackwell: Cambridge, UK, 1991. [Google Scholar]
Wallerstein, E. The Modern World System: Capitalist Agriculture and the Origins of the European World Economy in the 16-th Century; Academic Press: New York, NY, USA, 1974. [Google Scholar]
Piore, M.J. Birds of Passage: Migrant Labor and Industrial Societies; Cambridge University Press: Cambridge, UK, 1979. [Google Scholar]
Lucas, R.E.B. Chapter 13 Internal migration in developing countries. In Handbook of Population and Family Economics; Elsevier: Amsterdam, The Netherlands, 1997; Volume 1, pp. 721–798. [Google Scholar]
Greenwood, M.J. Chapter 12 Internal migration in developed countries. In Handbook of Population and Family Economics; Elsevier: Amsterdam, The Netherlands, 1997; Volume 1, pp. 647–720. [Google Scholar]
Piras, R. A long-run analysis of push and pull factors of internal migration in Italy. Estimation of a gravity model with human capital using homogeneous and heterogeneous approaches. Pap. Reg. Sci. 2017, 96, 571–602. [Google Scholar]
Batista, C.; McKenzie, D. Testing Classic Theories of Migration in the Lab. J. Int. Econ. 2023, 145, 103826. [Google Scholar]
Rosik, P.; Wójcik, J. Transport Infrastructure and Regional Development: A Survey of Literature on Wider Economic and Spatial Impacts. Sustainability 2023, 15, 548. [Google Scholar]
Vickerman, R.; Spiekermann, K.; Wegener, M. Accessibility and Economic Development in Europe. Reg. Stud. 1999, 33, 1–15. [Google Scholar]
Yan, L.; Tu, M.; Chagas, A.L.; Tai, L. The Impact of High-Speed Railway on Labor Spatial Misallocation—Based on Spatial Difference-in-Differences Analysis. Transp. Res. Part A-Policy Pract. 2022, 164, 82–97. [Google Scholar] [CrossRef]
Kong, Q.; Li, R.; Sun, P.; Peng, D. Has Transportation Infrastructure Development Improved the Quality of Economic Growth in China’s Cities? A Quasi-Natural Experiment Based on the Introduction of High-Speed Rail. Res. Int. Bus. Financ. 2022, 62, 101726. [Google Scholar] [CrossRef]
Lee, Y.; Chen, Z. Does Transportation Infrastructure Accelerate Factor Outflow from Shrinking Cities? An Evidence from China. Transp. Policy 2023, 134, 180–190. [Google Scholar]
Li, Z.; Wang, Q.; Cai, M.; Wong, W.-K. Impacts of high-speed rail on the industrial developments of non-central cities in China. Transp. Policy 2023, 134, 203–216. [Google Scholar] [CrossRef]
Kim, K.S. High-speed rail developments and spatial restructuring: A case study of the Capital region in South Korea. Cities 2000, 17, 251–262. [Google Scholar]
Guirao, B.; Lara-Galera, A.; Campa, J.L. High Speed Rail Commuting Impacts on Labour Migration: The Case of the Concentration of Metropolis in the Madrid Functional Area. Land Use Policy 2017, 66, 131–140. [Google Scholar] [CrossRef]
Zou, M.; Li, C.; Xiong, Y. Analysis of Coupling Coordination Relationship between the Accessibility and Economic Linkage of a High-Speed Railway Network Case Study in Hunan, China. Sustainability 2022, 14, 7550. [Google Scholar] [CrossRef]
Sugimori, S.; Hayashi, Y.; Takeshita, H.; Isobe, T. Evaluating the Regional Economic Impacts of High-Speed Rail and Interregional Disparity: A Combined Model of I/O and Spatial Interaction. Sustainability 2022, 14, 11545. [Google Scholar] [CrossRef]
Lu, H.; Zhao, P.; Hu, H.; Yan, J.; Chen, X. Exploring the heterogeneous impact of road infrastructure on rural residents’ income: Evidence from nationwide panel data in China. Transp. Policy 2023, 134, 155–166. [Google Scholar]
Hsieh, C.-T.; Moretti, E. Housing Constraints and Spatial Misallocation. Am. Econ. J. Macroecon. 2019, 11, 1–39. [Google Scholar]
Mkrtchyan, N.V.; Gilmanov, R.I. Large Russian Cities and Their Suburbs as Centers of Attraction for Internal Migrants. Reg. Res. Russ. 2024, 14, 14–24. [Google Scholar] [CrossRef]
Latif, E. The relationship between immigration and unemployment: Panel data evidence from Canada. Econ. Model. 2015, 50, 162–167. [Google Scholar]
Langella, M.; Manning, A. Residential mobility and unemployment in the UK. Labour Econ. 2022, 75, 102104. [Google Scholar]
Huynh, H.H.; Vo, D.H. The Effects of Migration on Unemployment: New Evidence from the Asian Countries. Sustainability 2023, 15, 11385. [Google Scholar] [CrossRef]
Gallo, M. The impact of urban transit systems on property values: A model and some evidences from the city of Naples. J. Adv. Transp. 2018, 2018, 1767149. [Google Scholar] [CrossRef]
Kasraian, D.; Li, L.; Raghav, S.; Shalaby, A.; Miller, E.J. Regional transport accessibility and residential property values: The case study of the Greater Toronto and Hamilton area. Case Stud. Transp. Policy 2023, 11, 100932. [Google Scholar] [CrossRef]
Cardenas, J.; Gallego, J.M.; Urrutia, M.A. Announcement of the first metro line and its impact on housing prices in Bogotá. Case Stud. Transp. Policy 2023, 11, 100941. [Google Scholar] [CrossRef]
Chen, Z.; Haynes, K.E. Impact of high speed rail on housing values: An observation from the Beijing–Shanghai line. J. Transp. Geogr. 2015, 43, 91–100. [Google Scholar] [CrossRef]
Chen, H.; Zhang, Y.; Zhang, N.; Zhou, M.; Ding, H. Analysis on the Spatial Effect of Infrastructure Development on the Real Estate Price in the Yangtze River Delta. Sustainability 2022, 14, 7569. [Google Scholar] [CrossRef]
Chwiałkowski, C.; Zydroń, A. The Impact of Urban Public Transport on Residential Transaction Prices: A Case Study of Poznań, Poland. ISPRS Int. J. Geo-Inf. 2022, 11, 74. [Google Scholar] [CrossRef]
Asso, P.F. New Perspectives on Old Inequalities: Italy’s North-South Divide. Territ. Politics Gov. 2020, 9, 346–364. [Google Scholar] [CrossRef]
Biagi, B.; Detotto, C.; Faggian, A. Evidence of Self-selection and Spatial Mismatch in Interregional Migration: The Case of Italy. Oxf. Econ. Pap. 2023, 75, 858–872. [Google Scholar] [CrossRef]
Cisco, G.; Fiduccia, A.; Lopresti, I.; Tartaglia, M. Transport Accessibility and Demographic Vibrancy: Evidence from the High-Speed Railways in Italy; Springer International Publishing: Cham, Switzerland, 2024; pp. 283–299. [Google Scholar]
Gallo, M.; La Rocca, R.A. The Impact of High-Speed Rail Systems on Tourist Attractiveness in Italy: Regression Models and Numerical Results. Sustainability 2022, 14, 13818. [Google Scholar] [CrossRef]
Gallo, M.; Marinelli, M.; Cavaiuolo, I. The Effects of Accessibility on the Location of Manufacturing Companies: The Italian Case Study. Adv. Intell. Syst. Comput. 1150, 2020, 1362–1372. [Google Scholar]
Hansen, W.G. How Accessibility Shapes Land Use. J. Am. Plan. Assoc. 1959, 25, 73–76. [Google Scholar]
Chatterjee, S.; Simonoff, J.S. Handbook of Regression Analysis; John Wiley & Sons: Hoboken, NJ, USA, 2013. [Google Scholar]
Seya, H.; Yoshida, T.; Yamagata, Y. Chapter Five—Spatial econometric models. In Spatial Analysis Unsing Big Data; Academic Press: Cambridge, MA, USA, 2020; pp. 113–158. [Google Scholar]
Li, H. Introduction to Machine Learning and Supervised Learning. In Machine Learning Methods; Springer: Singapore, 2024; pp. 1–37. [Google Scholar]
Ganong, P.; Shoag, D. Why has regional income convergence in the U.S. declined? J. Urban Econ. 2017, 102, 76–90. [Google Scholar]
Causa, O.; Abendschein, M.; Cavalleri, M.C. The laws of attraction: Economic drivers of interregional migration, housing costs and the role of policies. In OECD Economics Department Working Papers No. 1679; OECD: Paris, France, 2021. [Google Scholar]

Figure 1. Internal migration rates in Italian regions.

Figure 2. Total migration balance values.

Figure 3. Distribution function of total migration balance values.

Figure 4. Graph of the road network [43].

Figure 5. Scatter diagram for the calibrated model.

Table 1. Minimum value of t₉₅ in function of degrees of freedom (dfs) of the model.

df	1	2	3	4	5	6	7	8	9	10
t₉₅	6.314	2.920	2.353	2.132	2.015	1.943	1.895	1.860	1.833	1.812

Table 2. Correlation coefficients with the total migration balance, tmb_i.

Variable	Correlation Coefficient
aipc_i	0.836
or_i	0.833
ur_i	−0.815
rd_i	0.766
rtt_i	0.762
jh_i	0.706
ajh_i	0.665
ra_i	0.604
cjh_i	0.585
mjh_i	0.566
apv_i	0.536
ejh_i	0.316
ujh_i	0.144
hsr_i	0.140
dia_i	0.095
dha_i	0.025
uni_i	−0.021

Table 3. Correlation coefficient matrix.

	aipc_i	or_i	ur_i	rd_i	rtt_i	jh_i	ajh_i	ra_i	cjh_i	mjh_i	apv_i	ejh_i	ujh_i	hsr_i
aipc_i	1.00	0.90	−0.88	0.79	0.79	0.87	0.86	0.69	0.65	0.66	0.52	0.52	0.49	0.27
or_i	0.90	1.00	−0.96	0.87	0.86	0.85	0.78	0.69	0.62	0.73	0.45	0.36	0.11	0.18
ur_i	−0.88	−0.96	1.00	−0.86	−0.86	−0.81	−0.75	−0.65	−0.57	−0.70	−0.42	−0.33	−0.10	−0.15
rd_i	0.79	0.87	−0.86	1.00	0.99	0.78	0.69	0.77	0.61	0.72	0.46	0.27	0.12	0.24
rtt_i	0.79	0.86	−0.86	0.99	1.00	0.78	0.69	0.80	0.62	0.71	0.45	0.27	0.13	0.26
jh_i	0.87	0.85	−0.81	0.78	0.78	1.00	0.94	0.74	0.77	0.82	0.46	0.51	0.23	0.45
ajh_i	0.86	0.78	−0.75	0.69	0.69	0.94	1.00	0.72	0.72	0.59	0.55	0.60	0.32	0.56
ra_i	0.69	0.69	−0.65	0.77	0.80	0.74	0.72	1.00	0.57	0.56	0.40	0.33	0.20	0.59
cjh_i	0.65	0.62	−0.57	0.61	0.62	0.77	0.72	0.57	1.00	0.53	0.53	0.38	0.15	0.46
mjh_i	0.66	0.73	−0.70	0.72	0.71	0.82	0.59	0.56	0.53	1.00	0.12	0.18	0.02	0.10
apv_i	0.52	0.45	−0.42	0.46	0.45	0.46	0.55	0.40	0.53	0.12	1.00	0.39	0.16	0.36
ejh_i	0.49	0.36	−0.33	0.27	0.27	0.51	0.60	0.33	0.38	0.18	0.39	1.00	0.69	0.43
ujh_i	0.27	0.11	−0.10	0.12	0.13	0.23	0.32	0.20	0.15	0.02	0.16	0.69	1.00	0.40
hsr_i	0.31	0.18	−0.15	0.24	0.26	0.45	0.56	0.59	0.46	0.10	0.36	0.43	0.40	1.00

Table 4. Information on the main models specified (models 1–8).

	Model #
Variable	1	2	3	4	5	6	7	8
aipc_i	∙	•	•	•	•	•	•	•
or_i
ur_i
rd_i		•				•		•
rtt_i			•				•
jh_i
ajh_i
ra_i
cjh_i
mjh_i
apv_i					•	•	•
ejh_i
ujh_i
hsr_i				•				•
df	1	2	2	2	2	3	3	3
R²	0.699	0.728	0.726	0.715	0.713	0.739	0.738	0.744
R_adj²	0.696	0.723	0.721	0.709	0.708	0.732	0.730	0.737
F-test	2.21 × 10⁻²⁹	1.90 × 10⁻³⁰	2.81 × 10⁻³⁰	2.50 × 10⁻²⁹	3.30 × 10⁻²⁹	3.00 × 10⁻³⁰	4.23 × 10⁻³⁰	1.19 × 10⁻³⁰
Intercept	−65.839	−67.329	−67.557	−68.151	−66.682	−67.983	−68.209	−69.604
Coeff. 1	0.005	0.004	0.004	0.006	0.005	0.004	0.004	0.004
Coeff. 2		327.934	349.107	−0.142	0.006	309.458	329.670	325.831
Coeff. 3						0.005	0.005	−0.140
Coeff. 4
t₉₅	6.314	2.920	2.920	2.920	2.920	2.353	2.353	2.353
t-test_1	15.682	7.385	7.464	15.997	12.489	6.587	6.644	7.924
t-test_2		3.386	3.255	−2.426	2.302	3.232	3.113	3.448
t-test_3						2.096	2.117	−2.518
t-test_4
Significant	Yes	Yes	Yes	No	No	No	No	Yes
Sign	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes
Valid model	Yes	Yes	Yes	No	No	No	No	Yes

Table 5. Information on the main models specified (models 9–16).

	Model #
Variable	9	10	11	12	13	14	15	16
aipc_i	•	•	•		•
or_i				•	•	•	•	•
ur_i
rd_i		•				•		•
rtt_i	•		•				•
jh_i
ajh_i
ra_i
cjh_i
mjh_i
apv_i		•	•
ejh_i
ujh_i
hsr_i	•	•	•					•
df	3	4	4	1	2	2	2	3
R²	0.743	0.764	0.764	0.695	0.732	0.701	0.703	0.702
R_adj²	0.736	0.754	0.755	0.692	0.727	0.695	0.697	0.693
F-test	1.36 × 10⁻³⁰	2.20 × 10⁻³¹	2.19 × 10⁻³¹	4.54 × 10⁻²⁹	8.97 × 10⁻³¹	2.98 × 10⁻²⁸	2.16 × 10⁻²⁸	3.29 × 10⁻²⁷
Intercept	−69.973	−71.155	−71.538	−94.717	−84.219	−89.478	−89.510	−89.539
Coeff. 1	0.004	0.004	0.004	2.275	1.162	1.887	1.861	1.878
Coeff. 2	355.520	299.761	330.425		0.003	189.793	226.889	200.915
Coeff. 3	−0.146	0.007	0.007					−0.030
Coeff. 4		−0.180	−0.187
t₉₅	2.353	2.131	2.131	6.314	2.920	2.920	2.920	2.353
t-test_1	8.017	7.250	7.283	15.530	3.625	6.316	6.554	6.256
t-test_2	3.406	3.270	3.271		3.840	1.487	1.693	1.547
t-test_3	−2.624	2.925	2.974					−0.512
t-test_4		−3.250	−3.365
Significant	Yes	Yes	Yes	Yes	Yes	No	No	No
Sign	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes
Valid model	Yes	Yes	Yes	Yes	Yes	No	No	No

Table 6. Information on the main models specified (models 17–25).

	Model #
Variable	17	18	19	20	21	22	23	24	25
aipc_i
or_i	•	•	•	•	•
ur_i
rd_i		•		•		•
rtt_i	•		•		•		•
jh_i
ajh_i
ra_i
cjh_i
mjh_i
apv_i		•	•	•	•				•
ejh_i
ujh_i
hsr_i	•			•	•			•
df	3	3	3	4	4	1	1	1	1
R²	0.704	0.729	0.731	0.736	0.738	0.587	0.581	0.020	0.287
R_adj²	0.695	0.721	0.723	0.726	0.728	0.583	0.577	0.010	0.281
F-test	2.27 × 10⁻²⁷	2.17 × 10⁻²⁹	1.69 × 10⁻²⁹	6.42 × 10⁻²⁹	4.34 × 10⁻²⁹	4.22 × 10⁻²²	9.30 × 10⁻²²	1.48 × 10⁻⁰¹	2.26 × 10⁻⁰⁹
Intercept	−89.563	−90.284	−90.111	−90.596	−90.352	−50.345	−50.808	3.986	−21.573
Coeff. 1	1.848	1.771	1.739	1.727	1.682	893.553	981.678	0.149	0.023
Coeff. 2	243.770	131.224	165.698	156.982	202.483
Coeff. 3	−0.036	0.008	0.008	−0.094	−0.100
Coeff. 4				0.009	0.009
t₉₅	2.353	2.353	2.353	2.131	2.131	6.314	6.314	6.314	6.314
t-test_1	6.466	6.154	6.342	6.019	6.142	12.284	12.129	1.455	6.537
t-test_2	1.777	1.064	1.279	1.272	1.555
t-test_3	−0.611	3.298	3.275	−1.611	−1.702
t-test_4				3.653	3.660
Significant	No	No	No	No	No	Yes	Yes	No	Yes
Sign	Yes	Yes	Yes	Yes	Yes	Yes	Yes	No	Yes
Valid model	No	No	No	No	No	Yes	Yes	No	Yes

Table 7. Main features of the calibrated model.

Variable	Coefficient	t-Test
Intercept	−71.538
aipc_i (average per capita income)	+0.003767	7.283
rtt_i (road travel time accessibility)	+33.042	3.271
apv_i (average property value)	+0.007256	2.974
hsr_i (high speed rail services)	−0.187	−3.365
R²		0.764
R²_adj		0.755
F-test		≈0

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Basile, A.; Gallo, M. Investigating the Impact of Accessibility on Internal Migration Flows in Italy Through the Calibration of Multiple Linear Regression Models. World 2025, 6, 46. https://doi.org/10.3390/world6020046

AMA Style

Basile A, Gallo M. Investigating the Impact of Accessibility on Internal Migration Flows in Italy Through the Calibration of Multiple Linear Regression Models. World. 2025; 6(2):46. https://doi.org/10.3390/world6020046

Chicago/Turabian Style

Basile, Antonio, and Mariano Gallo. 2025. "Investigating the Impact of Accessibility on Internal Migration Flows in Italy Through the Calibration of Multiple Linear Regression Models" World 6, no. 2: 46. https://doi.org/10.3390/world6020046

APA Style

Basile, A., & Gallo, M. (2025). Investigating the Impact of Accessibility on Internal Migration Flows in Italy Through the Calibration of Multiple Linear Regression Models. World, 6(2), 46. https://doi.org/10.3390/world6020046

Article Menu

Investigating the Impact of Accessibility on Internal Migration Flows in Italy Through the Calibration of Multiple Linear Regression Models

Abstract

1. Introduction

2. Background

3. Data and Methods

3.1. Socio-Economic Variables

3.2. Property Value Variables

3.3. Accessibility Variables

3.3.1. Availability of High-Speed Rail Services

3.3.2. Distance from Rome Leonardo da Vinci Airport

3.3.3. Distance from the Nearest International Airport

3.3.4. Population-Weighted Road Accessibility

3.3.5. Road Distance Indicator

3.3.6. Road Travel Time Indicator

4. Regression Models

5. Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI