Estimating Land-Use Change Using Machine Learning: A Case Study on Five Central Coastal Provinces of Vietnam

: Population growth is one factor relevant to land-use transformation and expansion in urban areas. This creates a regular mission for local governments in evaluating land resources and proposing plans based on various scenarios. This paper discussed the future trend of three kinds of land-use in the ﬁve central coast provinces. Afterwards, the paper deployed machine learning such as Multivariate Adaptive Regression Splines (MARS), Random Forest Regression (RFR), and Lasso Linear Regression (LLR) to analyze the trend of rural land use and industrial land-use to urban land-use in the Central Coast Region of Vietnam. The input variables of land-use from 2010 to 2020 were obtained by the ﬁve provinces of the Department of Natural Resources and Environment (DONRE). The results showed that these models provided pieces of information about the relationship between urban, rural, and industrial land-use change data. Furthermore, the MARS model proved to be accurate in the Quang Binh, Quang Tri, and Quang Nam provinces, whereas RFR demonstrated efﬁciency in the Thua Thien-Hue province and Da Nang city in the ﬁelds of land change prediction. Furthermore, the result enables to support land-use planners and decision-makers to propose strategies for urban development.


Introduction
Estimating land-use change provides valuable information about potential conversion that might occur over time in the earth's surface landscapes. Furthermore, the prediction enables support for land-use planners and land resource managers to propose land-use change strategies, urban planning through modeling rural development, and selecting areas for setting industrial zones [1]; therefore, simulating models to observe and examine land-use change related to landscape dynamics over time is an interesting issue at both global and local scales [2].
Three major approaches are commonly applied in land-use change prediction, which are spatial pattern, statistical analysis, and artificial intelligence [3][4][5][6][7][8][9][10][11][12][13]. Models based on the simulation of the spatial pattern of land-use change processes, such as the Markov model, are deployed to know and interpret regional land changes and trends of regional land-use in effective ways [14]. Using machine learning based on quarterly or annual land-use statistics in localities to analyze land-use changes is also widely applied, such as Multivariate Adaptive Regression Splines (MARS) methodology, which is a kind of

Study Area
The Middle Central Coast region comprises five provinces and cities that serve as a link between the country's three primary socioeconomic hubs: the Red River Delta, the South Central Coast, and the Central Highlands (see Figure 2). The topography of this area is divided into two parts: the flat to gently rolling plains in the east and the most rugged forest-covered mountains in the western two-thirds. The delta area has a coastalmountainous nature, and it is separated by mountain branches close to the sea as Hoanh Son mountain range-Ngang pass, Bach Ma mountain range-Hai Van pass [26,27]. Furthermore, this region plays a critical role in Vietnam's maritime economic development strategy in terms of tourism development, research, and technology, the Vietnamese seaport system, and vital logistics. When it comes to administrative matters, the North Central region is divided into six provinces with a total size of 2,948,430 hectares (9.8% of the country's total area) and a population of 5,366,500 people (5.70 percent of the total population); the rates of average population growth of the region is 1.1% [28]. These points show that the urbanization rate in this area is not high, and the population growth rate is only low compared to Hanoi and Ho Chi Minh City regions. Furthermore, there are two first-grade cities in the region (Da Nang and Hue), two second-grade cities (Tam Ky and Dong Hoi), two third-grade cities (Dong Ha and Hoi An), and eight fourth-grade cities in total, and 35 fifth-grade cities (see Appendix Table A1) [29][30][31][32][33].
On the other hand, the region's average urbanization rate is 47.26 percent, with Da Nang having the highest rate at 87 percent and Quang Binh having the lowest at roughly 30 percent. Following that, agriculture, forestry, fisheries, and industry-construction, services, and product tax minus product subsidies accounted for 15%, 28%, 48%, and 9% of the region's GRDP in 2021, respectively. Per capita income based on GRDP is 2643 USD/person, with Da Nang having the most at 3822 USD/person and Quang Tri province having the lowest at 2087 USD/person. As this field expands, vocational training is designed to help people change occupations. Feeding a portion of the rural population and those who no longer have fertile land is a critical challenge for this region. By 2021, those with vocational training will make up approximately 66 percent of the working-age population. Their earnings will rise because they are well-educated, lowering the poverty rate to around 3.5 percent in 2021 [29][30][31][32][33]; therefore, vocational training demonstrates that urbanization stimulates and expands children's opportunities. People are more dynamic and innovative in their search for, and selection of, methods and forms of production, and organizations rise to become wealthy legally. The main trend and best aspect of urbanization is economic development, which improves employees' living standards. The rapid

Study Area
The Middle Central Coast region comprises five provinces and cities that serve as a link between the country's three primary socioeconomic hubs: the Red River Delta, the South Central Coast, and the Central Highlands (see Figure 2). The topography of this area is divided into two parts: the flat to gently rolling plains in the east and the most rugged forestcovered mountains in the western two-thirds. The delta area has a coastal-mountainous nature, and it is separated by mountain branches close to the sea as Hoanh Son mountain range-Ngang pass, Bach Ma mountain range-Hai Van pass [26,27]. Furthermore, this region plays a critical role in Vietnam's maritime economic development strategy in terms of tourism development, research, and technology, the Vietnamese seaport system, and vital logistics. When it comes to administrative matters, the North Central region is divided into six provinces with a total size of 2,948,430 hectares (9.8% of the country's total area) and a population of 5,366,500 people (5.70 percent of the total population); the rates of average population growth of the region is 1.1% [28]. These points show that the urbanization rate in this area is not high, and the population growth rate is only low compared to Hanoi and Ho Chi Minh City regions. Furthermore, there are two first-grade cities in the region (Da Nang and Hue), two second-grade cities (Tam Ky and Dong Hoi), two third-grade cities (Dong Ha and Hoi An), and eight fourth-grade cities in total, and 35 fifth-grade cities (see Appendix A Table A1) [29][30][31][32][33].
On the other hand, the region's average urbanization rate is 47.26 percent, with Da Nang having the highest rate at 87 percent and Quang Binh having the lowest at roughly 30 percent. Following that, agriculture, forestry, fisheries, and industry-construction, services, and product tax minus product subsidies accounted for 15%, 28%, 48%, and 9% of the region's GRDP in 2021, respectively. Per capita income based on GRDP is 2643 USD/person, with Da Nang having the most at 3822 USD/person and Quang Tri province having the lowest at 2087 USD/person. As this field expands, vocational training is designed to help people change occupations. Feeding a portion of the rural population and those who no longer have fertile land is a critical challenge for this region. By 2021, those with vocational training will make up approximately 66 percent of the working-age population. Their earnings will rise because they are well-educated, lowering the poverty rate to around 3.5 percent in 2021 [29][30][31][32][33]; therefore, vocational training demonstrates that urbanization stimulates and expands children's opportunities. People are more dynamic and innovative in their search for, and selection of, methods and forms of production, and organizations rise to become wealthy legally. The main trend and best aspect of urbanization is economic development, which improves employees' living standards. The rapid development of non-manufacturing industries is also aided by urbanization. Large cities also provide more work options, greater pay, better social services, and increased labor productivity. It is a driving force for economic transformation in both urban and rural areas, contributing to further economic development. At the same time, the metropolitan region serves as a big and diverse consumer of goods, a location to employ a skilled workforce and a hub for sophisticated technology and infrastructure facilities that draw significant domestic and foreign investment. development of non-manufacturing industries is also aided by urbanization. Large cities also provide more work options, greater pay, better social services, and increased labor productivity. It is a driving force for economic transformation in both urban and rural areas, contributing to further economic development. At the same time, the metropolitan region serves as a big and diverse consumer of goods, a location to employ a skilled workforce and a hub for sophisticated technology and infrastructure facilities that draw significant domestic and foreign investment. Urbanization in these provinces is established mainly in two ways, as follows. Firstly, the land-cover transforms rural areas into urban areas, then the villages and communes surrounding urban centers are gradually merged into urban areas. Alternatively, some rural areas have developed enough infrastructure, and the total population of the rural region meets the criterion of an urban population (higher than 50,000 people), that the place will be recognized as an urban area. Lastly, the developing industrial, commercial, and tourism zones promote neighboring suburb areas to develop as urban areas.
The total land-use of rural, urban, and industry areas within these provinces estimate about 71,067 ha in 2020 (see Table 1), in which Quang Binh, Quang Tri, Thua Thien-Hue, Da Nang, and Quang Nam occupy 9973 ha, 6341 ha, 14,510 ha, 11,834 ha, and 28,409 ha, respectively. In addition, Figure 3 indicates the square of land-use change in this study scope from 2010 to 2020. Figure 3a shows that the most change in urban land-use occurs in the Thua Thien Hue province, followed by Da Nang city, and the lowest variation is in the Quang Binh and Quang Tri provinces. Figure 3b indicates that the variation in rural land-use use in the Quang Nam province is the most, followed by the change in Thua Thien Hue and Quang Binh provinces, with the lowest variation occurring in the Da Nang and Quang Tri provinces. Finally, Figure 3c points out that the change in land-use of industrial zones in Quang Nam province is the highest, followed by Da Nang, then Thua Thien Hue, with the lowest being in the Quang Binh and Quang Tri provinces. Urbanization in these provinces is established mainly in two ways, as follows. Firstly, the land-cover transforms rural areas into urban areas, then the villages and communes surrounding urban centers are gradually merged into urban areas. Alternatively, some rural areas have developed enough infrastructure, and the total population of the rural region meets the criterion of an urban population (higher than 50,000 people), that the place will be recognized as an urban area. Lastly, the developing industrial, commercial, and tourism zones promote neighboring suburb areas to develop as urban areas.
The total land-use of rural, urban, and industry areas within these provinces estimate about 71,067 ha in 2020 (see Table 1), in which Quang Binh, Quang Tri, Thua Thien-Hue, Da Nang, and Quang Nam occupy 9973 ha, 6341 ha, 14,510 ha, 11,834 ha, and 28,409 ha, respectively. In addition, Figure 3 indicates the square of land-use change in this study scope from 2010 to 2020. Figure 3a shows that the most change in urban land-use occurs in the Thua Thien Hue province, followed by Da Nang city, and the lowest variation is in the Quang Binh and Quang Tri provinces. Figure 3b indicates that the variation in rural land-use use in the Quang Nam province is the most, followed by the change in Thua Thien Hue and Quang Binh provinces, with the lowest variation occurring in the Da Nang and Quang Tri provinces. Finally, Figure 3c points out that the change in land-use of industrial zones in Quang Nam province is the highest, followed by Da Nang, then Thua Thien Hue, with the lowest being in the Quang Binh and Quang Tri provinces.

Database
This paper contains a database containing 44 quarters of land-use change from 2010 to 2020 (4 quarters for each year) for five provinces obtained from the Department of Natural Resources and Environment (DONRE). Three input variables include rural landuse, industrial land-use, and urban land-use, which were collected from five provinces. Furthermore, the characteristic statistical results for urban land-use in Table 2 demonstrate that the mean ranges from 878 ha in Quang Binh to 4317 ha in Da Nang, the standard deviation (St Dev) ranges from 84 ha in Quang Tri to 959 ha in Da Nang, and the minimum (Min) and maximum (Max) range from 608 ha and 1238 ha in Quang Binh to 4093 ha and 4634 ha in Quang Nam. The ranges of skewness (Skew) and kurtosis (Kurt) parameters of the five provinces fluctuate from 0.17 and −1.72 to 1.03 and 1.65. These Skew and Kurt indicators approach low values that are highly appropriate for modeling [34]. The input data patterns of five provinces were randomly selected in two parts. About 70% of the dataset was selected for the training sample, whereas 30% was used for the testing sample. MARS is a nonparametric regression model, and it was introduced by Friedman [35]. MARS seems like a method for a fitted relationship between prediction and dependent variables. MARS is fast and based on a divide-and-conquer strategy, which divides the training dataset into distinct regions, each with its regression line [36][37][38]. The MARS algorithm feature is the procedure of the backward and forwards stepwise and may explain and control the complex nonlinear mapping between the inputs and output variables. This function predicts the new output y and the input variable x that uses either of the two base functions [39] and deploys a value or knot of variables that demonstrates the point of inflection along with the range of the inputs [40]. The general form of MARS forecasting is as below: where y is the dependent variable predicted by the function f (x) ; β 0 is the constant value; P is the number of terms, each of them formed by a coefficient α j , j ∈ {1, . . . , P}; x j is predictor variable; β j is an individual base function. The base functions of Max(0, x − H) and Max(0, H − x) are univariate and do not have to each be present if their β coefficients are 0; the H values are called "hinges" or "knots"; x is an independent variable. The function of backward stepwise relates to removing basis functions one at a time until the criterion of "lack of fit" is a minimum. In the deleting stage of backward stepwise, the last crucial important base functions are demolished one at a time. The lack of an applied fitting measurement is leaned on in the Generalized Cross-Validation (GCV) [41,42]: where N is a number of data; and C(M) = (M + 1) + dM are the complexity function [35]; d is a penalty for each basis function included in the model; M is the number of base functions in Equation (1). The criterion of GCV is examined for the average residual error multiplied by a penalty to modify the variability associated with more indicator prediction in the model [39,43].

Lasso Linear Regression (LLR)
The lasso linear regression method is widely used in domains with massive datasets, and it is also necessary to use when algorithms are efficient and quick [44]; however, the lasso is not vigorous in terms of determining the high correlation between predictors; it will randomly choose one and ignore the others, and split when all predictors are identical [44]. Moreover, the lasso penalty looks at many coefficients that are close to zero and only a small subset that is larger (and non-zero). The lasso estimator [45,46] uses the l 1 penalized least-squares criterion to get a sparse solution to the problem of optimization as below: β j is the l 1 -norm penalty on β, which is the cause of the sparse solution, and γ ≥ 0 is a tuning parameter. The l 1 penalty allows the lasso to simultaneously fit the smallest squares to and shrink some components ofβ(Lasso) to zero for a suitably chosen γ [44]. The cyclic coordinate reduction algorithm [44] efficiently computes the entire path of the Lasso solution paths for γ for the Lasso estimator and is faster than the Generalized Least Angle Regression (LARS) well-known algorithm. These properties make Lasso an attractive and popular method of variable selection.

Random Forest Regression (RFR)
Random forest is a regression technique that associates the performance of multiple decision tree algorithms to classify or forecast the value of a variable [47][48][49]. When an x input vector is received by random forest, in conjunction with the different evidential features values analyzed for a given training area, a number K of regression trees, on averages, the results are built by random forest. After K, such trees {T(x)} K 1 are grown, and the random forest regression predictor is as follows: To avoid the correlation of different trees, the random forest raises the tree's diversity by improving from different subsets of training data generated through a procedure called bagging [50]. Bagging is a technique used to generate training data by randomly resampling the original dataset with a replacement. As a result, some data may be used multiple times during training, whereas others may never be used. Thus, greater stability is achieved, as it makes it more robust in the face of small variations in the input data, and at the same time, it increases the accuracy of the prediction [47,51] Correlation Coefficient (R), and Correlation of Determination (R 2 ). Furthermore, the error metrics are defined as follows [52][53][54]: where x t , x t are the observed and estimated values in the period time t, and n is the number of the observed values in the testing data.
x, x are mean of the observed and estimated value. The R 2 and R (correlation coefficient) should be approaching 1 to indicate strong model performance, and the MSE, MAE, and RMSE should be as close to zero as possible.

Results Analysis
Regarding this platform, the output function of MARS and LLR of land-use in five provinces are presented as below: Quang The MARS Quang Nam analysis indicates that the most important is in bellowing order industrial land-use and rural land-use.
Furthermore, the output function for RFR does not occur. The data in Figures 4a-c, 5a-c, 6a-c, 7a-c and 8a-c present the relationship between the three types of land-use, and the area of industrial and rural land-use increases as the square of urban land-use increases. LLR makes a forecasting form that resembles a flat surface of paper. Additionally, the RFR and MARS charts are the same as the image of papers with some folds, and the folds enable a better fit to the data. In addition, Figures 4a-c, 5a-c and 8a-c demonstrate that the area of land usage in the Quang Nam province increased significantly from 2010 to 2020. Moreover, the MARS predicted algorithm shows that the red dots are evenly distributed on the surface, and the LLR and RFR forecasting algorithms demonstrate that the red dots are relatively far from the surfaces. The data in Figure 6a-c imply that the area used for the three categories of land experienced an upward tendency between 2010 and 2013; however, the urban land use area decreased dramatically and tended to be saturated, whereas the rural land use area and industrial zones grew steadily per annum from 2014 to 2020. The output in Figure 7a-c illustrates that the urban and industrial land use increased significantly from 2010 to 2020, whereas rural land use grew slowly and even reached its saturation in 2019. Additionally, the RFR forecasting model of Figures 6a and 7a shows that these red dots are moderately distributed closer to the surface compared with the two algorithms in Figures 6b,c and 7b,c; however, it is difficult to investigate the difference between the three models. Hence, this study performed an accurate metric to explore the potential models for each province. From comparing the three models, Table 3 shows that the MARS model supplies a better fit than the other models for Quang Binh, where R 2 value = 0.91 was the largest along with MSE and MAE, whereas RMSE obtained the lowest value out of the established models. According to the implementation of the other models, the hierarchical carrying out considers the order of MARS > RFR > LLR. The simultaneous determination of urban land-use change prediction of Quang Nam and Quang Tri also proves the forecasting skills of these models in urban land-use change prediction in these provinces. Using the experiment results from the three provinces, the MARS model indicated to supply the best forecasting accuracy, the hierarchical order of the models, and other models for three provinces are MARS > RFR > LLR in Quang Tri and Quang Nam. Moreover, the prediction result of urban land-use change prediction in Thua Thien-Hue and Da Nang presented in Table 3 denotes that the order of hierarchical models with performance accuracy is RFR > MARS > LLR models. Regarding the GVC parameter for MARS, it generates an equilibrium between flexibility and generalization capability of the MARS model function [55]. The data in Table 3 also indicates that the order of hierarchical models with the accuracy of the GVC value of the MARS model in five provinces are GVC Thua Thien-Hue > GVC Da Nang > GVC Quang Tri > GVC Quang Binh > GVC Quang Nam . Furthermore, the scatter charts in Figure 9a,b,e proved that the line of MARS models (red plus lines) of urban land used change estimations fit better than the line of RFR models (green triangle lines) and the line of LLR models (purple multiply lines) for the Quang Binh, Quang Tri, and Quang Nam provinces. In contrast, Figure 9c,d showed that the line of RFR models of urban land used change prediction in Thua Thien-Hue which showed that Da Nang has the best fit, followed by the line of MARS models and line of LLR models, respectively. These points may explain that the distribution of land-use change data with random selection for training and testing data is suitable for the MARS model of Quang Binh, Quang Tri, and Quang Nam, and is comfortable for the RFR model in Thua Thien-Hue and Da Nang.
is suitable for the MARS model of Quang Binh, Quang Tri, and Quang Nam, and is comfortable for the RFR model in Thua Thien-Hue and Da Nang.          Furthermore, the predicting performance of the model models is also visualized and examined by the Taylor diagram. The diagram summarizes the St Dev and correlation coefficient (CC) that is comprised concomitantly in assessing the respective model [56,57]. The St Dev, and CC between the observed and predicted datasets for all the land-use models of the provinces are described in the Taylor diagram. Figure 10a demonstrates that these models were optimal accuracies of almost all models' outcomes and were significantly closer to 1. Moreover, the LLR model of land-use in Thua Thien-Hue with CC LLR = 0.67 indicates that the level of accuracy achieved is only above medium. Furthermore, the predicting performance of the model models is also visualized and examined by the Taylor diagram. The diagram summarizes the St Dev and correlation coefficient (CC) that is comprised concomitantly in assessing the respective model [56,57]. The St Dev, and CC between the observed and predicted datasets for all the land-use

Discussion
Urban areas in the five central coastal provinces are organized evenly along the coast, primarily on urban beam space. Rural areas still account for a much larger proportion of land than urban areas, and approximately 15% to 20% of the land use belongs to the urban administrative boundary, and the population accounts for over 60%. The urbanization process creates a sharp change in land-use in peri-urban and rural areas. The conversion of a large part of agricultural land to land for the construction of industrial, service, and urban residential areas. The process of expanding urban space along with the appearance of housing projects, real estate, concentrated industrial parks, large-scale commercial service works in the peri-urban communes has caused a sharp decline in production land funds, natural land, and spatial change of the rural ecological landscape, which causes the mechanical population of peri-urban communes to increase, despite insufficient infrastructure, leading to rapid overcrowding. Moreover, it also has an impact upon technical infrastructure, social infrastructure, especially traffic, education, water supply, and drainage, environmental sanitation; however, urbanization has lagged behind the growth of industrial zones, and industrial development lacks a vision for future urbanization. In industrial zones, a large number of workers leaving the agricultural production area tend to move to the industrial production area, forming areas with high population density,

Discussion
Urban areas in the five central coastal provinces are organized evenly along the coast, primarily on urban beam space. Rural areas still account for a much larger proportion of land than urban areas, and approximately 15% to 20% of the land use belongs to the urban administrative boundary, and the population accounts for over 60%. The urbanization process creates a sharp change in land-use in peri-urban and rural areas. The conversion of a large part of agricultural land to land for the construction of industrial, service, and urban residential areas. The process of expanding urban space along with the appearance of housing projects, real estate, concentrated industrial parks, large-scale commercial service works in the peri-urban communes has caused a sharp decline in production land funds, natural land, and spatial change of the rural ecological landscape, which causes the mechanical population of peri-urban communes to increase, despite insufficient infrastructure, leading to rapid overcrowding. Moreover, it also has an impact upon technical infrastructure, social infrastructure, especially traffic, education, water supply, and drainage, environmental sanitation; however, urbanization has lagged behind the growth of industrial zones, and industrial development lacks a vision for future urbanization. In industrial zones, a large number of workers leaving the agricultural production area tend to move to the industrial production area, forming areas with high population density, centralization, creating demand for services such as food, accommodation, living, studying, and commuting purposes which are the premise for the initial formation of an industrial residential area, an industrial town, and in the future, it will become an industrial city. Hence, the following are the primary types of the relationship between the growth of industrial parks and the process of urbanization. Firstly, many industrial parks are located in rural areas but not in urban areas. Secondly, several industrial parks were formerly located in rural areas, but now it is still within urban areas' boundaries. Following that, there are many industrial parks in rural areas, which are now within the proposed expansion boundaries of the neighboring urban master plan. Consequently, the above situation shows that assessing the influence of rural land-use and industrial zone land-use is significantly vital for urban land-use and urbanization in this area.
This study implemented three machine learning models for land-use change to assess the speed of urbanization taking place in the Central Coast Region in Vietnam. Three models gave high accurate results for predicting urban land-use fluctuations, in which the MARS and RFR models showed more accuracy for Quang Binh, Quang Tri, Quang Nam, and Thua Thien-Hue, Da Nang, respectively, compared with the LLR model. In addition, the estimated values of types of land-use changes made by the LLR model also provided acceptable results. The data of this study was based on the statistics of land-use types that have been measured quarterly; therefore, these estimation values supplied the total types of land-use change that have been urbanized based on the process of forming urban areas in industrial and rural areas.
To evaluate the predicted accuracy of these study models, spatial models are needed to estimate accuracy parameter values. Comparing RMSE and MAE using the MARS model of this study result with the study result of Yilmaz et al. (2018) [21] about suspended sediment load, their RMSE = 3592, and MAE = 3483 are found to be greater than the values in this study with RMSE Average = 47.9, MAE Average = 50.8. Jamali (2019) [58] deployed RFR to predict land-use/land-cover mapping using Landsat 8 OLI in the northern region of Iran. The RMSE and MAE for the model are 5 and 5, respectively; these points are lower than this result study. Finally, the result of the study Adab et al. (2020) [59] concerns Estimate Surface Soil Moisture in the semi-arid region of west Khorasan-Razavi province of Iran, and it shows that RMSE and MAE of LRR model (at 7 March 2017) are 6.67 and 5.55, respectively. These points also indicate that the prediction model is lower than this study model. Duong et al. (2018) [60] deployed the kernel density estimation and remotely sensed data from multiple sensors to generate the land cover maps over Central Vietnam during the period of 2007 to 2017. The result indicated that the overall accuracies of the maps for 2007 and 2017 are 90.5% (kappa coefficient of 90%) and 90.6% (kappa coefficient of 90%), respectively, in which the urban prediction was approximately 91%. This point also proved that using machine learning to show the results of this study consider equivalent to the remote sensing method for estimating the land use/ land cover for the Center of Vietnam; however, the study deploys machine learning models and statistical algorithms that majorly focus on land use transition/change. Due to sufficient published literature relating to other aspects of land use planning such as zoning, land allocation, and land restrictions, land-use mapping was not mentioned in this study; therefore, combining multiple methods for land-use information would be useful in future research.
Although classification accuracies for land-use were not particularly large, estimating urban, rural, and industrial land-use change is still useful for five central coastal provinces of Vietnam. This study result will assist the provinces' authorities and other stakeholders in decision-making and planning regarding three kinds of land-use. The usual practice is for the Ministry of Natural Resources and Environment (MONRE) to carry out urban inventory and set up urban land-use change maps every five years. Then, the DONRE provinces obtain the predictive data and update them manually. In addition, many jobs are created as a result of the development of the service, commerce, and manufacturing industries in cities. At the same time, a lot of individuals lose arable land due to urbanization to make way for industrial parks, handicrafts, or concentrated craft villages. Moreover, urbanization will affect policymakers regarding labor reorganization, changing production methods, and enhancing human resource training solutions to adapt to new employment standards. Moreover, new industries and services drive economic growth. Furthermore, sustainable urbanization development is a concern, and several criteria need to be proposed as below. Firstly, harmonious development of the economy, society, environmental protection, and ecological balance is required. Secondly, the municipality must ensure that the amount of space available for activities, the infrastructure engineering system, and social infrastructure are all up to par with high-quality standards. Thirdly, cities must have a well-organized population distribution system to close the gap between urban, rural, and industrial zones. Fourthly, urban development must balance the ecology in the inner city and suburbs. Finally, the authorities have to enforce appropriate policies related to population, land use, technical infrastructure development, environmental protection, and preservation of natural and social ecosystems.
Therefore, sustainable urban development for urban provinces can be suggested as follows: 1.
Da Nang city, the most developed urban area in the region, has industrial parks equivalent to the urban land-use area. As a result, it is critical to relocate industrial zones in the ancient city, rationalize land use functions, employ high-tech equipment, and create a green environment. More importantly, to accommodate the influx of migrants from all over the country into the city's working streets, local authorities must plan to build land funding and infrastructure in industrial zones or rural areas near industrial zones, lowering stress in Da Nang's central city.

2.
The Quang Nam and Thua Thien Hue provinces, two provinces with many tangible cultural heritage sites such as Hue City, Hoi An Ancient Town, and My Son Holyland, need to build satellite urban areas to relieve the pressure on infrastructure and population for urban heritage areas. In addition, because the land fund for rural use is enormous, a strategy for converting agricultural land to industrial and commercial zones in rural areas is required to support rural growth and urban areas while also creating jobs for rural residents.

3.
Quang Binh and Quang Tri are two provinces with slower urban and industrial zone development than the Da Nang, Quang Nam, and Thua Thien-Hue provinces; however, plenty of rural land use and industrial land use funds are being used in these two provinces. As a result, these two provinces will need to construct satellite cities based on highly populated areas near industrial parks. In addition, it is necessary to form sub-regional centers in the district in the direction of commodity production with high technology.

Conclusions
Urbanization is an inevitable process for the economic and social development of the five central coastal provinces; therefore, this study has shown the role of rural landuse and industrial zone land-use in informing and expanding urban areas. However, this development has not been synchronized and has not yet ensured the infrastructure of an urban area; hence, this study used the MARS, RFR, and LLR models to estimate urban land-use change based on rural and industrial land-use from the five provinces. In addition, the projected and observed values were compared using five widely used statistical parameters (i.e., RMSE, MAE, MSE, R, and R 2 ). The results of the study of the models also show that the MARS model improves the accuracy of performance more than RFR, LLR in the Quang Binh, Quang Tri, Quang Nam provinces, and the RFR model gives a more accurate forecasting implementation than the MARS, LLR models in the Da Nang and Thua Thien-Hue provinces. The accuracy of the models may depend on the distribution of land-use change data with random selection for training and testing data. The prediction of land-use change may support the authorities' land-use planning and decision-making. Furthermore, the research also suggested sustainable urban development for each specific province, and the region in general. The future of the current work consists of using a hybrid of MARS and LLR in modeling land-use change and other studies to enhance the model estimating capability, or these methods may combine with the spatial pattern models to estimate land-use change.