Agricultural Drought Risk Evaluation Based on an Optimized Comprehensive Index System

In this study, a new optimized comprehensive drought index system (OCDIS) was developed based on pressure-state-response (PSR) and random forest (RF). Then the pressure, state, response, and integrated agricultural drought risk were evaluated according to the synthetic-weight variable fuzzy set (SW-VFS) model. Finally, the countermeasures in terms of pressure, state, and response were discussed. The proposed index has been implemented in Qujing, Yunnan Province, China. The results showed that of the 10 indices included in the OCDIS, the four most important indices for agricultural drought risk management are reservoir storage capacity, precipitation anomaly percentage, soil moisture, and per capita annual income. The pressure risk and response risk of Malong are relatively higher than other counties. The integrated results indicated that most counties of Quijng have moderate drought risk. The assessment results are consistent with the actual situation of Qujing. The proposed model provides a scientific and objective way to develop the risk index system of agricultural drought. This study can potentially assist government agencies with information on the most important drought impacts and provide the basis for science-informed decision-making.


Introduction
Drought is a water shortage phenomenon that affects agricultural production, food safety, social stability, and ecological harmony [1,2].China is one of the countries most affected by droughts.According to a bulletin from the Chinese government, from 1956 to 2016, the annual average drought-affected area was more than 200,000 km 2 ; the annual average economic loss related to drought was over tens of billions of dollars [3]. With the intensification of global climate change, development of society and economy, and rapid increase of population, the demand for water resources has been continuously increasing over the past years.Hence, the assessment of drought is important for the sustainable development of water resources and the management of grain safety [4].The grain safety refers to the sufficient, stable food supply, and people can obtain food according to their demands [5].Drought leads to fluctuations in food production and supply, and then resulting in regional grain safety problem.The study area is one of the most critical crop-producing regions in Yunnan province.From 2009 to 2012, it suffered the most severe drought, which affected more than 4000 km 2 of farmland and caused more than 400 million dollars of economic loss [6].Therefore, drought research in this area is vital to reduce loss and to improve drought relief ability.
Several drought indices have been developed to assess the risk of drought, such as precipitation anomaly percentage, Palmer drought severity index (PDSI) [7], crop moisture index (CMI) [8], standardized precipitation index (SPI) [9], standardized precipitation evapotranspiration index (SPEI) [10], Z-index [11], etc.The PDSI, relying on multi-source data (precipitation, temperature, soil moisture, and evapotranspiration), is one of the most commonly used indices for the severity and extent of drought disasters [12].However, the calculation of PDSI requires a large number of parameters, and the accuracy of the index is limited in extreme climate conditions [13,14].The SPI, which is based on historical precipitation data, is another widely used index [15].It allows drought risk assessment at multi-time scales from 1 month to 48 months, which better reflects the intensity and duration of droughts [8].Based on SPI, Vicente-Serrano proposed the improved drought index SPEI.The SPEI takes precipitation and evapotranspiration into account, and has become one of the most useful indices for drought management [16].
It is noteworthy that many drought indices only reflect one aspect of drought phenomenon.This may not be sufficient for reliable drought risk assessment.Hence, it is essential to develop comprehensive indices that combine many drought-related parameters to improve evaluation results.Hao et al. [17] proposed a linearly combined drought index (LDI) based on the multivariate ensemble streamflow prediction (MESP).Keyantash et al. [18] developed an aggregate drought index (ADI), including precipitation, evapotranspiration, streamflow, and other physical forms of drought.In the constructed of ADI, the principal component analysis (PCA) was used to aggregate all physical variables of drought into a single time series.Kim et al. [19] applied a Bayesian network to establish a composite drought index, which included SPI, standardized runoff index, and normalized storage volume index.Based on the entropy theory, Rajsekhar et al. [20] developed a multivariate drought index (MDI), which contained precipitation, runoff, evapotranspiration, and soil moisture; Zhu et al. also [21] proposed a hybrid drought index in accordance with entropy.The linear combination, PCA, Bayesian, and the entropy method all assume a linear relationship among drought factors.Therefore, the copulas function has been applied to analyze the drought phenomenon by analyzing the nonlinear relationship between multiple variables [22].Hao et al. [23] proposed a multivariate standardized drought index (MSDI) using the concept of copulas.The MSDI can forecast the drought disaster as early as SPI and can describe the persistence of drought disaster according to the joint state of both precipitation and soil moisture.Yang et al. [24] also constructed a nonlinear multivariate drought index (NMDI) using copulas.However, copulas highly rely on the assumption that the samples obey a given probability density [25].With the development of artificial intelligence (AI), the AI algorithms, such as support vector machine (SVM), decision trees (DT), and artificial neural network (ANN) have been widely used [26][27][28].They can better resolve the non-linearity problems, but, still, have some weakness.For example, the SVM is difficult to resolve the multi-classification problems.The DT is sensitive to missing data and is prone to over-fitting.The ANN is easy to fall into local optimization.Importantly, the contribution of each variable to the total drought risk cannot be effectively estimated by these methods.
The random forest (RF) model proposed by Breiman, is a data-driven model that has many benefits [29].For example, it is not easy to fall into over-fitting, can handle missing data well, and is capable of measuring the importance of each variable to the total drought risk.The RF model has been extensively used in biomedical [30], environmental science [31], economic management [32], and other fields.These studies prove that random forest can reduce data dimension effectively.The purpose of this study is to construct an optimized comprehensive drought index system (OCDIS) from a number of input variables based on the random forest model.The input variables are determined by the PSR (pressure-state-response) model.The PSR is a cyclical model that is based on the interaction of human and eco-system factors [33].In this model, climate change and human activities put pressure on the environment and natural resources.Then the state of the environment that results from the pressure prompts humans to take decision or actions to reduce the adverse impact.The response is the actions taken by society.The PSR model answers "what", "why", and "how", the three basic issues of sustainable development.It has been widely used for evaluation of ecological systems and assessment of land quality [34].Therefore, the input variables of the RF model determined by PSR have comprehensive and systematic characteristics which include natural resource, ecological environment, and social economy.In this study, the input variables of the RF model are named as the original index system of drought risk assessment.The "optimized" originates from the concept of "survival of the fittest" in biology.In the OCDIS, it refers to the indices that are selected based on the contribution to the results.The unimportant indices will be discarded.Hence, the index system constructed in this study is named as optimized comprehensive drought index system.Based on the above knowledge, constructing the OCDIS by combining PSR with RF is an effective solution for drought risk assessment.However, few studies have been conducted in this field.Hence, the focus of this study is to evaluate agricultural drought risk based on the OCDIS.
Drought risk is the probability of drought event and the comprehensive measurement results of adverse effects.It can be expressed as "risk probabilistic" or "risk degree" [1].Drought is a fuzzy phenomenon, and the assessment of drought risk has the characteristics of fuzziness, randomness, and uncertainness.The variable fuzzy set (VFS) which can describe vague phenomena was used in this study to assess the agricultural drought risk.Considering the difference between data and the contribution of each data to the results, synthetic-weight based on entropy and random forest was developed to determine the weight of VFS, and establish the synthetic-weight variable fuzzy set (SW-VFS) model.Then the pressure, state, response, and comprehensive risk can be calculated based on the SW-VFS model.The motivation of the paper is to construct an optimized comprehensive drought index system (OCDIS); then focus on the established index system and synthetic-weight variable fuzzy set (SW-VFS) model to evaluate the agricultural drought risk (ADR); and finally, analyze the ADR of Qujing.

Study Area
Qujing (Figure 1) is in the east of Yunnan province.It covers an area of 2.89 × 10 4 km 2 , with Guizhou province and Guanxi Zhuang Autonomous Region in the east, Wenshan and Honghe in the south, Kunming in the west, and Zhaotong in the north.The city is administratively divided into nine counties, including Qilin, Malong, Luliang, Shizong, Luoping, Fuyuan, Zhanyi, Huize, and Xuanwei.Qujing is the primary grain-producing area in Yunnan province.The annual grain output accounts for more than 14% of the province's total grain output [35].Hence, Qujing is known as the "Granary of Eastern Yunnan".The regional climate of Qujing is dominated by subtropical plateau monsoon climate.The annual average precipitation is approximately 1000 mm, and the mean annual temperature is around 14.5 • C. The precipitation from May to October accounts for more than 80% of the annual precipitation [6].Recently global climate change, the destruction of the environment, and the uneven spatiotemporal distribution of precipitation have made this city prone to drought.From 1960 to 2009, for almost 30% of the time different levels of drought disasters occurred; from 2009 to 2012, Qujing was affected by the most severe drought, affecting more than 3,000,000 people, with more than 900,000 people having difficulty with drinking water [6].In recent years, with the development of economy and population, water demand is increasing year by year.The conflict between water supply and water demand, results in frequent drought disasters.Thus, it is essential to assess the agricultural drought risk for the sustainable development of water resource and grain safety.

Data Sources
The meteorological data (e.g., the precipitation, temperature, soil moisture) were collected from "Meteorological Bureau of Qujing" and "National Meteorological Information Center" (http://data.cma.cn/data/cdcdetail/dataCode/A.0029.0005.html).The water resource data was gathered from "Yunnan Bureau of Hydrology and Water Resources" (http://www.ynswj.gov.cn/news_list.aspx?category_id=142).The socio-economic data, like population, gross domestic production (GDP), agricultural output, and per capita annual income were collected from the statistical yearbooks of Qujing [36].The cultivated land, grain yield, reservoir storage, sown area, water infrastructure investment, and irrigation area were collected from water conservancy statistical yearbooks of Yunnan [37].The drought relief expenditure and the drought relief capacity were provided by the Qujing Flood Control and Drought Relief Headquarters.In the collection of data set, the data were checked by the boxplot of SPSS.Then, the suspicious and erroneous data were corrected manually.

Random Forest
Random forest (RF) is a combination classification method proposed by Breiman.An RF is an ensemble of tree-structured classifiers { ( , ), is an unpruned tree constructed by CART (Classification and Regression Tree); each ( , ) casts a unit vote for the most popular class [38].
Suppose the length of the training data is N , the number of variables is M, then the principle of RF is summarized as follows: (1) Select a bootstrap sample { }, of size N from the training data.
(2) For each bootstrap sample, grow a maximum tree with no further splits.Randomly select mtry ( mtry M  ) variables out of M at each node of the tree; then choose the best split on these mtry variables.The value of mtry is constant during the growth of the forest.
(3) Repeat the above steps until the ensemble of trees is grown up.

Data Sources
The meteorological data (e.g., the precipitation, temperature, soil moisture) were collected from "Meteorological Bureau of Qujing" and "National Meteorological Information Center" (http://data.cma.cn/data/cdcdetail/dataCode/A.0029.0005.html).The water resource data was gathered from "Yunnan Bureau of Hydrology and Water Resources" (http://www.ynswj.gov.cn/news_list.aspx?category_id=142).The socio-economic data, like population, gross domestic production (GDP), agricultural output, and per capita annual income were collected from the statistical yearbooks of Qujing [36].The cultivated land, grain yield, reservoir storage, sown area, water infrastructure investment, and irrigation area were collected from water conservancy statistical yearbooks of Yunnan [37].The drought relief expenditure and the drought relief capacity were provided by the Qujing Flood Control and Drought Relief Headquarters.In the collection of data set, the data were checked by the boxplot of SPSS.Then, the suspicious and erroneous data were corrected manually.

Random Forest
Random forest (RF) is a combination classification method proposed by Breiman.An RF is an ensemble of tree-structured classifiers {h(x, Θ k ), k = 1, 2, • • • }, where {Θ k } is the independent, identically distributed random vector; h(x, Θ k ) is an unpruned tree constructed by CART (Classification and Regression Tree); each h(x, Θ k ) casts a unit vote for the most popular class [38].Suppose the length of the training data is N, the number of variables is M, then the principle of RF is summarized as follows: (1) Select a bootstrap sample {S k }, k ∈ {1, 2, • • • , k} of size N from the training data.
(2) For each bootstrap sample, grow a maximum tree with no further splits.Randomly select mtry (mtry M) variables out of M at each node of the tree; then choose the best split on these mtry variables.The value of mtry is constant during the growth of the forest.
(3) Repeat the above steps until the ensemble of trees is grown up.
(4) Build forest by the grown trees.Each tree casts a unit vote for the most popular class; then the optimal results are obtained.For the classification problem, the decision function is as follows: where I(•) is indicator function, Y is the prediction object, T is the number of trees.Then the probability of sample S belongs to Y C is as follows: Suppose the length of training data is N, the proximity function is prox(n, k), the average proximity between the case n and other cases in class j is prox 2 (n, k), the number of cases belongs to the class j is N j , the raw outlier measure for n is rawom = N j /P (n), then the final outlier measure is as follows: where µ and σ are the median and standard deviation of each raw outlier measure, respectively.The RF model can measure the importance of variables to help the deciders to identify critical factors in the evaluation problem.It provides two different importance measurement methods: Mean decrease in accuracy (MDA) and mean decrease in GINI (MDG) [39].MDA measures the importance of variables by calculating the change in prediction accuracy.Usually, the larger the value of the score, the higher the importance of the variable.MDG calculates the average decreases in GINI impurity due to a given variable (when this variable is adopted to construct a split).The larger the average value, the higher the importance of the variable.Generally, the results of MDG are consistent with MDA, but MDG is more robust [40].Thus, this study adopts MDG to measure the importance of ADR indices.The importance of each index to the total drought risk can be measured by the following equation: where m, n, and t are the number of indices, classification trees, and nodes, respectively.D Gkij is the GINI decrease value of the jth node in the ith tree that belongs to the kth index.

Variable Fuzzy Set
Suppose A is a fuzzy concept of the natural events, such as precipitation or temperature.
u is an element of U, denoting the research object.A and A c are attractability of U and repellency of U, respectively.µ A (u) is the relative membership degree (RMD) of A, and They satisfy µ A (u) + µ A c (u) = 1, and ranges from 0 to 1 [41].
For any element u, V 0 is defined as the variable fuzzy sets (VFS).Suppose X 0 = [a, b] and X = [c, d] are the attracting sets and respelling sets of V 0 , respectively.The point of M satisfies D A (u) = 1.
x is a random point in the interval X.If x lies at the left side of M, D A (u) can be calculated as follows: If x lies at the right side of M, D A (u) can be calculated as follows: In which β ≥ 0, usually we set β = 1.Suppose X = (x ij ) m×n is the sample set of ADR assessment, x j = (x 1j , x 2j , • • • , x mj ) is the index eigenvalue of the sample j, where n is the number of the sample set, m is the number of indices.Assume the drought grades is h, then the RMD µ A (u), and the non-normalized integrated RMD i u h can be calculated as below, . α is the optimal rule parameter, p is the distance parameter.ω i is the weight of index.Usually, it is determined by the experts' evaluation method or the entropy method.
In this study, a synthetic-weight is proposed based on information entropy theory and random forest.Information entropy, the concept of measuring the information produced by data, can reflect the difference between data.It determines the weight of indices based on the carried information.Generally, the more difference between data, the more information produced, and the larger weight of the index.Random forest measures the weight of index depending on the contribution to the results.The greater contribution to the results, the larger the weight of the index.Considering the difference between data and the contribution to the results, this study developed a synthetic weight based on entropy and random forest (see Equation ( 10)).
where En ω is the weight determined by entropy.RF ω is the weight decided by random forest.
Syn ω is the combined weight determined by entropy and random forest.α is the coefficient, which ranges from 0 to 1. Considering the difference between data and the contribution to the results, this study sets α = 0.5.The normalized integrated RMD and the grade characteristic value H can be calculated as follows: where U is the normalized RMD degree matrix.Finally, based on the rounding-off method, the corresponding risk levels of agricultural drought can be calculated.

Drought Risk Assessment Based on OCDIS and SW-VFS
Based on the PSR model, the original index system which includes human actions, environment, and other social-economy factors were established.Then, referring to the concept of "survival of the fittest" in biology, random forest was adopted to measure the importance of each index, and to establish the optimized comprehensive drought index system (OCDIS).Finally, synthetic-weight variable fuzzy set (SW-VFS) model was proposed to assess the drought risk based on OCDIS.The main steps of ADR assessment based on OCDIS and SW-VFS are as follows: (1) Establish the original index system based on the PSR model.where U is the normalized RMD degree matrix.Finally, based on the rounding-off method, the corresponding risk levels of agricultural drought can be calculated.

Drought Risk Assessment Based on OCDIS and SW-VFS
Based on the PSR model, the original index system which includes human actions, environment, and other social-economy factors were established.Then, referring to the concept of "survival of the fittest" in biology, random forest was adopted to measure the importance of each index, and to establish the optimized comprehensive drought index system (OCDIS).Finally, synthetic-weight variable fuzzy set (SW-VFS) model was proposed to assess the drought risk based on OCDIS.The main steps of ADR assessment based on OCDIS and SW-VFS are as follows: (1) Establish the original index system based on the PSR model.

Establishment of Original Index System
Drought is a fuzzy phenomenon affected by various factors.The drought index is critically important for agricultural drought risk (ADR) assessment.The scientific index system can describe the frequency, severity, duration, and extent of droughts accurately.The PSR model is a cyclical

Establishment of Original Index System
Drought is a fuzzy phenomenon affected by various factors.The drought index is critically important for agricultural drought risk (ADR) assessment.The scientific index system can describe the frequency, severity, duration, and extent of droughts accurately.The PSR model is a cyclical model that can express the relationship between social, economy, and ecosystem scientifically.The model divides the drought indices into pressure, state, and response of three subsystems.The pressure subsystem represents the source of risk, which can reflect the pressure caused by climate change and human activities.It includes the abnormal precipitation, low soil moisture, high temperature, etc.The state subsystem describes the state or trend of the ecosystem under various pressure.The response subsystem contains the countermeasures or changes taken by humans or the ecosystem under various pressures and states.It includes drought relief expenditure, drought resistance planning, irrigation facilities, etc.Based on the PSR model and field investigation, this study develops the original ADR index system in terms of pressure, state, and response of the three subsystems (see Table 1).

Index Meaning
Pressure soil moisture (%, X 1 ) It reflects the water content of the soil.
precipitation anomaly percentage (%, X 2 ) It reflects the degree of precipitation deviating from the average level.Suppose P is the precipitation, _ P is the mean annual precipitation, the value of X 2 is calculated by It means annual mean temperature.
agricultural population density (person/km 2 , X 4 ) It is determined by the amount of agricultural population and the region area.
agricultural output value proportion (%, X 5 ) It reflects the proportion of agriculture output in the gross domestic product (GDP).

State
per capita annual income ($, X 6 ) It reflects the economic situation in rural areas, which obtained from the regional statistical yearbook.
grain yield per mu (kg/mu, X 7 ) It is determined by crop yields and sown area.
per capita water resources (m 3 /person, X 8 ) It is equal to the gross amount of water resource divide by the amount of population.
drought rate (%, X 9 ) The proportion of the drought area to sown area.
irrigation rate of cultivated land (%, X 10 ) It is equal to irrigation area divided by cultivated land area.
sown area ratio (%, X 11 ) The proportion of sown areas to region area.

Response
irrigation facilities ratio (%, X 12 ) It reflects the electromechanical drainage and irrigation capability.It is the proportion of electromechanical drainage and irrigation area to the cultivated land area.
drought relief expenditure rate (%, X 13 ) The proportion of drought relief expenditure to the gross regional product.
disaster relief capacity (X 14 ) It refers to the ability of response to drought, which is determined by the expert assessment, which ranges from 0 to 100.
reservoir storage capacity (%, X 15 ) It reflects the abnormal degree of reservoir water storage.It is the proportion of reservoir storage capacity at the end of the year to annual average reservoir storage capacity.
According to the characteristics of each evaluation index, annual average temperature X 3 , agricultural population density X 4 , agricultural output value proportion X 5 , drought rate X 9 , and sown area ratio X 11 are positive indices, of which the larger of the values, the higher of the drought risk; the soil moisture X 1 , precipitation anomaly percentage X 2 , per capita annual income X 6 , gain yield per mu X 7 , per capita water resources X 8 , irrigation rate of cultivated land X 10 , irrigation facilities ratio X 12 , drought relief expenditure rate X 13 , disaster relief capacity X 14 , and reservoir storage capacity X 15 are negative indices, of which the larger the values, the lower the drought risk.
Among the indices, some originate from raw data of statistical materials; some are generated based on the calculation of raw data.For example, the precipitation data is collected from "Meteorological Bureau of Qujing".The precipitation anomaly percentage is calculated based on the precipitation.The irrigation facilities ratio is determined by the area of electromechanical drainage and the area of cultivated land, which are gathered from "Water Conservancy Statistics Yearbook of Yunnan".The data of reservoir storage capacity is also gathered from "Water Conservancy Statistics Yearbook of Yunnan".The drought relief expenditure and the disaster relief capacity are provided by the Qujing Flood Control and Drought Relief Headquarters.

Data Collection and Preprocess
Data selection is the first step for drought risk assessment.Qujing is vulnerable to drought due to seasonal precipitation.It has suffered various levels of droughts (e.g., slight drought, moderate drought, serious drought, and extremely serious drought) from 2000 to 2014 [36].Hence, this study chooses Qujing from 2000 to 2014 as the research object.For demonstration purposes, the data of Qilin was shown in pressure, state, and response, respectively (See Tables 2-4).
After data collection, it is important to preprocess data and eliminate the abnormal samples.An abnormal sample is distant from other observations.It may indicate an experimental, sampling error or a novelty.It should be understood why the novelty appeared.The error samples will be discarded or replaced with a statistic that is robust to the outlier.In this study, the random forest was adopted to identify abnormal samples.To have a direct understanding of samples, the proximity matrix projection diagram of Qujing from 2000 to 2014 was drawn (see Figure 3).
From Figure 3, the drought data is divided into five levels and relatively clustered in every level.It implies that the abnormal samples are small, i.e., the quality of data is relatively high.However, there are still some samples distant from the corresponding population.These samples may have some problems.In this study, we adopted the outlier measure degrees to identify the abnormal samples.The diagram of outlier measure degrees was shown in Figure 4.After data collection, it is important to preprocess data and eliminate the abnormal samples.An abnormal sample is distant from other observations.It may indicate an experimental, sampling error or a novelty.It should be understood why the novelty appeared.The error samples will be discarded or replaced with a statistic that is robust to the outlier.In this study, the random forest was adopted to identify abnormal samples.To have a direct understanding of samples, the proximity matrix projection diagram of Qujing from 2000 to 2014 was drawn (see Figure 3).From Figure 3, the drought data is divided into five levels and relatively clustered in every level.It implies that the abnormal samples are small, i.e., the quality of data is relatively high.However, there are still some samples distant from the corresponding population.These samples may have some problems.In this study, we adopted the outlier measure degrees to identify the abnormal samples.The diagram of outlier measure degrees was shown in Figure 4.In Figure 4, the X-axis means the serial number of samples and the Y-axis stands for the deviation value between the sample and the mean of the corresponding class.From Figure 4, the majority of the outlier measure degree is below four, i.e., the samples with the outlier degrees more than four have a lower proximity to the other samples in the corresponding class.In this study, those abnormal samples would be eliminated (the samples are Qilin of 2010 and Fuyuan of 2010, respectively).The proximity matrix projection matrix without those abnormal samples is shown in Figure 5.

Optimization of RF Parameters
There are two hyper-parameters need to optimize when using the RF model.They are the number of decision trees ( ntree ) and the number of chosen indices ( mtry ) when splitting a node.The default value of ntree is 500 and the default value of mtry is the square root of the number of indices [29].Generally, the larger the number of ntree , the higher the predictive accuracy.However, the return is diminishing once ntree grows up to several hundred.To obtain the optimal value of ntree , we set =4 mtry , and test the OOB (out of bag) errors under different ntrees (see Figure 6).In Figure 4, the X-axis means the serial number of samples and the Y-axis stands for the deviation value between the sample and the mean of the corresponding class.From Figure 4, the majority of the outlier measure degree is below four, i.e., the samples with the outlier degrees more than four have a lower proximity to the other samples in the corresponding class.In this study, those abnormal samples would be eliminated (the samples are Qilin of 2010 and Fuyuan of 2010, respectively).The proximity matrix projection matrix without those abnormal samples is shown in Figure 5.In Figure 4, the X-axis means the serial number of samples and the Y-axis stands for the deviation value between the sample and the mean of the corresponding class.From Figure 4, the majority of the outlier measure degree is below four, i.e., the samples with the outlier degrees more than four have a lower proximity to the other samples in the corresponding class.In this study, those abnormal samples would be eliminated (the samples are Qilin of 2010 and Fuyuan of 2010, respectively).The proximity matrix projection matrix without those abnormal samples is shown in Figure 5.

Optimization of RF Parameters
There are two hyper-parameters need to optimize when using the RF model.They are the number of decision trees ( ntree ) and the number of chosen indices ( mtry ) when splitting a node.The default value of ntree is 500 and the default value of mtry is the square root of the number of indices [29].Generally, the larger the number of ntree , the higher the predictive accuracy.However, the return is diminishing once ntree grows up to several hundred.To obtain the optimal value of ntree , we set =4 mtry , and test the OOB (out of bag) errors under different ntrees (see Figure 6).

Optimization of RF Parameters
There are two hyper-parameters need to optimize when using the RF model.They are the number of decision trees (ntree) and the number of chosen indices (mtry) when splitting a node.The default value of ntree is 500 and the default value of mtry is the square root of the number of indices [29].Generally, the larger the number of ntree, the higher the predictive accuracy.However, the return is diminishing once ntree grows up to several hundred.To obtain the optimal value of ntree, we set mtry = 4, and test the OOB (out of bag) errors under different ntrees (see Figure 6).In Figure 6, the X-axis indicates the number of ntree ; the Y-axis represents the value of OOB error.The diagrams show the trend of OOB error under different ntrees .From the diagrams, up to 500 ntree = , the OOB error tends to stable, i.e., 500 ntree = is the optimization value.mtry is optimized by caret and - K fold cross-validation.Firstly, this study divided the training data into K subsets randomly, then trained the subsets K times to obtain K evaluation values, and eventually averaged the K evaluation values as the performance evaluation criterion.This study set 5 K = , the repeat times 10 m = .In Figure 7, the X-axis is the value of mtry and the Y-axis is the accuracy.The higher the accuracy, the better the results.From Figure 7, when = 4 mtry , the accuracy is the highest.Hence, the best value of mtry is four.In Figure 6, the X-axis indicates the number of ntree; the Y-axis represents the value of OOB error.The diagrams show the trend of OOB error under different ntrees.From the diagrams, up to ntree = 500, the OOB error tends to stable, i.e., ntree = 500 is the optimization value.mtry is optimized by caret and K − f old cross-validation.Firstly, this study divided the training data into K subsets randomly, then trained the subsets K times to obtain K evaluation values, and eventually averaged the K evaluation values as the performance evaluation criterion.This study set K = 5, the repeat times m = 10.
In Figure 7, the X-axis is the value of mtry and the Y-axis is the accuracy.The higher the accuracy, the better the results.From Figure 7, when mtry = 4, the accuracy is the highest.Hence, the best value of mtry is four.In Figure 6, the X-axis indicates the number of ntree ; the Y-axis represents the value of OOB error.The diagrams show the trend of OOB error under different ntrees .From the diagrams, up to 500 ntree = , the OOB error tends to stable, i.e., 500 ntree = is the optimization value.mtry is optimized by caret and - K fold cross-validation.Firstly, this study divided the training data into K subsets randomly, then trained the subsets K times to obtain K evaluation values, and eventually averaged the K evaluation values as the performance evaluation criterion.This study set 5 K = , the repeat times 10 m = .In Figure 7, the X-axis is the value of mtry and the Y-axis is the accuracy.The higher the accuracy, the better the results.From Figure 7, when = 4 mtry , the accuracy is the highest.Hence, the best value of mtry is four.

Establishment of OCDIS
Measuring the importance of indices can help the decision makers gain insights into the importance of every index to the total drought risk.The RF model adopts MDA and MDG to measure the importance of indices.Generally, the larger value of MDA or MDG, the more important the index is.The results were shown in Figure 8.

Establishment of OCDIS
Measuring the importance of indices can help the decision makers gain insights into the importance of every index to the total drought risk.The RF model adopts MDA and MDG to measure the importance of indices.Generally, the larger value of MDA or MDG, the more important the index is.The results were shown in Figure 8.In Figure 8, the X-axis means the value of MDA and MDG and the Y-axis is the name of indices.The higher value of MDA or MDG is, the more important of the index is.From Figure 8, the reservoir storage capacity X15, precipitation anomaly percentage X2, soil moisture X1, and per capita annual income X6 are the four most important indices among the fifteen risk indices.That means that these indices contribute more to the total drought risk.The agricultural population density X14, annual average temperature X3, and disaster relief capacity X14 are less consequential to the total drought risk.The results also show that there is a minor difference between MDG and MDA.However, MDG is more stable than MDA and has a faster calculation speed [40].Thus, MDG was combined with AUC (area under the ROC curve) to establish the risk index system of OCDIS.
AUC evaluates the accuracy of the RF model by computing the OOB-AUC values under different numbers of variables.Firstly, the indices were sorted based on the MDG measurement results; secondly, a certain proportion of the indices were selected to calculate the OOB-AUC values; finally, the highest OOB-AUC value and the corresponding indices were obtained.In this study, we set 5 K = and the repeat times 10 m = .Figure 9 shows the relationship between OOB-AUC and the selected number of indices.In Figure 8, the X-axis means the value of MDA and MDG and the Y-axis is the name of indices.The higher value of MDA or MDG is, the more important of the index is.From Figure 8, the reservoir storage capacity X 15 , precipitation anomaly percentage X 2 , soil moisture X 1 , and per capita annual income X 6 are the four most important indices among the fifteen risk indices.That means that these indices contribute more to the total drought risk.The agricultural population density X 14 , annual average temperature X 3 , and disaster relief capacity X 14 are less consequential to the total drought risk.The results also show that there is a minor difference between MDG and MDA.However, MDG is more stable than MDA and has a faster calculation speed [40].Thus, MDG was combined with AUC (area under the ROC curve) to establish the risk index system of OCDIS.
AUC evaluates the accuracy of the RF model by computing the OOB-AUC values under different numbers of variables.Firstly, the indices were sorted based on the MDG measurement results; secondly, a certain proportion of the indices were selected to calculate the OOB-AUC values; finally, the highest OOB-AUC value and the corresponding indices were obtained.In this study, we set K = 5 and the repeat times m = 10. Figure 9 shows the relationship between OOB-AUC and the selected number of indices.In Figure 9, the X-axis is the number of chosen indices; the Y-axis is an OOB-AUC error.Figure 9 shows the trend of OOB-AUC error under different numbers of indices.From Figure 9, the OOB-AUC has the highest values when the selection number of indices is ten.It means the accuracy assessment results can be obtained by adopting ten indices.The importance of each index to the total drought risk is shown in Figure 10.In Figure 10, the X-axis is the name of indices and the Y-axis means the importance value of the corresponding index.From Figure 10, the importance of top 10 indices is over 5%, indicating they contribute significantly to drought risk.The total importance of top 10 indices is more than 80%, representing these indices play a decisive role in the final drought risk.Hence, we can assess the drought risk according to the top 10 indices.Thus, the optimized comprehensive drought index system (OCDIS) was developed, which includes soil moisture X1, precipitation anomaly percentage X2, agricultural output value proportion X5, per capita annual income X6, drought rate X9, irrigation rate of cultivated land X10, sown area ratio X11, irrigation facilities ratio X12, drought relief expenditure rate X13, and reservoir storage capacity X15.

Risk Assessment Based on SW-VFS and OCDIS
The ADR is divided into five grades, namely, lowest risk, lower risk, moderate risk, higher risk, and highest risk [42].The ranges and grade standards of OCDIS are determined by the historical data of Qujing from 2000 to 2014.For example, the average annual income and drought relief expenditure are estimated by the society and economic development; the sown area and agricultural output are calculated based on the development of agriculture and the climate change.
Based on Table 5, the attracting set ab I , the boundary set c d I , and the point value set In Figure 9, the X-axis is the number of chosen indices; the Y-axis is an OOB-AUC error.Figure 9 shows the trend of OOB-AUC error under different numbers of indices.From Figure 9, the OOB-AUC has the highest values when the selection number of indices is ten.It means the accuracy assessment results can be obtained by adopting ten indices.The importance of each index to the total drought risk is shown in Figure 10.In Figure 9, the X-axis is the number of chosen indices; the Y-axis is an OOB-AUC error.Figure 9 shows the trend of OOB-AUC error under different numbers of indices.From Figure 9, the OOB-AUC has the highest values when the selection number of indices is ten.It means the accuracy assessment results can be obtained by adopting ten indices.The importance of each index to the total drought risk is shown in Figure 10.In Figure 10, the X-axis is the name of indices and the Y-axis means the importance value of the corresponding index.From Figure 10, the importance of top 10 indices is over 5%, indicating they contribute significantly to drought risk.The total importance of top 10 indices is more than 80%, representing these indices play a decisive role in the final drought risk.Hence, we can assess the drought risk according to the top 10 indices.Thus, the optimized comprehensive drought index system (OCDIS) was developed, which includes soil moisture X1, precipitation anomaly percentage X2, agricultural output value proportion X5, per capita annual income X6, drought rate X9, irrigation rate of cultivated land X10, sown area ratio X11, irrigation facilities ratio X12, drought relief expenditure rate X13, and reservoir storage capacity X15.

Risk Assessment Based on SW-VFS and OCDIS
The ADR is divided into five grades, namely, lowest risk, lower risk, moderate risk, higher risk, and highest risk [42].The ranges and grade standards of OCDIS are determined by the historical data of Qujing from 2000 to 2014.For example, the average annual income and drought relief expenditure are estimated by the society and economic development; the sown area and agricultural output are calculated based on the development of agriculture and the climate change.
Based on Table 5, the attracting set ab I , the boundary set c d I , and the point value set In Figure 10, the X-axis is the name of indices and the Y-axis means the importance value of the corresponding index.From Figure 10, the importance of top 10 indices is over 5%, indicating they contribute significantly to drought risk.The total importance of top 10 indices is more than 80%, representing these indices play a decisive role in the final drought risk.Hence, we can assess the drought risk according to the top 10 indices.Thus, the optimized comprehensive drought index system (OCDIS) was developed, which includes soil moisture X 1 , precipitation anomaly percentage X 2 , agricultural output value proportion X 5 , per capita annual income X 6 , drought rate X 9 , irrigation rate of cultivated land X 10 , sown area ratio X 11 , irrigation facilities ratio X 12 , drought relief expenditure rate X 13 , and reservoir storage capacity X 15 .

Risk Assessment Based on SW-VFS and OCDIS
The ADR is divided into five grades, namely, lowest risk, lower risk, moderate risk, higher risk, and highest risk [42].The ranges and grade standards of OCDIS are determined by the historical data of Qujing from 2000 to 2014.For example, the average annual income and drought relief expenditure are estimated by the society and economic development; the sown area and agricultural output are calculated based on the development of agriculture and the climate change.
Based on Table 5, the attracting set I ab , the boundary set I cd , and the point value set M sh were obtained.The I cd was determined by the upper and lower values of the adjoin intervals, respectively.
The level I and V of M sh were determined by the left side and right side of the corresponding level of I ab , respectively.The other levels of M sh were determined by the middle values of the corresponding level of I ab .The weight of each index was determined by the synthetic-weight method.The weights are calculated as follows.En ω is the weight determined by entropy.RF ω is the weight decided by random forest.Syn ω is the combined weight that was calculated by Equation (10).From Table 6, on the basis of the En ω method, the weight of X 11 is the largest, suggesting that the difference in X 11 is larger than other indices.However, based on the results of RF ω , the weight of X 11 is only 0.06, which indicates the contribution of X 11 is small.Meanwhile, on the basis of RF ω , the weight of X 15 is the largest, though it is only 0.10 based on the En ω method.Thus, the Syn ω weight, which considers the difference between data and the contributions to results is more reasonable.Then, the normalized integrated relative membership degree, and the grade characteristic values were calculated according to Equations (11) and (12).The risk values of agricultural drought risk (ADR) were obtained according to the rounding-off method.Set the optimization parameters α = 1, p = 2.The evaluation results have been achieved in terms of pressure, state, and response, respectively.
The abnormal precipitation, soil moisture, and agricultural economic proportion are the primary inducers of pressure risk.From Figure 11, Qilin and Fuyuan have lower pressure risk while Luliang and Malong have higher pressure risk.The pressure risk is around three in most counties, i.e., the integrated pressure risk of Qujing is moderate.Qujing is one of the major grain-producing cities in Yunnan province.The agricultural output proportion of many counties occupies more than 30% of the GDP.Agriculture is sensitive to precipitation and soil moisture.According to historical data, the soil moisture and precipitation anomaly percentage decrease obviously when drought disasters occur.In 2011's extremely serious drought, the precipitation of many counties was lower than 30% of the normal level.The low precipitation and low soil moisture resulted in high pressure risk.The pressure risk made the state system abnormal, and then indirectly affected the actions taken by the response system.Recently, with the overexploitation of land resources, the soil water storage capacity decreased obviously.Thus, to reduce the pressure risk, the local government needs to formulate reasonable land use planning and pay attention to the proportion of ecological land.Meanwhile, promoting the development of service industry and advanced manufacturing industry will also reduce the pressure risk.
promoting the development of service industry and advanced manufacturing industry will also reduce the pressure risk.The state of ADR indicates the vulnerability of the ecosystem and the exposure of property.Generally, the higher the risk of state, the more losses caused when encountering droughts.From Figure 12, the state risk in most areas is lower than three, illustrating the state risk in Qujing is under control.However, the state risk of Xuanwei and Luliang are more than three, which indicate that the potential drought risk is relatively high in the two counties.According to statistical data, Luliang and Xuanwei have lower irrigation proportion and more than 15% of sown area ratio.That means the agricultural production in these two counties is highly reliant on precipitation.Once the precipitation is abnormal, the crop will be affected.The pressure risk and state risk of Luliang are more than three, representing the agricultural drought resistant ability of Luliang is relatively low.Though the per capita annual income increases year by year, the growth rate slows down during the drought-affected years.Meanwhile, affected by abnormal precipitation and lower soil moisture, the drought rate is increased in the drought-affected years.To reduce state risk, the local government can adopt watersaving measures and increase the drought prevention investment.The response risk is determined by the investment of drought relief, irrigation facilities ratio, and reservoir storage capacity.From Figure 13, the response risk of Luoping is 1.27 times than Qilin.It indicates that Qilin has more effective measures to mitigate the agricultural drought risk than Luoping.In most counties, the response risk is more than three, indicating that the integrated response risk of Qujing is relatively high.According to statistical data, the irrigation facilities ratio in The state of ADR indicates the vulnerability of the ecosystem and the exposure of property.Generally, the higher the risk of state, the more losses caused when encountering droughts.From Figure 12, the state risk in most areas is lower than three, illustrating the state risk in Qujing is under control.However, the state risk of Xuanwei and Luliang are more than three, which indicate that the potential drought risk is relatively high in the two counties.According to statistical data, Luliang and Xuanwei have lower irrigation proportion and more than 15% of sown area ratio.That means the agricultural production in these two counties is highly reliant on precipitation.Once the precipitation is abnormal, the crop will be affected.The pressure risk and state risk of Luliang are more than three, representing the agricultural drought resistant ability of Luliang is relatively low.Though the per capita annual income increases year by year, the growth rate slows down during the drought-affected years.Meanwhile, affected by abnormal precipitation and lower soil moisture, the drought rate is increased in the drought-affected years.To reduce state risk, the local government can adopt water-saving measures and increase the drought prevention investment.The state of ADR indicates the vulnerability of the ecosystem and the exposure of property.Generally, the higher the risk of state, the more losses caused when encountering droughts.From Figure 12, the state risk in most areas is lower than three, illustrating the state risk in Qujing is under control.However, the state risk of Xuanwei and Luliang are more than three, which indicate that the potential drought risk is relatively high in the two counties.According to statistical data, Luliang and Xuanwei have lower irrigation proportion and more than 15% of sown area ratio.That means the agricultural production in these two counties is highly reliant on precipitation.Once the precipitation is abnormal, the crop will be affected.The pressure risk and state risk of Luliang are more than three, representing the agricultural drought resistant ability of Luliang is relatively low.Though the per capita annual income increases year by year, the growth rate slows down during the drought-affected years.Meanwhile, affected by abnormal precipitation and lower soil moisture, the drought rate is increased in the drought-affected years.To reduce state risk, the local government can adopt watersaving measures and increase the drought prevention investment.The response risk is determined by the investment of drought relief, irrigation facilities ratio, and reservoir storage capacity.From Figure 13, the response risk of Luoping is 1.27 times than Qilin.It indicates that Qilin has more effective measures to mitigate the agricultural drought risk than Luoping.In most counties, the response risk is more than three, indicating that the integrated response risk of Qujing is relatively high.According to statistical data, the irrigation facilities ratio in The response risk is determined by the investment of drought relief, irrigation facilities ratio, and reservoir storage capacity.From Figure 13, the response risk of Luoping is 1.27 times than Qilin.It indicates that Qilin has more effective measures to mitigate the agricultural drought risk than Luoping.In most counties, the response risk is more than three, indicating that the integrated response risk of Qujing is relatively high.According to statistical data, the irrigation facilities ratio in Huize, Fuyuan, Malong, and Luoping is smaller than other counties.In other words, if abnormal precipitation occurs, the irrigation of farmland in those counties will be affected.Although the reservoir storage capacity is over 75% in normal years, it decreases significantly in drought-affected years.For example, in 2011, the reservoir storage capacity was lower than 30% in most counties.The decrease of reservoir storage capacity results in the shortage of water resource and high response risk.Huize, Fuyuan, Malong, and Luoping is smaller than other counties.In other words, if abnormal precipitation occurs, the irrigation of farmland in those counties will be affected.Although the reservoir storage capacity is over 75% in normal years, it decreases significantly in drought-affected years.For example, in 2011, the reservoir storage capacity was lower than 30% in most counties.The decrease of reservoir storage capacity results in the shortage of water resource and high response risk.In conclusion, the agricultural drought risk is the comprehensive results of pressure, state, and response.The precipitation is one of the primary sources of water resource, which affects the soil moisture and water supply directly.The high agricultural output proportion and low drought relief investment result in high pressure risk and response risk.According to Figures 11-13, the pressure risk and response risk of Malong are relatively higher than other counties.The pressure risk of Fuyuan is lower than other counties while the response risk is higher than other places.The pressure risk, state risk, and response risk of Xuanwei are around three, indicating it will have an increasing agricultural drought risk in the future if no countermeasures are taken.
To make the results more reliable and scientific, this paper changed the parameters a and p , and then averaged them as the assessment results.The integrated ADR assessment results of Qujing is shown in Table 7.In conclusion, the agricultural drought risk is the comprehensive results of pressure, state, and response.The precipitation is one of the primary sources of water resource, which affects the soil moisture and water supply directly.The high agricultural output proportion and low drought relief investment result in high pressure risk and response risk.According to Figures 11-13, the pressure risk and response risk of Malong are relatively higher than other counties.The pressure risk of Fuyuan is lower than other counties while the response risk is higher than other places.The pressure risk, state risk, and response risk of Xuanwei are around three, indicating it will have an increasing agricultural drought risk in the future if no countermeasures are taken.
To make the results more reliable and scientific, this paper changed the parameters α and p, and then averaged them as the assessment results.The integrated ADR assessment results of Qujing is shown in Table 7.The risk values are the quantitative expression of the possible consequence of drought disasters during a period.According to the rounding-off method, in this study, we computed the risk levels based on the average values of different parameters combination.II and III in  According to Figure 14, the pressure risk in the east-central, the state risk in Midwest and south, the response risk in Central, are lower than other regions, respectively.The integrated risk in most counties of Qujing is moderate, indicating Qujing is prone to drought.From the distribution, the pressure risk, response risk, and integrated risk of Qilin are lower than other regions.Therefore, the drought prevention and relief measures should have regional applicability.To the lower state risk, but high pressure and response risk areas, the decision makers need to pay more attention to abnormally high temperature and precipitation.It is better to increase the ratio of electromechanical drainage and irrigation, and improve the capacity of reservoir storage in these regions.Due to most places of Qujing experiencing moderate integrated drought level, the government needs to increase efforts for effective drought mitigation, like adjusting the crop-planting structure and improving irrigation and water-saving technology.

The Countermeasures of Drought Relief
According to the risk zoning maps, the integrated agricultural drought risk level in the most regions of Qujing is moderate.However, the risk levels in the three subsystems have regional characteristics.Therefore, each region should formulate drought prevention and resistance countermeasures according to the actual situation based on the analysis results of the three subsystems.According to Figure 14, the pressure risk in the east-central, the state risk in Midwest and south, the response risk in Central, are lower than other regions, respectively.The integrated risk in most counties of Qujing is moderate, indicating Qujing is prone to drought.From the distribution, the pressure risk, response risk, and integrated risk of Qilin are lower than other regions.Therefore, the drought prevention and relief measures should have regional applicability.To the lower state risk, but high pressure and response risk areas, the decision makers need to pay more attention to abnormally high temperature and precipitation.It is better to increase the ratio of electromechanical drainage and irrigation, and improve the capacity of reservoir storage in these regions.Due to most places of Qujing experiencing moderate integrated drought level, the government needs to increase efforts for effective drought mitigation, like adjusting the crop-planting structure and improving irrigation and water-saving technology.

The Countermeasures of Drought Relief
According to the risk zoning maps, the integrated agricultural drought risk level in the most regions of Qujing is moderate.However, the risk levels in the three subsystems have regional characteristics.Therefore, each region should formulate drought prevention and resistance countermeasures according to the actual situation based on the analysis results of the three subsystems.
In the pressure aspect, the risk of many counties is moderate.The pressure risk mainly depends on soil moisture, precipitation, and agricultural output proportion.Generally, the low precipitation easily causes low soil moisture, then affects the growth of crop and results in high-pressure risk.Though the annual precipitation is sufficient in most area of Qujing, the temporal distribution is uneven, which resulting in seasonal drought.Therefore, it is essential to make full use of the wet season precipitation and improve the soil water storage capacity in the high-pressure risk regions.The agricultural output proportion of Malong, Luliang, and Zhanyi is above 30%, indicating that those counties are vulnerable to droughts.Therefore, to reduce the pressure risk, the local government need to formulate reasonable land use planning, pay attention to the proportion of crop-planting and ecological land, and promote the development of the service industry and advanced manufacture industry.
In the state aspect, although the pressure risk of Fuyuan and Qilin is low, the state risk for them is relatively high.The drought-resistant ability and the irrigation ratio are low in these two counties.The decision-makers need to improve the drought prevention and resistance capability to reduce the state risk.The state risk of Luliang and Xuanwei is more than three.According to historical data, the sown area ratios are more than 15%, but the irrigation proportions are relatively low in the two counties.Thus, it is essential to strengthen the research of irrigation management mechanism and encourage farmers to adopt advanced irrigation methods, such as drip irrigation, sprinkler irrigation, and so on.Meanwhile, planting drought-tolerant crops and raising the annual income of farmers through multiple channels will also reduce state risk.
In the response aspect, the risk in most counties is moderate, indicating the drought prevention and resistance investment is insufficient in most counties.According to the statistical data, the irrigation facilities ratios of Luoping, Fuyuan, and Huize are lower than 10%.The government agencies in those counties need to increase the investment in water conservancy infrastructure construction to reduce response risk.Meanwhile, adopting remote sensing, deep learning, and other technologies to monitor the dynamic situation of crops will help farmers change passive drought resistance to proactive drought management.The measures will also effetely reduce the total drought risk.

Conclusions
Agricultural drought risk (ADR) is affected by many factors, e.g., precipitation, soil moisture, and reservoir storage capacity.However, the contribution of each factor to the total drought risk is different.The random forest (RF) is a data-driven model that can measure the importance of each factor to the results.The pressure-state-response (PSR) model can reflect the relationship between various factors.Therefore, in this study, an optimized comprehensive index system (OCDIS) was developed based on PSR and RF models.The PSR model was adopted to construct the original index system of agricultural drought risk, i.e., the input variables of RF; the RF model was used to measure the importance of each index, and to develop the OCDIS.Due to the randomness, fuzziness, and uncertainty of agricultural drought risk assessment, the synthetic-weight variable fuzzy set (SW-VFS) model was proposed to assess the pressure, state, response, and integrated risk of agricultural drought based on OCDIS.The weight of SW-VFS model was determined by entropy and random forest.It can reflect the difference between data and the contribution of each index to the risk results.
The agricultural drought risk index system OCDIS and SW-VFS model were applied in Qujing.
The results showed that of the 10 indices included in the OCDIS, the four most important indices for agricultural drought risk management are reservoir storage capacity, precipitation anomaly percentage, soil moisture, and per capita annual income.Thus, the local government can take targeted measures to reduce agricultural drought risk.For example, increasing conservancy investment and establishing an agricultural irrigation system which includes large, medium, and small reservoirs to improve the reservoir storage capacity; protecting forests, vegetation, and wetlands to increase the water storage capacity of soil; formulating agricultural insurance and planting drought-tolerance crops to reduce agricultural drought risk and increase farmers' income.Then, the SW-VFS model was adopted to assess the pressure, state, response, and integrated risk of agricultural drought in Qujing.

Figure 1 .
Figure 1.The geographic location of Qujing in Yunnan Province, China.

Figure 1 .
Figure 1.The geographic location of Qujing in Yunnan Province, China.

( 2 )
Preprocess the data and visualize them.(3) Optimize the parameters of random forest.(4) Measure the importance of indices and establish the OCDIS based on random forest model.(5) Optimize the weight of variable fuzzy set model and propose the SW-VFS model.(6) Calculate the pressure, state, response, and integrated risk of agricultural drought based on SW-VFS model and OCDIS.The flow of agricultural drought risk based on OCDIS and SW-VFS was shown in Figure 2.

( 2 )
Preprocess the data and visualize them.(3) Optimize the parameters of random forest.(4) Measure the importance of indices and establish the OCDIS based on random forest model.(5) Optimize the weight of variable fuzzy set model and propose the SW-VFS model.(6) Calculate the pressure, state, response, and integrated risk of agricultural drought based on SW-VFS model and OCDIS.The flow of agricultural drought risk based on OCDIS and SW-VFS was shown in Figure 2.

Figure 2 .
Figure 2. The flow diagram of agricultural drought risk (ADR) assessment.

Figure 2 .
Figure 2. The flow diagram of agricultural drought risk (ADR) assessment.

Figure 4 .
Figure 4.The diagnosis of abnormal samples.

Figure 4 .
Figure 4.The diagnosis of abnormal samples.

Figure 4 .
Figure 4.The diagnosis of abnormal samples.

Figure 7 .
Figure 7.The best value of mtry .

Figure 7 .
Figure 7.The best value of mtry .

Figure 7 .
Figure 7.The best value of mtry.

Figure 8 .
Figure 8.The importance measure of indices based on mean decrease in accuracy (MDA) and MDG.

Figure 8 .
Figure 8.The importance measure of indices based on mean decrease in accuracy (MDA) and MDG.

Figure 10 .
Figure 10.The importance of each index to the total drought risk.

Figure 10 .
Figure 10.The importance of each index to the total drought risk.

Figure 10 .
Figure 10.The importance of each index to the total drought risk.

Figure 11 .
Figure 11.The pressure assessment of Qujing from 2000 to 2014.

Figure 12 .
Figure 12.The state assessment of Qujing from 2000 to 2014.

Figure 11 .
Figure 11.The pressure assessment of Qujing from 2000 to 2014.

Sustainability 2018 ,
10, x FOR PEER REVIEW 17 of 23promoting the development of service industry and advanced manufacturing industry will also reduce the pressure risk.

Figure 11 .
Figure 11.The pressure assessment of Qujing from 2000 to 2014.

Figure 12 .
Figure 12.The state assessment of Qujing from 2000 to 2014.

Figure 12 .
Figure 12.The state assessment of Qujing from 2000 to 2014.

Figure 13 .
Figure 13.The response assessment of Qujing from 2000 to 2014.

Figure 13 .
Figure 13.The response assessment of Qujing from 2000 to 2014.

Figure 14 .
Figure 14.The pressure, state, response, and integrated drought risk zoning maps in Qujing.

Figure 14 .
Figure 14.The pressure, state, response, and integrated drought risk zoning maps in Qujing.

Table 1 .
Original index system of ADR assessment.

Table 2 .
The data of the pressure subsystem.

Table 3 .
The data of the state subsystem.

Table 4 .
The data of the response subsystem.

Table 4 .
The data of the response subsystem.

Table 6 .
The weight of OCDIS.

Table 7 .
Results of ADR assessment of Qujing from 2000 to 2014.The risk values are the quantitative expression of the possible consequence of drought disasters during a period.According to the rounding-off method, in this study, we computed the risk levels based on the average values of different parameters combination.II and III in Table7represent the lower risk and moderate risk of agricultural drought, which corresponds to Table5.From Table7, the agricultural drought risk (ADR) of most areas in Quijng from 2000 to 2014 is moderate.That

Table 7 .
Results of ADR assessment of Qujing from 2000 to 2014.
Table 7 represent the lower risk and moderate risk of agricultural drought, which corresponds to Table 5.From Table 7, the agricultural drought risk (ADR) of most areas in Quijng from 2000 to 2014 is moderate.That means Qujing is prone to drought.The integrated ADR in Qilin, Shizong, Zhanyi, and Fuyuan are low, while in Xuanwei, Luliang, Luoping, Malong, and Huize are high.Based on ArcGIS technology, the pressure, state, response, and integrated ADR zoning maps in Qujing are shown in Figure14.Qujing is prone to drought.The integrated ADR in Qilin, Shizong, Zhanyi, and Fuyuan are low, while in Xuanwei, Luliang, Luoping, Malong, and Huize are high.Based on ArcGIS technology, the pressure, state, response, and integrated ADR zoning maps in Qujing are shown in Figure14. means