Evaluating and Predicting Urban Greenness for Sustainable Environmental Development
Abstract
1. Introduction
- Integrate a new set of green cities indicators by referring to green city indicators in the literature and apply them to predict whether a city is green or not.
- Compare different prediction methods to learn the strengths and weaknesses of the prediction models.
- Recognize the correlation between the indicators and the level of a green city based on the XGBoost and Logistic Regression. The aim of the results is for the government to understand the crucial factors and to improve the greenness level of cities.
2. Literature Review
2.1. Research on Green Cities
2.2. Indicators of a Green City
2.3. Prediction Model
3. Solution Approach
3.1. Data Pre-Process
3.2. Evaluation of Green City Indicators
- In the transportation category, railway passenger volume, defined as the number of passengers transported by rail multiplied by kilometers traveled, serves as an effective proxy for sustainable urban mobility. High volumes reflect efficient public transit infrastructure, reduced dependency on private vehicles, and lower per capita emissions—all of which align with green city evaluation.
- For water resources, renewable rainwater and river resources are included, as mentioned in the Sustainable Development Goals and the Green Growth Index, using per capita renewable freshwater resources as the standard. The availability and sustainable management of renewable rainwater and river resources reflect a city’s capacity for self-sufficient, low-impact water usage. This criterion supports climate adaptation, reduces carbon footprints related to water supply, and enhances ecological health—key attributes of a green city.
- Finally, due to the increasing severity of climate change in recent years, the European Green Deal and Sustainable Development Goals have also proposed related policies and plans as indicators for climate change measures. Therefore, the degree of disaster risk reduction policies is added as an indicator to assess whether the strategies developed by countries to reduce disaster risk can truly mitigate risks, which is considered a policy-related category in the prediction. The degree of disaster risk reduction (DRR) policy implementation reflects a city’s ability to manage environmental hazards and adapt to climate change. Integrating DRR as a criterion for evaluating green cities underscores the importance of resilience, sustainability, and equitable protection of urban populations and ecosystems.
3.3. XGBoost
3.4. Evaluation Criteria
- Accuracy represents the ratio of correctly predicted cases to the total number of cases. It is not suitable when the proportion of actual cases representing true events is low. The formula is as follows:
- Recall represents the proportion of actual true cases that are correctly predicted as true. This metric is primarily used when the focus is on correctly identifying all true cases, ensuring that no cases are missed. Industries like healthcare often emphasize this metric. The formula is as follows:
- Precision is the proportion of correctly predicted true cases among all cases predicted as true. It is often used when the focus is on the accuracy of positive predictions, such as in situations where the cost of false positives is high. The formula is as follows:
- The F1-score is a combined metric of Precision and Recall. The formula is as follows, where is an adjustable parameter. Ifis 0.5, Precision is more important than Recall; if β is 2, the opposite is true. Ifis 1, Precision and Recall are equally important.
4. Case Study
4.1. Dataset
4.2. Pre-Process
4.3. Results of Prediction
- XGBoost
- 2.
- Logic Regression
4.4. Comparison and Discussion
- XGBt achieved the highest Recall (0.80), indicating its strong ability to correctly identify true positive cases (i.e., correctly classifying green cities). Its F1-score (0.7442) and Accuracy (0.7843) are also the highest among all models, suggesting a good balance between precision and recall as well as overall predictive power. These results indicate that XGBt balances sensitivity (Recall) and Precision effectively, making it the most robust model for predicting green cities.
- The Neural Network model also performs robustly, with an F1-score of 0.71 and a Recall of 0.75, indicating it is effective at identifying green cities while maintaining reasonable precision. The SVC model shows balanced performance as well, slightly outperforming RF in F1-score and Recall.
- In contrast, Logistic Regression shows the weakest performance, with the lowest Recall (0.40), F1-score (0.48), and Accuracy (64.7%). This suggests that while LR provides a simple and interpretable model, it may not capture the complexity of relationships among indicators as well as the ensemble and nonlinear models.
- The XGBl model, while having a lower Recall (0.50), demonstrated the highest Precision (0.83), meaning it is more conservative but reliable when identifying a city as green.
- Overall, tree-based models (XGBt and Random Forest) and Neural Networks show superior predictive capabilities, likely due to their ability to model complex, nonlinear relationships and feature interactions. These findings support the use of advanced machine learning methods—particularly gradient boosting trees—for predicting urban greenness with higher reliability.
- By cross-referencing the feature importance results from XGBoost with the variable correlations from Logistic Regression, each variable can be determined as positively or negatively correlated.
- XGBoost with gbtree emerged as the most effective model, offering a strong balance of Accuracy, Precision, and Recall. It shows superior performance in predicting urban greenness. This finding aligns with the case study conducted by Chen et al. [34]. Similar results have been reported in other domains. For example, in a study on veneer quality prediction in plywood manufacturing, XGBoost achieved comparable Accuracy and Recall to RF and outperformed SVM [71]. In cardiovascular disease prediction using ensemble learning, XGBoost demonstrated higher Precision than RF, SVM, LR, and NN [72]. Additionally, studies on urban forest carbon estimation [73], cropland SOC prediction [74], and a review by Nguyen and Saha [75]—which found that XGBoost outperformed other models, including NN, in 75% of comparisons—further support its superior performance.
- Based on the model results, it is recommended that for predicting green cities, the tree-based XGBoost method should be used to train the model for predicting whether a city is green. However, it is also suggested to use a regression model to understand the correlation coefficients between various indicators. The importance of the indicators and how to improve them to achieve a green city can be determined.
- The results of the XGBoost model reveal that Greenhouse Gas Emissions per Person and Transport Emissions are the most influential indicators (Figure 1). These findings are consistent with the existing literature. For instance, [76] recognizes Greenhouse Gas Emissions per Person as a widely adopted benchmark in evaluating national and urban sustainability. Similarly, the United Nations Human Settlements Programme [77] emphasizes this indicator as central to assessing urban sustainability. In addition, Creutzig et al. [78] identified emissions from the transport sector as a major barrier to green development, while the European Environment Agency [18] employed Transport Emissions as a key metric in evaluating urban environmental strategies.
- In a study by [15], which examined correlations between sector-specific green performance and overall green city performance, sanitation and air quality were found to have the highest correlation coefficients. Our results align with this, showing relatively stronger correlations for air-quality-related indicators. In both their findings and ours, energy-related indicators exhibited lower correlation with overall green city classification, particularly in the category of CO2 and energy usage.
- While green initiatives are commonly associated with both energy conservation and carbon reduction, our findings suggest that carbon reduction plays a more critical role. This is reflected in the ranking of top indicators, where Greenhouse Gas Emissions per Person and Transport Emissions per Capita emerge as the two most significant variables, followed by energy-related indicators. The prioritization of carbon metrics over energy consumption implies that cities aiming for green status should emphasize decarbonization strategies.
- Policy-related indicators, such as those measuring disaster risk reduction (DRR), are inherently more difficult to quantify and interpret. Nonetheless, UNDRR [79] provides national- and urban-level scoring frameworks for assessing DRR strategies and benchmarking urban resilience. Likewise, Cardona et al. [80] incorporate DRR strategy implementation as a core metric of sustainability and resilience in urban systems.
- For cities confirmed to be green, quartiles and mean values were calculated (Table 10), and these were used to derive empirical thresholds (Table 11). These thresholds include the following: greenhouse gas emissions of 7.35, carbon dioxide emissions of 11.23, transportation emissions of 1.58, a railway passenger volume of 305,825,000, renewable energy generation of 410.75, primary energy consumption of 42,257.25, proportion of renewable energy of 5.8775, green area of 13,448,250, agricultural land of 7,860,104.5, proportion of clean water resources of 94.1025, renewable inland freshwater of 9970.95, and a disaster mitigation policy level of 0.443. These values can serve as preliminary benchmarks for classifying cities as green or non-green based on objective indicator thresholds. To achieve green city status, it is recommended that governments ensure certain environmental indicators meet or remain below specific thresholds. Among these, per capita greenhouse gas (GHG) emissions is the most critical. Analysis of actual green cities shows that the third quartile value for this indicator is 7.35. Using this value as a threshold yields the highest classification accuracy in identifying green cities. Therefore, it can serve as a practical benchmark. For instance, if a city is predicted—and confirmed—to be non-green, its per capita GHG emissions can be evaluated. If the value exceeds 7.35, policymakers can be advised to prioritize emission reduction strategies. By focusing on this key indicator, cities can more effectively progress toward meeting green city criteria.
5. Conclusions
- Few studies were developed to evaluate and predict urban greenness, since the amount of data in the dataset is insufficient. Exploring more and consistent data for machine learning is emergent. In the future, larger and more varied datasets are used to make the results stronger and more general.
- The current predictive model still has errors. Future work should explore how to adjust the model or whether other machine learning methods can yield better results.
- Evaluate if there are any missing indicators that should be considered to ensure comprehensive assessment.
- Seek more complete data and further subdivide indicators to enhance accuracy. This will help governments establish clearer standards for achieving green city status.
- Establish separate standards for developing and developed countries. The capacity for investment and the level of infrastructure development vary significantly between these groups, making short-term improvements challenging. Thus, future work should focus on developing green city indicators tailored for developing countries or non-EU governmental authorities.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Ma, T.; Wang, S. Green Innovation under the Constraint of Economic Growth Targets: Evidence from Prefecture Level Cities in China. Processes 2023, 11, 1197. [Google Scholar] [CrossRef]
- Zhang, W.; Wu, C.; Zhong, H.; Li, Y.; Wang, L. Prediction of undrained shear strength using extreme gradient boosting and random forest based on Bayesian optimization. Geosci. Front. 2021, 12, 469–477. [Google Scholar] [CrossRef]
- Lucchi, E.; Buda, A. Urban green rating systems: Insights for balancing sustainable principles and heritage conservation for neighbourhood and cities renovation planning. Renew. Sustain. Energy Rev. 2022, 161, 112324. [Google Scholar] [CrossRef]
- Liu, F.P. A Planning Method for City Logistics Considering Carbon Emission and Time Windows. Master Thesis, National Central University, Taoyuan City, Taiwan.
- Freytag, T.; Gössling, S.; Mössner, S. Living the green city: Freiburg’s Solarsiedlung between narratives and practices of sustainable urban development. Local Environ. 2014, 19, 644–659. [Google Scholar] [CrossRef]
- Pham, T.T.H.; Lynch, M.; Turner, S. Creative counter-discourses to the “green city” narrative: Practices of small-scale urban agriculture in Hanoi, Vietnam. Local Environ. 2023, 28, 169–188. [Google Scholar] [CrossRef]
- Qiang, G.; Tang, S.; Hao, J.; Di Sarno, L.; Wu, G.; Ren, S. Building automation systems for energy and comfort management in green buildings: A critical review and future directions. Renew. Sustain. Energy Rev. 2023, 179, 113301. [Google Scholar] [CrossRef]
- UNEP U. Towards A Green Economy: Pathways to Sustainable Development and Poverty Eradication; UNEP: Nairobi, Kenya, 2011. [Google Scholar]
- Xia, W.; Liu, B.; Xiang, H. Prediction of Liquid Accumulation Height in Gas Well Tubing Using Integration of Crayfish Optimization Algorithm and XGBoost. Processes 2024, 12, 1788. [Google Scholar] [CrossRef]
- Suorsa, A.; Multas, A.M.; Rönkkö, E.; Juuti, E.; Lammi, A.; Enwald, H. Information practices in multi-professional work in urban planning. Inf. Res. Int. Electron. J. 2024, 29, 557–572. [Google Scholar] [CrossRef]
- Pace, R.; Churkina, G.; Rivera, M.; Grote, R. How Green is A “Green City”? A Review of Existing Indicators; Karlsruhe Institute of Technology: Karlsruhe, Germany, 2016; Available online: https://publikationen.bibliothek.kit.edu/1000064236 (accessed on 1 August 2025).
- Siemens, A.G. European Green City Index. Assessing the Environmental Impact of Europe’s Major Cities, a Research Project Conducted by the Economist Intelligence Unit, Sponsored by Siemens, Munich. Sponsored by Siemens. 2009. Available online: https://assets.new.siemens.com/siemens/assets/api/uuid:cf26889b-3254-4dcb-bc50-fef7e99cb3c7/gci-report-summary.pdfm (accessed on 1 August 2025).
- Ranaboldo, M.; Aragüés-Peñalba, M.; Arica, E.; Bade, A.; Bullich-Massagué, E.; Burgio, A.; Tani, R. A comprehensive overview of industrial demand response status in Europe. Renew. Sustain. Energy Rev. 2024, 203, 114797. [Google Scholar] [CrossRef]
- Du, X.; Khan, M.N.; Thakur, G.C. Machine Learning in Carbon Capture, Utilization, Storage, and Transportation: A Review of Applications in Greenhouse Gas Emissions Reduction. Processes 2025, 13, 1160. [Google Scholar] [CrossRef]
- Brilhante, O.; Klaas, J. Green city concept and a method to measure green city performance over time applied to fifty cities globally: Influence of GDP, population size and energy efficiency. Sustainability 2018, 10, 2031. [Google Scholar] [CrossRef]
- Kahn, M.E. Green Cities: Urban Growth and the Environment; Brookings Institution Press: Washington, DC, USA, 2007. [Google Scholar]
- Sharifi, A. Co-benefits and synergies between urban climate change mitigation and adaptation measures: A literature review. Sci. Total Environ. 2021, 750, 141642. [Google Scholar] [CrossRef] [PubMed]
- European Environment Agency (EEA). Greenhouse Gas Emissions from Transport in Europe. 2020. Available online: https://www.eea.europa.eu/ims/greenhouse-gas-emissions-from-transport (accessed on 1 July 2025).
- Olivier, J.G.J.; Muntean, M. Trends in Global CO2 Emissions: 2014 Report; PBL Netherlands Environmental Assessment Agency and Institute for Environment and Sustainability of the European Commission’s Joint Research Centre: The Hague, The Netherlands, 2014; Available online: https://www.pbl.nl/en/publications/trends-in-global-co2-emissions-2014-report (accessed on 1 August 2025).
- Santos, G. Road transport and CO2 emissions: What are the challenges? Transp. Policy 2017, 59, 71–74. [Google Scholar] [CrossRef]
- Hannan, M.A.; Begum, R.A.; Al-Shetwi, A.Q.; Ker, P.J.; Al Mamun, M.A.; Hussain, A.; Basri, H.; Mahlia, T.M.I. Waste collection route optimisation model for linking cost saving and emission reduction to achieve sustainable development goals. Sustain. Cities Soc. 2020, 62, 102393. [Google Scholar] [CrossRef]
- Kazancoglu, Y.; Ozbiltekin-Pala, M.; Ozkan-Ozen, Y.D. Prediction and evaluation of greenhouse gas emissions for sustainable road transport within Europe. Sustain. Cities Soc. 2021, 70, 102924. [Google Scholar] [CrossRef]
- Economist Intelligence Unit. The Green City Index: A Summary of the Green City Index Research Series; Economist Intelligence Unit: Munich, Germany, 2012. [Google Scholar]
- Venkatesh, G. A critique of the European green city index. J. Environ. Plan. Manag. 2014, 57, 317–328. [Google Scholar] [CrossRef]
- Wang, X.; Wang, N.; Liu, X.; Shi, R. Energy-saving analysis for the Modern Wing of the Art Institute of Chicago and green city strategies. Renew. Sustain. Energy Rev. 2017, 73, 714–729. [Google Scholar] [CrossRef]
- URL-GS. Available online: https://zero.co.uk/greenscore#:~:text=GreenScore%C2%AE%20is%20a%20number,lower%20impact%20on%20the%20planet (accessed on 1 July 2025).
- Miola, A.; Paccagnan, V.; Papadimitriou, E.; Mandrici, A. Climate resilient development index: Theoretical framework, selection criteria and fit-for-purpose indicators. In JRC Science and Policy Reports; Publications Office of the European Union: Luxembourg, 2015. [Google Scholar] [CrossRef]
- Renthlei, E.; George, A. Toward holistic neighbourhood sustainability assessment: Integrating fuzzy Delphi method for sustainable indicator selection for Aizawl city. Local Environ. 2024, 30, 622–641. [Google Scholar] [CrossRef]
- Mcalpine, P.; Birnie, A. Is there a correct way of establishing sustainability indicators? The case of sustainability indicator development on the Island of Guernsey. Local Environ. 2005, 10, 243–257. [Google Scholar] [CrossRef]
- Krank, S.; Wallbaum, H.; Grêt-Regamey, A. Constraints to implementation of sustainability indicator systems in five Asian cities. Local Environ. 2010, 15, 731–742. [Google Scholar] [CrossRef]
- Mihi, A.; Ghazela, R.; Wissal, D. Mapping potential desertification-prone areas in North-Eastern Algeria using logistic regression model, GIS, and remote sensing techniques. Environ. Earth Sci. 2022, 81, 385. [Google Scholar] [CrossRef]
- Charbuty, B.; Abdulazeez, A. Classification based on decision tree algorithm for machine learning. J. Appl. Sci. Technol. Trends 2021, 2, 20–28. [Google Scholar] [CrossRef]
- Chen, T.; He, T. xgboost: Extreme Gradient Boosting. R Package Version 0.4-2. 2015. Available online: https://cran.r-project.org/web/packages/xgboost/index.html (accessed on 1 August 2025).
- Chen, Z.; Jiang, F.; Cheng, Y.; Gu, X.; Liu, W.; Peng, J. XGBoost classifier for DDoS attack detection and analysis in SDN-based cloud. In Proceedings of the 2018 IEEE International Conference on Big Data and Smart Computing (Bigcomp), Shanghai, China, 15–18 January 2018; IEEE: New York, NY, USA, 2018; pp. 251–256. [Google Scholar] [CrossRef]
- Zhao, Y.; Li, Y.; Li, J.; Wang, Y. Comparative study of XGBoost and other machine learning models for diabetes prediction. Healthcare 2021, 9, 1748. [Google Scholar] [CrossRef]
- Zhang, D.; Zhou, L.; Zhang, D. Credit risk evaluation using machine learning models. Appl. Soft Comput. 2019, 86, 105934. [Google Scholar] [CrossRef]
- Naseem, U.; Razzak, I.; Musial, K.; Imran, M. Transformer based deep intelligent contextual embedding for Twitter sentiment classification. Future Gener. Comput. Syst. 2020, 113, 58–69. [Google Scholar] [CrossRef]
- Li, C.; Zhang, S. Assessing and explaining rising global carbon sink capacity in karst ecosystems. J. Clean. Prod. 2024, 477, 143862. [Google Scholar] [CrossRef]
- Chen, T.; Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar] [CrossRef]
- Yan, H.; Yan, K.; Ji, G. Optimization and prediction in the early design stage of office buildings using genetic and XGBoost algorithms. Build. Environ. 2022, 218, 109081. [Google Scholar] [CrossRef]
- Li, J.; An, X.; Li, Q.; Wang, C.; Yu, H.; Zhou, X.; Geng, Y.A. Application of XGBoost algorithm in the optimization of pollutant concentration. Atmos. Res. 2022, 276, 106238. [Google Scholar] [CrossRef]
- Wu, Y.; Zhang, Q.; Hu, Y.; Sun-Woo, K.; Zhang, X.; Zhu, H.; Li, S. Novel binary logistic regression model based on feature transformation of XGBoost for type 2 Diabetes Mellitus prediction in healthcare systems. Future Gener. Comput. Syst. 2022, 129, 1–12. [Google Scholar] [CrossRef]
- Poongodi, M.; Malviya, M.; Kumar, C.; Hamdi, M.; Vijayakumar, V.; Nebhen, J.; Alyamani, H. New York City taxi trip duration prediction using MLP and XGBoost. Int. J. Syst. Assur. Eng. Manag. 2022, 13, 16–27. [Google Scholar] [CrossRef]
- Opesade, A.O. Data mining of Internet users’ interests in diverse, hierarchical information needs. Inf. Res. Int. Electron. J. 2020, 25, 557–572. [Google Scholar] [CrossRef]
- Zhang, S.; Li, X.; Zong, M.; Zhu, X.; Cheng, D. Learning k for knn classification. ACM Trans. Intell. Syst. Technol. (TIST) 2017, 8, 1–19. [Google Scholar] [CrossRef]
- URL-XGB1. Available online: https://cyeninesky3.medium.com/xgboost-a-scalable-tree-boosting-system-%E8%AB%96%E6%96%87%E7%AD%86%E8%A8%98%E8%88%87%E5%AF%A6%E4%BD%9C-2b3291e0d1fe (accessed on 1 July 2025).
- Zhang, D.; Qian, L.; Mao, B.; Huang, C.; Huang, B.; Si, Y. A data-driven design for fault detection of wind turbines using random forests and XGboost. IEEE Access 2018, 6, 21020–21031. [Google Scholar] [CrossRef]
- Huang, J.C.; Tsai, Y.C.; Wu, P.Y.; Lien, Y.H.; Chien, C.Y.; Kuo, C.F.; Kuo, C.H. Predictive modeling of blood pressure during hemodialysis: A comparison of linear model, random forest, support vector regression, XGBoost, LASSO regression and ensemble method. Comput. Methods Programs Biomed. 2020, 195, 105536. [Google Scholar] [CrossRef]
- URL-XGB. Available online: https://xgboost.readthedocs.io/en/stable/python/python_api.html (accessed on 1 July 2025).
- URL-PAR. Available online: https://www.analyticsvidhya.com/blog/2016/03/complete-guide-parameter-tuning-xgboost-with-codes-python/ (accessed on 1 July 2025).
- Torlay, L.; Perrone-Bertolotti, M.; Thomas, E.; Baciu, M. Machine learning–XGBoost analysis of language networks to classify patients with epilepsy. Brain Inform. 2017, 4, 159–169. [Google Scholar] [CrossRef]
- Xu, Z.; Wang, Z. A risk prediction model for type 2 diabetes based on weighted feature selection of random forest and xgboost ensemble classifier. In Proceedings of the 2019 Eleventh International Conference on Advanced Computational Intelligence (ICACI), Guilin, China, 7–9 June 2019; IEEE: New York, NY, USA, 2019; pp. 278–283. [Google Scholar]
- Dhaliwal, S.S.; Nahid, A.A.; Abbas, R. Effective intrusion detection system using XGBoost. Information 2018, 9, 149. [Google Scholar] [CrossRef]
- Kabiraj, S.; Raihan, M.; Alvi, N.; Afrin, M.; Akter, L.; Sohagi, S.A.; Podder, E. Breast cancer risk prediction using XGBoost and random forest algorithm. In Proceedings of the 2020 11th International Conference on Computing, Communication and Networking Technologies (ICCCNT), Kharagpur, India, 1–3 July 2020; IEEE: New York, NY, USA, 2020; pp. 1–4. [Google Scholar] [CrossRef]
- Lee, T.; Lee, H. Collaborative governance and climate change: A case study of Seoul’s metropolitan government. Asian J. Political Sci. 2014, 22, 261–279. [Google Scholar]
- Castán Broto, V.; Bulkeley, H. A survey of urban climate change experiments in 100 cities. Glob. Environ. Change 2013, 23, 92–102. [Google Scholar] [CrossRef]
- Delmas, M.A.; Burbano, V.C. The drivers of greenwashing. Calif. Manag. Rev. 2011, 54, 64–87. [Google Scholar] [CrossRef]
- Anguelovski, I.; Connolly, J.J.; Masip, L.; Pearsall, H. Assessing green gentrification in historically disenfranchised neighborhoods: A longitudinal and spatial analysis of Barcelona. Urban Geogr. 2018, 39, 458–491. [Google Scholar] [CrossRef]
- Sæbø, Ø.; Rose, J.; Flak, L.S. The shape of eParticipation: Characterizing an emerging research area. Gov. Inf. Q. 2020, 27, 265–273. [Google Scholar] [CrossRef]
- Auld, G. Constructing Private Governance: The Rise and Evolution of Forest, Coffee, and Fisheries Certification; Yale University Press: New Haven, CT, USA, 2014. [Google Scholar]
- Bulkeley, H.; Castán Broto, V.; Hodson, M.; Marvin, S. Cities and Low Carbon Transitions; Routledge: Oxfordshire, UK, 2014. [Google Scholar]
- UN-Habitat. World Cities Report 2020: The Value of Sustainable Urbanization; United Nations Human Settlements Programme: Nairobi, Kenya, 2020. [Google Scholar]
- OECD. Compact City Policies: A Comparative Assessment; OECD Green Growth Studies: Paris, France, 2012. [Google Scholar]
- Ghosh, A. False solutions to climate change: How greenwashing undermines climate justice. Globalizations 2020, 17, 1046–1059. [Google Scholar]
- URL-OWD. Available online: https://ourworldindata.org/air-pollution (accessed on 1 July 2025).
- URL-GCA23. Available online: https://environment.ec.europa.eu/topics/urban-environment/european-green-capital-award/winning-cities/tallinn-2023_en (accessed on 1 July 2025).
- URL-GCA22. Available online: https://environment.ec.europa.eu/topics/urban-environment/european-green-capital-award/winning-cities/grenoble-2022_en (accessed on 1 July 2025).
- URL-GCA21. Available online: https://environment.ec.europa.eu/topics/urban-environment/european-green-capital-award/winning-cities/lahti-2021_en (accessed on 1 July 2025).
- URL-GCA. Available online: https://environment.ec.europa.eu/topics/urban-environment/european-green-capital-award_en (accessed on 1 July 2025).
- URL-LOG. Available online: https://scikitlearn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html (accessed on 1 July 2025).
- Ramos-Maldonado, M.; Gutiérrez, F.; Gallardo-Venegas, R.; Bustos-Avila, C.; Contreras, E.; Lagos, L. Machine learning and industrial data for veneer quality optimization in plywood manufacturing. Processes 2025, 13, 1229. [Google Scholar] [CrossRef]
- Liu, J.; Dong, X.; Zhao, H.; Tian, Y. Predictive classifier for cardiovascular disease based on stacking model fusion. Processes 2022, 10, 749. [Google Scholar] [CrossRef]
- Uniyal, S.; Purohit, S.; Chaurasia, K.; Rao, S.S.; Amminedu, E. Quantification of carbon sequestration by urban forest using Landsat 8 OLI and machine learning algorithms in Jodhpur, India. Urban For. Urban Green. 2022, 67, 127445. [Google Scholar] [CrossRef]
- Chen, F.; Feng, P.; Harrison, M.T.; Wang, B.; Liu, K.; Zhang, C.; Hu, K. Cropland carbon stocks driven by soil characteristics, rainfall and elevation. Sci. Total Environ. 2023, 862, 160602. [Google Scholar] [CrossRef]
- Nguyen, A.; Saha, S. Machine learning and multi-source remote sensing in forest carbon stock estimation: A review. arXiv 2024, arXiv:2411.17624. [Google Scholar] [CrossRef]
- OECD. Greenhouse Gas Emissions. OECD Environment Statistics. 2020. Available online: https://data-explorer.oecd.org/vis?fs[0]=Topic%2C1%7CEnvironment%23ENV%23%7CAir%20and%20climate%23ENV_AC%23&pg=0&fc=Topic&bp=true&snb=12&df[ds]=dsDisseminateFinalDMZ&df[id]=DSD_AIR_GHG%40DF_AIR_GHG&df[ag]=OECD.ENV.EPI&df[vs]=1.0&dq=.A.GHG._T.KG_CO2E_PS&pd=2014%2C&to[TIME_PERIOD]=false (accessed on 1 July 2025).
- UN-Habitat. Hot Cities: Battleground for Climate Change. United Nations Human Settlements Programme. 2011. Available online: https://unhabitat.org/sites/default/files/2012/06/P1HotCities.pdf (accessed on 1 July 2025).
- Creutzig, F.; Jochem, P.; Edelenbosch, O.Y.; Mattauch, L.; van Vuuren, D.P.; McCollum, D.; Minx, J. Transport: A roadblock to climate change mitigation? Science 2015, 350, 911–912. [Google Scholar] [CrossRef]
- UNDRR. Annual Report 2022. Available online: https://www.undrr.org/media/87504/download?startDownload=20250709 (accessed on 9 July 2025).
- Cardona, O.D.; Ordaz, M.; Reinoso, E.; Yamín, L.E.; Barbat, A.H. CAPRA–Comprehensive Approach to Probabilistic Risk Assessment: International Initiative for Risk Management Effectiveness. In Proceedings of the 15th World Conference on Earthquake Engineering, Lisbon, Portugal, 24–28 September 2012; Volume 1. [Google Scholar]
Category | Indicator | Category | Indicator |
---|---|---|---|
CO2 | CO2 emissions | Water | Water consumption |
CO2 intensity | Water system leakages | ||
Energy | Energy consumption | Wastewater treatment | |
Energy intensity | Waste and land use | Municipal waste production | |
Renewable energy consumption | Waste recycling | ||
Buildings | Energy consumption of residential buildings | Air quality | Nitrogen dioxide |
Transport | Use of non-car transport | Ozone | |
Size of non-car transport network | Particulate matter | ||
Sulphur dioxide |
Category | Indicators | Category | Indicators |
---|---|---|---|
CO2 | Air pollution emissions | Water and Land Use | Percentage of park area |
Climate sensitivity | Number of parks | ||
Energy | Availability of renewable energy | Percentage of wilderness areas | |
Transportation | Walking, cycling, and public transport usage | Tons of solid waste | |
Solo commuting distance | Recycling diversion rate | ||
Workforce commuting outside the city | Tons of organic waste | ||
Water | Residential water consumption | Air Quality | Greenhouse gas emissions |
Categories | Indicators | Description | Types |
---|---|---|---|
Air Quality | Greenhouse gas emissions per person * −80% (F1) | Greenhouse gas emissions refer to carbon dioxide, methane, and nitrous oxide, and encompass all sources, including agriculture and land use. | Integer |
Annual CO2 emissions * −80% (F2) | Including per capita carbon dioxide emissions from fossil fuels and industry. | Integer | |
Transportation | Transport emissions * −60% (F3) | Annual per capita carbon dioxide emissions from transportation, including road, rail, bus, and domestic air travel, but excluding international aviation and shipping. | Decimal |
Railway passengers carried (F4) | Rail passenger traffic is calculated by multiplying the number of passengers by the distance traveled in kilometers. | Integer | |
Energy | Renewable electricity * −80% (F5) | Renewable energy consists of hydropower, wind energy, solar energy, geothermal energy, wave energy, tidal energy, and biomass energy. | Integer |
Primary energy consumption * −60% (F6) | Per capita primary energy consumption includes electricity as well as other consumption areas such as transportation, heating, and cooking. | Integer | |
Energy from renewable sources −40% (F7) | Renewable energy includes hydropower, solar energy, wind energy, geothermal energy, biomass energy, wave energy, and tidal energy. | % | |
Green Field | Forest area −60% (F8) | Refers to natural or artificially planted forests with trees at least 5 m tall, excluding tree cover in agricultural production systems. | Decimal |
Agricultural land use −60% (F9) | The sum of arable land and pastures. | Integer | |
Water | Safely managed water −80% (F10) | The proportion of water resources within a location that can be accessed as needed and is free from contamination. | % |
Renewable internal freshwater (F11) | Refers to the country’s internal renewable resources, including internal river flow and groundwater from precipitation. | Integer | |
Policy | Score of DRR strategies (F12) | The numerical value indicates the extent to which countries have developed policies to reduce disaster risk, with higher values reflecting more comprehensive policy implementation. | % |
Hyperparameter | Description | Value Range |
---|---|---|
min_child_weight | Minimum sum of instance weight (hessian) in a child | [0, ∞], default is 1 Typically in [3–10], increments of 1 |
max_depth | Maximum depth of the decision tree | [1, ∞], default is 6 Typically in [1–6], increments of 1 |
gamma | Minimum loss reduction required for a split | [0, ∞], default is 0 Typically in [0–5], increments of 0.5 |
subsample | Fraction of samples used for training | (0, 1], default is 1 Typically in [0–1], increments of 0.1 |
colsample_bytree | Fraction of features randomly sampled for each tree | (0, 1], default is 1 Typically in [0.5–1], increments of 0.1 |
reg_alpha | L1 regularization term on weights | (0, 1], default is 0, Typically in [0–1] increments of 0.1 |
reg_lambda | L2 regularization term on weights | (0, 1], default is 0, Typically in [0–1] increments of 0.1 |
n_estimators | Number of trees (estimators) | Typically in the range of several hundred to thousands, default is 100. Usually in [100–2000], increments of 100 |
learning_rate | Learning rate | [0, 1], default is 0.3 Typically in [0.01–0.2] increments of 0.01 |
Prediction | |||
---|---|---|---|
True | False | ||
Actual | True | TP (true positive) | FN (false negative) |
False | FP (false positive) | TN (true negative) |
City, Country, Awarded Year | Awarded Reasons |
---|---|
Valencia, Spain, 2024 |
|
Tallinn, Estonia, 2023 |
|
Grenoble, France, 2022 |
|
Lahti, Finland, 2021 |
|
Iceland | 2019 | 140,592.8 | Iceland | 2019 | 140,592.8 |
Iceland | 2020 | 135,248.8 | Iceland | 2020 | 135,248.8 |
Iceland | 2021 | NA | Iceland | 2021 | 129,904.7 |
Training Data | Testing Data | |
---|---|---|
0 | 22 | 31 |
1 | 18 | 20 |
Total | 40 | 51 |
LR | XGBl | XGBt | Random Forest | SVC | NN | |
---|---|---|---|---|---|---|
Accuracy | 64.7 | 76.4 | 0.7843 | 0.7255 | 0.7843 | 0.7647 |
Recall test | 0.4 | 0.5 | 0.8000 | 0.6500 | 0.7500 | 0.75 |
Precision test | 0.57 | 0.83 | 0.6957 | 0.6500 | 0.7143 | 0.68 |
F1-score test | 0.48 | 0.63 | 0.7442 | 0.6500 | 0.7317 | 0.71 |
F1 | F2 | F3 | F4 (1010) | F5 | F6 (104) | F7 | F8 (106) | F9 (106) | F10 | F11 | F12 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
Mean | 5.75 | 6.70 | 1.901 | 3.4772 | 5651 | 4.79 | 17.3 | 8.34 | 7.860 | 94.1 | 9970 | 0.443 |
First quartile | 3.50 | 0.04 | 1.58 | 3.0582 | 410 | 4.22 | 5.87 | 1.26 | 1.64 | 95.7 | 1297 | 0 |
Second quartile | 5.96 | 8.60 | 2.02 | 0.8586 | 1106 | 4.53 | 8.76 | 3.13 | 3.16 | 98.6 | 3373 | 0.235 |
Third quartile | 7.35 | 11.2 | 2.295 | 1.9104 | 4465 | 5.26 | 28.5 | 1.34 | 8.52 | 99.7 | 9081 | 0.9475 |
F1 | F2 | F3 | F4 | F5 | F6 | F7 | F8 | F9 | F10 | F11 | F12 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
Mean | 10 | 8 | 12 | 4 | 5 | 7 | 8 | 12 | 15 | 17 | 16 | 10 |
First quartile | 5 | 4 | 15 | 15 | 15 | 15 | 15 | 5 | 5 | 14 | 5 | 10 |
Second quartile | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 |
Third quartile | 15 | 14 | 5 | 5 | 5 | 5 | 5 | 15 | 15 | 5 | 14 | 5 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Huang, C.-C.; Liang, W.-Y.; Tseng, T.-L.; Chan, C.-Y. Evaluating and Predicting Urban Greenness for Sustainable Environmental Development. Processes 2025, 13, 2465. https://doi.org/10.3390/pr13082465
Huang C-C, Liang W-Y, Tseng T-L, Chan C-Y. Evaluating and Predicting Urban Greenness for Sustainable Environmental Development. Processes. 2025; 13(8):2465. https://doi.org/10.3390/pr13082465
Chicago/Turabian StyleHuang, Chun-Che, Wen-Yau Liang, Tzu-Liang (Bill) Tseng, and Chia-Ying Chan. 2025. "Evaluating and Predicting Urban Greenness for Sustainable Environmental Development" Processes 13, no. 8: 2465. https://doi.org/10.3390/pr13082465
APA StyleHuang, C.-C., Liang, W.-Y., Tseng, T.-L., & Chan, C.-Y. (2025). Evaluating and Predicting Urban Greenness for Sustainable Environmental Development. Processes, 13(8), 2465. https://doi.org/10.3390/pr13082465