Optimizing Urban Land-Use Through Deep Reinforcement Learning: A Case Study in Hangzhou for Reducing Carbon Emissions
Abstract
1. Introduction
1.1. Built Environment and Carbon Emissions
1.2. AI in Carbon Emissions
1.3. Research Gap
1.4. Main Contributions
2. Materials and Methods
2.1. Study Region
2.2. Datasets
2.2.1. Area Data
| Land-Use Type | Energy Consumption (kWh/m2·year) | Carbon Emission Factor (kg CO2/m2·year) | Data Source | Notes |
|---|---|---|---|---|
| Residential | 71.2 | 28.5 | [70,71,72] | Based on 40 m2 household |
| Commercial | 120.5 | 48.2 | [70] | Office buildings |
| Medical | 159.1 | 63.6 | [70] | Includes natural gas |
| Industrial | 145.3 | 58.1 | [68] | Manufacturing zones |
| Educational | 95.4 | 38.2 | [73] | Schools/universities |
| Parks/Green Space | – | -0.82 | [74] | 327.67 kg CO2 per 400 m2 annually |
| Other | – | Variable | – | Mixed-use spaces |
2.2.2. Point Data
2.2.3. Line Data
2.3. Methods
2.3.1. Environmental Model
2.3.2. Training Dataset Preparation
2.3.3. Deep Reinforcement Learning Architecture
2.3.4. Reward Function Design
3. Results
3.1. Actor-Critic Framework
3.2. Reward Function Design
- Base Emissions: The base emissions are calculated based on the land-use types within each cell. Each land-use type i has an associated emission factor αi, which is multiplied by the area’s ratio of that land-use type Aij in cell j to compute its emission contribution. The sum of these contributions across all land-use types gives the base emissions for each cell, as show in Equation (5):
- Path Emissions: Path emissions are calculated based on transportation activities between different cells. We use a discrete choice model to determine the probabilities Pkm of using different modes of transportation m for traveling between cells k and l. The emissions for each transport mode m are given by βm, and the distance between cells k and l is denoted as dkl, as shown in Equation (6):
3.3. Model Training
3.3.1. Algorithm Selection
3.3.2. Training Process and Loss Analysis
- Rewards: The reward plot exhibits an overall increasing trend, albeit with significant variability across episodes. In the early stages of training, the high fluctuations in rewards suggest an active exploration phase where the agent is learning the environmental dynamics. As the training progresses, the rewards begin to show a more consistent upward trajectory, indicating that the agent is becoming increasingly proficient at optimizing its actions to maximize rewards. This upward trend, despite ongoing fluctuations, underscores the efficacy of the PPO algorithm in enhancing the agent’s performance over time.
- Actor Loss: The actor loss, which quantifies the policy network’s error in selecting actions, also shows considerable variability but generally trends towards stabilization. Despite occasional spikes due to the stochastic nature of policy updates and the exploration-exploitation trade-off, the overall trend is a decrease in actor loss, signifying that the policy network is learning to make more accurate action selections. The stabilization of actor loss as training advances reflects the network’s improved ability to refine its policy.
- Critic Loss: The critic loss, representing the value network’s error in estimating the value of states, shows a pronounced decrease over time, albeit with some variability. High initial loss values highlight the challenges of accurately estimating state values at the beginning of training. As training continues, the critic loss decreases, indicating better accuracy in value estimation. A notable spike around episode 100 suggests a temporary deviation in value estimation, likely due to a significant policy update. Nevertheless, the critic loss quickly stabilizes, demonstrating the network’s resilience and its capacity to recover from such deviations.
3.3.3. Output Analysis
3.4. Baseline Comparisons
3.4.1. Linear Regression Model
3.4.2. Genetic Algorithm
- Initialization of the Population: A population of individuals is generated, with each individual containing a series of random solutions. Each solution represents the index of a cell and the corresponding state modification vector. Each solution (or action) consists of two parts: the selected cell index (cell_idx) and the corresponding state modification (action). An individual is represented as:
- Fitness Evaluation: The quality of each solution is evaluated using a fitness function. The fitness function calculates the score of each solution based on its performance in the environment. In this study, the difference in carbon emissions before and after modifying the environment state is used as the fitness value, as shown in the following formula:
- Selection, Crossover, and Mutation: Based on the fitness values, individuals with better performance are selected to undergo crossover and mutation operations, generating new solutions.
3.4.3. Comparison with Baseline Models
4. Discussion and Conclusions
4.1. Comparison with Similar Studies and Methodological Critique
4.2. Uncertainties and Model Parameters
4.3. Research Limitations
4.4. Future Directions and Recommendations
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Arabi Aliabad, F.; Ghafarian Malamiri, H.; Sarsangi, A.; Sekertekin, A.; Ghaderpour, E. Identifying and Monitoring Gardens in Urban Areas Using Aerial and Satellite Imagery. Remote Sens. 2023, 15, 4053. [Google Scholar] [CrossRef]
- Zhao, Y.; Xie, J.; Zhu, H.; Luo, T.; Xiong, Y.; Fan, C.; Xia, H.; Chen, Y.; Zhang, F. Land-Unet: A deep learning network for precise segmentation and identification of non-structured land use types in rural areas for green urban space analysis. Ecol. Inform. 2025, 87, 103078. [Google Scholar] [CrossRef]
- Wang, S.; Liu, X.; Zhou, C.; Hu, J.; Ou, J. Examining the impacts of socioeconomic factors, urban form, and transportation networks on CO2 emissions in China’s megacities. Appl. Energy 2017, 185, 189–200. [Google Scholar] [CrossRef]
- Qiao, R.; Wu, Z.; Jiang, Q.; Liu, X.; Gao, S.; Xia, L.; Yang, T. The nonlinear influence of land conveyance on urban carbon emissions: An interpretable ensemble learning-based approach. Land Use Policy 2024, 140, 107117. [Google Scholar] [CrossRef]
- Wu, C.; Li, G.; Yue, W.; Lu, R.; Lu, Z.; You, H. Effects of Endogenous Factors on Regional Land-Use Carbon Emissions Based on the Grossman Decomposition Model: A Case Study of Zhejiang Province, China. Environ. Manag. 2015, 55, 467–478. [Google Scholar] [CrossRef] [PubMed]
- Seto, K.C.; Güneralp, B.; Hutyra, L.R. Global forecasts of urban expansion to 2030 and direct impacts on biodiversity and carbon pools. Proc. Natl. Acad. Sci. USA 2012, 109, 16083–16088. [Google Scholar] [CrossRef] [PubMed]
- Pouyat, R.V.; Yesilonis, I.D.; Nowak, D.J. Carbon Storage by Urban Soils in the United States. J. Environ. Qual. 2006, 35, 1566–1575. [Google Scholar] [CrossRef] [PubMed]
- Lv, T.; Hu, H.; Zhang, X.; Xie, H.; Fu, S.; Wang, L. Spatiotemporal pattern of regional carbon emissions and its influencing factors in the Yangtze River Delta urban agglomeration of China. Environ. Monit. Assess. 2022, 194, 515. [Google Scholar] [CrossRef]
- Liu, S.; Zhang, X.; Feng, Y.; Xie, H.; Jiang, L.; Lei, Z. Spatiotemporal Dynamics of Urban Green Space Influenced by Rapid Urbanization and Land Use Policies in Shanghai. Forests 2021, 12, 476. [Google Scholar] [CrossRef]
- Mu, W.; Zhu, X.; Ma, W.; Han, Y.; Huang, H.; Huang, X. Impact assessment of urbanization on vegetation net primary productivity: A case study of the core development area in central plains urban agglomeration, China. Environ. Res. 2023, 229, 115995. [Google Scholar] [CrossRef] [PubMed]
- Liu, Y.; Gao, C.; Lu, Y. The impact of urbanization on GHG emissions in China: The role of population density. J. Clean. Prod. 2017, 157, 299–309. [Google Scholar] [CrossRef]
- Liu, X.; Li, T.; Zhang, S.; Jia, Y.; Li, Y.; Xu, X. The role of land use, construction and road on terrestrial carbon stocks in a newly urbanized area of western Chengdu, China. Landsc. Urban Plan. 2016, 147, 88–95. [Google Scholar] [CrossRef]
- Shi, W.; Xiang, Y.; Ying, Y.; Jiao, Y.; Zhao, R.; Qiu, W. Predicting Neighborhood-Level Residential Carbon Emissions from Street View Images Using Computer Vision and Machine Learning. Remote Sens. 2024, 16, 1312. [Google Scholar] [CrossRef]
- Murray, M.; Greer, J.; Houston, D.; McKay, S.; Murtagh, B. Bridging Top Down and Bottom Up: Modelling Community Preferences for a Dispersed Rural Settlement Pattern. Eur. Plan. Stud. 2009, 17, 441–462. [Google Scholar] [CrossRef]
- Healey, P. Collaborative Planning: Shaping Places in Fragmented Societies, 1st ed.; Macmillan: Houndmills, UK, 1997. [Google Scholar]
- Han, Z.; Yan, W.; Liu, G. A Performance-Based Urban Block Generative Design Using Deep Reinforcement Learning and Computer Vision. In Proceedings of the 2020 DigitalFUTURES, Proceedings of the 2nd International Conference on Computational Design and Robotic Fabrication (CDRF 2020), Shanghai, China, 5–6 July 2020; Yuan, P.F., Yao, J., Yan, C., Wang, X., Leach, N., Eds.; Springer: Singapore, 2021; pp. 134–143. [Google Scholar] [CrossRef]
- Yubo, Z.; Zhuoran, Y.; Jiuchun, Y.; Yuanyuan, Y.; Dongyan, W.; Yucong, Z.; Fengqin, Y.; Lingxue, Y.; Liping, C.; Shuwen, Z. A Novel Model Integrating Deep Learning for Land Use/Cover Change Reconstruction: A Case Study of Zhenlai County, Northeast China. Remote Sens. 2020, 12, 3314. [Google Scholar] [CrossRef]
- Lillicrap, T.P.; Hunt, J.J.; Pritzel, A.; Heess, N.; Erez, T.; Tassa, Y.; Silver, D.; Wierstra, D. Continuous control with deep reinforcement learning. arXiv 2019, arXiv:1509.02971. [Google Scholar] [CrossRef]
- Lee, J.-S.; Nam, J.; Lee, S.-S. Built Environment Impacts on Individual Mode Choice: An Empirical Study of the Houston-Galveston Metropolitan Area. Int. J. Sustain. Transp. 2014, 8, 447–470. [Google Scholar] [CrossRef]
- Khan, M.; Kockelman, K.M.; Xiong, X. Models for anticipating non-motorized travel choices, and the role of the built environment. Transp. Policy 2014, 35, 117–126. [Google Scholar] [CrossRef]
- Munshi, T. Built environment and mode choice relationship for commute travel in the city of Rajkot, India. Transp. Res. Part D Transp. Environ. 2016, 44, 239–253. [Google Scholar] [CrossRef]
- Antipova, A.; Wang, F.; Wilmot, C. Urban land uses, socio-demographic attributes and commuting: A multilevel modeling approach. Appl. Geogr. 2011, 31, 1010–1018. [Google Scholar] [CrossRef]
- Wang, D.; Lin, T. Residential self-selection, built environment, and travel behavior in the Chinese context. J. Transp. Land Use 2014, 7, 5–14. [Google Scholar] [CrossRef]
- Cao, J. Examining the impacts of neighborhood design and residential self-selection on active travel: A methodological assessment. Urban Geogr. 2015, 36, 236–255. [Google Scholar] [CrossRef]
- Thao, V.T.; Ohnmacht, T. The impact of the built environment on travel behavior: The Swiss experience based on two National Travel Surveys. Res. Transp. Bus. Manag. 2020, 36, 100386. [Google Scholar] [CrossRef]
- Faber, R.; Merkies, R.; Damen, W.; Oirbans, L.; Massa, D.; Kroesen, M.; Molin, E. The role of travel-related reasons for location choice in residential self-selection. Travel Behav. Soc. 2021, 25, 120–132. [Google Scholar] [CrossRef]
- Næss, P. Causality and self-selection. In Handbook on Transport and Land Use; De Abreu E Silva, J., Currans, K., Van Acker, V., Schneider, R., Eds.; Edward Elgar Publishing: Cheltenham, UK, 2023; pp. 107–128. [Google Scholar] [CrossRef]
- Van Wee, B.; Cao, X.J. Residential self-selection in the relationship between the built environment and travel behavior: A literature review and research agenda. In Advances in Transport Policy and Planning; Elsevier: Amsterdam, The Netherlands, 2022; Volume 9, pp. 75–94. [Google Scholar] [CrossRef]
- Mondal, A.; Bhat, C.R. Investigating Residential Built Environment Effects on Rank-Based Modal Preferences and Auto-Ownership. Transp. Res. Rec. J. Transp. Res. Board 2023, 2677, 777–796. [Google Scholar] [CrossRef]
- Seong, E.Y.; Lee, N.H.; Choi, C.G. Relationship between Land Use Mix and Walking Choice in High-Density Cities: A Review of Walking in Seoul, South Korea. Sustainability 2021, 13, 810. [Google Scholar] [CrossRef]
- Schlossberg, M.; Greene, J.; Phillips, P.P.; Johnson, B.; Parker, B. School Trips: Effects of Urban Form and Distance on Travel Mode. J. Am. Plan. Assoc. 2006, 72, 337–346. [Google Scholar] [CrossRef]
- Hu, Y.; Sobhani, A.; Ettema, D. Exploring commute mode choice in dual-earner households in a small Chinese city. Transp. Res. Part D Transp. Environ. 2022, 102, 103148. [Google Scholar] [CrossRef]
- Li, J.; Walker, J.L.; Srinivasan, S.; Anderson, W.P. Modeling Private Car Ownership in China: Investigation of Urban Form Impact Across Megacities. Transp. Res. Rec. J. Transp. Res. Board 2010, 2193, 76–84. [Google Scholar] [CrossRef]
- Manaugh, K.; Miranda-Moreno, L.F.; El-Geneidy, A.M. The effect of neighbourhood characteristics, accessibility, home–work location, and demographics on commuting distances. Transportation 2010, 37, 627–646. [Google Scholar] [CrossRef]
- Mo, B.; Zheng, Y.; Guo, X.; Ma, R.; Zhao, J. Robust Discrete Choice Model for Travel Behavior Prediction with Data Uncertainties (Version 1). arXiv 2024, arXiv:2401.03276. [Google Scholar] [CrossRef]
- Harz, J.; Sommer, C. Mode choice of city tourists: Discrete choice modeling based on survey data from a major German city. Transp. Res. Interdiscip. Perspect. 2022, 16, 100704. [Google Scholar] [CrossRef]
- Jeong, J.; Lee, J.; Gim, T.T. Travel mode choice as a representation of travel utility: A multilevel approach reflecting the hierarchical structure of trip, individual, and neighborhood characteristics. Pap. Reg. Sci. 2022, 101, 745–766. [Google Scholar] [CrossRef]
- Liu, Y. Transportation Mode Choice and Built Environment Around Metro Stations. In Built Environment and Walking & Cycling Around Metro Stations; Springer Nature: Singapore, 2023; pp. 17–26. [Google Scholar] [CrossRef]
- Nguyen, T.M.C.; Kato, H.; Phan, L.B. Is Built Environment Associated with Travel Mode Choice in Developing Cities? Evidence from Hanoi. Sustainability 2020, 12, 5773. [Google Scholar] [CrossRef]
- Chen, P.; Gao, J.; Ji, Z.; Liang, H.; Peng, Y. Do Artificial Intelligence Applications Affect Carbon Emission Performance?—Evidence from Panel Data Analysis of Chinese Cities. Energies 2022, 15, 5730. [Google Scholar] [CrossRef]
- Zhang, M.; Kafy, A.-A.; Xiao, P.; Han, S.; Zou, S.; Saha, M.; Zhang, C.; Tan, S. Impact of urban expansion on land surface temperature and carbon emissions using machine learning algorithms in Wuhan, China. Urban Clim. 2023, 47, 101347. [Google Scholar] [CrossRef]
- Nassef, A.M.; Olabi, A.G.; Rezk, H.; Abdelkareem, M.A. Application of Artificial Intelligence to Predict CO2 Emissions: Critical Step towards Sustainable Environment. Sustainability 2023, 15, 7648. [Google Scholar] [CrossRef]
- Dong, M.; Wang, G.; Han, X. Artificial Intelligence, Industrial Structure Optimization, and CO2 Emissions. Environ. Sci. Pollut. Res. 2023, 30, 108757–108773. [Google Scholar] [CrossRef] [PubMed]
- Chen, S.; Zhang, S.; Zeng, Q.; Ao, J.; Chen, X.; Zhang, S. Can artificial intelligence achieve carbon neutrality? Evidence from a quasi-natural experiment. Front. Ecol. Evol. 2023, 11, 1151017. [Google Scholar] [CrossRef]
- Li, X.; Piao, Z.; Zheng, Y.; Han, J.; Cong, R. Smart Urban Carbon Emission Management Platform Based on Energy Big Data. In Innovative Computing Vol 1—Emerging Topics in Artificial Intelligence; Hung, J.C., Chang, J.-W., Pei, Y., Eds.; Springer Nature: Singapore, 2023; Volume 1044, pp. 831–840. [Google Scholar] [CrossRef]
- Priya, S.D.; Saranya, K.G. Significance of artificial intelligence in the development of sustainable transportation. Sci. Temper 2023, 14, 418–425. [Google Scholar] [CrossRef]
- Montague, P.R. Reinforcement Learning Models Then-and-Now: From Single Cells to Modern Neuroimaging. In 20 Years of Computational Neuroscience; Bower, J.M., Ed.; Springer: New York, NY, USA, 2013; pp. 271–277. [Google Scholar] [CrossRef]
- Buffet, O.; Pietquin, O.; Weng, P. Reinforcement Learning (Version 2). arXiv 2020, arXiv:2005.14419. [Google Scholar] [CrossRef]
- Lyu, L.; Shen, Y.; Zhang, S. The Advance of Reinforcement Learning and Deep Reinforcement Learning. In Proceedings of the 2022 IEEE International Conference on Electrical Engineering, Big Data and Algorithms (EEBDA), Changchun, China, 25–27 February 2022; pp. 644–648. [Google Scholar] [CrossRef]
- Tsantekidis, A.; Passalis, N.; Tefas, A. Deep reinforcement learning. In Deep Learning for Robot Perception and Cognition; Elsevier: Amsterdam, The Netherlands, 2022; pp. 117–129. [Google Scholar] [CrossRef]
- François-Lavet, V.; Henderson, P.; Islam, R.; Bellemare, M.G.; Pineau, J. An Introduction to Deep Reinforcement Learning. Found. Trends® Mach. Learn. 2018, 11, 219–354. [Google Scholar] [CrossRef]
- Renna, L. Deep Reinforcement Learning for 2D Physics-Based Object Manipulation in Clutter (Version 1). arXiv 2023, arXiv:2312.04570. [Google Scholar] [CrossRef]
- Li, S.E. Deep Reinforcement Learning. In Reinforcement Learning for Sequential Decision and Optimal Control; Springer Nature: Singapore, 2023; pp. 365–402. [Google Scholar] [CrossRef]
- Liu, Y. Applications of deep reinforcement learning Alphago. Appl. Comput. Eng. 2023, 5, 637–641. [Google Scholar] [CrossRef]
- Zheng, Y.; Lin, Y.; Zhao, L.; Wu, T.; Jin, D.; Li, Y. Spatial planning of urban communities via deep reinforcement learning. Nat. Comput. Sci. 2023, 3, 748–762. [Google Scholar] [CrossRef] [PubMed]
- Shen, J.; Zheng, F.; Ma, Y.; Deng, W.; Zhang, Z. Urban travel carbon emission mitigation approach using deep reinforcement learning. Sci. Rep. 2024, 14, 27778. [Google Scholar] [CrossRef] [PubMed]
- Di Napoli, C.; Paragliola, G.; Ribino, P.; Serino, L. Deep-Reinforcement-Learning-Based Planner for City Tours for Cruise Passengers. Algorithms 2023, 16, 362. [Google Scholar] [CrossRef]
- Nguyen, T.T.; Nguyen, C.M.; Huynh-The, T.; Pham, Q.-V.; Nguyen, Q.V.H.; Razzak, I.; Reddi, V.J. Solving Complex Sequential Decision-Making Problems by Deep Reinforcement Learning with Heuristic Rules. In Computational Science—ICCS 2023; Mikyška, J., De Mulatier, C., Paszynski, M., Krzhizhanovskaya, V.V., Dongarra, J.J., Sloot, P.M.A., Eds.; Springer Nature: Cham, Switzerland, 2023; Volume 14074, pp. 298–305. [Google Scholar] [CrossRef]
- Janner, M. Deep Generative Models for Decision-Making and Control (Version 2). arXiv 2023, arXiv:2306.08810. [Google Scholar] [CrossRef]
- Liang, X.; Li, X. Carbon emission causal discovery and multi-step forecasting for global cities. Cities 2024, 148, 104881. [Google Scholar] [CrossRef]
- Zhu, E.; Deng, J.; Zhou, M.; Gan, M.; Jiang, R.; Wang, K.; Shahtahmassebi, A. Carbon emissions induced by land-use and land-cover change from 1970 to 2010 in Zhejiang, China. Sci. Total Environ. 2019, 646, 930–939. [Google Scholar] [CrossRef]
- Ali, G.; Nitivattananon, V. Exercising multidisciplinary approach to assess interrelationship between energy use, carbon emission and land use change in a metropolitan city of Pakistan. Renew. Sustain. Energy Rev. 2012, 16, 775–786. [Google Scholar] [CrossRef]
- Zhi, D.; Zhao, H.; Chen, Y.; Song, W.; Song, D.; Yang, Y. Quantifying the heterogeneous impacts of the urban built environment on traffic carbon emissions: New insights from machine learning techniques. Urban Clim. 2024, 53, 101765. [Google Scholar] [CrossRef]
- Ren, Y.; Yang, J.; Shen, Y.; Wang, L.; Zhang, Z.; Zhao, Z. Multidimensional effects of history, neighborhood, and proximity on urban land growth: A dynamic spatiotemporal rolling prediction model (STRM). Trans. GIS 2024, 28, 1928–1956. [Google Scholar] [CrossRef]
- Li, Y. Reinforcement Learning in Practice: Opportunities and Challenges. arXiv 2022, arXiv:2202.11296. [Google Scholar] [CrossRef]
- Yang, T.; Cao, Y.; Sartoretti, G. Intent-based Deep Reinforcement Learning for Multi-agent Informative Path Planning. arXiv 2023, arXiv:2303.05351. [Google Scholar] [CrossRef]
- Zhu, W.; Li, Y. Reinforcement learning-based genetic algorithm for solving low carbon multimodal transportation path planning problem. In Proceedings of the International Conference on Smart Transportation and City Engineering (STCE 2023), Chongqing, China, 16–18 December 2023; Mikusova, M., Ed.; SPIE: Bellingham, WA, USA, 2024; p. 77. [Google Scholar] [CrossRef]
- Liu, H.; Yan, F.; Tian, H. A Vector Map of Carbon Emission Based on Point-Line-Area Carbon Emission Classified Allocation Method. Sustainability 2020, 12, 10058. [Google Scholar] [CrossRef]
- Guo, Y.; Zheng, X.; Wei, W.; He, Y.; Peng, X.; Zhao, F.; Wu, H.; Bi, W.; Yan, H.; Ren, X. Construction of Multi-Sample Public Building Carbon Emission Database Model Based on Energy Activity Data. Energies 2025, 18, 3635. [Google Scholar] [CrossRef]
- Shen, C.; Zhao, K.; Ge, J.; Zhou, Q. Analysis of Building Energy Consumption in a Hospital in the Hot Summer and Cold Winter Area. Energy Procedia 2019, 158, 3735–3740. [Google Scholar] [CrossRef]
- Jing, L.; Wang, J. A Study on the Characteristics of Energy Consumption Index of Office Buildings Hot-summer and Cold-winter Zone. Refrig. Air Cond. 2022, 36, 263–268. [Google Scholar]
- Wang, Y.; Zhao, P. Survey Research on Residential Building Energy Consumption in Urban and Rural Area of China. Acta Sci. Nat. Univ. Pekin. 2018, 1, 162–170. [Google Scholar]
- GB/T 51161-2016; Standard for Energy Consumption of Building. China Architecture & Building Press: Beijing, China, 2016.
- Zhang, X.; Huang, H.; Tu, K.; Li, R.; Zhang, X.; Wang, P.; Li, Y.; Yang, Q.; Acerman, A.C.; Guo, N.; et al. Effects of plant community structural characteristics on carbon sequestration in urban green spaces. Sci. Rep. 2024, 14, 7382. [Google Scholar] [CrossRef] [PubMed]
- Ewing, R.; Cervero, R. Travel and the Built Environment: A Meta-Analysis. J. Am. Plan. Assoc. 2010, 76, 265–294. [Google Scholar] [CrossRef]
- Lu, J. The Influencing Mechanism of Urban Travel Carbon Emissions from the Perspective of Built Environment: The Case of Guangzhou, China. Atmosphere 2023, 14, 547. [Google Scholar] [CrossRef]
- Tayarani, M.; Poorfakhraei, A.; Nadafianshahamabadi, R.; Rowangould, G. Can regional transportation and land-use planning achieve deep reductions in GHG emissions from vehicles? Transp. Res. Part D Transp. Environ. 2018, 63, 222–235. [Google Scholar] [CrossRef]
- Rawlings, J.O.; Pantula, S.G.; Dickey, D.A. (Eds.) Applied Regression Analysis; Springer: Berlin/Heidelberg, Germany, 1998. [Google Scholar] [CrossRef]
- Reeves, C.R.; Rowe, J.E. Genetic Algorithms—Principles and Perspectives: A Guide to GA Theory; Springer: New York, NY, USA, 2002; Volume 20. [Google Scholar] [CrossRef]
- Liu, X.; Wang, S.; Wu, P.; Feng, K.; Hubacek, K.; Li, X.; Sun, L. Impacts of Urban Expansion on Terrestrial Carbon Storage in China. Environ. Sci. Technol. 2019, 53, 6834–6844. [Google Scholar] [CrossRef] [PubMed]
- Kolat, M.; Kővári, B.; Bécsi, T.; Aradi, S. Multi-Agent Reinforcement Learning for Traffic Signal Control: A Cooperative Approach. Sustainability 2023, 15, 3479. [Google Scholar] [CrossRef]
- Semeraro, T.; Nicola, Z.; Lara, A.; Sergi Cucinelli, F.; Aretano, R. A Bottom-Up and Top-Down Participatory Approach to Planning and Designing Local Urban Development: Evidence from an Urban University Center. Land 2020, 9, 98. [Google Scholar] [CrossRef]













| Land-Use Category | Subcategory | POI Count | Data Source | Notes |
|---|---|---|---|---|
| Residential | Communities | 6758 | Lianjia.com | Rental housing data |
| Companies/Enterprises | All commercial entities | 12,229 | Amap.com | Gaode Map platform |
| Factory/Industrial Zone | Manufacturing facilities | 245 | Amap.com | Gaode Map platform |
| Science and Education | Schools, universities | 7460 | Amap.com | Gaode Map platform |
| Daily Leisure—Dining | Restaurants, cafes | 26,980 | Amap.com | Gaode Map platform |
| Daily Leisure—Shopping | Retail, malls | 13,414 | Amap.com | Gaode Map platform |
| Scenic Spots | Parks, attractions | 2755 | Amap.com | Gaode Map platform |
| Transportation—Train | Railway stations | 146 | Amap.com | Gaode Map platform |
| Transportation—Airport | Airports, terminals | 30 | Amap.com | Gaode Map platform |
| Medical Care | Hospitals, clinics | 6846 | Amap.com | Gaode Map platform |
| TOTAL | 76,863 | Multiple | Central built-up area |
| Act. No. | X | Y | COND. | Residential | Company | Sci.& Edu. | Dining | Shopping | Scenic Spots | Factory | Medical | Rwd. |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 14 | 17 | BEF. | 29 | 39 | 31 | 106 | 118 | 4 | 0 | 14 | 0.32 |
| AFT. | 35 | 43 | 32 | 118 | 147 | 3 | 0 | 15 | ||||
| 2 | 14 | 17 | BEF. | 35 | 43 | 32 | 118 | 147 | 3 | 0 | 15 | 0.31 |
| AFT. | 43 | 49 | 35 | 135 | 152 | 2 | 0 | 16 | ||||
| 3 | 23 | 39 | BEF. | 1 | 7 | 2 | 12 | 22 | 1 | 1 | 3 | 0.18 |
| AFT. | 0 | 6 | 2 | 11 | 27 | 0 | 0 | 2 | ||||
| 4 | 14 | 17 | BEF. | 43 | 49 | 35 | 135 | 152 | 2 | 0 | 16 | 0.40 |
| AFT. | 53 | 54 | 37 | 156 | 172 | 1 | 0 | 18 | ||||
| 5 | 14 | 23 | BEF. | 24 | 12 | 10 | 115 | 1 | 8 | 0 | 16 | 0.19 |
| AFT. | 30 | 9 | 8 | 136 | 1 | 6 | 0 | 19 | ||||
| 6 | 14 | 17 | BEF. | 53 | 54 | 37 | 156 | 172 | 1 | 0 | 18 | 0.37 |
| AFT. | 58 | 60 | 42 | 178 | 208 | 0 | 0 | 21 | ||||
| 7 | 14 | 17 | BEF. | 58 | 60 | 42 | 178 | 208 | 0 | 0 | 21 | 0.23 |
| AFT. | 58 | 68 | 42 | 212 | 235 | 0 | 0 | 25 | ||||
| 8 | 24 | 12 | BEF. | 5 | 8 | 4 | 29 | 18 | 1 | 1 | 4 | 0.01 |
| AFT. | 4 | 9 | 4 | 29 | 19 | 0 | 1 | 3 | ||||
| 9 | 14 | 17 | BEF. | 58 | 68 | 42 | 212 | 235 | 0 | 0 | 25 | 0.05 |
| AFT. | 58 | 78 | 45 | 212 | 236 | 0 | 0 | 28 | ||||
| 10 | 14 | 17 | BEF. | 58 | 78 | 45 | 212 | 236 | 0 | 0 | 28 | 0.00 |
| AFT. | 58 | 78 | 46 | 212 | 236 | 0 | 0 | 28 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Shen, J.; Zheng, F.; Chen, T.; Deng, W.; Bellotti, A.; Tesema, F.B.; Lucchi, E. Optimizing Urban Land-Use Through Deep Reinforcement Learning: A Case Study in Hangzhou for Reducing Carbon Emissions. Land 2025, 14, 2368. https://doi.org/10.3390/land14122368
Shen J, Zheng F, Chen T, Deng W, Bellotti A, Tesema FB, Lucchi E. Optimizing Urban Land-Use Through Deep Reinforcement Learning: A Case Study in Hangzhou for Reducing Carbon Emissions. Land. 2025; 14(12):2368. https://doi.org/10.3390/land14122368
Chicago/Turabian StyleShen, Jie, Fanghao Zheng, Tianyi Chen, Wu Deng, Anthony Bellotti, Fiseha Berhanu Tesema, and Elena Lucchi. 2025. "Optimizing Urban Land-Use Through Deep Reinforcement Learning: A Case Study in Hangzhou for Reducing Carbon Emissions" Land 14, no. 12: 2368. https://doi.org/10.3390/land14122368
APA StyleShen, J., Zheng, F., Chen, T., Deng, W., Bellotti, A., Tesema, F. B., & Lucchi, E. (2025). Optimizing Urban Land-Use Through Deep Reinforcement Learning: A Case Study in Hangzhou for Reducing Carbon Emissions. Land, 14(12), 2368. https://doi.org/10.3390/land14122368

