Decoding Urban Riverscape Perception: An Interpretable Machine Learning Approach Integrating Computer Vision and High-Fidelity 3D Models
Abstract
1. Introduction
2. Background and Related Work
2.1. Visual Perception Theory and Indicators in Built Environments
2.2. Evolution of Environmental Representation: From Static 2D Imagery to Immersive 3D Environments
2.3. Analytical Approaches: From Linear Prediction to Interpretable Machine Learning
2.4. Summary and Research Gaps
- Beyond 2D Indicators: Existing indicator systems, predominantly based on 2D visual elements, have yet to fully incorporate 3D morphological metrics—such as the river canyon ratio and 3D viewshed—which are essential for quantifying the unique spatial experience of water-based perspectives.
- Enhancing Ecological Validity: Mainstream research relies heavily on static 2D imagery. While efficient, this approach faces limitations in representing depth and immersion. Adopting High-Fidelity 3D Reality Models and IVR offers a pathway to better simulate the spatial depth and “sense of presence” required for ecologically valid perception research.
- From Prediction to Interpretation: Current analytical tools often struggle to balance non-linear fitting capability with interpretability. There is a need for an integrated analytical framework that not only predicts perception outcomes accurately but also transparently reveals the driving mechanisms to inform specific urban design interventions.
3. Methodology
3.1. Study Areas and Data Sources
3.1.1. Rationale for Site Selection and Study Areas
3.1.2. Data Acquisition and Processing
3.2. Analysis of Objective Visual Characteristics
3.2.1. Quantification of 2D Visual Composition
3.2.2. Quantification of 3D Spatial Configuration
3.3. Subjective Visual Perception
3.3.1. Experimental Apparatus and Environmental Standardization
3.3.2. Selection of Perceptual Indicators and Rationale for IVR
- Sense of Affluence: In this study, this is defined as the perceived level of economic prosperity and urban prestige conveyed by the built environment. The use of IVR is critical here, as the perception of wealth in high-density waterfronts is often driven by the “vertical scale” and “spatial enclosure” of skyscrapers. 2D images tend to flatten this verticality, potentially leading to misinterpretation as mere crowding, whereas VR restores the imposing atmospheric quality of the skyline.
- Vibrancy: Defined as the perceived intensity of human activity and dynamic energy. This construct captures the “social liveliness” of the waterfront, distinct from static aesthetics. IVR enhances the validity of this metric by immersing the observer in the scene, making the presence of dynamic elements (e.g., moving boats, flowing water, crowds on the banks) feel physically proximal and engaging rather than just visual symbols.
- Scenic Beauty: Defined as the aesthetic harmony and attractiveness of the visual composition, focusing on the synergy between natural and artificial elements. Unlike framed photographs that can be carefully composed to hide eyesores, IVR forces a holistic evaluation of the 360-degree environment, ensuring that the “beauty” score reflects the authentic, uncurated reality of the riverscape.
- Sense of Boredom: Defined as the level of psychological under-stimulation or monotony induced by the environment. This serves as a negative control variable to identify river sections lacking visual foci or rhythmic variation. VR is uniquely suited to measure this, as the sensation of boredom often arises from the “emptiness” of spatial scale (e.g., a vast, featureless water surface) that is difficult to convey through a cropped 2D image.
3.3.3. Participants and Experimental Procedure
3.4. Statistical Analysis and Interpretable Machine Learning Strategy
4. Results
4.1. Analysis of Objective Characteristics
4.1.1. Model Performance and Data Summary
4.1.2. Comparative Analysis of Urban Morphologies
4.1.3. Spatial Sequence and Evolutionary Patterns
4.2. Analysis of Subjective Visual Perception Results
4.3. Correlation Analysis and Model Construction Results
4.3.1. Correlation Analysis Results
4.3.2. Model Construction Results
5. Discussion
5.1. Critical Reflection on Model Performance and Predictability
5.2. Core Driving Mechanisms of Riverscape Perception
- (1)
- The “Water-Buffered” Canyon Effect: Redefining Density and Prosperity. Contrary to traditional urban design wisdom, our findings challenge the assumption that spatial enclosure invariably leads to psychological oppression. The SHAP analysis identifies a counter-intuitive mechanism where high “spatial enclosure” (high Building View Index, low Sky View Index) and “vertical scale” (Building Height) act as primary drivers for Sense of Affluence and Vibrancy. In terrestrial street canyons, high H/W ratios are often associated with stress and pollution [8]. However, our results suggest that in riverscapes, the expansive water surface acts as a “Blue Buffer,” mitigating the oppressive feeling of density while framing the skyline as a visual symbol of economic power. This is exemplified by the heterogeneous skyline of the Thames: the juxtaposition of historic landmarks and ultra-modern skyscrapers (e.g., The Shard) creates a dramatic “urban canyon” that is interpreted by the public not as crowdedness, but as a manifestation of vitality and prosperity. This finding refines the application of “Street Canyon Theory” in waterfront contexts, suggesting that moderate enclosure is not a defect but a catalyst for shaping the image of a Global City [7].
- (2)
- Synergistic Aesthetics: The Interdependence of Nature and Artifact. While “green ecology” remains the cornerstone of aesthetic experience, this study reveals that Scenic Beauty in urban riverscapes is not derived from wilderness alone but from the synergy between high-quality artificial interfaces and natural elements. The dominance of the Green View Index supports Kaplan’s Attention Restoration Theory [68]. However, the significant positive contribution of the Building View Index highlights a critical nuance: the “riverscape beauty” aspired to by the public is a “civilized nature.” This is vividly illustrated by the River Seine, where the Haussmannian planning philosophy creates a continuous, rhythmic architectural façade interwoven with orderly tree-lined embankments. Unlike the fragmented green patches of the Thames, the Seine’s beauty stems from this “Nature–Artifact Synergy,” where the built environment is not an intrusion but a harmonious frame for the water. This aligns with Nasar’s theory of “Order and Complexity,” suggesting that visual coherence in the built interface significantly amplifies the aesthetic value of blue–green spaces [69].
- (3)
- The Paradox of Prospect: Why Openness Can Be Boring. A crucial theoretical contribution of this study is identifying the root cause of negative perception through the Sense of Boredom. Our model reveals a “Paradox of Prospect”: while openness is generally valued, “unfocused emptiness” (excessive Viewshed Area without visual focal points) is the primary driver of boredom. This finding adds a critical layer to Appleton’s Prospect-Refuge Theory [70]. While humans desire “prospect” (open views), an unobstructed view loses its psychological appeal if it lacks “content” or “complexity” to engage visual attention. This explains the spatial heterogeneity observed in the results: the River Thames, despite its grandeur, suffers from high boredom scores in its downstream sections where the spatial fabric becomes loose and lacks vertical landmarks. In contrast, the River Seine maintains a low boredom level throughout due to its continuous visual rhythm and appropriate H/W ratio. Thus, avoiding “visual vacuums” is as important as creating openness in riverscape design.
5.3. Practical Implications for Urban Waterfront Renewal
5.4. Limitations and Future Prospects
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Li, X.; Wang, X.; Jiang, X.; Han, J.; Wang, Z.; Wu, D.; Lin, Q.; Li, L.; Zhang, S.; Dong, Y. Prediction of riverside greenway landscape aesthetic quality of urban canalized rivers using environmental modeling. J. Clean. Prod. 2022, 367, 133066. [Google Scholar] [CrossRef]
- Luo, J.; Yuan, Z.; Xu, L.; Xu, W. Assessing the Impact of Waterfront Environments on Public Well-Being Through Digital Twin Technology. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2025, 18, 4536–4553. [Google Scholar] [CrossRef]
- Luo, J.; Liu, P.; Kong, X.; Shen, J.; Wu, Q.; Xu, D. Urban digital twins for citizen-centric planning: A systematic review of built environment perception and public participation. Int. J. Appl. Earth Obs. Geoinf. 2025, 143, 104746. [Google Scholar] [CrossRef]
- Gu, Y.; Quintana, M.; Liang, X.; Ito, K.; Yap, W.; Biljecki, F. Designing effective image-based surveys for urban visual perception. Landsc. Urban Plan. 2025, 260, 105368. [Google Scholar] [CrossRef]
- Wronkowski, A. Towards discovering human urban activity—Tactics of human spatial behavior. J. Hum. Behav. Soc. Environ. 2025, 35, 624–642. [Google Scholar] [CrossRef]
- Tavakoli, A.; Douglas, I.P.; Noh, H.Y.; Hwang, J.; Billington, S.L. Psycho-behavioral responses to urban scenes: An exploration through eye-tracking. Cities 2025, 156, 105568. [Google Scholar] [CrossRef]
- Luo, J.; Zhao, T.; Cao, L.; Biljecki, F. Semantic Riverscapes: Perception and evaluation of linear landscapes from oblique imagery using computer vision. Landsc. Urban Plan. 2022, 228, 104569. [Google Scholar] [CrossRef]
- Luo, J.; Zhao, T.; Cao, L.; Biljecki, F. Water View Imagery: Perception and evaluation of urban waterscapes worldwide. Ecol. Indic. 2022, 145, 109615. [Google Scholar] [CrossRef]
- Chiciudean, V.; Florea, H.; Beche, R.; Oniga, F.; Nedevschi, S. Data augmentation for environment perception with unmanned aerial vehicles. IEEE Trans. Intell. Veh. 2024, 10, 2334–2348. [Google Scholar] [CrossRef]
- Pradilla, G.; Hack, J. An urban rivers renaissance? Stream restoration and green–blue infrastructure in Latin America–Insights from urban planning in Colombia. Urban Ecosyst. 2024, 27, 2245–2265. [Google Scholar] [CrossRef]
- Zhou, M.; Wang, F. The driving factors of recreational utilization of ecological space in urban agglomerations: The perspective of urban political ecology. Ecol. Indic. 2024, 158, 111409. [Google Scholar] [CrossRef]
- Grzyb, T.; Kulczyk, S. How do ephemeral factors shape recreation along the urban river? A social media perspective. Landsc. Urban Plan. 2023, 230, 104638. [Google Scholar] [CrossRef]
- Yuan, Z.; Luo, J.; Liu, Y.; Xu, L.; Chen, L.; Xu, W. Toward climate-adaptive waterfront walking spaces: Thresholds and cross-modal mechanisms of multi-sensory comfort. Urban For. Urban Green. 2026, 118, 129278. [Google Scholar] [CrossRef]
- Garau, E.; Torralba, M.; Pueyo-Ros, J. What is a river basin? Assessing and understanding the sociocultural mental constructs of landscapes from different stakeholders across a river basin. Landsc. Urban Plan. 2021, 214, 104192. [Google Scholar] [CrossRef]
- Kim, B.C.; Kim, S.N.; Joo, Y. Visual impact control of urban waterfront development on the background mountain view: Examining its justifiability through two types of immersive virtual reality experiments. Environ. Impact Assess. Rev. 2024, 106, 107500. [Google Scholar] [CrossRef]
- Jiang, F.; Ma, J.; Webster, C.J.; Chiaradia, A.J.; Zhou, Y.; Zhao, Z.; Zhang, X. Generative urban design: A systematic review on problem formulation, design generation, and decision-making. Prog. Plan. 2024, 180, 100795. [Google Scholar] [CrossRef]
- Puspitasari, A.W.; Kwon, J. Analysis of the visual quality of riverfront skyline through the feature of height and spatial arrangement of tall building. Archit. Res. 2019, 21, 91–98. [Google Scholar]
- Wang, Z.; Ito, K.; Biljecki, F. Assessing the equity and evolution of urban visual perceptual quality with time series street view imagery. Cities 2024, 145, 104704. [Google Scholar] [CrossRef]
- Ogawa, Y.; Oki, T.; Zhao, C.; Sekimoto, Y.; Shimizu, C. Evaluating the subjective perceptions of streetscapes using street-view images. Landsc. Urban Plan. 2024, 247, 105073. [Google Scholar] [CrossRef]
- Xu, Z.; Jiang, T.; Zheng, N. Developing and analyzing eco-driving strategies for on-road emission reduction in urban transport systems-A VR-enabled digital-twin approach. Chemosphere 2022, 305, 135372. [Google Scholar] [CrossRef]
- D’Urso, F.; Santoro, C.; Santoro, F.F. An integrated framework for the realistic simulation of multi-UAV applications. Comput. Electr. Eng. 2019, 74, 196–209. [Google Scholar] [CrossRef]
- Liu, Y.; Yuan, Z.; Luo, J.; Shen, Y.; Yao, X.; Xu, W. The Impact of 3D Urban Landscapes on Multidimensional Human Experience: An Interpretable Machine Learning Approach. Build. Environ. 2026, 293, 114320. [Google Scholar] [CrossRef]
- Ardiny, H.; Beigzadeh, A.; Mahani, H. Applications of unmanned aerial vehicles in radiological monitoring: A review. Nucl. Eng. Des. 2024, 422, 113110. [Google Scholar] [CrossRef]
- Zhu, J.; Wang, S.; Ma, H.; Shan, T.; Xu, D.; Sun, F. Nonlinear effect of urban visual environment on residents’ psychological perception—An analysis based on XGBoost and SHAP interpretation model. City Environ. Interact. 2025, 27, 100202. [Google Scholar] [CrossRef]
- Li, C.; Managi, S. Impacts of community attachment and community livability on environmental activity according to XGBoost and SHAP. Cities 2025, 156, 105559. [Google Scholar] [CrossRef]
- Collins, S.A.; McDonnell, A.S.; Scott, E.E.; McNay, G.D.; Shannon, M.F.; Augustin, L.; Hoffmann, J.N.; Johnson, S.; Strayer, D.; LoTemplio, S. Nature imagery’s influence on ERN amplitude: An examination of Attention Restoration Theory using EEG. Front. Hum. Neurosci. 2025, 19, 1567689. [Google Scholar] [CrossRef]
- Sniderman, S. The Design of Emotionally Resonant Learning Experiences: Prospect-Refuge, Framing, and Friction. Int. J. Adv. Corp. Learn. 2025, 18, 82. [Google Scholar] [CrossRef]
- Haeri, S.; Masnavi, M.R. Analyzing and Developing Strategies for the Ecological Restoration of Urban Rivers in the Framework of Ecological Urbanism. MANZAR Sci. J. Landsc. 2023, 15, 54–71. [Google Scholar]
- Wantzen, K.M. River culture: How socio-ecological linkages to the rhythm of the waters develop, how they are lost, and how they can be regained. Geogr. J. 2024, 190, e12476. [Google Scholar] [CrossRef]
- Li, X.; Li, Y.; Zhang, S.; Lin, R.; Chen, M.; Feng, L. Driving effects of land use and landscape pattern on different spontaneous plant life forms along urban river corridors in a fast-growing city. Sci. Total Environ. 2023, 876, 162775. [Google Scholar] [CrossRef]
- Bush, J.; Doyon, A. Planning a just nature-based city: Listening for the voice of an urban river. Environ. Sci. Policy 2023, 143, 55–63. [Google Scholar] [CrossRef]
- Çınar, E.; Tuncal, A. The Future of UAVs in Urban Air Mobility: Public Perception and Concerns. Türkiye İnsansız Hava Araçları Derg. 2023, 5, 50–58. [Google Scholar] [CrossRef]
- Kamarulzaman, A.M.M.; Jaafar, W.M.; Said, M.; SNM; Saad; Mohan, M. UAV Implementations in Urban Planning and Related Sectors of Rapidly Developing Nations: A Review and Future Perspectives for Malaysia. Remote Sens. 2023, 15, 2845. [Google Scholar] [CrossRef]
- Chen, C.; Li, H.; Luo, W.; Xie, J.; Yao, J.; Wu, L.; Xia, Y. Predicting the effect of street environment on residents’ mood states in large urban areas using machine learning and street view images. Sci. Total Environ. 2022, 816, 151605. [Google Scholar] [CrossRef]
- Li, X.; Huang, K.; Zhang, R.; Chen, Y.; Dong, Y. Visual Perception Optimization of Residential Landscape Spaces in Cold Regions Using Virtual Reality and Machine Learning. Land 2024, 13, 367. [Google Scholar] [CrossRef]
- Clay, G.R.; Daniel, T.C. Scenic landscape assessment: The effects of land management jurisdiction on public perception of scenic beauty. Landsc. Urban Plan. 2000, 49, 1–13. [Google Scholar] [CrossRef]
- Wu, C.; Ye, Y.; Gao, F.; Ye, X. Using street view images to examine the association between human perceptions of locale and urban vitality in Shenzhen, China. Sustain. Cities Soc. 2023, 88, 104291. [Google Scholar] [CrossRef]
- Liu, Y.; Abbasabadi, N. Enhancing urban building energy models with Vision Transformers: A Case study in material classification from Google street view. Energy Build. 2025, 333, 115457. [Google Scholar] [CrossRef]
- Danish, M.; Labib, S.M.; Ricker, B.; Helbich, M. A citizen science toolkit to collect human perceptions of urban environments using open street view images. Comput. Environ. Urban Syst. 2025, 116, 102207. [Google Scholar] [CrossRef]
- Liu, M.; Gou, Z.; Zhao, Q. Enhancing building performance evaluation through Google street View: A multimodal transfer learning framework. Energy Build. 2025, 344, 115968. [Google Scholar] [CrossRef]
- Huang, Y.; Sanatani, R.P.; Liu, C.; Kang, Y.; Zhang, F.; Liu, Y.; Duarte, F.; Ratti, C. No “true” greenery: Deciphering the bias of satellite and street view imagery in urban greenery measurement. Build. Environ. 2025, 269, 112395. [Google Scholar] [CrossRef]
- Nathvani, R.; Cavanaugh, A.; Suel, E.; Bixby, H.; Clark, S.N.; Metzler, A.B.; Nimo, J.; Moses, J.B.; Baah, S.; Arku, R.E.; et al. Measurement of urban vitality with time-lapsed street-view images and object-detection for scalable assessment of pedestrian-sidewalk dynamics. ISPRS J. Photogramm. Remote Sens. 2025, 221, 251–264. [Google Scholar] [CrossRef]
- Mehta, D.; Gopalakrishnan, P. Real Environment or Virtual Environment?: Perception Bias Evaluation of Perceived Security in Urban Parks. J. Park Recreat. Adm. 2025, 43, 22–38. [Google Scholar] [CrossRef]
- Li, F.; Zhang, Z.; Xu, L.; Yin, J. The effects of professional design training on urban public space perception: A virtual reality study with physiological and psychological measurements. Cities 2025, 158, 105654. [Google Scholar] [CrossRef]
- Alkhresheh, M. Effects of Levels of Realism on Perceived Distance in Computer-Simulated Urban Spaces. Buildings 2025, 15, 3565. [Google Scholar] [CrossRef]
- Ju, H.; Bulbul, T.; Yang, X.; Withers, J. Risk awareness assessment of construction workers wearing AR head-mounted displays using EEG signals. J. Saf. Res. 2025, 95, 237–250. [Google Scholar] [CrossRef]
- Figueiredo, M.; Eloy, S.; Marques, S. Age-friendly cities and active mobility: A thematic analysis based on immersive 360-degree video elicitation. Cities 2025, 166, 106260. [Google Scholar] [CrossRef]
- Jiang, Y.; Yao, Y.; Yu, Q.; Jiang, Z.; Liu, X.; Li, J.; Zhang, H. Integrating Virtual Reality-based eye-tracking with urban digital twin for unveiling pedestrian visual attention in wayfinding tasks. Travel Behav. Soc. 2026, 42, 101120. [Google Scholar] [CrossRef]
- Juan, Y.K.; Chen, Y. Not or Yes in My Back Yard? A physiological and psychological measurement of urban residents in Taiwan. Landsc. Urban Plan. 2025, 255, 105256. [Google Scholar] [CrossRef]
- Li, T.; Hu, H.; Ma, H.; Ma, J.; Li, Q. Using Virtual Reality to Enhance Learning Performance and Address Educational Resource Disparities in Architectural History Courses. Sustainability 2025, 17, 866. [Google Scholar] [CrossRef]
- Huang, D.; Gong, W.; Wang, X.; Liu, S.; Zhang, J.; Li, Y. A Cognition–Affect–Behavior Framework for Assessing Street Space Quality in Historic Cultural Districts and Its Impact on Tourist Experience. Buildings 2025, 15, 2739. [Google Scholar] [CrossRef]
- Hong, T.; Yim, S.H.; Heo, Y. Interpreting complex relationships between urban and meteorological factors and street-level urban heat islands: Application of random forest and SHAP method. Sustain. Cities Soc. 2025, 126, 106353. [Google Scholar] [CrossRef]
- Qiao, Y.; Sun, H.; Qi, J.; Liu, S.; Li, J.; Ji, Y.; Wang, H.; Peng, Y. Examining water bodies’ cooling effect in urban parks with buffer analysis and random forest regression. Urban Clim. 2025, 59, 102301. [Google Scholar] [CrossRef]
- Eshraghi, P.; Talami, R.; Dehnavi, A.N.; Mirdamadi, M.; Zomorodian, Z.S. Adopting explainable-AI to investigate the impact of urban morphology design on energy and environmental performance in dry-arid climates. Adv. Build. Energy Res. 2025, 19, 497–531. [Google Scholar] [CrossRef]
- Sheng, T.; Zhang, Z.; Qian, Z.; Ma, P.; Xie, W.; Zeng, Y.; Zhang, K.; Sun, Z.; Yu, J.; Chen, M. Examining urban agglomeration heat island with explainable AI: An enhanced consideration of anthropogenic heat emissions. Urban Clim. 2025, 59, 102251. [Google Scholar] [CrossRef]
- Liu, Y.; Wang, H.; Guan, X.; Meng, Y.; Xu, H. Urban flood depth prediction and visualization based on the XGBoost-SHAP model. Water Resour. Manag. 2025, 39, 1353–1375. [Google Scholar] [CrossRef]
- Zhang, X.; Zhao, P.; Hu, Q.; Ai, M.; Hu, D.; Li, J. A UAV-based panoramic oblique photogrammetry (POP) approach using spherical projection. ISPRS J. Photogramm. Remote Sens. 2020, 159, 198–219. [Google Scholar] [CrossRef]
- Alvarez-Vanhard, E.; Corpetti, T.; Houet, T. UAV & satellite synergies for optical remote sensing applications: A literature review. Sci. Remote Sens. 2021, 3, 100019. [Google Scholar]
- Ding, L.; Zhou, J.; Meng, L.; Long, Z. A practical cross-view image matching method between UAV and Satellite for UAV-based geo-localization. Remote Sens. 2020, 13, 47. [Google Scholar] [CrossRef]
- Larkin, A.; Huang, T.; Chen, L.; Lin, P.I.; Hart, J.E.; Zhang, W.; Coull, B.A.; Yi, L.; Suel, E.; Hankey, S.; et al. Developing Nationwide Estimates of Built Environment Quality Characteristics Using Street-View Imagery and Computer Vision. Environ. Sci. Technol. 2025, 59, 13638–13646. [Google Scholar] [CrossRef]
- Li, S.; Tan, Y.; Zhou, Z.; Chen, P.; Li, C.; Zhou, C. Enhancing 3D building reconstruction quality using UAV multi-view photogrammetry and multi-modal large models. Autom. Constr. 2026, 181, 106664. [Google Scholar] [CrossRef]
- Maniee, S.; Zomorodian, Z.S.; Tahsildoost, M. Assessing Lighting Design in Urban Open Spaces: A Virtual Reality Experiment. Build. Environ. 2025, 283, 113362. [Google Scholar] [CrossRef]
- Zhang, H. Understanding Human Perceptions of Street View Images Using MIT Place Pulse 2.0 Dataset. Ph.D. Thesis, Waseda University, Shinjuku, Japan, 2023. [Google Scholar]
- Reza, S.A.; Hasan, M.S.; Amjad, M.H.; Islam, M.S.; Rabbi, M.M.; Hossain, A.; Shovon, M.S.; Jakir, T. Predicting energy consumption patterns with advanced machine learning techniques for sustainable urban development. J. Comput. Sci. Technol. Stud. 2025, 7, 265–282. [Google Scholar] [CrossRef]
- Zhang, A.; Tariq, A.; Quddoos, A.; Naz, I.; Aslam, R.W.; Barboza, E.; Ullah, S.; Abdullah-Al-Wadud, M. Spatio-temporal analysis of urban expansion and land use dynamics using google earth engine and predictive models. Sci. Rep. 2025, 15, 6993. [Google Scholar] [CrossRef] [PubMed]
- Li, X.; Wang, Z.; Chen, Y.; Wang, Z.; Kuang, D. Exploring the impact of land use on bird diversity in high-density urban areas using explainable machine learning models. J. Environ. Manag. 2025, 374, 124080. [Google Scholar] [CrossRef]
- Hao, J.; Ho, T.K. Machine learning made easy: A review of scikit-learn package in python programming language. J. Educ. Behav. Stat. 2019, 44, 348–361. [Google Scholar] [CrossRef]
- Neilson, B.N.; Craig, C.M.; Travis, A.T.; Klein, M.I. A review of the limitations of Attention Restoration Theory and the importance of its future research for the improvement of well-being in urban living. Vis. Sustain. 2019, 11, 59–67. [Google Scholar]
- Akcelik, G.N.; Choe, K.W.; Rosenberg, M.D.; Schertz, K.E.; Meidenbauer, K.L.; Zhang, T.; Rim, N.; Tucker, R.; Talen, E.; Berman, M. Quantifying urban environments: Aesthetic preference through the lens of prospect-refuge theory. J. Environ. Psychol. 2024, 97, 102344. [Google Scholar] [CrossRef]
- Dosen, A.S.; Ostwald, M.J. Evidence for prospect-refuge theory: A meta-analysis of the findings of environmental preference research. City Territ. Archit. 2016, 3, 4. [Google Scholar] [CrossRef]
- de Faria, L.C. Airflow in urban canyons: A comparison of CFD calculated results with values from physical models with reduced scale tested in a wind tunnel. Arte 21 2023, 20, 63–112. [Google Scholar] [CrossRef]
- Li, J.; Huang, Z.; Zheng, D.; Zhao, Y.; Huang, P.; Huang, S.; Fang, W.; Fu, W.; Zhu, Z. Effect of landscape elements on public psychology in urban park waterfront green space: A quantitative study by semantic segmentation. Forests 2023, 14, 244. [Google Scholar] [CrossRef]
- Han, T.; Tang, L.; Liu, J.; Jiang, S.; Yan, J. The influence of multi-sensory perception on public activity in urban street spaces: An empirical study grounded in landsenses ecology. Land 2024, 14, 50. [Google Scholar] [CrossRef]
- Foellmer, J.; Tyler, N. Multisensory approaches to urban pollution: A controlled study of sound-odour interactions and their impact on physiology and perception. Build. Environ. 2025, 285, 113506. [Google Scholar] [CrossRef]
- Soleimanpour, M.; Alizadeh, O.; Sabetghadam, S. Analysis of diurnal to seasonal variations and trends in air pollution potential in an urban area. Sci. Rep. 2023, 13, 21065. [Google Scholar] [CrossRef]






| Data Category | Specific Data Type | Source/Instrument | Resolution/Scale | Extent/Quantity | Purpose |
|---|---|---|---|---|---|
| 3D Spatial Data | Oblique Photography Images | DJI Mavic 3 Drone | GSD = 5 cm | Core Riverfront Zone | High-fidelity texture for visual focus |
| Contextual 3D Data | Google Earth | - | Peripheral Urban Background | Skyline context & Geometric validation | |
| Integrated Reality Model | ContextCapture v4.4.10 | Relative Accuracy Verified | 2 Models (Thames & Seine) | 3D Metric Extraction | |
| 2D Visual Data | Water-based Panoramic Images | Google Street View API | 16,384 × 8192 pixels | 200 Sampling Points | Segmentation of Visual Elements |
| Dimension | Indicator Name (Abbr.) | Operational Definition/Formula | Data Source | Brief Rationale |
|---|---|---|---|---|
| 2D Visual | Green View Index (GVI) | Ratio of pixel area identified as ‘Tree’ and ‘Grass’ to total image pixels. | SegFormer (Panoramic Image) | Proxy for “naturalness” and ecological visual quality. |
| Sky View Index (SVI) | Ratio of pixel area identified as ‘Sky’ to total image pixels. | SegFormer (Panoramic Image) | Indicator of spatial openness and potential psychological relief. | |
| Building View Index (BVI) | Sum of ‘Traditional’ and ‘Modern Building’ pixel ratios. | SegFormer (Panoramic Image) | Represents the visual density and dominance of the artificial interface. | |
| Revetment View Index (RVI) | Ratio of pixel area identified as ‘Revetment’ (riverbank walls). | SegFormer (Panoramic Image) | Quantifies the hardness of the immediate water-land boundary. | |
| Dynamic Object Index (DOI) | Aggregate ratio of mobile element pixels (People, Boats, Cars, etc.). | SegFormer (Panoramic Image) | Proxy for visual vibrancy and social activity intensity. | |
| 3D Spatial | River Width (W) | Shortest perpendicular distance between riverbanks at the sampling point. | Grasshopper (3D Model) | Defines the fundamental horizontal scale of the corridor. |
| Building Height (Havg) | Average vertical height of the first-line buildings along the banks. | Grasshopper (3D Model) | Key metric for the vertical scale of the urban interface. | |
| Height-to-Width Ratio (H/W) | Ratio of average building height to river channel width at the cross-section. | Grasshopper (3D Model) | Quantifies the “Canyon Effect” and degree of spatial enclosure. | |
| Viewshed Area (V3D) | Total visible area (km2) from the observer’s viewpoint. | Grasshopper (3D Model) | Measures 3D spatial permeability and visual scope. |
| Category | Indicator | River Thames | River Seine | Characteristic Difference |
|---|---|---|---|---|
| 2D Visual | Green View Index (GVI) | M = 3.3% SD = 3.1% | M = 9.0% SD = 4.3% | Seine is significantly greener. |
| Sky View Index (SVI) | M = 45.5% SD = 4.9% | M = 38.6% SD = 3.1% | Thames is more open. | |
| Building View Index (BVI) | M = 5.7% SD = 1.4% | M = 5.4% SD = 2.3% | Thames is uniform; Seine varies at nodes. | |
| Revetment View Index (RVI) | M = 2.8% SD = 0.8% | M = 5.0% SD = 1.9% | Seine has higher revetment visibility. | |
| Dynamic Object Index (DOI) | M = 0.5% SD = 0.7% | M = 1.2% SD = 0.9% | Seine has higher vibrancy/activity. | |
| 3D Spatial | River Width (W) | M = 83.6 m SD = 8.8 m | M = 79.5 m SD = 17.7 m | Thames is wider and more uniform. |
| Building Height (Havg) | M = 57.1 m SD = 44.7 m | M = 45.8 m SD = 13.6 m | Thames has extreme vertical fluctuations. | |
| Height-to-Width Ratio (H/W) | M = 0.7 SD = 0.6 | M = 0.6 SD = 0.2 | Thames creates dramatic “urban canyons”. | |
| Viewshed Area (V3D) | M = 0.7 km2 SD = 0.18 km2 | M = 0.2 km2 SD = 0.05 km2 | Thames has significantly higher permeability. |
| Building View Index | Green View Index | Sky View Index | Revetment View Index | Dynamic Object Index | Viewshed Area | H/W Ratio | Building Height | River Width | |
|---|---|---|---|---|---|---|---|---|---|
| Sense of Affluence | 0.719 ** | 0.407 ** | −0.715 ** | 0.249 ** | 0.112 | −0.523 ** | 0.683 ** | 0.637 ** | −0.369 ** |
| Vibrancy | 0.367 ** | −0.005 | −0.280 ** | 0.155 * | 0.592 ** | 0.386 ** | 0.421 ** | 0.638 ** | 0.082 |
| Sense of Boredom | −0.173 ** | 0.250 | 0.130 ** | 0.238 | −0.392 ** | −0.499 ** | −0.200 | −0.410 | −0.262 |
| Scenic Beauty | 0.595 ** | 0.560 ** | −0.688 ** | −0.238 * | 0.286 * | −0.287 * | 0.181 | 0.216 ** | −0.097 |
| Sense of Affluence | Vibrancy | Sense of Boredom | Scenic Beauty | ||
|---|---|---|---|---|---|
| Model optimal parameters | max_depth | 10 | 10 | 10 | 5 |
| min_samples_split | 6 | 2 | 6 | 2 | |
| n_estimators | 100 | 50 | 50 | 100 | |
| R2 | 0.509 | 0.667 | 0.425 | 0.619 | |
| MSE | 0.080 | 0.107 | 0.274 | 0.134 | |
| MAE | 0.234 | 0.238 | 0.415 | 0.281 | |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Published by MDPI on behalf of the International Society for Photogrammetry and Remote Sensing. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Tang, Y.; Chen, S.; Xu, W.; Ren, J.; Luo, J. Decoding Urban Riverscape Perception: An Interpretable Machine Learning Approach Integrating Computer Vision and High-Fidelity 3D Models. ISPRS Int. J. Geo-Inf. 2026, 15, 91. https://doi.org/10.3390/ijgi15020091
Tang Y, Chen S, Xu W, Ren J, Luo J. Decoding Urban Riverscape Perception: An Interpretable Machine Learning Approach Integrating Computer Vision and High-Fidelity 3D Models. ISPRS International Journal of Geo-Information. 2026; 15(2):91. https://doi.org/10.3390/ijgi15020091
Chicago/Turabian StyleTang, Yuzhen, Shensheng Chen, Wenhui Xu, Jinxuan Ren, and Junjie Luo. 2026. "Decoding Urban Riverscape Perception: An Interpretable Machine Learning Approach Integrating Computer Vision and High-Fidelity 3D Models" ISPRS International Journal of Geo-Information 15, no. 2: 91. https://doi.org/10.3390/ijgi15020091
APA StyleTang, Y., Chen, S., Xu, W., Ren, J., & Luo, J. (2026). Decoding Urban Riverscape Perception: An Interpretable Machine Learning Approach Integrating Computer Vision and High-Fidelity 3D Models. ISPRS International Journal of Geo-Information, 15(2), 91. https://doi.org/10.3390/ijgi15020091
