1. Introduction
Accelerating global urbanization has elevated the design of urban vitality spaces to a central concern in contemporary urban planning and architectural practice [
1,
2]. While conventional approaches prioritize functional efficiency and visual appeal, they frequently overlook people’s psychological needs and profound emotional attachment with built environments [
3,
4]. Empirical evidence demonstrates that human–space interactions and their associated emotional responses constitute fundamental determinants of spatial desirability and place attachment [
5,
6,
7,
8]. Research underscores that emotional experiences generated through these interactions significantly enhance people’s belongingness while shaping a space’s perceived attractiveness [
9]. Nevertheless, current methodologies lack systematic frameworks for incorporating emotional science into spatial design practices or implementing technological solutions for targeted emotional design methodology [
10,
11,
12]. This gap persists despite emerging capabilities in emotion-aware technologies, limiting designers’ capacity to create psychologically responsive built environments [
13]. The complexity and interdisciplinary nature of emotional attachment hinder its quantification and systematic study, let alone the translation of relevant findings into actionable design methods [
3,
4,
14,
15,
16,
17]. While the big data methodology now enables multimodal data analysis, converting two-dimensional results into three-dimensional spatial designs remains challenging. The recent maturity of AI-generative design presents an opportunity to address this limitation [
18,
19].
Meanwhile, generative artificial intelligence (AI) models, such as Stable Diffusion and Midjourney, have shown significant potential in automating visual content creation [
20,
21]. AI-generative design is currently employed by practitioners for early scheme comparison and final rendering production in architectural and urban design [
13,
22,
23]. However, their integration with psychological principles remains underdeveloped, often yielding outputs with a limited scientific validation of emotional relevance. This disconnect underscores the critical need to establish intelligent, emotion-driven design methodologies [
23]. Current emotional design research predominantly employs subjective evaluations or qualitative analyses, lacking systematic, quantifiable, and replicable technical frameworks, which can be explored with the help of AI’s generative design capabilities based on massive data [
13,
20]. Conversely, while generative AI has advanced image synthesis capabilities, existing models prioritize formal aesthetics over psychological resonance, rendering them inadequate for addressing the nuanced emotional requirements of multifunctional urban environments. Therefore, this study further aims to address limitations in applying AI-generative technology to urban design, enabling its meaningful application for advancing human-centered spatial design rather than mere image generation. Bridging these interdisciplinary and technological divides to create a unified design and evaluation framework has emerged as a critical challenge for fostering emotionally intelligent urban vitality spaces.
To address these challenges, this study introduces an emotionally responsive design methodology leveraging low-rank adaptation (LoRA) technology, establishing a novel integration of generative AI with psychological evaluation systems through a dual-phase “intelligent generation–emotional quantification” framework. Taking a representative urban vitality complex, THE BOX in Beijing’s Chaoyang District, as an example, this study intends to explore the following to operate this framework:
- (1)
The way to fine-tuning the Stable Diffusion model via LoRA to optimize output fidelity for context-specific urban scenarios is explored.
- (2)
The possibility to automate text-to-panoramic design generation, combining ChatGPT 4.0’s semantic parsing with CycleGAN’s cross-domain style, is explored.
- (3)
The construction and implementation of emotional attachment scales to quantitatively assess people’s attachment to original and AI-generated environments are explored.
- (4)
The similarities and differences between multi-dimensional features of people’s emotional attachment to original and AI-generated environments are explored.
In this way, this study systematically integrates LoRA fine-tuning, multimodal text-image synthesis, and cross-domain style transfer techniques to address the limitations of single-model architectures in generating urban environments aligned with people’s emotional needs. By developing a dual-phase framework that synergizes generative AI with emotional attachment theory, it establishes an empirically verifiable technical framework for affectively optimized urban design. The results advance human-centric urban vitality design strategies while providing actionable insights for computationally augmented, psychologically attuned urban development in the AI era. Furthermore, the methodology offers urban planners and designers a systematic workflow to strategically implement AI tools for evidence-based built environment design and evaluation.
1.1. Design and Research of Urban Vitality Spaces in AI Era
The conceptual evolution of urban vitality space remains intrinsically tied to the interpretation of the core value of “vibrancy.” Initially framed by Jane Jacobs through the lens of built environment design as spatial attractiveness and activity driving force [
24], this notion was expanded by Montgomery through emphasizing its experiential dimension of creating “liveliness” [
25] perceived by people. After over half a century of interdisciplinary research, scholarly consensus acknowledges that urban vitality universally correlates with three elements across various perspectives and scales: people density, diverse behavior, and mixed function [
14,
26,
27,
28]. This understanding has directed the recent research focus toward urban vibrant complexes [
29], which has also provided the contextual framework and reference for the research object selection for this study [
14]. Though lacking a universal definition, the core concept of “vibrancy” inherently prioritizes the creation of life-sustaining built environments and human-centered spatial design methodology. In the digital era, achieving such vitality increasingly demands attention to advanced human needs, particularly emotional dimensions, during spatial design processes. The emergence and evolution of vibrant spaces fundamentally stem from interactions between people and built environments, where fulfilling emotional spatial needs constitutes the essence of vibrancy.
However, the advancement of digital technologies, particularly the maturation of data mining and analytical methods, has driven interdisciplinary scholars in urban vitality research to prioritize data-driven identification of vitality-influencing factors. Their focus predominantly centers on non-material dimensions such as socioeconomic and cultural dimensions [
1,
30,
31,
32], while spatial studies often reduce people’s three-dimensional behavioral patterns to simplified two-dimensional heatmaps or one-dimensional metrics through diverse data mining and analysis approaches [
33]. Current research and design in the field of built environments rarely extend beyond street-scale analyses [
32,
34,
35,
36], with minimal attention to architectural spatial characteristics that are close to human scale and help shape people’s real emotional experiences. Existing discussions predominantly offer either qualitative descriptions of built environments or quantified indicators, failing to capture the interactive essence of vitality. This deficiency may stem from two critical gaps: first, the challenge of representing human-scale three-dimensional spatial interactions within conventional two-dimensional mapping frameworks common in geographical and urban planning studies; second, the absence of interdisciplinary theoretical frameworks capable of objectively evaluating human perceptions and emotional responses in urban vitality studies.
While urban vitality research has yielded substantial insights, it necessitates further methodological advancements in quantifying and evaluating emotional dimensions—the core essence of vitality. Meanwhile, the rapid development of AI-generative design tools has prompted some urban planners and architects to experiment with algorithmically generated vibrant urban complexes. However, the critical question of whether such AI-generated spaces authentically align with people’s emotional needs for urban vitality spaces remains unanswered, directly determining whether these tools genuinely enhance design quality or merely perpetuate technical formalism. Consequently, systematically evaluating the emotional attachment characteristics of both existing and AI-generated vibrant spaces holds dual significance: methodologically, it advances urban vitality research by integrating humanized evaluations into design frameworks; ethically, it establishes value-oriented guidelines for redefining design logic in an era where AI increasingly mediates the creation of urban environments [
13,
23]. This interdisciplinary inquiry bridges the gap between computational design efficiency and human experiential depth, ultimately steering AI’s role in urban design and research toward emotionally intelligent spatial solutions.
In this case, the low-rank adaptation (LoRA) model, a parameter-efficient fine-tuning technique, offers enhanced computational efficiency and output precision in generative AI systems. This capability positions generative AI as a transformative tool for urban vitality space design, enabling the data-driven optimization of complex built environments while addressing their emotional dimensions. Designers and researchers globally have employed this methodology across diverse spatial design dimensions [
37,
38,
39]. Recent practices and findings have yielded particularly notable results, proving its effectiveness and offering foundational reference for this study. However, current research remains confined to generated forms and technical workflows of LoRA, largely unexplored in AI-generative design’s impact on real users’ spatial perception, which is the precise gap this study addresses.
1.2. Development and Application of AI-Generated Design in Urban Design and Related Fields
To evaluate the commonalities and distinctions in emotional attachment characteristics between real urban vitality spaces and those generated through AI-driven design methodologies, thereby optimizing the application of generative AI in urban vitality space design, it is imperative to systematically examine existing research and practical applications of this methodology in urban design contexts. AI-assisted design has successfully promoted the creation of various types of works in the field of art, including images, videos, music, and so on, and has received positive feedback from the public [
40,
41,
42]. However, prior to deployment, generative AI systems require substantial model training to translate human design intent and expertise into computationally tractable representations [
43,
44]. The low-rank adaptation (LoRA) technique addresses this challenge through the parameter-efficient fine-tuning of foundation models, employing a trainable low-rank matrix decomposition to update network weights [
37,
38,
39,
45]. This approach achieves targeted model specialization while preserving baseline performance, drastically reducing computational and memory overhead compared to full-parameter retraining. By decoupling adaptation complexity from original model architecture, LoRA enables the resource-efficient optimization of generative systems [
46]—a critical advancement for scalable applications in design domains where balancing computational pragmatism with output fidelity remains paramount [
39].This LoRA-driven spatial image generation is widely adopted across disciplines such as industrial design, game development, and animation production [
47,
48]. Its rapid iteration capability offers a novel methodological approach to design exploration, which also influences urban design field [
13].
Recent advancements demonstrate the growing efficacy of low-rank adaptation (LoRA) in generative urban design applications. Some researchers have employed LoRA to enhance text-guided architectural façade generation, minimizing dependence on extensive training datasets while integrating ControlNet for constraint-based refinement, thereby improving output precision and controllability [
49,
50,
51]. Parallel research has systematically compared fine-tuning strategies for Stable Diffusion, validating LoRA’s superiority in achieving parameter-efficient model adaptation without compromising generative fidelity [
52,
53,
54]. These studies collectively underscore LoRA’s dual strengths: (1) enabling high-quality domain-specific synthesis through low-rank matrix decomposition that preserves baseline model integrity, and (2) drastically reducing computational overhead via selective weight updates—a critical advantage for iterative design workflows. While LoRA’s technical merits are well-established in image generation, its application to specific urban vitality spaces remains underexplored, particularly in reconciling computational efficiency with the psychosocial complexity of public environments. The methodology’s capacity for rapid, targeted adjustments positions it as a transformative tool for AI-generated urban design, yet the absence of empirical implementations necessitates further research to optimize its integration with vitality space design and research, as well as human-centered emotional evaluation frameworks.
1.3. Definition and Measurement of People–Environment Emotional Attachment
The study of people–environment emotional attachment has evolved through interdisciplinary contributions spanning psychology, human geography, architecture, and social ecology. Pioneering concepts such as Wright and Tuan’s Topophilia (1960s–70s) [
55] and Ralph’s sense of place (1970s) [
56] have laid the foundation for later expansions by scholars including Krupat, Hidalgo, and Norberg-Schulz, who articulated multi-dimensional frameworks encompassing place identity [
57], place dependence, place attachment [
5], and genius loci [
58]. Defined as the dynamic interplay of emotional, cognitive, and behavioral bonds between people and specific environments (both built and social), emotional attachment theory has continuously gained traction in urban design field as cities prioritize spaces that enhance public well-being [
6,
8]. Urban vitality spaces—critical loci for promoting activity and fostering happiness—exemplify this shift, serving as perfect research objects for adapting attachment principles (e.g., place-based rehabilitation strategies), while addressing complex socio-cognitive needs. This theoretical progression directly informs the focus of this study on vitality spaces as catalysts for emotionally responsive urban design. Meanwhile, this study further advocates integrating affective considerations into assessment frameworks for vital urban spaces.
Based on this, scholars have developed multi-dimensional measurement frameworks for emotional attachment, among which the Positive and Negative Affect Scale [
59] (PANAS) for measuring emotional orientation is often integrated with the new problem-oriented scale for measuring emotional degrees. For the former, the reliability, validity, and rationality for PANAS’ application to architecture, landscape, and urban-related research have been confirmed in relevant studies [
60,
61]. For the latter, some researchers have constructed emotional attachment measurement scales for specific characteristics of the built environment based on relevant theoretical methods, and have explored their application in various types of cities and landscape spaces at home and abroad [
3,
14]. Among them, the characteristics are divided into material ones such as materials, structure, form, etc., and immaterial ones such as privacy describing the spatial experience, territoriality describing the culture and history, and changeability-related characteristics emerging with the digital age. Their results have demonstrated the effectiveness of the measurement methods. This methodological evolution not only confirms the efficacy of hybrid emotional attachment evaluation approaches but directly informs the current study’s development of semi-structured evaluation tools, where emotional attachment theory operates dually as a conceptual foundation and operational framework for systematizing people–environment emotional interactions. This approach not only addresses the existing demand for quantitative emotional analysis in urban vitality space research, but also establishes theoretical frameworks and value-oriented guidelines for AI-generated urban space design and evaluation, thereby bridging methodological gaps while steering technological applications toward emotion-oriented urban innovation.
4. Conclusions
This study explored an emotion-driven design generation method based on LoRA (low-rank adaptation) technology, comparing LoRA-generated designs with the emotional attachment characteristics of the real built environments of the representative vitality space, THE BOX, in Beijing. The results show that, in general, both images of generative spaces and original spaces can bring positive emotional experience to people, but generative designs still cannot replace the positive emotional impact of the original real space, especially in the characteristics of forming a true three-dimensional experience of space, such as form and structure. At the same time, the ability of generative designs to convey spatial culture and help people perceive the interactivity of the space through vision is still weak, which will cause people to have a negative experience. The findings indicate that, compared to the original space, LoRA-generated designs need to enhance more spatial dynamism and interactivity by optimizing visual elements, spatial formation and structural layouts, attractive color schemes, and material selection. In the future, this approach may possibly hold promise for broader applications in urban vitality space design, further exploring how to integrate deeply with built environments to create richer and more multi-dimensional spatial experiences for users.
Generally speaking, AI-generated design methods enhance real-world urban design workflows and participatory processes through accelerated ideation, data-informed iteration, and scalable engagement. In design workflows of urban vitality spaces, generative tools such as LoRA technology rapidly produce prototypes or spatial configurations based on emotional-oriented parametric constraints, augmenting designers’ creativity with combinatorial efficiency. For public engagement, AI promotes the ability to synthesize community input into visualizable space alternatives—democratizing feedback loops while mitigating resource barriers. In participatory planning, these methods enable dynamic scenario modeling techniques that respond to emotional needs and foster inclusive deliberation by translating diverse stakeholder priorities into tangible options. However, critical oversight remains essential to address embedded biases, ensure cultural resonance, and preserve human agency in final decision-making. Specifically, from the theoretical and methodological perspectives, the conclusions of this study are as follows:
4.1. Practical Implementations of AI-Generated Technology in the Field of Urban Design
This study applied AI-generated LoRA technology to the field of emotional-oriented design for urban vitality spaces, significantly aiming at enhancing the emotional attachment effectiveness of LoRA-generated designs. Spatially, LoRA-generated designs improve physical attributes and play a key role in enhancing spatial interactivity and emotional attachment. They not only visually attract participants’ interest and positive responses but also effectively improve interactivity and comfort, significantly boosting urban vitality. In terms of detail, LoRA fine-tuning allows for precise adjustments of emotional elements, optimizing visual components, structural layouts, and appealing color and material selections. This fosters stronger emotional attachments between participants and spaces, greatly enhancing their sense of emotional attachment and psychological satisfaction. Overall, generative AI technology optimizes designs that have the potential to provide new intelligent solutions for emotional spatial designs and promote the application of cross-individual intelligence in the field. Specifically, AI-generated design methods to promote the formation of emotionally attached vibrant spaces can focus specifically on two levels: material and non-material generation methods.
4.1.1. Material-Level Generation: Focusing on Spatial Structural Characteristics That Elicit Embodied Experiences
At the level of the material characteristics of vitality space, based on the above results, it can be inferred that the influencing ability of generative designs on emotional attachment in describing the three-dimensional structural characteristics of space is still weaker than that of real spaces to a certain extent. Such characteristics often enhance people’s positive emotional attachment to space by triggering people’s perception of the embodiment of spatial experience. Therefore, in the future construction and adjustment of emotional-oriented generative design models, further attention can be paid to the generation methods of characteristics such as spatial materials, spatial structure, and spatial forms.
4.1.2. Immaterial-Level Generation: Focusing on Cultural and Interactive Characteristics Generation That Enhance Sense of Place
At the level of the immaterial characteristics of vitality space, the results show that generative designs lead to people’s negative experience to a certain extent due to the lack of the depiction of local cultural and historical characteristics of a certain space, as well as flexible interactive characteristics. This indicates that even when the spatial image is static, the cultural characteristics such as the territoriality and the interactive characteristics such as the playability of the spatial scene depicted are still valuable in enhancing a positive emotional attachment. Therefore, in the improvement of generative design models in the future, more technologies that can help generate characteristics to express spatial culture and interactivity should be added.
4.2. Theoretical Value of AI-Driven Emotional-Oriented Paradigm for Urban Vitality Space
By integrating the data analysis with emotional attachment, this study proposed a new paradigm for urban vitality space design that surpasses the limitations of traditional design approaches. Through the construction of a series of emotional attachment scales, this study compared the emotional attachment characteristics of LoRA-generated designs with those of built environments, further assessing their advantages and disadvantages compared with traditional design methods. Taking THE BOX in Chaoyang, Beijing, as an example, the analysis of 115 valid datasets validated the effectiveness of the “Generative AI—Emotional Attachment” dual-driven model in creating emotionally resonant spatial designs.
In the process of spatial design generation, iterative optimization models serve as a crucial approach to enhancing design quality and ensuring emotional attachment. To continuously refine the design scheme and improve its emotional guidance, this study employed a multi-round iterative optimization process based on LoRA technology. Through a cyclic feedback mechanism, the design scheme and research model underwent repeated optimization and adjustment, gradually achieving precise capture and response to users’ emotional needs.
Specifically, emotional evaluation and feedback were derived from a combination of interviews and surveys, leading to adjustments in LoRA parameters to maximize the generation of positive affect. After each iteration, the design was not only visually optimized but also fine-tuned in terms of emotional elements—such as color, material, and layout—based on emotional assessment results. This approach ensured that the design continuously approached an ideal state aligned with users’ emotional expectations, ultimately maximizing spatial vitality.
Through this iterative optimization mechanism, this study not only enhanced the emotional attachment of the design scheme but also expanded the depth and breadth of LoRA technology in emotion-driven designs. Each iteration provided valuable data and emotional insights for subsequent design generations, forming a positive feedback loop that offers a reliable theoretical and practical foundation for the emotional design of urban vitality spaces. In the future, generative AI can play an active role in more urban design projects, precisely meeting residents’ emotional needs and enhancing spatial vitality and comfort. The introduction of this method provides new perspectives for urban space design, driving it toward greater intelligence and emotional resonance.
4.3. Research Limitations and Future Work
Although this study provides strong empirical results, it is limited by its focus on a single case (THE BOX in Beijing), small number of subjects, and insufficient diversity, lacking discussions on diverse geographic and socio-cultural contexts, as mentioned above. This may restrict the generalizability of the findings. Additionally, the emotional scales used in this study may contain potential cultural biases, as they do not fully account for how different cultural backgrounds influence emotional responses. Emotional expressions toward the same scene may vary under different cultural influences, potentially leading to measurement validity issues in some emotional dimensions within existing scales, which could affect the reliability of conclusions. Furthermore, emotional complexity necessitates multi-dimensional scales and multimodal measurement methodologies.
Therefore, to advance both practical applications and theoretical frameworks, future work should prioritize the following directions. As for the research object, future research should validate findings across diverse urban vibrant spaces of varying scales and cultural contexts. Meanwhile, multimodal emotional measurement methods should be used to explore the differences and commonalities of emotional attachment across various cultural contexts. Human factor engineering, which studies emotional attachment characteristics through physiological feedback measurement, offers significant potential for application in emotion-focused built environment research. At the same time, how to further adjust the path of generating vitality spaces by the AI model to obtain emotional design results that are more conducive to people’s emotional attachment also needs further exploration. Integrating with the immersive environment through VR and AR devices to experience AI-generated spatial design would also help improve the diversity of evaluation perspectives. At the same time, the time dimension should also be included in the emotional evaluation of AI-generative design results in future research. For example, long-term user research methods from the field of industrial design can help achieve this goal.