Research on the Application Effectiveness of Generative AI in Design Projects from Data-Driven and Sustainable Perspectives
Abstract
1. Introduction
1.1. Development History of Generative AI in Design
1.2. Research Questions
- (1)
- Through comparative analysis of common large generative AI models in the current industry, which large generative AI model is more suitable as the core application tool in the practice of architectural design projects [12]?
- (2)
- How can suitable application scenarios and standardized operational processes for generative AI in the early, middle, and late stages of architectural projects be constructed?
- (3)
- Can the actual effectiveness of this technology across multiple dimensions such as design efficiency, scheme diversity, and sustainability be quantitatively verified through data science methods (cluster analysis and statistical testing)?
1.3. Research Scope and Objectives
2. Materials and Methods
2.1. Definition of Traditional and AI-Assisted Design Processes
2.2. Case Selection Criteria
- (1)
- In terms of type selection, empirical research needs to cover at least 2–3 different types of buildings to exclude interference from the particularity of a single type [16]. Among them, office buildings, cultural buildings, and industrial buildings are the three major typical project types in China’s current architectural design field. According to the “2024 China Architectural Design Industry Report”, these three project types account for 62% of the annual design volume, making them representative and ensuring that the research conclusions can be transferred to mainstream industry scenarios.
- (2)
- Regarding data completeness, the comparative experiment needs to obtain information such as modeling duration and discussion/modification frequency in the traditional design process, as well as generation time and prompt parameters for generative AI-assisted design. The lack of any item would prevent establishing a fair comparison baseline [17]. Among the considered types, commercial complex buildings were excluded due to missing iteration records of hand-drawn sketches.
- (3)
- Regarding stage matching, the core task in the initial stage of schematic design is ‘creative divergence—form exploration’, which is the efficiency bottleneck of the traditional process (accounting for 58–65% of the total schematic design cycle), providing a theoretical basis for this criterion in our study [18]. Among the potential cases, residential projects encountered by the author’s design institute had already entered the mid-design stage, and the building forms of residential types were relatively simple, making them unsuitable for complete application in the comparative experiment and unable to maximize the use of generative AI tools for scenario application; thus, residential building types were excluded.
- (4)
- For variable control, projects from the same region were selected as much as possible, adhering to the same set of local architectural design standards (e.g., “Zhejiang Province Green Building Design Standard” DB33/1092-2021 [19]), to exclude the impact of cross-regional standard differences in the design process.
2.3. Generative AI Model Selection and Adaptation
2.3.1. Analysis of Commonly Used Large Generative AI Models
2.3.2. Introduction to the Technical Principles of Stable Diffusion
- (1)
- Core Technical Architecture
- (2)
- Analysis of Adapted Functions for design projects
2.4. Research Methodology
2.4.1. Preliminary Analysis
2.4.2. Case Study
- (1)
- In the Early Stage of Architectural Design Projects
- (2)
- In the Mid- and Late Stages of Architectural Design Projects
2.4.3. Data Analysis
2.4.4. Multi-Dimensional Evaluation
- (1)
- Regarding Questionnaire Design and Implementation
- (a)
- Questionnaire Conception: The content primarily focuses on the quality evaluation of design outcomes, aiming to compare the quality of renderings produced by AI-assisted design vs. the traditional workflow. According to the different focuses of the mid- and late stages, the mid-stage focuses on design expression and creative inspiration, while the late stage focuses on visual realism and Detail Completion. Referencing multiple relevant pieces of literature [1,36,37], such as the four core dimensions proposed by Albaghajati ZM et al. in “Exploring text-to-image application in architectural design: Insights and implications”, potential dimensional indicators were initially designed. Three frontline architectural designers (working in office, cultural, and industrial design, respectively, with over 15 years of experience) and two architectural theory research scholars (university professors) were invited for consultation and discussion to optimize the survey questionnaire, for example, by deleting non-core dimensional content such as construction feasibility, forming an evaluation system adapted to this case project.
- (b)
- Definition of Evaluation Dimensions and Corresponding Indicators: The specific content of the survey questionnaire is shown in Appendix A. The scoring mechanism follows a 7-point Likert scale, making the obtained data closer to continuous variables and the scale more scientific [37]. The specific scoring criteria are defined as 1 point = “Very poor/Completely disagree”, 2 points = “Poor/Basically disagree”, 3 points = “Slightly poor/Partially disagree”, 4 points = “Neutral/Neutral”, 5 points = “Slightly good/Partially agree”, 6 points = “Good/Basically agree”, and 7 points = “Very good/Completely agree” (for example, in the “Visual Realism” dimension, 1 point means “The image differs greatly from the real scene, lighting, shadows, and proportions are completely distorted”, and 7 points means “The image is close to a real photo, with no obvious deviations in lighting direction, material reflection, or perspective relationship”).
- (c)
- Reasons for choosing a 7-point scale instead of 5-point or 10-point scales are as follows: Data continuity—the normality distribution fit of data from a 7-point scale is significantly higher than that of a 5-point scale, allowing direct use in parametric statistical methods like independent samples t-test and ANOVA, meeting the need of this study to “quantitatively compare the differences between AI and traditional workflows”; secondly, regarding evaluation discrimination—5-point scales are prone to “central tendency stacking” (score 4 accounting for over 40%), while the 7-point scale refines the scoring gradient (e.g., the “Detail Completion” dimension can distinguish between “6 points (details are relatively rich, only a few components missing)” and “7 points (details are complete, no obvious defects)”), better fitting the professional need for “fine-grained evaluation” of renderings; additionally, the operational cost is controllable, as this scale can keep the completion time for a single questionnaire within 10–15 min, with a final effective recovery rate of 79.1% (meeting the academic survey standard of ≥70%).
- (d)
- Respondent Characteristics and Quantity Statistics: The survey was conducted on the final renderings for both the mid- and late stages, using online and offline distribution of subjective questionnaires. A total of 212 valid questionnaires were collected. The invited personnel were mainly senior architects from the company, owner representatives, and residents around the company. Specific respondent characteristics are as follows: 82 architectural designers (38.7%), among whom 45 were senior architects with ≥8 years of experience (accounting for 54.9% of the architect group, ensuring the authority of professional scoring); 45 project investors/owner representatives (21.2%, focusing on the practicality of outcomes); 57 non-architectural professional surrounding residents (26.9%, providing a public perspective); additionally, 28 academic researchers in the architectural field (13.2%, supplementing theoretical evaluation dimensions).
- (e)
- Supplementary Details on Survey Implementation: The survey period was from 20 February 2025 to 20 March 2025. At this stage, the late-stage scheme design of the cases was completed, ensuring that respondents could score based on the project’s final outcome drawings (non-simulated diagram). The distribution form adopted a mixed “online + offline” mode. Online distribution was through widely used local social and office software such as WeChat 4.1.5, QQ v9.9.23, and DingTalk 7.6.11. Offline distribution was conducted in research meeting rooms in Jinhua and Yiwu, owner’s office spaces, and near the author’s residence, distributing paper questionnaires (accompanied by color-printed renderings). A total of 268 questionnaires were distributed, and 212 valid questionnaires were recovered.
- (2)
- Integration with Sustainable DevelopmentThe “sustainable development” mentioned in this study is not a broad environmental protection concept but rather performed to excavate the low-carbon potential throughout the life cycle at the architectural schematic design stage, with the linkage of “Form Parameters–Performance Indicators–Low-Carbon Goals” as the core, integrating sustainability requirements into the entire design process through quantitative tools, reducing environmental impact, and improving resource utilization efficiency. By extracting architectural form parameters such as floor area ratio, window-to-wall ratio, building shape coefficient, etc., and using professional simulation tools to evaluate the energy consumption, daylighting, and other performance of the schemes, design outcomes that combine creativity and low-carbon attributes are selected. Specifically, the covered dimensions are as follows:
- (a)
- Efficient use of natural resources. For example, using Ecotect software to evaluate the daylight factor of main functional areas (≥3%, complying with the “Standard for daylighting design of buildings” GB 50033-2013 [38]), calculating natural ventilation efficiency (≥0.7, complying with the “Assessment standard for green building” GB/T 50378-2019 [39]).
- (b)
- Controlled energy consumption and carbon emissions. For example, based on the building shape coefficient (≤0.40, complying with the “Design standard for energy efficiency of public buildings” GB/T 50189-2015 [40]), window-to-wall ratio (≤0.7, complying with the “General code for building energy efficiency and renewable energy utilization” GB/T 55015-2021 [41]), using EnergyPlus software to simulate the annual comprehensive energy consumption (Class A office building ≤65 kWh/m2·year, cultural building ≤55 kWh/m2·year, industrial building ≤50 kWh/m2·year, complying with the “Zhejiang Province Green Building Design Standard” DB33/1092-2021).
- (c)
- Adaptation of low-carbon materials and technologies. For example, focusing on the selection of building facade materials, collecting the proportion of low-carbon material usage in the design scheme (≥30% (based on the “Zhejiang Province Green Building Design Standard” DB33/1092-2021)); by determining the layout area of PV modules, solar collectors, etc., in the scheme design, using relevant green building assessment software (PKPM-GBS Green Building Design Software V3.3) to quickly calculate the contribution ratio of renewable energy.
3. Results
3.1. Online Opinion Analysis Results
3.2. Application and Demonstration of Generative AI in the Early Stage of Design Projects
3.2.1. Demonstration Method: Efficiency Data Comparison
3.2.2. Demonstration Method: Diversification Quantitative Analysis
- (1)
- Jinhua Information Economy Industrial Park (Figure 7)
- (2)
- Jinhua Library (Figure 8)
- (3)
- Yiwu Comprehensive Bonded Industrial Park (Figure 9)
3.2.3. Preliminary Assessment of Sustainability Indicators
3.3. Mid- and Late-Stage Application Effectiveness Results
3.3.1. Demonstration Method: Efficiency Analysis
3.3.2. Demonstration Method: Output Quality Analysis
4. Research Discussion
4.1. Generative AI Technology Reshaping Architectural Workflows
4.2. Limitations and Future Work
4.2.1. Limitations
- (1)
- Limitations of the questionnaire. Considering operability, the questionnaire content for mid-to-late-stage architectural design should not be excessive. The comparative analysis was only representative, using “high-rise office” type projects, resulting in slightly less coverage of project types. Future work needs to further verify performance for other project types.
- (2)
- Hardware acceleration limitations. Commonly used large generative AI models like Stable Diffusion require high-performance GPUs to generate high-resolution (2048 × 2048) images, resulting in high hardware costs. Furthermore, the refinement and realism need improvement. Currently, this poses an operational barrier for small architectural firms lacking professional computing infrastructure.
- (3)
- Insufficient interoperability within the software ecosystem. Currently, apart from Inspiration Rendering being an internal plugin for SketchUp pro 2025, allowing interoperability and real-time rendering, the integration of mainstream generative AI models with industry-standard BIM and CAD platforms (e.g., Revit and ArchiCAD) is limited. This hinders the seamless flow and real-time iteration of design data, constraining the scalability of technology application.
- (4)
- Insufficient Domain-Specific Knowledge Embedding: Current generative models lack embedded architectural code compliance checks and structural logic, requiring manual post-processing and external validation for regulatory adherence and constructability.
- (5)
- Prompt Sensitivity and Operational Threshold: Output quality and semantic alignment are highly sensitive to prompt engineering and parameter tuning (CFG Scale, Denoising Strength, ControlNet module selection, etc.), necessitating advanced user expertise and increasing operational complexity.
- (6)
- Data Provenance and Copyright Risk: Training datasets sourced from publicly available architectural imagery may introduce copyright infringement risks and high-similarity outputs, necessitating improved dataset curation and provenance tracking [16].
4.2.2. Future Work
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Appendix A
- Questionnaire Preface and Informed Consent
- II
- Respondent Background Information
- What is your primary professional field? (Single choice)A. Architectural DesignerB. Academic Research/Education in ArchitectureC. Project Investor/Project Decision ControllerD. Not related to construction engineeringG. Other (Please specify) _________
- What is your professional experience in architecture and related fields? (Single choice)A. Less than 3 yearsB. 3–5 yearsC. 6–10 yearsD. More than 10 years
- What is your practical experience with AI image generation tools (e.g., Stable Diffusion)? (Single choice)A. Never used, and not familiarB. Somewhat familiar, but have not usedC. Beginner, have tried simplyD. Intermediate user, can apply in some projectsE. Advanced user, can skillfully integrate into workflow
- III
- Evaluation Instructions
- IV
- Main Evaluation Section
- (1)
- Quality Comparison of Mid-Stage Architectural Project RenderingsDesign Scenario Description: This project is in the mid-stage of scheme deepening. Please evaluate Image A and Image B of the mid-stage renderings separately according to the following detailed indicators.Figure A1. (Left): Image A, (Right): Image B.
Evaluation Dimension and Specific Indicators Indicator Description Image A Score (1–7) Image B Score (1–7) 1. Clarity of Design Concept Expression 1.1 Concept Legibility Whether the core design concept (e.g., “financial district ambiance”, “rhythmic facade repetition”) can be intuitively perceived and understood. 1 2 3 4 5 6 7 1 2 3 4 5 6 7 1.2 Functional Layout Expression Whether the main building entrance and pedestrian/circulation flows are clearly suggested or expressed. 1 2 3 4 5 6 7 1 2 3 4 5 6 7 1.3 Design Intent Communication Whether the image successfully conveys the intended atmosphere or experience the project aims to create (e.g., dignified and solemn, strong financial ambiance). 1 2 3 4 5 6 7 1 2 3 4 5 6 7 2. Spatial and Volumetric Relationship 2.1 Proportion and Sense of Scale The correctness of proportions between architectural elements, and between the building and reference objects (people, vehicles); the clarity of the building’s scale. 1 2 3 4 5 6 7 1 2 3 4 5 6 7 2.2 Structural Massing Expression The clarity in expressing the building’s form, silhouette, volume, and transitional relationships. 1 2 3 4 5 6 7 1 2 3 4 5 6 7 2.3 Spatial Hierarchy The richness of spatial organization and the variation between solid/void, near/far elements; the clarity of spatial layers. 1 2 3 4 5 6 7 1 2 3 4 5 6 7 3. Overall Atmosphere and Esthetic Value 3.1 Composition and Balance The esthetic quality of the image composition (e.g., viewpoint selection, subject placement); the balance and stability of elements. 1 2 3 4 5 6 7 1 2 3 4 5 6 7 3.2 Color and Tone The harmony of the color palette; whether the tonal control aligns with the project’s intended character. 1 2 3 4 5 6 7 1 2 3 4 5 6 7 3.3 Atmosphere Rendering Whether the specific atmosphere (e.g., commercial, leisure, residential) intended for the scene is effectively and clearly conveyed. 1 2 3 4 5 6 7 1 2 3 4 5 6 7 4. Potential to Inspire Further Design Development 4.1 Creative Inspirational Value Whether the image is inspiring and can effectively stimulate further thinking within the design team, promoting scheme optimization. 1 2 3 4 5 6 7 1 2 3 4 5 6 7 The convenience of using this image as a base for exploring design variants and making rapid modifications. The convenience of using this image as a base for exploring design variants and making rapid modifications. 1 2 3 4 5 6 7 1 2 3 4 5 6 7 - (2)
- Quality Comparison of Late-Stage Architectural Project RenderingsDesign Scenario Description: This project is located in Jinhua City, Zhejiang Province, and is in the late stage of final delivery. This rendering serves as the final output for reporting, approval, or promotion. Please evaluate the late-stage rendering (Image A) and (Image B) separately based on the following detailed criteria.Figure A2. (Left): Image A; (Right): Image B.
Evaluation Dimension and Specific Indicators Indicator Description Image A Score (1–7) Image B Score (1–7) 1. Visual Realism 1.1 Lighting and Shadow Rationality Whether the direction, intensity, quality (hard/soft) of light, and resulting shadows conform to natural laws or the logic of the scene setting. 1 2 3 4 5 6 7 1 2 3 4 5 6 7 1.2 Perceived Visual Realism The extent to which the image resembles a real photograph in terms of lighting, materials, perspective, and proportions. 1 2 3 4 5 6 7 1 2 3 4 5 6 7 1.3 Camera Angle and Composition The appropriateness of the chosen camera angle; whether the image composition is well-framed and effectively highlights the subject. 1 2 3 4 5 6 7 1 2 3 4 5 6 7 2. Detail Depiction and Completeness 2.1 Architectural Detail Richness The level of detailing in architectural elements like facade articulation, window frames, railings, etc., avoiding simplistic or crude representations. 1 2 3 4 5 6 7 1 2 3 4 5 6 7 2.2 Fineness of Contextual Elements (Entourage) The diversity and refinement of contextual elements (e.g., vegetation, people, vehicles), avoiding simplistic “copy-paste” appearances. 1 2 3 4 5 6 7 1 2 3 4 5 6 7 2.3 Scene Detailing The presence of realistic light effects and rich details throughout the scene; subtle variations in environment and vegetation. 1 2 3 4 5 6 7 1 2 3 4 5 6 7 3. Material and Texture Representation 3.1 Materiality Representation The believability and authenticity of surface materials for both the main building and environmental elements (e.g., stone, metal, glass). 1 2 3 4 5 6 7 1 2 3 4 5 6 7 3.2 Material Texture Realism The accurate depiction of material textures and surface qualities, including glossiness, reflections, and refractions. 1 2 3 4 5 6 7 1 2 3 4 5 6 7 4. Image Artistry and Harmony 4.1 Foreground/Middleground/Background Layering The distinct separation of visual planes (foreground, middleground, background); appropriate use of focus/blur and sense of aerial perspective. 1 2 3 4 5 6 7 1 2 3 4 5 6 7 4.2 Primary-Secondary Relationship Whether the intended subject of the image is prominent and the visual center is clearly defined. 1 2 3 4 5 6 7 1 2 3 4 5 6 7 4.3 Overall Harmony and Cohesion The degree to which all elements in the image, including shadows, are harmonious, unified, and naturally integrated in terms of color, lighting, etc. 1 2 3 4 5 6 7 1 2 3 4 5 6 7
- V
- Closing Remarks
References
- Albaghajati, Z.M.; Bettaieb, D.M.; Malek, R.B. Exploring text-to-image application in architectural design: Insights and implications. Archit. Struct. Constr. 2023, 3, 475–497. [Google Scholar] [CrossRef]
- Petráková, L.; Šimkovič, V. Architectural alchemy: Leveraging Artificial Intelligence for inspired design–a comprehensive study of creativity, control, and collaboration. Archit. Pap. Fac. Archit. Des. STU 2023, 28, 3–14. [Google Scholar] [CrossRef]
- Manovich, L. Diffusions in Architecture: Artificial Intelligence and Image Generators; John Wiley & Sons: Hoboken, NJ, USA, 2024. [Google Scholar]
- Wu, J.; Gan, W.; Chen, Z.; Wan, S.; Lin, H. Ai-generated content (aigc): A survey. arXiv 2023, arXiv:2304.06632. [Google Scholar] [CrossRef]
- Chen, Y.; She, S.; Sun, Y. Is AI-generated content better? A study based on ant Forest game content recommendation. In Proceedings of the 2024 Guangdong-Hong Kong-Macao Greater Bay Area International Conference on Digital Economy and Artificial Intelligence, Dongguan, China, 14–16 March 2024; pp. 749–755. [Google Scholar]
- Deamer, P.; Bernstein, P. Building (in) the Future: Recasting Labor in Architecture; Princeton Architectural Press: New York, NY, USA, 2010. [Google Scholar]
- Negroponte, N. Toward a theory of architecture machines. J. Archit. Educ. 1969, 23, 9–12. [Google Scholar]
- Steinfeld, K. Clever little tricks: A socio-technical history of text-to-image generative models. Int. J. Archit. Comput. 2023, 21, 211–241. [Google Scholar] [CrossRef]
- Thampanichwat, C.; Wongvorachan, T.; Sirisakdi, L.; Chunhajinda, P.; Bunyarittikit, S.; Wongmahasiri, R. Mindful architecture from text-to-image AI perspectives: A case study of DALL-E, Midjourney, and stable diffusion. Buildings 2025, 15, 972. [Google Scholar] [CrossRef]
- Ploennigs, J.; Berger, M. AI art in architecture. AI Civ. Eng. 2023, 2, 8. [Google Scholar] [CrossRef]
- Alyıldız, E.C. Generative Text-to-Image Models in Architectural Design: A Study on Relationship of Language, Architectural Quality and Creativity. Icontech. Int. J. 2023, 7, 12–26. [Google Scholar]
- Cai, Y.; Zhou, Y.; Yu, X.; Kim, W. Identifying Usability Challenges in Text-to-Image AI: A Comprehensive Comparison Among Mainstream Platforms. In Proceedings of the International Conference on Human-Computer Interaction, Gothenburg, Sweden, 22–27 June 2025; Springer: Berlin/Heidelberg, Germany, 2025; pp. 163–174. [Google Scholar]
- Bansal, G.; Nawal, A.; Chamola, V.; Herencsar, N. Revolutionizing visuals: The role of generative AI in modern image generation. ACM Trans. Multimed. Comput. Commun. Appl. 2024, 20, 1–22. [Google Scholar] [CrossRef]
- Cross, N.; Roozenburg, N. Modelling the design process in engineering and in architecture. J. Eng. Des. 1992, 3, 325–337. [Google Scholar] [CrossRef]
- Archer, L.B. The structure of the design process. Des. Methods Archit. 1969, 76–102. [Google Scholar]
- Li, Y.; Chen, H.; Yu, P.; Yang, L. A review of artificial intelligence in enhancing architectural design efficiency. Appl. Sci. 2025, 15, 1476. [Google Scholar] [CrossRef]
- Chen, J.; Wang, D.; Shao, Z.; Zhang, X.; Ruan, M.; Li, H.; Li, J. Using artificial intelligence to generate master-quality architectural designs from text descriptions. Buildings 2023, 13, 2285. [Google Scholar] [CrossRef]
- Camburn, B.; Dunlap, B.; Gurjar, T.; Hamon, C.; Green, M.; Jensen, D.; Crawford, R.; Otto, K.; Wood, K. A systematic method for design prototyping. J. Mech. Des. 2015, 137, 081102. [Google Scholar] [CrossRef]
- DB33/1092-2021; Zhejiang Province Green Building Design Standard. Zhejiang Provincial Department of Housing and Urban-Rural Development: Hangzhou, China, 2021.
- Somepalli, G.; Gupta, A.; Gupta, K.; Palta, S.; Goldblum, M.; Geiping, J.; Shrivastava, A.; Goldstein, T. Investigating Style Similarity in Diffusion Models. In Proceedings of the European Conference on Computer Vision, Milan, Italy, 29 September–4 October 2024; Springer: Berlin/Heidelberg, Germany, 2024; pp. 143–160. [Google Scholar]
- Tan, L.; Luhrs, M. Using Generative AI Midjourney to enhance divergent and convergent thinking in an architect’s creative design process. Des. J. 2024, 27, 677–699. [Google Scholar] [CrossRef]
- Paananen, V.; Oppenlaender, J.; Visuri, A. Using text-to-image generation for architectural design ideation. Int. J. Archit. Comput. 2024, 22, 458–474. [Google Scholar] [CrossRef]
- Hedlin, E.; Sharma, G.; Mahajan, S.; Isack, H.; Kar, A.; Tagliasacchi, A.; Yi, K.M. Unsupervised semantic correspondence using stable diffusion. Adv. Neural Inf. Process. Syst. 2023, 36, 8266–8279. [Google Scholar]
- Wang, C. Research on AI painting generation technology based on the [stable diffusion]. Int. J. Adv. Smart Converg. 2023, 90–95. [Google Scholar]
- Zhang, L.; Rao, A.; Agrawala, M. Adding conditional control to text-to-image diffusion models. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, France, 2–3 October 2023; pp. 3836–3847. [Google Scholar]
- Dehouche, N.; Dehouche, K. What’s in a text-to-image prompt? The potential of stable diffusion in visual arts education. Heliyon 2023, 9, e16757. [Google Scholar] [CrossRef]
- Lipp, B. Modern Data Architectures with Python; Packt Publishing: Birmingham, UK, 2023. [Google Scholar]
- Kim, S.-K.; Kim, T.-Y.; Kim, K. Development and effectiveness verification of AI education data sets based on constructivist learning principles for enhancing AI literacy. Sci. Rep. 2025, 15, 10725. [Google Scholar] [CrossRef]
- Bao, Y.; Xiang, C. Exploration of Conceptual Design Generation Based on the Deep Learning Model–Discussing the Application of AI Generator to the Preliminary Architectural Design Process. In Proceedings of the xArch–Creativity in the Age of Digital Reproduction Symposium, Suzhou, China, 11–12 November 2023; Springer: Berlin/Heidelberg, Germany, 2023; pp. 171–178. [Google Scholar]
- GB/T 51366-2019; Standard for Building Carbon Emission Calculation. Ministry of Housing and Urban-Rural Development of the People’s Republic of China: Beijing, China, 2019.
- Bonomolo, M.; Di Lisi, S.; Leone, G. Building information modelling and energy simulation for architecture design. Appl. Sci. 2021, 11, 2252. [Google Scholar] [CrossRef]
- Topuz, B.; Alp, N.Ç. Machine learning in architecture. Autom. Constr. 2023, 154, 105012. [Google Scholar] [CrossRef]
- Terol, R.M.; Reina, A.R.; Ziaei, S.; Gil, D. A machine learning approach to reduce dimensional space in large datasets. IEEE Access 2020, 8, 148181–148192. [Google Scholar] [CrossRef]
- Piras, D.; Peiris, H.V.; Pontzen, A.; Lucie-Smith, L.; Guo, N.; Nord, B. A robust estimator of mutual information for deep learning interpretability. Mach. Learn. Sci. Technol. 2023, 4, 025006. [Google Scholar] [CrossRef]
- Trisnawan, D. Ecotect design simulation on existing building to enhance its energy efficiency. In Proceedings of the IOP Conference Series: Earth and Environmental Science, Paris, France, 7–9 February 2018; IOP Publishing: Bristol, UK, 2018; p. 012117. [Google Scholar]
- Daemei, A.B.; Safari, H. Factors affecting creativity in the architectural education process based on computer-aided design. Front. Archit. Res. 2018, 7, 100–106. [Google Scholar] [CrossRef]
- Krishna Kumar, T.; Ramesh, S.; Pallavi, D.; Ramachandran, M.; Saravanan, V.; Selvam, M. Individual and group behavior based customer product Recommendation to Designing Information Systems SPSS statistics. In Proceedings of the International Conference on Intelligent Computing & Optimization, Phnom Penh, Cambodia, 26–27 October 2023; Springer: Berlin/Heidelberg, Germany, 2023; pp. 292–299. [Google Scholar]
- GB 50033-2013; Standard for Daylighting Design of Buildings. National Standard of the People’s Republic of China: Beijing, China, 2013.
- GB/T 50378-2019; Assessment Standard for Green Building. Ministry of Housing and Urban-Rural Development of the People’s Republic of China: Beijing, China, 2019.
- GB/T 50189-2015; Design Standard for Energy Efficiency of Public Buildings. Beijing Municipal Commission of Planning and Natural Resources: Beijing, China, 2015.
- GB/T 55015-2021; General Code for Building Energy Efficiency and Renewable Energy Utilization. Ministry of Housing and Urban-Rural Development of the People’s Republic of China: Beijing, China, 2021.
- Chen, F.; Mai, M.; Huang, X.; Li, Y. Enhancing the sustainability of AI technology in architectural design: Improving the matching accuracy of Chinese-style buildings. Sustainability 2024, 16, 8414. [Google Scholar] [CrossRef]
- Sebestyen, A.; Tyc, J. Machine learning methods in energy simulations for architects and designers. In Proceedings of the 38th eCAADe Conference, Berlin, Germany, 16–18 September 2020; pp. 613–622. [Google Scholar]
- Onatayo, D.; Onososen, A.; Oyediran, A.O.; Oyediran, H.; Arowoiya, V.; Onatayo, E. Generative AI applications in architecture, engineering, and construction: Trends, implications for practice, education & imperatives for upskilling—A review. Architecture 2024, 4, 877–902. [Google Scholar] [CrossRef]
- El Moussaoui, M. Architectural practice process and artificial intelligence–an evolving practice. Open Eng. 2025, 15, 20240098. [Google Scholar] [CrossRef]
- Jin, S.; Tu, H.; Li, J.; Fang, Y.; Qu, Z.; Xu, F.; Liu, K.; Lin, Y. Enhancing architectural education through artificial intelligence: A case study of an AI-assisted architectural programming and design course. Buildings 2024, 14, 1613. [Google Scholar] [CrossRef]

















| Time | Before 2020 | 2020 | 2022 | 2023 | 2025 | 2030 |
|---|---|---|---|---|---|---|
| Text | Spam detection; translation; basic Q&A, a | Basic copywriting; first draft, b | Longer form; draft deepening, b | Fine-tuning in vertical fields (scientific papers, etc.) to achieve better results, c | The quality of the final draft is higher than the human average, c | The quality of the final draft is higher than that of professional authors, c |
| Programming | Single-line auto-completion, a | Multi-line generation, b | Longer form; better accuracy, b | Adapt to more calculator languages; apply to more vertical fields, c | Text generation (software) product, c | Ultimate text generation (software) product, with quality higher than that of full-time developers, c |
| Image | Art, logos, photos, b | First draft (product design, architectural design, etc.), b | Final draft (product design, architectural design, etc.), c | The quality of the final draft is higher than that of professional artists, designers, and photographers, c |
| Traditional Design Process | AI-Assisted Design Process | ||||||
|---|---|---|---|---|---|---|---|
| Phase | Core Task | Tools/Methods | Outputs | Core Task | Tools/Methods | Outputs | |
| Preliminary Phase | Requirement Analysis | Sort out functions, planning conditions, owner preferences, etc., to form a requirements list | Review materials, site surveys and owner interviews | Feasibility analysis report and site analysis report | Sort out functions, planning conditions, owner preferences, etc., to form a requirements list | Review materials, site surveys, owner interviews | Feasibility analysis report and site analysis report |
| Concept Design | Hand-drawn design sketches to explore architectural forms, styles, spatial organization, etc., based on professional experience, with multiple rounds of modification and discussion | Drawing paper, marker hand-drawing, scheme discussion meetings | 2–5 sets of hand-drawn concept sketches | Use generative AI’s text-to-image function, select appropriate prompts, multiple iterations, and generate various design style sketches | Text-to-image in Stable Diffusion | 10–30 design images of different styles and types | |
| Mid-Phase | Form Expression | Convert selected sketches into 3D models, clarify building mass relationships, facade textures, and spatial layout; modifications require constant model readjustment; this phase is time-consuming | SketchUp pro 2025, Rhino 8, and other 3D modeling software, AutoCAD 2020 (for plan drawing) | 3D models, plan layout drawings, elevation sketches | Determine the general design style, use ControlNet, LoRA, and other large model plugins for precise control, front-load the energy simulation phase, use AI-related software for testing, adjust building form and plan layout at any time, provide multiple design drawings for owner comparison and selection, and draw corresponding plan sketches | Image-to-image in Stable Diffusion, EnergyPlus, XiaoKu AI, and other energy simulation-related software, AutoCAD 2020 (for plan drawing) | Generate a set of series renderings with consistent volume and diverse details, plan layout drawings, and energy consumption assessment |
| Deepening and Optimization | Develop detailed design drawings based on structural feasibility, regulatory codes, and analysis of factors such as natural lighting | Use SketchUp pro 2025 to deepen the model and Excel for energy consumption estimation | Scheme model deepened drawings and energy consumption estimation report | ||||
| Later Phase | Results Output | Deliver the finalized deepened model to professional rendering agencies, and use professional rendering software to produce final results, requiring multiple rounds of adjustments and detail modifications during the process, forming presentable reporting results files | Lumion 12/3ds Max 2025 (used by rendering companies) and PPT (reporting files) | Final renderings and reporting files | Further deepen the finalized deepened model or selected design drawings from the previous stage using AI, form detail-rich renderings, and produce reporting results files | Inspiration rendering plugins and PPT (reporting files) | Final renderings and reporting files |
| Project Category | Project Name | Selected | Reason |
|---|---|---|---|
| Commercial | A Commercial Complex Project in Hangzhou | No | Incomplete Data |
| Office | Jinhua Information Economy Industrial Park | Yes | Meets Selection Criteria |
| Renovation | Key Building Enhancement Design North of Jinhua Huhaitang Park | No | Phase Mismatch; Renovation Projects Have Multiple Design Constraints |
| Residential | Residential Plot Project North of Jinhua Jinchuang Green Valley | No | Phase Mismatch |
| Industrial | Chengdu E-Workshop Project | No | Located in Chengdu; Regional Differences Exist |
| Cultural | Jinhua New Library | Yes | Meets Selection Criteria |
| Healthcare | A Medical Center in Yunnan Province | No | Located in Yunnan; Regional Differences Exist and Data is Incomplete |
| Industrial | Yiwu Comprehensive Bonded Industrial Park | Yes | Meets Selection Criteria |
| Education | The Supporting Secondary School in Shanghai Lingang New Area | No | Phase Mismatch; Multiple Design Constraints and Regional Differences Exist |
| Office | Yiwu Cross-Border E-Commerce Industrial Park | No | Weak Phase Compatibility; Prolonged Design Cycle Hinders Experiment Progress |
| Project Category | Case Name | Location | Current Project Progress | Final Scheme Effect | Site Area | Height Limit | Sustainability |
|---|---|---|---|---|---|---|---|
| Office | Jinhua Information Economy Industrial Park | Jinhua City, Zhejiang Province, China | Under Construction | ![]() | 1.68 hectares | ≤80 m | Yes |
| Cultural | Jinhua Library | Jinhua City, Zhejiang Province, China | Completed | ![]() | 2.15 hectares | ≤80 m | Yes |
| Industrial | Yiwu Comprehensive Bonded Industrial Park | Yiwu City, Zhejiang Province, China | Under Construction | ![]() | 45.84 hectares | ≤35 m | Yes |
| Platform Name | Midjourney | Stable Diffusion | Modai Cloud AIi | SketchUp pro 2025 Inspiration Rendering (Plugin) | ||
|---|---|---|---|---|---|---|
| Core Principle Basis | Transformer-based deep learning generative model | Latent Diffusion Model (LDM) | Combines GAN and Transformer architecture | Ray tracing rendering engine + lightweight generative AI model | ||
| Applicable Stage | Early stage | Primarily mid-stage, also early and late stages | Early stage and mid-stage | Mid-stage and late stage | ||
| Image Generation Comparison | Usage Threshold | Hardware Requirements | Pure cloud deployment, no special configuration required | Local deployment preferred: requires mid-to-high-end GPU | Pure cloud deployment, no special configuration required | Relies on SketchUp pro 2025 runtime environment, moderate configuration requirements |
| Ease of Use | Relatively low | Relatively high | Relatively low | Medium | ||
| Feature Richness | Content Quality | Relatively high | High | Relatively high | High, excels in realistic style | |
| Architectural Industry Adaptability | Low | High | Relatively high | High | ||
| Supports Text Prompts | Yes | Yes | Yes | Yes | ||
| Strict Generation Control | No | Yes | No | Yes | ||
| Extensibility | Low, only uses official templates | High, open-source, supports developing architecture-specific plugins | Medium, does not support third-party extensions | Medium, not extensible | ||
| Model Training Comparison | Usage Threshold | No custom training function | Medium difficulty | Medium difficulty | Requires official updates | |
| Training Feature Richness | None | High, supports Lora, Dreambooth, etc. | Low | None | ||
| Training Effect (Architectural Scenes) | None | High | Medium | None | ||
| Models/Scripts | Uses |
|---|---|
| LoRA Mode | Generate images with specific styles |
| X/Y/Z Plot | Generate comparison charts to find the best parameters |
| controlnet v1.1 | Add image conditions to control pictures |
| Ultimate Plugin | Render images in blocks and add image details |
| Research Phase | Research Objective | Method | Tools |
|---|---|---|---|
| Preliminary Analysis | Understand current industry perspectives on AI applications in architecture | Web data crawling; semantic analysis | Python3.10; Large Model Prompt Engineering |
| Case Study | Compare the efficacy of traditional vs. AI-assisted workflows | A/B testing; case study | Stable Diffusion, AI-related plugins in SketchUp pro 2025, manual recording of |
| Data Analysis | Quantify scheme diversity and efficacy metrics | Cluster analysis; statistical testing | Scikit-learn, SPSS19, K-means, t-SNE, PCA |
| Multi-Dimensional Evaluation | Comprehensive evaluation of subjective and objective indicators | Questionnaire surveys; performance simulation | EnergyPlus8.1.0, Autodesk Ecotect Analysis 2013, SPSS19 |
| Metric Category | Metric Name | Operational Definition | Traditional Process Measurement Method | AI Process Measurement Method |
|---|---|---|---|---|
| Time Efficiency | Preliminary Scheme Generation Cycle | Pure design time from “completion of requirements analysis” to the “first version of design sketches”, excluding time for requirement communication and presentation preparation. | Record the cumulative duration of all stages, including hand-drawing and modification. | Cumulative duration including prompt debugging, model generation, and design drawing selection. |
| Mid-Phase Iteration Response Speed | The average time required to complete a single round of scheme modification and optimization, from form expression to deepening and optimization. | Calculate the cumulative duration of multiple typical modification rounds and take the average. | Calculate the cumulative duration of multiple typical modification rounds and take the average. | |
| Final Phase Generation Cycle | The time required to convert the optimized model into the final presentable design drawings. | The time spent by the rendering company on model optimization and processing. | The time taken to use generative AI-assisted software for rendering and generating the final design drawings. | |
| Decision Iterations | The number of modifications made during the design cycle, based on presentation rounds and internal discussions. | Record the number of complete modeling versions. | Count the number of modification rounds. | |
| Resource Input | Input Cost | Labor cost is calculated based on daily salary; hardware energy consumption cost is the fee calculated from cumulative power usage; fees paid to rendering companies are calculated according to industry standards. | Calculate the fees paid to rendering companies and the labor cost of personnel reviewing the renderings. | Calculate labor costs and hardware energy consumption. |
| Project Category | Case Name | Positive Prompt Description | Negative Prompt Description |
|---|---|---|---|
| Office | Jinhua Information Economy Industrial Park | (masterpiece, best quality, 8 k, detailed), modern office building, glass curtain wall, (daylighting: 1.3), rooftop solar panels, sustainable design, energy efficient LED lighting, natural ventilation, (employees walking: 1.1), surrounding trees, clear blue sky, isometric view, architectural rendering | (worst quality, low quality: 1.4), blurry, cartoon, anime, deformed, ugly, bad anatomy, extra limbs, (traditional style: 1.3), polluted sky, empty, no people |
| Cultural | Jinhua Library | (masterpiece, best quality, 8 k, detailed), modern public library, cultural building, large glass windows, (green roof: 1.3), rainwater collection system, (sunlit atrium: 1.2), eco-friendly wooden facade, public space, (reading citizens: 1.1), plaza, trees, low-rise building, top view, architectural visualization | (worst quality, low quality: 1.4), cartoon, sketch, overly ornate, classical style, dark interior, gloomy, crowded, chaotic, industrial park |
| Industrial | Yiwu Comprehensive Bonded Industrial Park | (masterpiece, best quality, 8 k, detailed), modern industrial factory, steel structure, large span, (energy-efficient skylights: 1.2), insulated walls, (solar panels on roof: 1.3), clean and tidy, ventilation, heat recovery, sustainable materials, minimalist design, daytime, aerial view | (worst quality, low quality: 1.4), dirty, rusty, broken windows, smoke, pollution, outdated, cramped, residential building, ornate decoration, text, human |
| Project Case | Traditional Design Time (Control Group A) | Stable Diffusion-Assisted Time (Experimental Group B) | Efficiency Improvement |
|---|---|---|---|
| Mean ± SD (Median) | Mean ± SD (Median) | (Control Group A—Experimental Group B)/Control Group A × 100 | |
| Jinhua Information Economy Industrial Park | 42 ± 4.5 (42) | 2.8 ± 0.6 (3.0) | 93.33% |
| Jinhua Library | 45 ± 5.1 (45) | 3.1 ± 0.7 (2.5) | 93.11% |
| Yiwu Comprehensive Bonded Industrial Park | 38 ± 3.2 (38) | 2.3 ± 0.5 (2.4) | 93.95% |
| Category | Scheme Concept |
|---|---|
| Scheme 1 Conference Center and Office Building Combined | ![]() |
| Scheme 2 Independent Conference Center along the Street | ![]() |
| Scheme 3 Continuous Financial Institutions along the Street | ![]() |
| Case Category | Color Saturation (0–100) | Contour Regularity (0–100) | Detail Richness (0–100) | Spatial Hierarchy (0–10) | Block Dispersion (1–100) | Greening Level (0–100) | Light–Shadow Contrast (0–100) | Material Complexity (0–100) | Void–Solid Relationship (0–1) |
|---|---|---|---|---|---|---|---|---|---|
| High-Rise Office Building | 65 | 85 | 78 | 7 | 15 | 30 | 80 | 85 | 0.82 |
| High-Rise Office Building | 68 | 88 | 82 | 7 | 25 | 28 | 82 | 88 | 0.78 |
| High-Rise Office Building | 60 | 82 | 75 | 6 | 10 | 32 | 78 | 80 | 0.86 |
| Code | Window-to-Wall Ratio | Building Shape Coefficient | Facade Design Strategy | Average Daylight Factor in Main Functional Areas (%) | Estimated Annual Comprehensive Energy Consumption (KWh/m2) |
|---|---|---|---|---|---|
| A-1 | 0.58 | 0.30 | performance insulating LOW-E glass curtain wall | 3.7 | 66.3 |
| A-2 | 0.50 | 0.33 | Comprehensive external shading + metal curtain wall | 3.1 | 62.1 |
| A-3 | 0.62 | 0.29 | Double-skin facade (breathing facade) | 4.0 | 64.8 |
| A-4 | 0.47 | 0.35 | Vertical shading grille + point windows | 2.8 | 59.5 |
| A-5 | 0.53 | 0.31 | Glass curtain wall + integrated vertical PV panels | 3.5 | 63.4 |
| Reference Standard | ≤0.70 | ≤0.40 | - | ≥3.0 | ≤65 |
| Project Case | Single Traditional Design Deepening Cycle (Hours) | Stable Diffusion Assisted Deepening Cycle (Hours) | Traditional Design Decision Rounds | Stable Diffusion-Assisted Decision Rounds |
|---|---|---|---|---|
| Jinhua Information Economy Industrial Park | 40–56 | 12 | 4 | 2 |
| Jinhua Library | 24–40 | 9.6 | 5 | 2 |
| Yiwu Comprehensive Bonded Industrial Park | 24–40 | 14.4 | 4 | 2 |
| Evaluation Dimension | Traditional Design (Rendering Company) | AI-Assisted Design with Inspiration Rendering Plugin |
|---|---|---|
| Single-Image Production Cycle (Hours) | 48 ± 6 | 1 ± 0.5 |
| Single-Image Cost (CNY) | 1200 ± 200 | 100 ± 10 (Labor and Hardware Energy Cost) |
| Analysis Scope | Number of Items | Sample Size | Cronbach’s α | |
|---|---|---|---|---|
| Mid-stage | Clarity of Design Concept Expression | 3 | 212 | 0.792 |
| Spatial and Volumetric Relationship | 3 | 212 | 0.886 | |
| Overall Atmosphere and Esthetic Value | 3 | 212 | 0.908 | |
| Potential to Inspire Deepening Design | 2 | 212 | 0.898 | |
| Late stage | Visual Realism | 3 | 212 | 0.905 |
| Detail Depiction and Completion | 3 | 212 | 0.912 | |
| Material and Texture Representation | 2 | 212 | 0.904 | |
| Image Artistry and Harmony | 3 | 212 | 0.807 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Cao, Q.; Zhou, Y. Research on the Application Effectiveness of Generative AI in Design Projects from Data-Driven and Sustainable Perspectives. Sustainability 2025, 17, 10643. https://doi.org/10.3390/su172310643
Cao Q, Zhou Y. Research on the Application Effectiveness of Generative AI in Design Projects from Data-Driven and Sustainable Perspectives. Sustainability. 2025; 17(23):10643. https://doi.org/10.3390/su172310643
Chicago/Turabian StyleCao, Qiran, and Ying Zhou. 2025. "Research on the Application Effectiveness of Generative AI in Design Projects from Data-Driven and Sustainable Perspectives" Sustainability 17, no. 23: 10643. https://doi.org/10.3390/su172310643
APA StyleCao, Q., & Zhou, Y. (2025). Research on the Application Effectiveness of Generative AI in Design Projects from Data-Driven and Sustainable Perspectives. Sustainability, 17(23), 10643. https://doi.org/10.3390/su172310643







