Abstract
In the transition toward a more digital and data-driven construction industry, understanding how Artificial Intelligence (AI) and Building Information Modeling (BIM) are integrated is key to planning, delivering, and operating projects effectively. This review examines recent studies to identify usage patterns of AI and BIM. Searches were conducted on the Web of Science Core Collection from 2022 to 2025. After running a reproducible review protocol aligned with PRISMA 2020, which began with 1212 articles, and after a funneling process, 12 studies met the predefined eligibility criteria. In the present study, the synthesis was non-meta-analytic; instead, the information was analyzed by using standardized tabulation with a consistent format and compared using a two-level weighting scheme. The methodological approach combines full-text reading and descriptive coding with a reproducible weighting scheme that accounts for mentions per study and integrates them at the corpus level using open-source tools. The results show a strong focus on Deep Learning (DL), with a greater presence in Digital Twins (DT) and BIM Modeling (BIMM); Multidimensional BIM (4D/5D) appears as a secondary line, while the Common Data Environment (CDE) and Clash detection (CD) are sporadic. The coupling of DL-DT and DL-BIMM predominates. Simultaneously, Machine Learning (ML) provides explainable analysis on structured data, and Generative Adversarial Networks (GAN) and Automated Machine Learning (AutoML) with Machine Learning Operations (MLOps) act as enablers for data generation/adaptation and deployment with traceability. It is concluded that advancing metrics and shared datasets, especially for CDE and CD, along with developing reproducible workflows oriented toward MLOps, are key to scaling AI in real-world environments.
1. Introduction
The Architecture, Engineering, Construction, and Operations (AECO) industry is under pressure to deliver safer, faster, and lower-carbon projects while integrating fragmented data across design, construction, and operations. In this context, BIM provides the information backbone, and AI supplies perception, prediction, and automation, together enabling near-real-time decisions at scale. However, current adoption remains uneven, and evidence is scattered. This review addresses that need by examining where AI is actually coupling with BIM in recent studies and the intensity of this coupling.
1.1. Context: Digital Transformation in AECO
The AECO industry is undergoing a sustained process of digitization and automation, aligned with the principles of Industry 4.0, which has been conceptualized as Construction 4.0 []. This transition drives the intensive use of data and the integration of technologies such as Building Information Modeling (BIM), the Internet of Things (IoT), advanced analytics, and Artificial Intelligence (AI), with the aim of improving coordination, production control, and asset operation []. Various recent reviews and frameworks indicate that this evolution depends on reliable information flows and the ability to close control loops almost in real time through measurements, automatic interpretations, and continuous feedback [].
In this context, BIM acts as the informational backbone, while AI provides analytical, predictive, and prescriptive functions that enable new levels of automation and evidence-based decision-making []. The integration of these technologies enables the closure of the data, model, and decision cycle, enabling predictive analytics, such as energy optimization, and the orchestration of services on assets in near real-time, which is vital for efficiency in project management [,].
1.2. Fundamentals of BIM and AI
BIM is understood as a set of technologies, processes, and policies to generate, manage, and share digital representations of assets throughout their lifecycle []. Methodologically, BIM has been widely systematized in academic literature and international standards; for example, the ISO standards family 19650 defines principles and processes for information management, including the operation of a Common Data Environment (CDE) as an ‘agreed source of information’ for project teams [].
An updated and sufficient framework to support the integration of BIM with AI is the ISO 19650 series, which defines principles and processes for information management and establishes the CDE as the ‘agreed source’ of data; achieving maturity in these processes and operating robust CDEs is a condition for reproducible, traceable, and auditable workflows in projects and assets [,].
Meanwhile, AI in AECO has shown rapid and heterogeneous advances: from Machine Learning (ML) and Deep Learning (DL) for classification, detection, and prediction, to generative models, such as Generative Adversarial Networks (GAN), for data synthesis and training set augmentation []. At the deployment frontier, Automated Machine Learning (AutoML) and Machine Learning Operations (MLOps) practices emerge as mechanisms to standardize, automate, and scale the model lifecycle in industrial environments []. Likewise, digital twins (DTs) represent a critical pathway to close the loop between operational data and analytical models, connecting representation, simulation, and control [,,].
1.3. AI-BIM Knowledge Gap
Prior syntheses have mapped prominent AI applications in BIM and documented the rise in deep-learning pipelines for perception and forecasting in AECO, as well as the growing role of Digital Twins and information-management standards [,,,]. Focused surveys also describe generative methods for data augmentation and adaptation and outline emerging operational practices such as AutoML and MLOps [,,,]. However, a gap remains: existing reviews seldom align AI families with specific BIM application ambits under a standard, comparable metric; reporting is heterogeneous (tasks, datasets, outcomes), hindering cross-study comparison; CDE and clash-detection evidence is under-specified in terms of governance metrics; and lifecycle operationalization (AutoML/MLOps) is rarely quantified or linked to reproducible deployment.
This review aims to help close this gap by proposing a minimal common taxonomy of nine strategic ambits at the intersection of AI-BIM, five BIM applications, BIM Modeling (BIMM) 4D/5D, CDE, Clash Detection (CD), Multidimensional BIM 4D/5D (MBIM 4D/5D), (DT), and four families of AI (ML, DL, GAN, AutoML & MLOps), and by systematizing recent evidence (2022–2025) through a reproducible review protocol aligned with PRISMA 2020 [].
Building on the gap identified above, the research gap will be addressed through a PRISMA-guided, two-level quantitative framework that standardizes heterogeneous findings into comparable scores: (i) model-class weights within each study and (ii) link weights between AI families and BIM application axes. This explicit bridge from gap to method yields decision-relevant rankings and reveals underexplored AI–BIM intersections that merit investment. Section 2 details the search strategy, eligibility criteria, and weighting procedure; Section 3 presents the comparative synthesis and key rankings.
1.4. Objective and Research Questions
The overall objective of this research is to characterize, through a systematic review of recent literature (2022–2025) following PRISMA 2020, the mechanisms of integrating artificial intelligence into the workflows and artifacts of BIM in the AECO sector. The analysis examines the principal AI families across key BIM applications to identify adoption and performance trends, document methodological and reporting gaps, and delineate opportunities for technology transfer and operational scaling.
Guided by the overall objective, the study structures its research questions for quantitative and qualitative analysis of the 2022–2025 corpus into two groups that drive the analysis. The first group, focused on AI methods and integration patterns, poses the following questions: which BIM flows and artifacts, BIMM, CDE, CD, MBIM 4D/5D, and DT, concentrate the highest integration with AI in the 2022–2025 literature, and with what relative intensity (e.g., standardized metrics like URW per study)? which AI families (ML, DL, GAN, and AutoML & MLOps) predominate in this integration, and for what purposes (perception/vision, prediction/time series, data synthesis, operational automation), and how do they vary according to the project lifecycle phase and context? The second group, focused on governance and reproducibility, proposes: what information management and governance requirements (e.g., ISO 19650, CDE practices, traceability/versioning, interoperability) influence the adoption, reproducibility, and scaling of AI-BIM in practice, including its operationalization through MLOps?
2. Materials and Methods
2.1. Context
A first tentative classification of the nine keywords shown in Table 1 was made to start bibliographical analysis based on authors such as Fujii et al. [], Liu et al. [], and Pan & Zhang [], who have served as a basis to define the scope of the pillars that build the framework which characterize the relationship between AI and BIM. These keywords were then confirmed, as seen in Table 2. These descriptors have been selected for their ability to reflect, in an integrated manner, both the informational, parametric, and collaborative dimensions inherent to BIM, as well as the computational and algorithmic developments associated with various AI families. The combination of these categories enables a comprehensive analysis of the indexed literature from 2022 to 2025, focusing on domains that are driving substantial transformations in the AECO industry. The inclusion of each term is justified not only by its recurrence in high-impact publications but also by its potential to articulate convergent research lines and provide a coherent comparative framework for future academic explorations and professional applications.
Table 1.
Strategic keywords for the integrated analysis of BIM applications and Artificial Intelligence (AI) technologies.
2.1.1. BIM Applications
This subsection defines the five BIM application ambits used for analysis: BIMM, CDE, CD, MBIM 4D/5D, and DT. These categories define where AI integrates with BIM and provide a basis for collecting evidence, comparing studies, and organizing results across the asset lifecycle.
- BIM Modeling: The BIMM is conceived as a digital, three-dimensional, and parametric representation of the asset, which integrates geometry, semantic metadata, and behavior rules throughout its lifecycle. This model serves as the core to coordinate design, information management, and activity coordination []. Its consideration as a keyword in this study is unavoidable, as it serves as the informational backbone upon which the main use cases and integration flows are based. The growing convergence of BIM and AI enables data-driven systems that address complex AECO problems more effectively [].
- Common Data Environment: The CDE is conceived as the ‘source of information agreement’ for a project, that is, the collaborative environment where documents, models, and data are gathered, managed, and shared throughout the lifecycle, with traceability, version control, and standardized exchanges as guiding principles []. The UK BIM Framework specifies CDE workflows (states, approvals, metadata, automation) that improve consistency and transparency, while noting barriers such as maturity, tool integration, and governance []. In summary, the CDE serves as an information management infrastructure that enables reproducible and auditable flows and is a prerequisite for scaling AI-BIM integrations in real-world environments.
- Clash Detection: CD is defined as the automated process that identifies geometric or logical conflicts between model components to prevent problems before construction and reduce rework and costs. Recent literature incorporates the use of machine learning to filter out irrelevant clashes and prioritize those with greater impact, thereby decreasing false positives and coordination effort []. These investigations demonstrate that the combination of rule-based reasoning and supervised learning algorithms significantly enhances the classification and management of collisions in construction projects, leading to more efficient coordination.
- Multidimensional BIM 4D/5D: MBIM 4D/5D greatly enhances the planning and management of construction projects by incorporating time and cost factors into the digital model. BIM 4D incorporates time into the 3D representation, enabling construction sequence simulations and progress control, which allows for more effective visualization and better planning []. On the other hand, BIM 5D, which incorporates the dimension of costs, has proven to improve the accuracy of cost estimation and project management, overcoming barriers such as tool integration and staff training, which is crucial in complex environments []. MBIM links models to schedules and costs, improving planning and estimation in complex projects.
- Digital Twins: DTs are conceived as rich, semantic virtual representations of assets and processes that are bidirectionally synchronized with their physical counterparts through sensors, historization, and control systems, extending the scope of BIM into operation and maintenance. This BIM-DT integration enables closing the data-model-decision cycle, allowing for predictive analyses, such as degradation and failure, energy optimization, and service orchestration on the asset in near real-time. At the same time, it requires semantic alignment, interoperability, and an information management infrastructure to ensure traceability and governance [,,].
2.1.2. AI Technologies
As mentioned in Table 1, this subsection introduces the four AI families considered in the review: ML, DL, GAN, and AutoML & MLOps. This set covers the main patterns of learning and data generation, as well as the operationalization of the model lifecycle. It will allow us to map, using comparable criteria, how each family aligns with the BIM ambits and the tasks for which they are used in the analyzed studies.
- 6.
- Machine Learning: ML involves algorithmic methods that can learn patterns from large volumes of data to predict or classify without specifying rules in advance. In the AECO field, ML has established itself as an entry point for AI due to its versatility with structured and semi-structured data, with recurring applications in cost estimation, risk prediction, schedule optimization, as well as productivity and performance monitoring; literature mappings identify it as one of the most widespread families across the sector, along with computer vision and deep learning []. In particular, recent reviews on productivity document the systematic use of ML throughout the entire workflow, from data collection to modeling and evaluation, to improve estimation, monitoring, and decision-making on-site, reinforcing its role as an enabling technology in operational tasks, e.g., neural networks, Support Vector Machines, and DL approaches in scenarios related to construction labor and equipment productivity [].
- 7.
- Deep Learning: DL, as a subfield of machine learning based on deep neural networks, learns hierarchical representations of complex data (images, point clouds, video, and time series), enabling significant advances in computer vision, recognition, and forecasting relevant to AECO. In the construction industry, literature reviews show that DL has become a methodological pillar for tasks such as automatic progress tracking, site safety, defect detection/segmentation, activity recognition, and point cloud analysis, outperforming traditional approaches in various scenarios when sufficient labeled data is available []. These syntheses also emphasize that the performance of DL depends on the quality of data, consistent annotations, and reproducible pipeline design, as well as its integration with BIM workflows to close the data-model-decision cycle in real-world contexts [].
- 8.
- Generative Adversarial Networks: GANs consist of a generator and a discriminator that compete to produce realistic synthetic data and have become key tools for data augmentation/synthesis, domain adaptation, and improving robustness in typical AI-BIM tasks, e.g., semantic segmentation of images/point clouds, defect detection, closing gaps between simulated and real data. In the AECO field, a recent review in Automation in Construction systematizes their use cases and benefits in construction, including improvements in generalization when there is a scarcity of labels and heterogeneity among projects, and highlighting their cross-cutting potential to enable more reliable pipelines in AI-BIM integration []. Additionally, in civil engineering, a focused review reveals that GANs generate plausible data for training and reduce the domain shift between structural conditions, reinforcing their role as an enabling technology where real data is costly or difficult to obtain [].
- 9.
- Automated Machine Learning & Machine Learning Operations: AutoML automates key tasks in the modeling cycle, including algorithm selection, hyperparameter search, and pipeline design (including neural architecture search), thereby reducing dependence on manual expertise and standardizing experimentation. MLOps, on the other hand, applies DevOps principles to the model lifecycle for deployment, versioning, monitoring, and governance (model/data registry, traceability, drift control, Continuous Integration/Continuous Delivery). In the context of AI-BIM, the synergy between AutoML and MLOps is critical for transitioning prototypes into operational environments, such as CDE and DT, ensuring reproducibility, auditability, and maintainability of analytical artifacts throughout the asset’s lifecycle. Specialized literature synthesizes both the fundamentals and systems of AutoML as well as the architectures and best practices of MLOps for industrialization, providing a solid framework to scale AI-BIM workflows in a governed and sustainable manner [].
Together, the nine keywords selected outline a thematic map that links the comprehensive digitalization of the asset lifecycle through enriched BIM models, CDE, and multidimensional analyses, with AI techniques that enhance automation, knowledge generation, and evidence-based decision-making. This shared taxonomy facilitates the identification of research gaps, guides the formulation of future questions, and provides the scientific community with a cohesive terminological reference for the interdisciplinary advancement of the AI-BIM agenda.
Below is Table 2. Summary of AI Technologies and BIM applications, which summarizes the main issues addressed in recent studies and the correspondence between AI techniques and BIM workflows. This synthesis illustrates how strategies such as ML, DL, GAN, and AutoML & MLOps practices are integrated in a complementary manner to enhance the efficiency and quality of processes throughout the digital asset lifecycle.
Table 2.
Summarizes how AI families map to BIM applications/problems across recent studies.
Table 2.
Summarizes how AI families map to BIM applications/problems across recent studies.
| AI/BIM | ML | DL | GAN | AutoML & MLOps | Citations |
|---|---|---|---|---|---|
| BIMM | Supervised learning on structured BIM attributes (e.g., quantity/attribute-driven estimation, classification). | Deep models for geometric/semantic feature extraction and recognition over images/point clouds tied to BIM. | Synthetic data to augment training for segmentation/classification of BIM-related imagery/point clouds. | Pipeline/search automation and governed experimentation for BIM-centric analytics (registries, versioning). | [,,] |
| CDE | Prioritization/triage of metadata and documents aligned with ISO 19650 states; quality signals for governance. | Assisted validation for consistency checks (when applicable) under standardized exchange/workflow states. | Data synthesis to balance datasets for document/image-based tasks within governed repositories. | Model/data registries, CI/CD, lineage, and monitoring integrated with CDE processes (ISO 19650, UK guidance). | [,,] |
| CD | Hybrid rule-based + ML to filter irrelevant clashes and prioritize high-impact conflicts. | 3D/point-cloud perception and learned representations to support clash identification/triage when scans are available. | Scenario synthesis/augmentation to improve the robustness of detection pipelines in varied projects. | Continuous retraining/versioned datasets to adapt detectors across projects with traceable metrics. | [,,] |
| MBIM 4D/5D | Prediction of schedule/cost deviations and parametric what-if analyses linked to the BIM/WBS. | Sequence-aware models for schedule/progress signals (when labeled timelines are available). | Synthetic schedules/sequences to stress-test plans and support robustness under uncertainty. | Comparative model selection and deployment as services for planning/control dashboards. | [,,] |
| DT | Predictive analytics and supervisory control using operational data streams mapped to the asset model. | Perception/time-series modeling for state estimation, forecasting, and anomaly detection within the twin. | Stress-testing and domain adaptation (simulation-to-real) to improve twin robustness. | Lifecycle management (data/model versioning, monitoring) for deployed analytics in DT workflows. | [,,] |
The matrix provides an initial, structured overview of how AI impacts key BIM concerns, serving as a baseline for the systematic analysis developed in this study. Building on this foundation, the review proceeds to examine empirical evidence, pinpoint methodological gaps, and outline opportunities for technology transfer to advance the AI-BIM agenda in the AECO industry.
2.2. Review Process
This review was conducted following the PRISMA 2020 guidelines []. The search was performed in the Web of Science (WoS) Core Collection—Index Expanded, enabling Enriched Cited References to maximize the retrieval of records with extended bibliographic traceability. The last search was conducted on 11 August 2025. The exact search string used was:
TS = ((“BIM Modeling” OR “Common Data Environment” OR “Clash detection” OR “Multidimensional BIM” OR “Digital Twin”)
AND (“Machine Learning” OR “Deep Learning” OR “Generative Adversarial Network” OR “AutoML” OR “MLOps”))
The inquiry was limited to the thematic areas of Engineering and Construction Building Technology, in line with the focus on AI + BIM in the building industry.
2.3. PRISMA Analysis
To ensure transparency and reproducibility, the PRISMA 2020 workflow for this AI-BIM review is reported in Figure 1. The process involves record search and consolidation, relevance and quality filtering, full-text evaluation using explicit criteria, and final inclusion decisions, structured across four stages: Identification, Screening, Eligibility, and Included. Each stage is detailed in the following subsections.
Figure 1.
Flowchart of the PRISMA methodology for literature selection.
Screening at all stages was conducted independently by two experts, both of whom were experienced in BIM and AI; any disagreements were resolved by consensus, and no automation tools were utilized in this part of the review process.
2.3.1. Preliminary Identification
The keyword search was designed to capture, in parallel, the principal BIM integration vectors (BIMM, CDE, CD, MBIM, DT) and the most representative AI families (ML, DL, GAN, AutoML & MLOps). Beginning with 1212 articles, and after an initial retrieval in the WoS database and automatic deduplication, 59 unique records were identified. This set underpins the PRISMA flow and reflects the breadth of the AI-BIM spectrum over the study period.
2.3.2. Screening Process
Minimally restrictive methodological filters were used to preserve sensitivity while maintaining relevance:
- Years of publication (2022–2025). Seven documents were excluded (n = 7) because they fell outside the established time window, leaving 52 records. This temporal cutoff prioritizes recent publications in a rapidly evolving technological domain, ensuring that the findings reflect the current state of the art.
- Language: English. No exclusions were recorded (n = 0), maintaining 52 articles. The linguistic homogeneity facilitates synthesis and comparative evaluation, while also representing the publication standard in the field.
- Document type: Article. No exclusions were made (n = 0), so 52 was retained. The restriction to original or review articles ensures methodological rigor, availability of peer review, and completeness of metadata and results.
2.3.3. Eligibility Process
The 52 articles were evaluated in their entirety according to three explicit criteria: (i) journal prestige in the Journal Impact Factor (Q1), (ii) number of citations in the Web of Science as an indicator of influence and dissemination, and (iii) thematic relevance to a genuine intersection between AI and BIM. This combination of sources provided a balanced framework for judging quality, influence, and content. Based on this, 40 reports were excluded. These included those that did not meet the journal category criterion (JCR) (n = 17), had insufficient citations in the WoS (n = 22), and lacked a robust or explicit methodological integration between AI and BIM (n = 1).
2.3.4. Included Articles
After screening and eligibility assessment, 12 studies (n = 12) were included for in-depth analysis. Table 3 details each article, including its corresponding identifier, authors, DOI, and publication date.
Table 3.
Included Articles (n = 12), 2022–2025, in the AI-BIM Review.
2.3.5. Risk of Bias and Language Considerations
This review conducts a non-meta-analytic mapping of evidence using a two-level comparative weighting (MCW/LW) approach, rather than estimating effect sizes; therefore, a formal assessment of the risk of bias at the study level does not apply to its objectives. This limitation is explicitly acknowledged, and transparency is emphasized through the protocolized search, independent double-checking, and reproducible coding and normalization.
Although the search was limited to English, the PRISMA flow indicates that this filter did not exclude any records (n = 52 before and after; articles excluded by language, n = 0), as shown in Figure 1. Furthermore, it is recognized that future updates should consider multilingual sources to broaden coverage.
2.4. Quantitative and Qualitative Analyses
A mixed-methods approach was employed to characterize the state-of-the-art of AI-BIM, as outlined in the corpus of 12 articles presented in Table 3. Quantitatively, weights for AI families and BIM subcriteria were estimated and subsequently integrated at the corpus level to obtain comparable indicators across studies. Qualitatively, problems, applications, tools, and methodological features were coded and triangulated with numerical patterns. The design follows the principles of multicriteria decision analysis (MCDA), articulating preferences and trade-offs among competing objectives [].
2.4.1. Data Extraction and Initial Weight Processing
All analyses were conducted in Python (v.3.13.5) within Jupyter notebooks (v.7.3.2), relying exclusively on open-source software to ensure reproducibility. The 12 articles were ingested as PDFs, and text was extracted using pdfminer.six. Linguistic preprocessing employed spaCy (en_core_sci_sm) and PyTextRank, which comprised sentence and token segmentation, basic normalization (including case unification and removal of non-informative characters), and key-phrase extraction.
A seed dictionary for AI-BIM was constructed from the extracted text to guide the identification of substantive mentions. The AI dimension encompasses ML, DL, GAN, and AutoML & MLOps, while the BIM dimension includes BIMM, CD, MBIM 4D/5D, and DT. The dictionary incorporates common synonyms and acronyms and is applied case-insensitively to avoid formatting bias. Lemmatization is used solely to reinforce co-occurrence analysis and semantic consistency; it does not affect the frequency counts that underpin the weightings.
For each study, two input vectors are derived from the text counts: (i) a vector of AI families and (ii) a vector of BIM integration sub-criteria. When a study contains no explicit mention of any AI families, a neutrality rule assigns equal weights to all families to preserve comparability across studies. If no mentions are found for a given integration subcriterion, it receives a weight of zero in that study. These vectors support the subsequent steps: study-level normalization and corpus-level aggregation.
To aid readability, we define the symbols used in this subsection before formal expressions. We index AI families by i ∈ {ML, DL, GAN, AutoML/MLOps} and BIM sub-criteria by j ∈ {BIMM, CDE, CD, MBIM, DT}. Let n and m denote their counts, and N the number of papers. MCW refers to the study-level weight that summarizes the relative prominence of each AI family within a paper. LW denotes the within-family distribution that indicates which BIM application axes that family is directed toward in the study. GW captures the overall contribution of a specific AI family × BIM axis pair, reflecting the combined emphasis of the family and its focus on that axis in the study’s evidence. RW expresses this contribution in percentage terms to facilitate straightforward, like-for-like comparisons across rows and tables.
2.4.2. Calculation of Global Weights (GW) and Relative Weights (RW)
Two weights are defined at the study level: the model-class weight (MCW), which distributes weight across AI families within a study, and the link-weight (LW), which distributes weight across BIM integration subcriteria within the same study. These levels are combined via multiplicative aggregation to yield the global weight (GW) of each AI-BIM pair, as formalized in Equation (1):
where i goes through AI families and j goes through BIM sub-criteria.
A study-level normalization is established. To ensure comparability across articles with differing lengths and terminological densities, weights at each level are imposed to sum to 1 within each study, as shown in Equations (2) and (3):
where n is the number of AI criteria IA.
where m is the number of BIM sub-criteria.
From Equations (1)–(3), it follows that the total global weights of the study also sum to 1, as seen in Equation (4):
To facilitate interpretation, we express the overall weight as a percentage called relative weight (RW), as shown in Equation (5):
In the absence of mentions of AI families in a study (total count: zero), we apply a neutral rule assigning equal weights in MCW to avoid introducing arbitrary preferences. If a BIM subcriterion has no mentions, its component in LW is set to zero, so the corresponding pair does not contribute to GW or RW. These decisions preserve the normalization constraints.
Equations (2)–(4) prevent biases when specific terms do not appear in the text. Table 4 presents the pseudocode for the two-level weighting per paper, including inputs, fallback rules (uniform MCW and LW = 0), normalization of MCW/LW, and the computation of GW and RW, along with the per-study consistency checks.
Table 4.
The proposed AI-BIM two-level weighting.
2.4.3. Aggregation of Results and Unified Relative Weight (URW)
With the global weights by study, the results are integrated at the corpus level to obtain the unified relative weight (URW) for each BIM subcriterion. The GWi,j of all studies are summed and projected onto each subcriterion j (summing over i) then normalized to 100%, as shown in Equation (6):
where N is the number of papers analyzed, in this case, 12 units.
Because weights sum uniformly per study, Equation (6) reduces to averaging GW by subcriterion across studies (including zeros), as outlined in Table 5.
Table 5.
Corpus-level unified relative weight (URW).
2.4.4. Qualitative Analysis
A preliminary study was conducted to define the vocabulary of interest; drawing on a recent exploratory review, a set of keywords was identified to operationalize four AI families and five BIM application areas. This group guided both the search and selection of the 12-article corpus, as well as the mention-extraction procedure described in Section 2.1.
Each study was read in full and descriptively coded; we then triangulated these notes with MCW, LW, GW, RW, and URW to relate narratives to weights.
Finally, we compared corpus patterns with preliminary expectations to identify convergences, gaps, and shifts, thereby guiding the discussion without altering the calculations.
2.4.5. Quality Control and Numerical Consistency
The identities in Equations (2) and (3) are verified in each study; at the corpus level, . In the absence of evidence in AI or BIM, the procedures described in Section 2.4.1 and Section 2.4.2 are applied, with equal weights across AI families and/or zero weights for BIM subcriteria without mentions, to maintain consistency and avoid biases due to terminological silence.
2.4.6. Export and Traceability
After computing the weights, three outputs are generated for downstream analysis and visualization:
- Per study tables with MCW, LW, GW, and RW.
- Corpus-level summaries of URW by AI family and by BIM subcriterion.
- The IA-BIM crossing matrix (URW, %) used in Section 3.6.
Traceability is ensured by keeping the study identifier (P00k) in every table and in the exported filenames, which provides a one-to-one link to the bibliographic record in Table 3 and allows any figure to be reconstructed from its source.
3. Results
This section presents the findings of the comparative analysis described in the previous Materials and Methods section.
3.1. Criteria Matrix AI-BIM
Table 6 presents the AI–BIM matrix. Columns are arranged in pairs: for each study P00k, the first column shows its Methodological Composite Weight (P00k_MCW), and the adjacent column shows its Result Weight (P00k_RW). Rows are organized in blocks. Each block begins with the AI family (ML, DL, GAN, AutoML/MLOps) and is followed by the five BIM application rows (BIMM, CDE, CD, MBIM, and DT). This paired-column layout, with rows grouped into blocks, makes the structure explicit and enables quick, like-for-like comparisons.
Table 6.
(A) AI-BIM weighting results by (normalized MCW and RW)—Panel A: P001–P006. (B) AI-BIM weighting results by (normalized MCW and RW)—Panel B: P007–P012.
It should be noted that the empty cells shown in Table 6 (A and B) are equivalent to 0, as the study does not report information on the evaluated aspect. The sum of all RW values in a column, across all families and ambits, is approximately 100%, and the MCW values of the four families sum to 1 in each paper.
The above Table 6 (A and B) are recommended to be read in three complementary ways:
- Intra-paper (vertical): For each article, there are two columns in Table 6B: P00k_MCW and P00k_RW. The first column shows the relative weight of each AI family in a study, while the second column indicates the BIM axis to which the study focuses on that family.
Example 1.
For paper P007, column P007_MCW indicates that with an MCW of 0.91, DL is the predominant AI. Within this block, examining the second column, P007_RW, reveals that with an RW of 91.18%, the emphasis is on DT. This finding indicates that the DT pipeline is strongly driven by DL. The result is illustrated graphically in Figure 2.
Figure 2.
E1-P007: DL to DT, DL-Driven Digital Twins (MCW = 0.91; DT RW = 91.18%).
Example 2.
When looking at paper P006 in Table 6A, column P006_MCW assigns a value of 0.96 to DL, and within that block, in column P006_RW, we observe that 73.33% is associated with the BIMM sub-criterion, which is consistent with the perception/model-centered workflows linked to the BIM model. The result is illustrated graphically in Figure 3.
Figure 3.
E2-P006: DL to BIMM, Perception/Model Focus (MCW = 0.96; BIMM RW = 73.30%).
Example 3.
As shown in Table 6B, for paper P010, ML contributes an MCW of 0.49, and BIMM carries the emphasis with an RW of 48.72%, typical of analyses on structured BIM attributes. The result is illustrated graphically in Figure 4.
Figure 4.
E3-P010: ML to BIMM, Structured Attribute Analytics (MCW = 0.49; BIMM RW = 48.72%).
- Intra-axis (horizontal, by sub-row): Based on the information shown in Table 6 (A and B), to see which AI family contributes more to a given BIM axis, select one sub-row and scan across studies within each AI block. In the DT subrow, the analysis from P001 to P012 reveals that the DL family consistently exhibits the presence of DT, with ML and GAN as secondary families, and AutoML/MLOps as occasional contributors.
Example 4.
In paper P007 (Table 6B), the weight RW = 91.18% of DT is concentrated in DL, with a negligible contribution, RW = 8.82%, from GAN; while in paper P008 (Table 6B), DL remains the main component (36.45%), but GAN (17.55%) and AutoML/MLOps (18.90%) provide significant support for DT. The result is illustrated graphically in Figure 5.
Figure 5.
E4-DT Axis (P007 vs. P008): DL Main Component with GAN/AutoML Support.
- Between papers (horizontal by family row): An analysis of the MCW row of a given AI family reveals how its relevance varies across studies.
Example 5.
When analyzing Table 6 (A and B), in the DL row, the MCW is moderate from P001 to P003 with values between 0.25 and 0.42 but increases sharply from P004 to P007 (0.90 to 0.96) and remains high from P010 to P012 (0.85 to 0.89), indicating DL-centric workflows. Conversely, in the ML row, the strongest signals appear at P002 (0.50), P003 (0.50), and P010 (0.49), while ML is marginal from P004 to P007 (≤0.03), showing a shift away from ML in those studies. The result is illustrated graphically in Figure 6.
Figure 6.
E5-MCW Across Papers: DL vs. ML Family Prominence.
3.2. Comparative Analysis of the Weights by Criterion for Each Paper
Figure 7 displays, for each paper (P001-P012), the MCW distribution across the four AI families (ML, DL, GAN, AutoML, and MLOps), with segments normalized and labeled for comparison. DL predominates in most studies, consistent with vision and time-series workflows on BIM artifacts and Digital Twins [,].
Figure 7.
Distribution of MCW by Main Criterion in each Paper.
A second group of works presents hybrid ML-DL profiles, featuring a relatively balanced distribution between the two families. Notable examples of these approaches are projects P002 and P003, where ML achieves a feature mixing coefficient MCW of 0.50 and coexists with DL, which has coefficients of 0.25 and 0.42, respectively. ML handles structured attributes, while DL processes images, point clouds, and time series; both layers integrate within the BIM/DT flow [].
Among the DL-centric profiles, several studies show an almost exclusive concentration; for example, P004 (MCW_DL = 0.91) and P005-P007 (values visually close to the upper end in Figure 7), which is consistent with vision and time-series workflows in BIM artifacts and digital twins [].
A distinctive feature of the set is the GAN, present in P001 (MCW_GAN = 0.36) and P002 (0.25), and perceptible although in lesser preponderance in other cases. These contributions often serve as a complement or synthesis of data to strengthen deep learning models against label scarcity or domain changes between works and projects [].
In terms of scalability and operation, the AutoML & MLOps family appears to be in its early stages in most of the studies shown in Figure 7 but gains greater prominence in study P008 (MCW_AutoML&MLOps = 0.21), where the lifecycle automation component acquires significant relative weight, signaling a shift from prototyping to governed deployment and monitoring [].
Overall, the observed profiles reveal three behaviors: (i) DL predominates in perception and operation scenarios (DT, reality capture), (ii) hybrid ML+DL when tabular attributes and rich signals coexist, and (iii) occasional emergence of GAN/AutoML as accelerators, the former to address data limitations and the latter to professionalize the model lifecycle. The low systematic signal of AutoML & MLOps, as well as the heterogeneity in the use of CDE/ISO 19650, highlight areas of opportunity to scale the AI-BIM integration with greater traceability and reproducibility.
3.3. Aggregation and URW by Criterion
In this section, the corpus-level aggregation by AI families is presented. Table 7 summarizes the unified relative weight of each family (ML, DL, GAN, and AutoML & MLOps), computed from the per-study GW matrices and normalized according to Equations (5) and (6) so that the total equals 100%. This table enables a comparison, under a standard metric, of the average contribution of each family across the 12 studies.
Table 7.
Unified relative weight (URW, %) by AI criteria.
Figure 8, based on the information in Table 7, is a normalized URW graph. It illustrates how the total weight is distributed among the five AI families (DL, ML, GAN, AutoML, and MLOps), with their proportions totaling 1, and each value is labeled for easy comparison. As shown in Figure 8, there is a clear bias towards DL, which accounts for 68.40% of the URW. This finding is consistent with the recent shift in the AECO sector toward perception tasks (2D-3D detection/segmentation, feature extraction) and time series analysis on BIM artifacts and Digital Twins, where DL performs better than purely tabular approaches. The added priority in the Digital Twin and BIM Modeling subcriteria is expected: these are the areas that benefit most from deep representations and high-frequency operational data [].
Figure 8.
Global distribution by AI Criterion (URW %).
Overall, three profiles emerge: DL-dominant pipelines for perception and operations (digital twins, reality capture); hybrid ML-DL when structured attributes and rich signals coexist; and tactical GAN-AutoML, where GAN mitigates data scarcity and AutoML & MLOps professionalize the model lifecycle. The weak and inconsistent signal of AutoML & MLOps, together with heterogeneous use of CDE/ISO 19650, points to clear opportunities to scale AI-BIM with stronger traceability and reproducibility [,,].
3.4. Comparative Analysis of Sub-Criteria Weights for Each Paper
Figure 9 is a bar chart with one column per item (P001–P012), dividing the normalized weight of each item (LW, 0–1) across the five BIM sub-criteria (BIMM, CDE, CD, BIM, MBIM 4D/5D, and DT), with segment values labeled for easy comparison. A bimodal pattern is observed in the corpus, in which the weights tend to concentrate in DT and BIMM. In the first group, DT visually dominates in several studies, such as P003, P005-P006, P009, and P011–P012, which is consistent with flows oriented toward operation and monitoring, as well as the integration of sensor data or reality capture. This preference aligns with the literature linking digital twins to perception, diagnosis, and prognosis tasks supported by AI, especially when BIM artifacts are coupled with data streams for asset management and performance [].
Figure 9.
Distribution of LW by Main Subcriterion in each Paper.
The second pole corresponds to BIMM, with peaks at P001, P009, and, notably, P010, as shown in Figure 9. This emphasis suggests cases where the parametric model is central to the flow (extraction/management of quantities, semantic classification, and consistency validations), as well as scenarios where the structured attributes of the model are decisive, and data traceability takes precedence over unstructured signals [,].
The complementary sub-criteria appear with less weight and a selective nature. MBIM 4D/5D exhibits non-trivial contributions in P006, P009, and P012, which are compatible with planning and cost analyses linked to the model. CD mainly emerges in P006 and P012, indicating that, although coordination uses are consolidated in practice, they occupy a more limited methodological space when the focus of the study shifts toward DT or other operational ambits []. Meanwhile, CDE only records sporadic signals, with greater prominence at P010, suggesting that information governance and versioning are present to support the flow, but not as the primary analytical focus [,].
The combinatorial study reinforces three archetypes within the set: (i) central DT cases, focused on operational performance and continuous updating of the asset’s status; (ii) central BIMM cases, where the model and its structured attributes explain most of the weight; and (iii) hybrid cases that combine BIMM with 4D/5D or specific coordination (clash) to solve planning/execution problems. This qualitative interpretation, derived from full-text inspection and contrasted with the weights, provides explanatory context about why certain sub-criteria have relative importance and how they manifest in the reported flows.
3.5. Aggregation and URW by Subcriterion
The corpus-level aggregation by BIM subcriteria is presented in Table 8, which reports the unified relative weight (URW) for BIMM, CDE, CD, MBIM 4D/5D, and DT. The values are obtained by summing the GW matrices across the 12 studies, projecting onto each subcriterion, and normalizing to 100% using Equations (5) and (6).
Table 8.
Unified relative weight (URW, %) by BIM subcriterion.
Figure 10 shows a normalized horizontal bar chart (URW%) that ranks five BIM sub-criteria (BIMM, CDE, CD, MBIM 4D/5D, and DT) from highest to lowest, with percentages labeled at the end of each bar for direct comparison. A higher corpus concentration is observed in DT (56.58%), followed by BIMM (34.38%). The 4D/5D multidimensional MBIM has an intermediate share (7.43%), while CDE (1.53%) and CD (0.08%) are marginal.
Figure 10.
Global distribution by Subcriterion BIM (URW %).
The dominance of DT and BIMM also aligns with evidence that deep learning and supervised learning deliver greater returns in perception and modeling tasks, such as vision on images/point clouds, and state prediction over operational data, than in traditional uses, such as clash coordination []. In contrast, the low weights for CDE and clash detection most likely reflect under-reporting and the lack of standardized metrics rather than lesser practical relevance. This interpretation aligns with the empirical profiling of BIM users, which reveals gaps in adoption and process standardization that obscure information-management practices, with CDE serving as the source of truth and a backbone for traceability [].
3.6. Cross Between Criteria and Subcriteria
Table 9 is presented below, presenting the URW% between AI applications and BIM applications, which consolidates the AI-BIM matrix at the corpus level, based on the rows by AI families (ML, DL, GAN, AutoML, and MLOps) and the columns by BIM subcriteria (BIMM, CDE, CD, MBIM, DT). Each cell indicates the unified relative weight of the AI-BIM pair, obtained from Table 6 (A and B) (summing GW per study and normalizing to 100% according to Equations (5) and (6)). This matrix provides the numerical basis for subsequent visualizations, allowing for the interpretation of the dominant and secondary couplings in the corpus under a standard metric.
Table 9.
URW (%) by AI family and BIM subcriterion.
The Sankey diagram in Figure 11, as well as the heatmap in Figure 12, shows that the flow mass primarily originates from DL and converges in DT and BIMM. This interaction suggests pipelines where DL processes rich signals, such as images, point clouds, and time series, and feeds digital twins for sensing, simulation, and prediction during operation. At the same time, BIM Modeling provides the semantic and geometric framework to align data and decisions. The direction from DL to DT and from DL to BIMM is consistent with the field’s evolution toward data-driven operational flows and with the role of DT as a computational container connected to the physical asset [,].
Figure 11.
Sankey diagram. AI-BIM with global percentages (URW).
Figure 12.
Heatmap of Average AI-BIM Integration.
The Sankey diagram also highlights the secondary contributions of ML to BIMM and DT, as well as the tactical role of GAN in data synthesis, augmentation, and simulation, which strengthens DL pipelines []. In contrast, the AutoML and MLOps streams refer to lifecycle orchestration and governance, including automated search of model pipelines, model and data registries, versioning, monitoring, and continuous integration and continuous delivery processes that automatically build, test, and deploy models safely into production, enabling traceability and reproducibility in DT/CDE workflows [,].
These patterns reflect multimodal architectures: DL addresses perception and asset status, while ML exploits structured model attributes for classification and estimation; both interact with BIM as semantic support for tasks and with the DT as an execution environment and feedback [,].
The weak couplings toward CDE and CD, with values close to zero across all families, suggest a reporting gap rather than a lack of relevance: the management of information and traceability enabled by the CDE is rarely quantified with comparable metrics. This scenario occurs even though the CDE acts as an authorized repository and single point of reference for data and models within the information governance framework. In turn, the still-nascent signal of AutoML & MLOps indicates that the integration of the model lifecycle (training, deployment, and monitoring) into the CDE is still under development, a necessary condition for scaling AI-BIM with reproducibility [,].
In line with the second analysis group, the corpus shows limited and heterogeneous reporting of ISO 19650/CDE artifacts and MLOps deliverables. When Digital Twin implementations are described, performance metrics dominate, while traceability and governance are rarely documented, including state transitions, version control, approvals, experiment tracking, and implementation and monitoring logs. This pattern helps explain the scarcity of comparable evidence on CDE and CD despite their practical relevance. The CDE is positioned as the contractual backbone and MLOps as the operational layer required to scale AI-BIM with traceability and auditability, in accordance with the International Organization for Standardization 2018 and with engineering guidance on maintainable machine-learning systems [].
4. Discussion
This section consolidates the interpretation of the findings, first by contrasting the predefined conceptual map (AI families and BIM application axes) with the quantitative evidence (MCW, LW, GW/RW, URW) and then by synthesizing cross-cutting patterns, practical implications, limitations, and needs for future research.
4.1. Key Concepts Analysis
The nine keywords (BIMM, CDE, CD, MBIM 4D/5D, DT, DL, ML, GAN, and AutoML & MLOps) are contrasted with the analyses presented in Section 3. For each element, the expected role in the AI-BIM integration, the quantitative evidence from the study, and the implications and gaps are reported.
4.1.1. AI Technologies Analysis
The AI dimension is framed into four categories spanning the value cycle: ML for explainable analytics on structured data; DL for perception and representation in images, point clouds, and time series; GAN for data synthesis and adaptation; and AutoML & MLOps for operationalizing models (search, deployment, and monitoring).
- Machine Learning:
ML provides explainable and efficient models for tasks based on BIM attributes (classification, estimation, and control) and integrates with vision outputs or sensors when the input is tabular. In the corpus, it reaches URW = 16.43%, with weights concentrated in ML-DT = 9.04% and ML-BIMM = 6.89%, while MBIM = 0.37% and CDE with CD ≈ 0 are the residual components. The pattern suggests a complementary role to DL, which is useful when interpretability and data governance are required. It is advisable to standardize features derived from the model and establish comparable protocols between projects [,,].
The prominence of ML-BIMM and ML-DT reflects contexts where project teams favor transparent models built on tabular BIM attributes and monitoring data for cost, delay, and failure risk predictions that must be explainable for contractual and managerial use [,].
- Deep Learning
DL addresses dense perception and representation across images, point clouds, and time series, and supports end-to-end pipelines within digital twins []. In our corpus, DL predominates (URW = 68.40%), with the strongest crossings in DL-DT (38.01%) and DL-BIMM (22.47%); the latter is exemplified by DL-based generation of BIM models directly from 2D plans []. An intermediate signal appears in DL-MBIM (6.54%), while CDE (1.31%) and CD (0.07%) remain low. This profile highlights DL’s reliance on labeled data and robust traceability, underscoring the need for standardized baselines and metrics to ensure reproducibility. In parallel, securing and auditing information flows in DT/CDE pipelines is crucial for trustworthy deployment [].
The strong DL-DT and DL-BIMM couplings correspond to use cases where convolutional and sequence-based models absorb images, point clouds, and dense sensor streams anchored to BIM/DT, enabling automated defect detection, occupancy inference, and performance optimization in data-rich assets [,].
- Generative Adversarial Networks
Generative approaches play a crucial role in enhancement and synthesis, as well as domain adaptation, to strengthen perception models. In the corpus, for GANs, an URW of 11.45% is recorded, with focuses on GAN-DT at 7.02% and GAN-BIMM at 3.99%, with the rest being marginal. The evidence is consistent with scenarios of label scarcity or variability between works; however, a reporting gap persists. Clear protocols are necessary to evaluate the quality of synthetic data (fidelity, diversity, and bias) and to assess their impact on subsequent tasks reproducibly. This fact includes recording the origin and transformations of data and models, as well as maintaining version control for each delivery [,].
The limited GAN-BIMM/DT links indicate a focused but realistic application: synthetic scenarios and data augmentation, where labeled records are scarce, supporting the more robust training of inspection and monitoring models without altering the built asset [,].
- Automated Machine Learning & Machine Learning Operations
Low weight is not due to a lack of relevance, but to limited implementation and documentation: few works describe pipelines with version control for data and models, as well as service metrics. Integrating these elements within the CDE/DT is a condition for scaling with reproducibility [,].
The emerging connections between AutoML/MLOps and CDE, BIMM, and DT signal early efforts to formalize experiment tracking, deployment, and monitoring in governed environments, a prerequisite for scalable and auditable AI-BIM services in practice [,].
4.1.2. BIM Applications Analysis
This subsection addresses five BIM ambits (BIMM, CDE, CD, MBIM 4D/5D, and DT), where integration with AI is expressed. Quantitative patterns and qualitative evidence are synthesized to compare the role of each axis and identify implications and gaps.
- BIM Modeling
BIMM acts as a semantic-geometric backbone to structure AI data and support attribute extraction, labeling, and validation. It accounts for URW = 34.38%, primarily driven by DL = 22.47% and ML = 6.89%. Its guiding role is confirmed; however, there is a need for a greater explicit connection with CDE for traceability and with MBIM to close the planning-operation cycle [,].
The strong intersections with ML and DL demonstrate that many real-world implementations utilize BIMM as the stable, queryable layer from which quantities, states, and risks are computed, before extending to more complex DT-based control loops [,].
- Common Data Environment
The CDE should be the agreed-upon and traceable repository for data and model governance; however, its signal is weak (URW = 1.53%) and dispersed. This fact suggests a deficit in comparable metrics and reports, not a lack of importance. There is a need to operationalize within the CDE a provenance and traceability scheme (recording the origin and transformations of data and models), apply systematic versioning of datasets/code/models, and establish mechanisms for auditing analytical products (recording experiments and reproducible evidence), in line with ISO 19650 and the CDE guidelines [,,].
- Clash Detection
As a coordination ambit, CD could benefit from supervised learning and graphs; however, it appears to be residual (URW = 0.08%). The focus of the corpus shifts toward perception/operation; there remains an opportunity for standardized datasets and tasks that measure the impact and prioritization of interferences with AI [,].
- Multidimensional BIM 4D/5D
MBIM 4D/5D links the model with time and cost to support planning, monitoring, and control, consolidating BIM as the semantic backbone of schedule and budget []. In the corpus, MBIM achieves a URW of 7.42%, with DL-MBIM accounting for 6.54%, while ML, GAN, and AutoML each contribute less than 1%. Functionally, MBIM acts as a hinge between modeling and operations; however, the scale of sequential learning in projects remains limited by inconsistent time series and labeling. These constraints are mitigated when MBIM is embedded in CDE/DT-centric digital management frameworks that integrate IoT and analytics to standardize WBS/Gantt, define control points, and produce reproducible temporal traces []. Complementarily, recent advances in 4D scheduling for modular construction reinforce the operational value of MBIM [].
The moderate ML/DL–MBIM weights are consistent with applications where predictive models convert simulated 4D/5D scenarios into decisions on sequencing, logistics, and contingencies, improving schedule reliability and cost control beyond purely visual planning [,].
- Digital Twins
DT functions as an operational container for sensing, simulation, and prediction, and is the natural deployment destination for AI. It leads with URW = 56.57%, driven by DL-DT = 38.01% and, to a lesser extent, ML-DT = 9.04%, GAN-DT = 7.02%, and AutoML-DT = 2.50%. The domain confirms the shift toward operational exploitation and the need to consolidate DT-CDE-MLOps for traceability and continuous improvement [,,].
Why does deep learning tend to dominate traditional learning? The leading DL-DT crossings suggest that twins become effective decision platforms when high-frequency sensing is combined with deep models for anomaly detection, predictive maintenance, and energy-control actions executed directly on the asset, rather than relying on static visualization tools [,].
4.2. Cross-Study Synthesis of Primary Couplings
Our findings generally confirm the suitability of the keywords defined in the preliminary phase, as well as four families of AI tools and five BIM applications to characterize the AI-BIM interaction in the corpus. The initial selection effectively guided the formation of the 12 articles set and the extraction of mentions. The nine-keyword scheme strikes a balance between precision and comprehensiveness; however, terminological variation in the literature (e.g., Building Information Modeling/Modeling, Digital Twin(s), Common Data Environment/CDE, 4D/5D planning/costing, conflict/coordination) can influence both retrieval and subsequent weighting. To mitigate bias, case-insensitive matching and controlled synonym lists were used.
Across heatmaps, Sankey flows, and crossing matrices, the interaction exhibits a coherent architecture: DL to DT and DL to BIMM carry the majority of the mass, while ML to DT and ML to BIMM provide a structured and interpretable layer. This pattern aligns with perception and time-series pipelines that are anchored to the asset model and complemented by those of feature-level analytics on BIM attributes [,]. Corpus-level averages, for example, DL to DT approximately 0.38 and DL to BIMM approximately 0.225, support a layered design in which DL addresses state estimation, forecasting, and anomaly detection within DT. In contrast, ML operates on semantically grounded BIM features, with BIM as the semantic backbone and DT as the operational container []. This reading aligns with field trends, including a BIM-enabled multi-objective design that couples simulation with random-forest metamodeling and NSGA-II search, and DT-based HVAC maintenance that integrates rule-based diagnostics with ML [,].
Conversely, the low signals observed in CDE and CD reflect measurement and reporting gaps rather than marginality. MBIM 4D/5D often acts as a bridge between modeling and operations, reinforcing the need to operationalize comparable governance indicators for CDE and CD without altering the taxonomy. Finally, GAN and AutoML/MLOps appear smaller but strategic; GAN as a lever for data synthesis and augmentation, and AutoML/MLOps as the path to governed operations with traceability—warranting the prioritization of studies that explicitly connect CDE, BIMM, and DT with these enablers to scale reproducibility [,].
In practice, generative approaches, such as GANs, remain underutilized due to practical barriers rather than limited potential: data scarcity and IP/privacy constraints, training instability with ambiguous evaluation, weak CDE-level provenance, and limited MLOps reporting, as well as high compute demands for 3D and sequential models. Addressing these with standardized datasets and metrics, explicit CDE lineage for synthetic data, and MLOps-ready assessment would enable disciplined, broader adoption and make GANs a more reliable complement to DL-centric pipelines.
In summary, the confrontation between what is defined in Section 2.1, where the keywords are justified, and the empirical findings confirms the core of the taxonomy: DT and BIMM as dominant applications, DL with ML as support, as analytical engines, and GAN, along with AutoML/MLOps, as data and operational enablers. The areas to strengthen do not require changing the categories but rather clarifying and applying comparable metrics for CDE and CD, as well as explicitly linking CDE-BIMM-DT with these enablers, to increase traceability, reproducibility, and scalability in AI-BIM integration.
4.3. Practical Implications
Looking ahead, the MCW/LW framework can be implemented by project lifecycle phase, maintaining standardization at the study level and aggregation at the corpus level (URW). This phase-based perspective reveals where AI families best integrate with BIM sub-criteria at each stage and how these patterns evolve, enabling direct comparisons under a transparent weighting scheme. Based on this evidence map, the implications translate AI-BIM signals into practical steps for AECO stakeholders. These include the need to adopt governance practices through robust CDE, operationalize reproducible workflows through MLOps, and prioritize use cases aligned with phases (e.g., DL to DT and ML to BIMM). Practical actions for owners, contractors/designers, operators, and PMO/procurement departments are then presented.
Based on the study’s findings, it is recommended that owners and public agencies formalize the CDE framework as the contractual backbone, including model/data registries, version control, procedure tracking, and consider MLOps deliverables, with deployment and monitoring records to maintain auditable and maintainable development technology DT. General contractors and designers align use cases with data reality, DL with DT for vision/time series, and ML with BIMM for attribute analysis, label planning, and documenting workflows for reproducibility. Facility/asset operators can establish results-based KPIs, such as energy usage and fault resolution time, linked to DT dashboards and institute retraining programs with rollback policies. Collectively, these steps translate the evidence into implementable policy, de-risking AI-BIM adoption. They ensure traceable, comparable pipelines with measurable cost, schedule, and sustainability impact.
4.4. Limitations and Future Research
Despite its contributions, this review has limitations. First, the results depend on a unique WoS corpus and the period from 2022 to 2025. Expanding the dataset could alter the relative weighting, particularly in poorly documented areas such as CDE and CD, by improving comprehensiveness and coverage. However, the workflow is fully reproducible and will be rerun with expanded datasets, recording changes in inclusion and classification. Second, the synthesis is non-meta-analytic and does not include a formal study-level risk-of-bias appraisal, consistent with our mapping and weighting aims rather than effect estimation. Third, the two-level weighting is derived from full-text, keyword-based mention counts, which can be influenced by reporting heterogeneity across studies. Finally, the inclusion thresholds, JCR quartile, and minimum citation count may skew the corpus toward more visible venues. Nevertheless, the framework is reproducible and can be strengthened in future updates by adding new databases (e.g., Scopus), incorporating multi-lingual sources, running sensitivity analyses on weights, and adopting standardized benchmarks to enhance comparability and coverage.
In terms of future research, three priorities are identified: (i) standardizing metrics and datasets for CDE and CD to enable evaluation of governance, traceability, and coordination under comparable criteria; (ii) strengthening data generation/adaptation (e.g., generative approaches or federated learning) in response to the scarcity and heterogeneity inherent in AI-BIM; and (iii) operationalizing AI within the CDE through MLOps practices (data/model versioning, automated testing, drift monitoring) and real-scale longitudinal assessments that quantify impacts on productivity, costs, energy, and emissions. These priorities follow directly from the empirical patterns observed in the corpus.
5. Conclusions
This study characterized the interaction between AI tools and BIM applications using a reproducible procedure, based on a systematic literature review. Full-text reading and descriptive coding were integrated with a two-level quantitative scheme, which assigns weights by AI family and BIM subcriteria and aggregates at the corpus level. This approach enabled the comparison of heterogeneous frameworks under a common metric basis (MCW, LW, GW, RW, URW). Normalizations were performed per study, and traceable export facilitated subsequent verification.
Taken together, the evidence indicates that research activity clusters around DT and BIMM because these tasks align with data-rich, model-centric workflows in which DL excels at perception and feature extraction, while ML supports attribute-level analytics. By contrast, CDE and CD appear underrepresented, not due to a lack of relevance, but rather because of fragmented reporting, missing shared protocols, and outcome measures that obscure the benefits of governance and integration. Generative approaches and AutoML-MLOps remain emergent and underdocumented, particularly in terms of data lineage, retraining, and deployment. The study highlights these asymmetries and outlines a practical agenda, consisting of consolidating evaluation resources for CDE and CD, aligning DL use cases with DT operational needs, and institutionalizing MLOps to move from prototypes to maintainable services.
In terms of contribution, a minimal taxonomy, five BIM ambits, and four AI families are provided, together with a standardized methodology for weighting and aggregation (MCW/LW/GW/RW/URW), enabling transparent comparisons between studies and facilitating joint interpretation of hybrid ML-DL profiles and emerging signals GAN, MLOps. The calculation structure and its default policies ensure consistency by assigning equal weights across AI families when mentions are missing, and by assigning zeros to BIM subcriteria when evidence is absent.
The findings underscore that governance and reproducibility are prerequisites for scalable AI-BIM. It is therefore recommended to operationalize ISO 19650 within the CDE through versioned information containers, explicit approval states, and documented lineage, and to institutionalize MLOps as standard deliverables, including registries, deployment logs, drift monitoring, and defined retraining and rollback procedures. Adopting this minimal indicator set will improve the identification of relevant texts, strengthen reproducibility, and increase the generalizability of the proposed weighting framework.
A further step involves operationalizing the framework as a phased, integrated dashboard that links the weights of the AI and BIM axes to impact metrics, such as energy consumption intensity and CO2e for sustainability, schedule reliability and cost variance for 4D/5D, and fault resolution time for operations. Reassessing the weights by phase (design, construction, delivery, O&M) and combining them with these key performance indicators (KPIs) would generate comparable cost–carbon-performance ratios to guide decision-making and procurement. As the datasets expand, the same tools enable sensitivity analysis, allowing teams to track how AI and BIM contributions vary across phases and prioritize interventions with the highest verified value.
Author Contributions
Conceptualization, E.F. and R.V.; methodology, E.F.; software, R.V.; validation, E.F. and R.V.; formal analysis, E.F.; investigation, R.V.; resources, E.F.; data curation, R.V.; writing—original draft preparation, R.V.; writing—review and editing, E.F.; visualization, R.V.; supervision, E.F.; project administration, R.V.; funding acquisition, E.F. All authors have read and agreed to the published version of the manuscript.
Funding
The authors would like to acknowledge the support provided by the Universidad San Sebastián, Chile, Universidad Técnica Federico Santa María, Chile, and the National Agency for Research and Development (ANID) of the Government of Chile under the National Fund for Scientific and Technological Development (FONDECYT) under grant number [1251708].
Data Availability Statement
Not applicable.
Conflicts of Interest
The authors declare no conflicts of interest.
References
- Begić, H.; Galić, M. A Systematic Review of Construction 4.0 in the Context of the BIM 4.0 Premise. Buildings 2021, 11, 337. [Google Scholar] [CrossRef]
- Sacks, R.; Brilakis, I.; Pikas, E.; Xie, H.S.; Girolami, M. Construction with Digital Twin Information Systems. Data-Centric Eng. 2020, 1, e14. [Google Scholar] [CrossRef]
- Darko, A.; Chan, A.P.C.; Adabre, M.A.; Edwards, D.J.; Hosseini, M.R.; Ameyaw, E.E. Artificial Intelligence in the AEC Industry: Scientometric Analysis and Visualization of Research Activities. Autom. Constr. 2020, 112, 103081. [Google Scholar] [CrossRef]
- Eastman, C.; Teicholz, P.; Sacks, R.; Liston, K. BIM Handbook: A Guide to Building Information Modeling for Owners, Designers, Engineers, Contractors, and Facility Managers, 3rd ed.; Wiley: Hoboken, NJ, USA, 2018. [Google Scholar]
- ISO 19650-1:2018; Organization and Digitization of Information about Buildings and Civil Engineering Works, Including Building Information Modelling (BIM)—Information Management Using Building Information Modelling—Part 1: Concepts and Principles. International Organization for Standardization (ISO): Geneva, Switzerland, 2018.
- Ford, J. UK BIM Framework Guidance—Facilitating the Common Data Environment; Centre for Digital Built Britain: London, UK, 2020. [Google Scholar]
- Kreuzberger, D.; Kühl, N.; Hirschl, S. Machine Learning Operations (MLOps): Overview, Definition, and Architecture. IEEE Access 2023, 11, 31866–31879. [Google Scholar] [CrossRef]
- Boje, C.; Guerriero, A.; Kubicki, S.; Rezgui, Y. Towards a Semantic Construction Digital Twin: Directions for Future Research. Autom. Constr. 2020, 114, 103179. [Google Scholar] [CrossRef]
- Zhao, Y.; Wang, N.; Liu, Z.; Mu, E. Construction Theory for a Building Intelligent Operation and Maintenance System Based on Digital Twins and Machine Learning. Buildings 2022, 12, 87. [Google Scholar] [CrossRef]
- Akinosho, T.D.; Oyedele, L.O.; Bilal, M.; Ajayi, A.O.; Delgado, M.D.; Akinade, O.O.; Ahmed, A.A. Deep Learning in the Construction Industry: A Review of Present Status and Future Innovations. J. Build. Eng. 2020, 32, 101827. [Google Scholar] [CrossRef]
- Chai, P.; Hou, L.; Zhang, G.; Tushar, Q.; Zou, Y. Generative Adversarial Networks in Construction Applications. Autom. Constr. 2024, 159, 105265. [Google Scholar] [CrossRef]
- Luleci, F.; Catbas, F.N.; Avci, O. A Literature Review: Generative Adversarial Networks for Civil Structural Health Monitoring. Front. Built Environ. 2022, 8, 1027379. [Google Scholar] [CrossRef]
- Hutter, F.; Kotthoff, L.; Vanschoren, J. Automated Machine Learning; Vanschoren, J., Ed.; Springer International Publishing: Cham, Switzerland, 2019; ISBN 978-3-030-05317-8. [Google Scholar]
- Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 Statement: An Updated Guideline for Reporting Systematic Reviews. BMJ 2021, 372, n71. [Google Scholar] [CrossRef] [PubMed]
- Fujii, T.Y.; Hayashi, V.T.; Arakaki, R.; Ruggiero, W.V.; Bulla, R.; Hayashi, F.H.; Khalil, K.A. A Digital Twin Architecture Model Applied with MLOps Techniques to Improve Short-Term Energy Consumption Prediction. Machines 2021, 10, 23. [Google Scholar] [CrossRef]
- Liu, Y.; Li, T.; Xu, W.; Wang, Q.; Huang, H.; He, B.-J. Building Information Modelling-Enabled Multi-Objective Optimization for Energy Consumption Parametric Analysis in Green Buildings Design Using Hybrid Machine Learning Algorithms. Energy Build. 2023, 300, 113665. [Google Scholar] [CrossRef]
- Pan, Y.; Zhang, L. A BIM-Data Mining Integrated Digital Twin Framework for Advanced Project Management. Autom. Constr. 2021, 124, 103564. [Google Scholar] [CrossRef]
- Pacheco, A.; Pacheco-Pumaleque, L.; Uribe-Hernández, Y.; Mogrovejo, A.; Pariona-Luque, R.; Añaños-Bedriñana, M.; Alvarez, A.; Peñaranda, C.; Pineda, F.; Ruiz, M.; et al. Transforming Construction Management in Peru: The Role of BIM in Innovation and Efficiency. Sage Open 2024, 14, 1–12. [Google Scholar] [CrossRef]
- Lin, W.Y.; Huang, Y.-H. Filtering of Irrelevant Clashes Detected by BIM Software Using a Hybrid Method of Rule-Based Reasoning and Supervised Machine Learning. Appl. Sci. 2019, 9, 5324. [Google Scholar] [CrossRef]
- Nawaz, A.; Su, X.; Nasir, I.M. BIM Adoption and Its Impact on Planning and Scheduling Influencing Mega Plan Projects- (CPEC-) Quantitative Approach. Complexity 2021, 2021, 8818296. [Google Scholar] [CrossRef]
- Mayouf, M.; Jones, J.; Elghaish, F.; Emam, H.; Ekanayake, E.M.A.C.; Ashayeri, I. Revolutionising the 4D BIM Process to Support Scheduling Requirements in Modular Construction. Sustainability 2024, 16, 476. [Google Scholar] [CrossRef]
- Abdelrahman, M.M.; Chong, A.; Miller, C. Personal Thermal Comfort Models Using Digital Twins: Preference Prediction with BIM-Extracted Spatial–Temporal Proximity Data from Build2Vec. Build. Environ. 2022, 207, 108532. [Google Scholar] [CrossRef]
- Arsiwala, A.; Elghaish, F.; Zoher, M. Digital Twin with Machine Learning for Predictive Monitoring of CO2 Equivalent from Existing Buildings. Energy Build. 2023, 284, 112851. [Google Scholar] [CrossRef]
- Lim, Y.T.; Yi, W.; Wang, H. Application of Machine Learning in Construction Productivity at Activity Level: A Critical Review. Appl. Sci. 2024, 14, 10605. [Google Scholar] [CrossRef]
- Jacobsen, E.L.; Teizer, J. Deep Learning in Construction: Review of Applications and Potential Avenues. J. Comput. Civil. Eng. 2022, 36, 03121001. [Google Scholar] [CrossRef]
- Keeney, R.L.; Raiffa, H. Decisions with Multiple Objectives; Cambridge University Press: Cambridge, UK, 1993; ISBN 9780521438834. [Google Scholar]
- Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative Adversarial Networks. Commun. ACM 2020, 63, 139–144. [Google Scholar] [CrossRef]
- Tian, J.; Che, C. Automated Machine Learning: A Survey of Tools and Techniques. J. Ind. Eng. Appl. Sci. 2024, 2, 71–76. [Google Scholar] [CrossRef]
- Forcael, E.; Puentes, C.; García-Alvarado, R.; Opazo-Vega, A.; Soto-Muñoz, J.; Moroni, G. Profile Characterization of Building Information Modeling Users. Buildings 2023, 13, 60. [Google Scholar] [CrossRef]
- Patacas, J.; Dawood, N.; Kassem, M. BIM for Facilities Management: A Framework and a Common Data Environment Using Open Standards. Autom. Constr. 2020, 120, 103366. [Google Scholar] [CrossRef]
- Zhan, S.; Wichern, G.; Laughman, C.; Chong, A.; Chakrabarty, A. Calibrating Building Simulation Models Using Multi-Source Datasets and Meta-Learned Bayesian Optimization. Energy Build. 2022, 270, 112278. [Google Scholar] [CrossRef]
- Hosamo, H.H.; Svennevig, P.R.; Svidt, K.; Han, D.; Nielsen, H.K. A Digital Twin Predictive Maintenance Framework of Air Handling Units Based on Automatic Fault Detection and Diagnostics. Energy Build. 2022, 261, 111988. [Google Scholar] [CrossRef]
- Urbieta, M.; Urbieta, M.; Laborde, T.; Villarreal, G.; Rossi, G. Generating BIM Model from Structural and Architectural Plans Using Artificial Intelligence. J. Build. Eng. 2023, 78, 107672. [Google Scholar] [CrossRef]
- Salim, M.M.; Comivi, A.K.; Nurbek, T.; Park, H.; Park, J.H. A Blockchain-Enabled Secure Digital Twin Framework for Early Botnet Detection in IIoT Environment. Sensors 2022, 22, 6133. [Google Scholar] [CrossRef]
- Emunds, C.; Pauen, N.; Richter, V.; Frisch, J.; van Treeck, C. SpaRSE-BIM: Classification of IFC-Based Geometry via Sparse Convolutional Neural Networks. Adv. Eng. Inform. 2022, 53, 101641. [Google Scholar] [CrossRef]
- Piras, G.; Muzi, F.; Tiburcio, V.A. Digital Management Methodology for Building Production Optimization through Digital Twin and Artificial Intelligence Integration. Buildings 2024, 14, 2110. [Google Scholar] [CrossRef]
- Wu, S.; Hou, L.; Zhang, G.; Chen, H. Real-Time Mixed Reality-Based Visual Warning for Construction Workforce Safety. Autom. Constr. 2022, 139, 104252. [Google Scholar] [CrossRef]
- Ni, Z.; Zhang, C.; Karlsson, M.; Gong, S. A Study of Deep Learning-Based Multi-Horizon Building Energy Forecasting. Energy Build. 2024, 303, 113810. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).