A Lightweight Multi-Layer Perceptron Approach for Carbon Emission Prediction of Public Buildings Under Low-Dimensional Data Scenarios

Wang, Yang; Wang, Qiming; Zhang, Shutong

doi:10.3390/buildings15244508

Open AccessArticle

A Lightweight Multi-Layer Perceptron Approach for Carbon Emission Prediction of Public Buildings Under Low-Dimensional Data Scenarios

by

Yang Wang

^*,

Qiming Wang

and

Shutong Zhang

School of Economics and Management, Shanghai University of Electric Power, Shanghai 201306, China

^*

Author to whom correspondence should be addressed.

Buildings 2025, 15(24), 4508; https://doi.org/10.3390/buildings15244508

Submission received: 26 October 2025 / Revised: 29 November 2025 / Accepted: 2 December 2025 / Published: 12 December 2025

(This article belongs to the Special Issue Innovative Research Applied to Building Structures: From Materials Development to Structural Application)

Download

Browse Figures

Versions Notes

Abstract

Amid global efforts toward carbon neutrality, carbon emission accounting in the construction sector has become essential for sustainable design. Public buildings, with complex energy systems and high operational loads, are major carbon emitters. However, early design stages often provide only low-dimensional parameters—such as floor area, number of floors, and location—limiting conventional regression methods. This study develops a lightweight prediction framework using a multilayer perceptron (MLP) neural network. Feature engineering constructs composite indicators—layers per unit area (LPA) and height-to-area ratio (HAR)—to quantify spatial compactness and vertical density. A three-layer MLP with Swish activation, adaptive L2 regularization, and Dropout reduces overfitting and improves generalization. Tests show the model achieves a mean absolute error of 4160 tCO₂ and R² of 0.966, reducing prediction error by 54.7% compared to linear regression. For high-rise buildings (>15 floors), error remains below 8.1%. SHAP analysis highlights floor area as the dominant factor (51.2%), while HAR and LPA jointly improve accuracy by 5.8%. A Python-based tool is developed for rapid emission estimation during design. Using 150 samples and 10-fold cross-validation, this work demonstrates the potential of deep learning in low-dimensional carbon prediction, offering a practical reference for early-stage green building design, though generalizability requires further validation with larger datasets.

Keywords:

building carbon emissions; multilayer perceptron; neural network; feature engineering; transfer learning; low-dimensional modeling; green building

1. Introduction

The building sector constitutes one of the most significant contributors to global energy consumption and greenhouse gas emissions, accounting for approximately 38% of total life-cycle carbon emissions worldwide and up to 50.9% in China [1,2,3]. Public buildings within this sector are of particular concern due to their complex energy systems, extended operating hours, and concentrated equipment loads [4,5]. Compared to residential buildings, the unit-area carbon emission intensity of public buildings is three to five times higher, establishing them as a critical focus for carbon reduction strategies.

Accurate carbon emission prediction plays a pivotal role in green building design; however, existing approaches face significant limitations. Traditional methods, such as life cycle assessment (LCA) and regression-based simplified models, rely heavily on high-dimensional or multi-stage data inputs [6], which are often unavailable during the early design stage [7]. International efforts, such as the European Union’s life-cycle database under the Energy Performance of Buildings Directive (EPBD) framework and ASHRAE’s simulation-based emission tools, have demonstrated progress in large-scale applications [8,9]. Concurrently, recent studies have explored the adoption of recurrent neural networks (e.g., LSTM and GRU). However, these methods generally require detailed and high-quality datasets that are difficult to obtain in practice, particularly during early-stage design or in grassroots construction projects. In China, although the “Building Carbon Emission Calculation Standard”(GB/T51366-2019) [10] provides a regulatory framework, existing models such as Random Forest and Support Vector Regression still exhibit limited generalization ability under low-dimensional conditions [11].

Although the aforementioned high-dimensional methods provide valuable insights for detailed design stages, their applicability diminishes significantly during the critical early phases of architectural design [12]. During this phase, designers require rapid, simplified tools for comparative scenario analysis rather than exhaustive, data-intensive simulations. Consequently, the focus on low-dimensional prediction is not merely a concession to data scarcity but represents a deliberate methodological choice aligned with the practical constraints and decision-making needs of early-stage design [13]. This approach prioritizes expediency and accessibility, enabling architects to swiftly evaluate the carbon implications of fundamental massing and spatial decisions.

Several studies have aimed to support early-stage carbon prediction. Simplified regression models, for instance, provide computational efficiency by drawing on existing building stock data, yet they often overlook the complex, non-linear relationships between fundamental design parameters and carbon emissions. Likewise, rule-based methods and early benchmarks, while convenient for preliminary reference, offer limited flexibility and fail to reflect the nuanced interplay among a project’s specific area, height, and climatic conditions. These constraints—rooted in linear assumptions and weak generalization point to a clear research need: an approach that preserves the ease of low-dimensional inputs while leveraging non-linear modeling to deliver accuracy close to that of more detailed, later-phase tools. This study responds to that need by introducing a lightweight multilayer perceptron (MLP) framework designed to balance these competing requirements.

To this end, we developed a carbon emission prediction model adapted to low-data scenarios in early design. The model is built around an MLP architecture that relies on three basic parameters floor area, number of above-ground floors, and geographic region commonly accessible at this stage. We further constructed two composite variables through feature engineering: layers per unit area (LPA) and height-to-area ratio (HAR), which help quantify spatial compactness and vertical density. To strengthen non-linear representation and prevent overfitting, the network incorporates Swish activation functions, adaptive L2 regularization, and Dropout layers. Additionally, transfer learning is employed using pre-training on large public datasets followed by fine-tuning on local samples to improve model adaptability and robustness across varied contexts.

The contributions of this study are summarized as follows. First, it presents a practical application of a multilayer perceptron (MLP) network to carbon emission prediction under low-data conditions, effectively addressing the limitations of conventional linear regression methods. Second, it introduces two composite indicators height-to-area ratio (HAR) and layers per unit area (LPA) which help quantify spatial compactness and vertical density, thus supporting early-stage low-carbon design decisions. Third, a user-friendly Python-based tool (using Python 3.9) is developed, allowing designers to obtain rapid carbon estimates with minimal input during schematic design. Together, this work highlights the potential of lightweight deep learning models to facilitate carbon-aware building design and supports the transition toward carbon neutrality in the construction sector.

2. Relevant Theories and Technical Route

2.1. Relevant Theories

2.1.1. Life Cycle Carbon Emission Theory

Building carbon emissions exhibit distinct stage-dependent characteristics, typically categorized into material production (Em), transportation (Et), construction (Ec), operation (Eo), and demolition (Ed) [14]. The total emissions can be expressed by the following equation:

Etotal = Em + Et + Ec + Ed + Eo

(1)

This life-cycle perspective (Figure 1) ensures that emissions from all structural and operational activities are accounted for [15,16,17]. However, during the early design stage, only low-dimensional descriptors—floor area, above-ground floor count, and geographic region—are typically available. Consequently, rather than estimating each stage separately, this study focuses on establishing a nonlinear mapping between these limited parameters and the aggregated Etotal, thereby embedding the implicit stage-level emission patterns within a neural network framework to facilitate rapid and practical prediction.

2.1.2. Multilayer Perceptron Theory

The Multi-Layer Perceptron (MLP) is a class of feed-forward artificial neural networks. As illustrated in Figure 2, its structure consists of an input layer, one or more hidden layers, and an output layer, making it one of the most fundamental and widely used deep learning models. Compared to traditional linear regression, the MLP can capture complex nonlinear relationships between input features through its nonlinear activation functions and multilayer structure. This capability makes it particularly suitable for modeling low-dimensional, small-sample datasets with high feature interactivity [18].

In the context of building carbon emission calculation, the multilayer perceptron demonstrates superior nonlinear modeling and feature interaction capabilities compared to traditional linear regression models. Building carbon emissions are influenced by the superposition of numerous factors—such as floor area, number of floors, and regional climate—which exhibit significant nonlinear interactions. These complex interactions are often inadequately captured by linear models [19]. In contrast, the MLP architecture, leveraging its multilayer structure and nonlinear activation functions, can automatically learn the underlying patterns from limited features and achieve high-precision fitting even with low-dimensional parameter inputs. Furthermore, the MLP offers the flexibility to incorporate manually constructed interaction variables (e.g., layers per unit area, height-to-area ratio) and exhibits strong structural extensibility and data adaptability. These properties make it especially suitable for the rapid prediction of building carbon emissions in scenarios where detailed data are unavailable during the early design stage.

2.2. Technical Route

This study innovatively applies a multilayer perceptron (MLP) neural network to the carbon emission prediction of public buildings. This approach provides a novel solution to the challenge of carbon emission estimation in low-dimensional, small-sample scenarios [20]. Leveraging its unique layered structure and nonlinear activation functions, the MLP effectively captures the complex interactions among limited parameters—such as building area, number of floors, and regional climate—thereby overcoming the expressive limitations inherent in traditional linear models.

In terms of methodological design, a lightweight three-layer MLP network (64-32-16 neurons) was constructed. The Swish activation function was adopted to enhance nonlinear modeling capability, and composite features—such as layers per unit area (LPA) and height-to-area ratio (HAR)—were innovatively introduced. These engineered features significantly enhance the information density of the low-dimensional input data. To address the challenge of limited sample size, a transfer learning strategy is employed. The model is first pre-trained on large-scale public datasets to learn a generalized feature representation and is subsequently fine-tuned on the local data distribution. This strategy enables the model to maintain excellent generalization performance with a dataset of only 150 samples [21].

As outlined in the technology roadmap (Figure 3), this research follows a complete iterative cycle: problem definition → feature engineering → model design → transfer learning → validation analysis → tool development. Finally, the model is encapsulated into a Python-based rapid calculation tool. This tool allows practitioners to obtain a reliable carbon emission estimate by inputting only three basic parameters: floor area, number of floors, and geographical region. It provides practical technological support for green building design and verifies the feasibility of applying multi-layer perceptron approach to carbon emission prediction in low-dimensional building scenarios.

3. Model Design and Methods

3.1. Input Feature Construction and Preprocessing

3.1.1. Data Source

The local dataset was compiled by selecting office buildings from journal publications and government disclosures within the past five years. For each building, floor height, area, geographical region, and total carbon emissions were collected and integrated into a structured dataset. This data collection approach ensures high levels of authenticity, accuracy, and comprehensiveness [22]. All selected office buildings feature reinforced concrete structures. Variation in their carbon emissions primarily stems from differences in height, area, geographical location, and spatial characteristics, which are influenced by building form and construction complexity [23,24]. The construction complexity varies significantly between low-rise and high-rise buildings. These inherent characteristics provide valuable insights for parameter design in this study.

According to the actual data collection, three types of core building parameters, as shown in Table 1, Core Data Parameter Map are selected as raw inputs:

3.1.2. Feature Enhancement and Interaction Variables

To enhance the model’s expressive capability, feature interaction design was implemented through the construction of the following derived variables:

Layers per unit area (LPA): Quantifies building compactness through the ratio LPA = F/A;
Height-to-area ratio (HAR): Represents volumetric density through the ratio HAR = F²/A.

The selection of LPA and HAR as engineered features is grounded in a dual theoretical foundation addressing both physical drivers of building carbon emissions and mathematical limitations of low-dimensional data modeling [25,26,27]. This approach transforms basic early-stage design parameters into informative proxies characterizing building spatial configuration.

The primary rationale stems from the principles of Life Cycle Carbon Emission theory, which posits that a building’s total carbon footprint (Etotal) is an aggregate of emissions from material production, construction, operation, and demolition stages. However, during the early design phase, detailed data for modeling these stages individually are unavailable. The parameters of floor area (A) and number of floors (F) are accessible but insufficient on their own to capture the complex interplay between a building’s form and its life-cycle carbon intensity [28]. Herein lies the innovation of LPA and HAR. Rather than being arbitrary mathematical constructs, they serve as quantifiable proxies for fundamental architectural properties that directly influence emissions across the life cycle. The parameter LPA = F/A quantifies horizontal compactness. A high LPA value indicates a building with a greater concentration of floor area, which typically correlates with intensified energy use per unit area during operation (e.g., from centralized HVAC and elevator systems) but may also imply structural efficiency in material use. Conversely, the parameter HAR = F²/A captures vertical density or slenderness. A high HAR value signifies a tall, narrow building, which is associated with increased embodied carbon from the structural systems required to resist lateral loads, as well as altered operational energy profiles due to a higher surface-area-to-volume ratio. By incorporating LPA and HAR, the model is provided with condensed, physically meaningful indicators that embed critical aspects of the life cycle carbon emission structure, enabling a more nuanced prediction than is possible with A and F alone.

The second pillar of the rationale addresses a fundamental mathematical limitation of traditional regression models when applied to this domain. Standard linear models or simplified formulas often treat parameters like area (A) and floor count (F) as independent variables, operating under the assumption of linearity and additivity [29]. However, the impact of building area on carbon emissions is not constant; it is intrinsically modulated by the building’s height. This interaction effect is a quintessential nonlinear relationship that linear models fail to capture effectively.

The engineered features LPA and HAR are, in essence, predefined nonlinear interaction terms. The MLP neural network, while capable of learning such interactions implicitly, benefits significantly from being guided by these semantically meaningful transformations, especially in a low-dimensional, small-sample scenario. By explicitly providing LPA = F/A and HAR = F²/A, the model is relieved from the burden of discovering these specific functional forms from limited data. This reduces model complexity, accelerates convergence, and enhances generalization [30]. It effectively linearizes a core piece of the underlying nonlinear problem, allowing the MLP to focus on learning higher-order complexities. Thus, this feature engineering is not merely an enhancement but a mathematical prerequisite for achieving high predictive accuracy with a lightweight model when working with constrained input dimensions.

In summary, the theoretical basis for LPA and HAR is twofold: (1) they embed physically significant descriptors of building morphology that are intrinsically linked to lifecycle carbon emissions, and (2) they explicitly introduce critical nonlinear interactions between area and height, thereby overcoming a fundamental shortcoming of traditional linear modeling approaches and providing a robust input structure for the subsequent MLP network.

Geographical climate coding: mapping the “cold/mild/hot summer/cold winter” zones into 0/1/2 numerical variables, which is convenient for modeling, and the main source of data is the city code, as shown in Table 2 Climate coding table:

All input features are normalized before training:

X^{*} = \frac{X - μ}{σ}

(2)

where

X

is the original data,

X^{*}

is the scaled data,

μ

is the sample mean, and

σ

is the standard deviation.

3.2. Model Structure Design

3.2.1. Network Structure Configuration

In this study, a lightweight multilayer perceptron (MLP) neural network is designed and implemented for predicting carbon emissions in low-dimensional building scenarios [31,32,33]. The model architecture comprises an input layer, three hidden layers, and an output layer. The input layer accepts five feature variables as inputs. The three hidden layers consist of fully connected neurons with sizes set to 64, 32, and 16, respectively, to facilitate the layer-by-layer extraction of higher-level feature representations.

The network architecture was determined using an empirical approach, guided by the specific constraints of the problem. Given the limited training sample size (N = 150), automated hyperparameter search techniques, such as grid search or Bayesian optimization, were deemed unsuitable due to risks of high computational cost and result instability. Consequently, a pragmatic strategy combining manual tuning with preliminary validation was adopted for architecture selection. This strategy primarily prioritizes ensuring the model’s generalization capability within low-dimensional, small-sample scenarios. The selected layered structure [64, 32, 16] aligns with common empirical configurations used in the deep learning community for tabular data, aiming to achieve effective feature learning while maintaining controllable model complexity. In terms of design principles, this architecture embodies the core objective of balancing expressive power with generalization performance. The initial layer, comprising 64 neurons, provides sufficient capacity to nonlinearly map the complex interactions among the input features: building area, number of floors, region, and the structural parameters LPA and HAR. Subsequent hidden layers, with 32 and 16 neurons, progressively abstract features and compress information. This progressively decreasing structure facilitates the extraction of higher-order representations pertinent to carbon emissions from the raw data. Simultaneously, the relatively lightweight three-layer design inherently acts as a form of regularization, proactively mitigating overfitting issues that are prone to occur with small datasets when model complexity is excessive. This design establishes a structural foundation that enhances the model’s robustness.

To enhance nonlinear modeling capability while ensuring stability during training on low-dimensional samples, the Swish activation function is employed in the hidden layers. The Swish function offers smoother gradients compared to the ReLU function, enabling more stable gradient propagation and feature extraction under limited data conditions. The output layer consists of a single neuron, which outputs the predicted value of total building carbon emissions (unit: tCO₂) or carbon intensity per unit floor area (unit: tCO₂/m²).

1.: Input layer

The input feature variables are:

X = {[A, F, R, L P A, H A R]}^{T} \in R^{5 \times 1}

(3)

where

A

is the Floor area,

F

is the Number of floors above ground,

R

is the Geographical climate code (numeric),

L P A = F / A

, and

H A R = F^{2} / A

;

2.: Hidden layer structure (3 layers)

Let the weight matrix of the layer

l

be

W^{(l)}

, the bias vector be

b^{(l)}

and the activation function be

ϕ (\cdot)

(Swish function). Dimension of each layer:

Input layer → Hidden layer 1: 5 → 64;
Hidden layer 1 → Hidden layer 2: 64 → 32;
Hidden layer 2 → Hidden layer 3: 32 → 16.

Hidden Layer 1 (

l

= 1):

z^{(1)} = w^{(1)} x + b^{(1)} (w^{(1)} \in R^{64 \times 5}, b^{(1)} \in R^{64})

(4)

a^{(1)} = ϕ s w i s h (z^{(1)})

(5)

Hidden Layer 2 (

l

= 2):

z^{(2)} = w^{(2)} a^{(1)} + b^{(2)} (w^{(2)} \in R^{32 \times 64}, b^{(2)} \in R^{32})

(6)

a^{(2)} = ϕ s w i s h (z^{(2)})

(7)

Hidden Layer 3 (

l

= 3)

z^{(3)} = w^{(3)} a^{(2)} + b^{(3)} (w^{(3)} \in R^{16 \times 32}, b^{(3)} \in R^{16})

(8)

a^{(3)} = ϕ s w i s h (z^{(3)})

(9)

S w i s h

Activation function:

φ s w i s h (z) = z \cdot σ (β z)

(10)

where

β (β = 1)

is the science system parameter and

σ (\cdot)

is the sigmoid function.

3.: Output layer

\hat{y} = w^{(4)} a^{(3)} + b^{(4)} (w^{(4)} \in R^{1 \times 16}, b^{(4)} \in R)

(11)

where

\hat{y}

is the predicted carbon emissions (tCO₂).

3.2.2. Loss Function and Optimizer

During model training, the Huber loss function is selected as the objective function to enhance training robustness. The Huber loss integrates the advantages of the mean square error (MSE) and the mean absolute error (MAE). Specifically, it behaves similarly to MSE for small errors, ensuring differentiable gradient continuity, while for large errors, it approximates MAE, which is less sensitive to outliers. This characteristic reduces the negative impact of extreme samples on model training. In this study, the threshold parameter δ for the Huber loss is set to 1500, a value chosen to align with the data magnitude and balance the penalty imposed on large errors. The threshold δ = 1500 was determined by considering the data statistics: the mean carbon emission is 98,756 tCO₂, the standard deviation is 102,345 tCO₂, and the data range is substantial (1474 to 665,233 tCO₂). This value is appropriate given the data magnitude. This value, approximately 1.5% of the data’s standard deviation, avoids excessive sensitivity to small errors while effectively controlling the influence of outliers. The input features are normalized using the Z-score method (Equations (1)–(3)), and the output magnitude must be consistent with this scaling. The value δ = 1500 corresponds to a reasonable quantile within the normalized error distribution.

L_{δ} (a) = \{\begin{cases} \frac{1}{2} α^{2} \\ δ (|α| - \frac{1}{2} δ) \end{cases} \begin{matrix} f o r |α| \leq δ \\ otherwise \end{matrix}

(12)

where,

α = y_{t r u e} - y_{p r e d}

,

δ

takes the value of 1500 (when

δ

< 1000 the model is sensitive to noise,

δ

> 2000 ignores important error information, and 1500 is the optimal compromise point).

3.2.3. Regularization and Dropout

1.: L2 regularization: add a weight decay term to the loss function:

L_{t o t a l} = L_{h u b e r} (\hat{y}, y) + λ {\sum_{l = 1}^{4} ‖w^{(l)}‖}_{F}^{2}

(13)

where

λ

is the adaptive regularization coefficient (adjusted according to the importance of features).

2.: Dropout: Discard the neuron outputs of hidden layers 1–3 with probability p = 0.3 during training (no dropout during training).

3.2.4. Complete Training Model Expression

\hat{y} = w^{(4)} [ϕ s w i s h (w^{(3)} [ϕ s w i s h (w^{(2)} [ϕ s w i s h (w^{(1)} x + b^{(1)}) ⊙ m^{(1)}] + b^{(2)}) ⊙ m^{(2)}] + b^{(3)}) ⊙ m^{(3)}] + b^{(4)}

(14)

where ⊙ is the denotes element-by-element multiplication and

m^{(l)} \in {\{0, 1\}}^{d_{l}}

is the Dropout mask vector (elements obey Bernoulli distribution B(1, 0.7)).

3.3. Migration Learning Strategy Design

3.3.1. Pre-Training Data Sources

To mitigate the challenges associated with training on small samples, this study employs a transfer learning strategy. The model is first pre-trained on a large-scale dataset compiled from publicly available building energy consumption and carbon emission case studies, following the methodological framework advocated by the China Association of Building Energy Efficiency (CABEE). Although this integrated dataset is not directly published by CABEE, its provenance is clearly documented. It primarily originates from the Donghe Building Carbon Emission Calculation Platform, which was jointly developed by Southeast University and China Construction Group. The data, calculated using the life cycle assessment (LCA) methodology, encompass over 10,000 samples covering various public building types across China’s major climate zones. The key characteristics of the pre-training dataset are summarized in Table 3.

This large-scale pre-training enables the model to learn robust, generalized feature representations that capture the relationship between basic building parameters and carbon emissions.

3.3.2. Fine-Tuning Method

Following pre-training, the model is fine-tuned on the smaller, specific local dataset (N = 150). A layer-wise fine-tuning strategy is adopted. The weights of the lower layers (

ω^{(1)}

,

b^{(1)}

,

ω^{(2)}

,

b^{(2)}

) are frozen, as these layers capture universal, low-level features (e.g., fundamental relationships between area, height, and emissions). Only the weights of the higher layers (

ω^{(3)}

,

b^{(3)}

,

ω^{(4)}

,

b^{(4)}

) are updated during fine-tuning. This approach allows the model to adapt its high-level reasoning to the specific distribution of the local data while preserving the general knowledge acquired during pre-training, effectively reducing overfitting and improving convergence stability.

Pre-training: optimization on the public dataset $D_{public}$

\min_{θ} \sum_{(x_{i}, y_{i}) \in D_{p u b l i c}} L_{H u b e r} (f (x_{i}; θ), y_{i})

(15)

Fine-tuning: freeze the first two layer weights ( $w^{(1)}, b^{(1)}, w^{(2)}, b^{(2)}$ ) and optimize only the subsequent parameters:

\min_{w^{(3)}, b^{(3)}, w^{(4)}, b^{(4)}} \sum_{(x_{i}, y_{i}) \in D_{l o c a l}} L_{H u b e r} (f (x_{i}; θ_{f r o z e n}; θ_{t u n e}), y_{i})

(16)

During fine-tuning, the model progressively incorporates the data distribution of local samples into its higher-level parameters, while preserving the generalized features captured in the initial layers during pre-training. This "freeze lower layers, adjust higher layers" strategy offers two key benefits in low-dimensional settings: it substantially shortens training time by updating only a subset of parameters, and it alleviates overfitting and training instability, leading to faster and more reliable convergence. By combining pre-training on large datasets with targeted fine-tuning on local samples, the model retains robust feature extraction capabilities while adapting its final layers to domain-specific characteristics. In practice, this approach effectively addresses common small-data challenges—such as model instability, slow convergence, and training difficulty—and enhances the generalizability and practical utility of the proposed carbon emission prediction model, offering a viable path toward efficient building carbon estimation under low-dimensional constraints.

3.4. Model Evaluation Methods and Parameters

To comprehensively evaluate the effectiveness and generalization capability of the proposed MLP model for building carbon emission prediction, a rigorous evaluation framework incorporating a multidimensional performance index system is implemented during the model training stage. The evaluation process encompasses three key aspects: the training strategy, the validation mechanism, and the error measurement methodology. This multi-faceted approach aims to ensure the stability, reliability, and interpretability of the model’s performance.

3.4.1. Training and Validation Mechanism

Given the limited local sample size of 150 instances, a 10-fold cross-validation (K = 10) strategy is adopted for model training and performance validation to enhance data utilization efficiency and evaluation robustness. The specific operational workflow is illustrated in Figure 4.

A nested data partitioning strategy, integrating 10-fold cross-validation with an 8:1:1 data split ratio, was meticulously implemented to ensure a robust and unbiased evaluation of the model’s performance on the limited sample data. This approach maximizes data utility and yields a reliable estimate of the model’s generalizability.

This mechanism operates through a two-tiered process. First, an outer 10-fold cross-validation loop is executed. The complete dataset of 150 samples is partitioned into 10 mutually exclusive subsets (or folds) of equal size (15 samples each) using the KFold (n splits = 10) method. In each iteration, one unique fold is held out as the external test set for the final evaluation of the model’s generalization capability. The remaining nine folds (135 samples) constitute the interim training pool for that iteration. Subsequently, within each outer loop iteration, an internal 8:1:1 split is performed. This split is achieved automatically during model training by setting the validation_split parameter to 0.1111 in Keras’s fit() function. This setting instructs the training routine to randomly reserve approximately 11.11% (1/9) of the 135 samples from the current iteration’s training pool to create an internal validation set. This internal validation set is used for real-time performance monitoring and for triggering the Early Stopping callback during training. The remaining 88.89% (8/9) of the pool serves as the actual training set for updating the model weights. The effective global data allocation is as follows: the training set comprises approximately 120 samples (80% of the total data), the internal validation set comprises about 15 samples (10%), and the external test set about 15 samples (10%), achieving the intended global 8:1:1 split ratio. This method maximizes the utilization of limited sample information while effectively reducing the impact of chance data distribution on the model results, thereby enhancing the representativeness and stability of the assessment. The average error metrics obtained through cross-validation provide a more accurate reflection of the model’s performance on unseen data.

3.4.2. Parameter Setting

The hyperparameters for the multilayer perceptron (MLP) model were selected using a pragmatic, manual tuning approach, guided by the dual constraints of small-sample learning (N = 150) and the strategic implementation of large-scale pre-training. This methodology prioritized stability, reproducibility, and efficient knowledge transfer, thereby avoiding computationally expensive automated searches (e.g., grid search or Bayesian optimization). The selection process was fundamentally shaped by the two-stage learning framework: initial pre-training on a large, diverse dataset followed by fine-tuning on the small local sample.

The parameter configurations for the multi-layer perceptron model are summarized in Table 4.

To assess the prediction accuracy and goodness of fit from multiple perspectives, three standard regression evaluation metrics are employed: the mean absolute error (MAE), the root mean square error (RMSE), and the coefficient of determination (R²). These metrics are defined as follows:

Mean Absolute Error (MAE):

MAE = \frac{1}{n} \sum_{i = 1}^{n} | y^{i} - {\hat{y}}^{i} |

(17)

Root Mean Square Error (RMSE):

RMSE = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} | y^{i} - {\hat{y}}^{i} |^{2}}

(18)

Coefficient of determination (R²):

R^{2} = 1 - \frac{\sum {(y^{i} - {\hat{y}}^{i})}^{2}}{\sum {(y^{i} - {\bar{y}}^{i})}^{2}}

(19)

where

y^{i}

is the actual value,

{\hat{y}}^{i}

is the predicted value and

{\bar{y}}^{i}

is the average value.

3.4.3. Specific Approach and Optimization Strategy

To ensure robust and unbiased evaluation under conditions of limited sample size and inherent data variability, a comprehensive validation methodology was employed throughout the modeling process. All input features were standardized using Z-score normalization prior to training to mitigate scale-related bias. During cross-validation, model parameters and prediction errors were systematically recorded to analyze performance variations across data partitions. Evaluation metrics were computed strictly on held-out test sets, and the proposed MLP model was explicitly compared against baseline models, including linear regression, random forest, and support vector regression to objectively demonstrate its predictive advantage. Furthermore, the statistical properties of prediction errors, such as mean error, standard deviation, and skewness were quantitatively examined to assess model robustness across varying data conditions. This multi-faceted validation framework ensures a thorough assessment across three critical dimensions: predictive accuracy, operational stability, and practical relevance. The systematically gathered performance data also provides a solid empirical basis for subsequent comparative analysis and visual interpretation in the results and discussion sections.

4. Results

4.1. Dataset Construction and Outlier Handling

4.1.1. Dataset Construction and Division

To validate the effectiveness of the multilayer perceptron (MLP) model for building carbon emission prediction, a unified dataset was constructed by integrating the carbon emission sample data compiled in Section 3.2.1 with typical case studies disclosed by the China Building Energy Conservation Association (CABEE). The dataset comprises 150 records encompassing floor area, number of above-ground floors, geographic climate code, and carbon emissions. These data were standardized and feature-engineered to form a complete input matrix. To enhance generalization performance, the dataset was partitioned into training, validation, and test sets using an 8:1:1 ratio, ensuring objective and robust model evaluation. The distribution characteristics of the data features are summarized in Table 5. This partitioning strategy accounted for geographical distribution and building size balance to mitigate potential data bias.

4.1.2. Data Cleaning Procedures and Outlier Handling

To ensure model robustness and generalization capability, a systematic data cleaning process was implemented, focusing on outlier identification and handling. Initially, completeness checks were performed on the dataset, confirming that all 150 samples contained complete values for key parameters (area A, number of floors F, and total carbon emissions E). Subsequently, a combined approach utilizing statistical visualization and model-driven analysis was employed to identify data points potentially detrimental to model training. To better illustrate the distribution characteristics and dispersion of core parameters, box plots for each parameter were utilized (Figure 5).

Figure 5 clearly illustrates several data points in the area (A) and carbon emissions (E) data that reside far from the main distribution zone (i.e., statistical outliers). As illustrated in Figure 5, the boxplots graphically depict the median, quartiles, and extremes of the data. Several data points for Area (A) and Total Carbon Emissions (E) fall beyond the whiskers (defined as 1.5 times the interquartile range), classifying them as statistical outliers. This is an expected characteristic of real-world building data, which often includes extreme cases, such as very large public edifices.

However, statistical anomalies do not necessarily constitute harmful noise in the modeling process. To identify samples that pose significant challenges to prediction accuracy, an in-depth error interaction analysis was conducted. Following initial model training, the correlation between predicted values and absolute errors was examined.

Figure 6 reveals that the prediction errors for specific samples far exceed the average level. For instance, points a and b exhibit exceptionally high relative errors despite their low absolute carbon emissions, as the model’s absolute prediction errors appear disproportionately large relative to their own magnitude. This indicates the model struggles to accurately capture the carbon emission patterns of such small-scale buildings. Without intervention, the training process may become dominated by these high-leverage points.

Based on this analysis, a targeted handling strategy was formulated:

Identification and Marking: Samples such as points a and b, characterized by high relative error and low emission values, were clearly marked as high-impact outliers.
Handling Approach: To avoid bias from simple deletion and enhance adaptability to complex data distributions, an algorithmic augmentation strategy was adopted. Specifically, during final MLP model training, lower sample weights were applied to labeled high-impact outliers. This reduced their contribution to the loss function, enabling the model to focus on learning intrinsic patterns from the main data rather than fitting special cases.
Results Validation: After implementing this weighted training strategy, the final model achieved outstanding test set performance (MAE = 4160 tCO₂, R² = 0.966). Crucially, prediction stability for small-scale buildings improved while maintaining overall generalization capability, demonstrating that the strategy successfully balances data diversity respect with robust model training.

4.2. Model Training Process

Figure 7 illustrates the trend in the error percentage change in the multilayer perceptron model throughout the training process. It can be observed that as the number of training epochs increases, the error percentage decreases rapidly and gradually stabilizes, indicating effective model convergence.

To further monitor convergence behavior and potential overfitting, the training and validation loss curves are plotted and analyzed across the training epochs. As illustrated in Figure 8, both the training loss (blue solid line) and the validation loss (red dashed line) exhibit a consistent downward trend during initial stages before plateauing. The absence of significant divergence between the two curves, combined with the early stopping mechanism (patience = 30) restoring weights from the epoch with the lowest validation loss, demonstrates that the model achieved stable convergence without severe overfitting. This behavior validates the effectiveness of the regularization strategies (L2 and Dropout) and the selected network architecture in maintaining generalization performance on the limited dataset.

Additionally, as shown in Figure 9 (Actual vs. Predicted Carbon Emissions), the overall difference between actual and predicted carbon emissions for office buildings during training is small, indicating high prediction accuracy.

4.3. Comparative Analysis of Model Performance

4.3.1. Comparison of Model Performance Before and After Fine-Tuning

This section comprehensively compares the performance of the transfer learning approach with training from scratch using only the limited local dataset, while also discussing potential domain shift issues between the pre-training and fine-tuning datasets. The analysis aims to objectively evaluate how fine-tuning impacts model adaptation and generalization capabilities. The Multilayer Perceptron (MLP) model was evaluated under three key scenarios to isolate the effects of transfer learning:

Direct application of pre-trained model: Utilizing weights initialized from large-scale pre-training without fine-tuning.
Fine-tuned model: Following domain-specific adaptation on the local dataset.
Training from scratch on local data only: Compared to training from scratch, the fine-tuned model achieved an average absolute error reduction of approximately 49.4% (4160 vs. 8223 tCO₂), demonstrating the advantage of transfer learning in leveraging pre-trained knowledge.

Table 6 summarizes the comparative results quantifying the impact of fine-tuning.

The pre-training dataset, sourced from Donghe Software’s (Version: V3.5)case repository, encompasses diverse building types (e.g., steel and masonry-concrete structures) across multiple climate zones, yielding a generalized but heterogeneous feature representation. In contrast, the fine-tuning set consists of only 150 reinforced concrete buildings from a specific region, representing a more homogeneous and specialized domain. This discrepancy induces both covariate and label shifts: the input feature distributions, such as structural properties and spatial configurations differ from those in the pre-training corpus, and the resulting carbon emission profiles exhibit distinct characteristics. Although the pre-trained model demonstrates reasonable generalizability, its initial predictions on the target domain are inaccurate, yielding a high mean absolute error (MAE = 11,506 tCO₂e). Fine-tuning addresses this domain gap by adapting the model’s higher-level parameters to the target data distribution, substantially improving predictive accuracy. The process exhibited stable convergence, as indicated by the synchronous decline in training and validation loss (Figure 8), confirming that adaptation was achieved without overfitting despite limited samples. This suggests that pre-training provided a robust foundational prior, which fine-tuning efficiently specialized for the reinforced concrete domain.

In summary, the performance gap between the pre-trained and fine-tuned models stems primarily from domain shift, arising from differences in building typology and climatic representation between the two datasets. The transfer learning strategy particularly fine-tuning effectively mitigates this issue by transferring knowledge from a broad, public dataset to a specialized local context. These results highlight the value of domain adaptation in settings with data distribution mismatch, though further validation across larger and more varied datasets remains necessary to generalize the findings. Ultimately, this approach provides a practical pathway to leverage large-scale public data while maintaining relevance in localized, data-constrained scenarios.

4.3.2. Performance Comparison Among Different Models

To comprehensively evaluate the predictive capabilities of the MLP model, we used the same dataset and selected the following three models as benchmarks for comparison:

Linear regression model (LR): A simplified carbon emission formula based on area and number of floors;
Random Forest Regression (RF): Tree model representation with some nonlinear fitting ability;
Support Vector Regression (SVR): Representative of linear fitting in high dimensional space.

The performance of each model on the test set is comparatively illustrated in Figure 10.

The results demonstrate that the proposed MLP model outperforms all baseline models across all evaluation metrics. Notably, it achieves a 54.7% reduction in Mean Absolute Error (MAE) compared to traditional Linear Regression and attains a coefficient of determination (R²) of 0.966, reflecting its strong nonlinear modeling capacity and generalization ability. The quantitative results are summarized in Table 7.

4.4. Validation of High-Rise Building Adaptation

To validate the model’s applicability to high-carbon-emission building scenarios, buildings exceeding 15 floors above ground were classified as a ‘high-rise group’ for specialized validation. The prediction results for low-rise and high-rise buildings are presented in Figure 11a and Figure 11b, respectively.

A stable fit is observed between predicted and actual values for high-rise buildings, with an R² value of 0.957, which is higher than the R² of 0.889 achieved for low-rise buildings. Comparison of Figure 11c,d reveals that the error percentage for high-rise buildings exhibits less fluctuation compared to low-rise buildings, indicating better model adaptation. Furthermore, comparison of Figure 11a,b indicates that the MLP model achieves an average error percentage of 8.1% for the high-rise group, significantly outperforming other models, such as the linear model, in predicting high-rise building emissions. This demonstrates the model’s capability to handle complex building structures through nonlinear enhanced feature combinations (e.g., HAR and LPA), confirming its suitability for estimating emissions from high-emission projects like office complexes and large public buildings.

4.5. Model Interpretability Analysis (SHAP) and Ablation Studies

To elucidate the model’s decision-making logic and the contribution of key variables, the SHAP (SHapley Additive exPlanations) value analysis method was employed to interpret the carbon emission predictions. The relative contribution of each input variable is illustrated in Figure 12.

The analysis indicates that building floor area and number of stories are the primary factors influencing carbon emissions. Geographic characteristics nonlinearly influence emissions by affecting energy consumption patterns and building material choices via climatic differences. The engineered variables, Height-to-Area Ratio (HAR) and Layers per Unit Area (LPA), enhance the model’s ability to characterize building carbon emissions in both planar and spatial dimensions. As shown in Figure 12, these two parameters collectively improve prediction accuracy by 5.8%. Particularly in high-rise scenarios (HAR > 15), prediction errors are controlled within 8.1%. For example, designers can potentially reduce carbon emissions from super-tall buildings by approximately 12% by lowering the HAR value from 20 to 15. This approach of quantifying spatial density into computable parameters overcomes limitations of traditional linear models, advancing carbon emission prediction from a simple area-to-story ratio relationship to a correlation incorporating spatial efficiency and carbon intensity. It thereby provides directly actionable quantitative metrics for low-carbon building design.

To quantitatively assess the individual importance of each input feature in the model and provide a robust theoretical basis for feature construction, we conducted a systematic ablation study. Three simplified models were established: a model containing only the original features, a model excluding geographic features, and a model integrating only area and layer count features while keeping other conditions constant. Their specific performance is shown in Table 8.

The ablation study quantitatively elucidates the distinct and complementary roles of each feature category in the model’s predictive framework. The most pronounced performance degradation is observed upon the removal of the geographical climate code (R), as evidenced by a substantial increase in MAE (+1650 tCO₂) and a marked decrease in R² (−0.054). This underscores the fundamental role of regional climate as a primary driver of building carbon emissions, primarily governing operational energy consumption patterns for heating and cooling. The removal of the engineered features, Layers per Unit Area (LPA) and Height-to-Area Ratio (HAR), results in a significant though comparatively smaller performance decline (ΔMAE = +760 tCO₂, ΔR² = −0.025). This confirms that these composite variables capture essential, non-redundant information pertaining to building spatial configuration and volumetric density, which are not fully encapsulated by the raw features of area (A) and number of floors (F) alone. The precipitous drop in performance when utilizing only A and F (ΔMAE = +2570 tCO₂) highlights the synergistic effect of the feature set; the model’s high accuracy is contingent upon the confluence of basic parameters, geographical context, and morphological indicators.

4.6. Model Output Examples and Visualization

The trained multilayer perceptron model was further developed into a rapid calculation tool using Python to predict carbon emissions of public buildings during early project stages. In the deployment scenario, users can input parameters and obtain outputs as summarized in Table 9.

The interface of the rapid calculation tool is illustrated in Figure 13.

Users select the number of floors, area, and climate zone for the target building in the input panel on the left. Upon clicking ‘Calculate Carbon Emissions,’ the tool displays the corresponding predicted values in the output panel on the right, along with reference suggestions to support decision-making for designers during early building project stages.

5. Discussion

This study successfully developed a lightweight MLP model for predicting public building carbon emissions under the significant constraint of low-dimensional, early-stage design data. The model demonstrated superior performance compared to tradi-tional linear and other benchmark models, achieving an MAE of 4160 tCO₂ and an R² of 0.966. The integration of feature-engineered variables (HAR, LPA) and a transfer learning strategy proved effective in enhancing nonlinear modeling capacity and mitigating over-fitting on a small dataset (N = 150). Despite these promising results, several limitations must be acknowledged to objectively assess the model’s applicability and to guide future research.

(1): Oversimplification of Climatic and Regional Representation

A primary limitation stems from the coarse classification of geographical and cli-matic influences. The model input relies on a simplified climate code (R: 0,1,2) that groups vast and diverse regions (e.g., "Cold region" encompassing both Beijing and Shenyang). This approach fails to capture critical intra-regional variations in microclimates, which significantly impact building energy consumption for heating and cooling. Impact on Ac-curacy: This simplification likely introduces a source of error. For instance, the heating demand and associated carbon emissions for a building of identical size and form would differ between a coastal city like Qingdao and a more continental city like Beijing, even though they share the same climate code in this model. The model’s predictive accuracy could be dampened in areas that are climatically transitional or atypical within their as-signed zone. Path for Enhancement: Future work should incorporate more granular, con-tinuous climatic parameters. Utilizing actual meteorological data, such as Heating Degree Days (HDD) and Cooling Degree Days (CDD), or higher-resolution climate zoning would allow the model to learn the nuanced relationship between local climate severity and op-erational carbon emissions more accurately.

(2): Challenges in Model Generalizability and Scalability

The model’s performance, while robust on the tested dataset, raises valid concerns regarding its generalizability to broader contexts. This limitation is twofold: ① Geographical and Typological Transferability: The model was trained and validated primarily on a dataset of office buildings from specific Chinese cities. Its performance on other building types (e.g., hospitals, schools) or in entirely different geographical and regulatory contexts (e.g., European or North American building stocks) remains unproven. Building designs, construction standards, and operational patterns vary greatly across regions, and a model trained on one context may not translate effectively to another. ② Temporal Generalizability: The model is a snapshot based on current construction practices and energy systems. As the power grid decarbonizes and building energy efficiency standards evolve, the underlying relationship between building form and operational carbon emissions will change. A model trained on current data may become progressively less accurate without mechanisms for temporal adaptation.

To address these challenges, future research should prioritize external validation on diverse, international datasets. Furthermore, incorporating a mechanism for continuous learning or designing the model to be sensitive to dynamic parameters like the grid carbon intensity factor (as discussed next) would significantly improve its long-term utility and scalability.

(3): Neglect of Spatial and Temporal Variations in Grid Carbon Factors

Perhaps the most significant limitation for a life-cycle perspective is the treatment of operational carbon emissions. The model implicitly assumes a static, average carbon emission factor for electricity consumption across a broad climate zone. In reality, the carbon intensity of the electrical grid (gCO₂eq/kWh) exhibits substantial spatial heterogeneity (even within a single country) and significant temporal variation (by time of day and season). Impact on Prediction Validity: This assumption can lead to substantial inaccuracies. A building’s operational carbon footprint is not just a function of its energy use but also of when and where that energy is consumed. For example, an all-electric building using air conditioning during peak afternoon hours in a grid with high solar penetration will have a lower carbon footprint than the same building consuming the same amount of energy at night when the grid relies more on fossil fuels. Our model, in its current form, cannot capture this crucial dynamic. Towards a More Robust Approach: To enhance the physical realism and accuracy of predictions, future iterations of the model should integrate time-sensitive grid carbon fac-tor data. This could involve using historical average data for different regional grids as a more refined input or, ideally, developing a model that can integrate with smart grid data for real-time or seasonal carbon accounting. This advancement would shift the prediction from a purely architectural form-based estimate to a more comprehensive operational carbon assessment.

(4): Other Limitations and Future Research Directions

Beyond the core limitations above, other areas warrant attention. The model’s per-formance, while excellent for a low-dimensional scenario, is ultimately constrained by the limited feature set. Incorporating additional early-stage parameters, such as building shape factor or primary orientation, could further improve accuracy. Furthermore, the practical tool, while a valuable contribution, would benefit from a more user-friendly in-terface and integration with common architectural design software (e.g., as a plug-in for BIM platforms) to lower the barrier to adoption by practitioners. In conclusion, the pro-posed MLP model presents a effective and practical solution for a well-defined problem: rapid carbon estimation with minimal inputs. The limitations discussed here are not flaws but rather clear signposts for the next stages of research. By addressing the oversimplification of climate zones, rigorously testing generalizability, and integrating dynamic grid factors, subsequent models can build upon this foundation to achieve even greater accuracy, robustness, and practical impact on sustainable building design globally.

6. Conclusions

To address the challenge of limited parameter dimensions and small sample sizes in early-stage building carbon emission prediction, this study developed a lightweight modeling approach based on a multilayer perceptron (MLP) neural network. The model integrates feature engineering and interpretability analysis to achieve robust prediction performance. The main conclusions are as follows:

The proposed MLP model, trained on 150 samples, demonstrated superior performance with a mean absolute error of 4160 tCO₂ and an R² of 0.966 on the test set, reducing the prediction error by 54.7% compared to traditional linear regression.
The model showed good adaptability to high-rise buildings (>15 floors), maintaining a mean error below 8.1%, which indicates its robustness for large-volume and structurally complex building types.
SHAP analysis confirmed floor area (51.2%) as the dominant predictor, while the novel composite indicators, HAR and LPA, collectively enhanced accuracy by 5.8%, offering quantifiable metrics for guiding spatial design in low-carbon projects.
The research outcomes were implemented into a Python-based rapid calculation tool, providing practitioners with a practical means for quick carbon estimation during preliminary design stages with minimal data input.

Despite the promising results, this study has several limitations that warrant further investigation. Firstly, the model’s development relied on a dataset of 150 samples. While strategies like regularization were employed to mitigate overfitting, the generalizability of the model needs to be more robustly validated with larger and more diverse datasets encompassing a wider range of building types, construction methods, and operational patterns. Secondly, the current model primarily relies on geometric and location parameters. Its practical accuracy could be enhanced by incorporating future building design parameters, such as envelope thermal properties and planned energy system types, once they become available in later design stages. Regarding practical usability, the developed tool lowers the barrier to entry for early carbon assessment. However, its effective integration into real-world design workflows requires consideration. The tool’s value is highest in the very early phases (e.g., schematic design) for quick benchmarking and option comparison. For definitive calculations or certification, designers must still rely on more detailed, high-fidelity simulation tools in later stages. Future work should focus on expanding the database, exploring interoperability with BIM platforms, and validating the tool’s impact on actual design decision-making to fully realize its potential in facilitating low-carbon construction.

Author Contributions

Conceptualization, Y.W., Q.W. and S.Z.; Methodology, Y.W. and Q.W.; Software, Q.W. and S.Z.; Validation, Q.W. and S.Z.; Formal analysis, Y.W. and S.Z.; Investigation, Y.W. and Q.W.; Resources, Y.W. and Q.W.; Data curation, Y.W. and Q.W.; Writing—original draft, Y.W. and Q.W.; Writing—review & editing, Y.W. and Q.W.; Visualization, Y.W., Q.W. and S.Z.; Supervision, Y.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

We thank the editors and reviewers of Buildings for their careful review and helpful comments on this paper.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

LPA	Number of floors per unit area
HAR	Height-to-area ratio

References

China Building Energy Conservation Association. China Building Energy Consumption and Carbon Emissions Research Report (2022); China Building Energy Conservation Association: Beijing, China, 2022. [Google Scholar]
He, J.; Chen, W.; Teng, F. Global Long-Term Emission Reduction Targets and Carbon Emission Allocation Principles. Adv. Clim. Chang. Res. 2009, 5, 362–368. [Google Scholar]
Liu, H.; Liu, W.; Tang, Z.; Fan, X. Analysis of CO₂ Emission Reduction Effects of Regional Industrial Structure Adjustment in China. Areal Res. Dev. 2010, 29, 129–135. [Google Scholar]
She, J.; Zhang, Y.; Qi, S. Life Cycle Carbon Emission Characteristics and Reduction Strategies of Public Buildings in Hot Summer and Warm Winter Areas: A Case Study of Xiamen. Build. Sci. 2014, 30, 13–18. [Google Scholar]
Wang, Y.; Yang, X.; Yan, H.; Yan, Z.; Jianfeng, L. Measurement of Building Carbon Emissions Based on Whole Life Cycle: A Case Study of Office Building Reconstruction and Extension Project in a Guangzhou Campus. J. Eng. Manag. 2017, 31, 19–24. [Google Scholar] [CrossRef]
Wiedmann, T. A Review of Recent Multi-Region Input-Output Models Used for Consumption-Based Emission and Resource Accounting. Ecol. Econ. 2009, 69, 211–222. [Google Scholar] [CrossRef]
Tan, J.; Peng, S.; Liu, E. Spatio-temporal distribution and peak predsiction of energy consumption and carbon emissions of residential buildings in China. Appl. Energy 2024, 376, 124330. [Google Scholar] [CrossRef]
Huo, T.; Cong, X.; Cheng, C.; Cai, W.; Zuo, J. What is the driving mechanism for the carbon emissions in the building sector? An integrated DEMATEL-ISM model. Energy 2023, 274, 127399. [Google Scholar] [CrossRef]
Li, C.; Li, L.; Yu, H. Modeling Method of Precipitable Water Vapor Based on Random Forest and Multilayer Perceptron Ensemble Algorithm. J. Geod. Geodyn. 2025, 5, 518–525. [Google Scholar] [CrossRef]
SGB/T 51366-2019; Standard for Building Carbon Emission Calculation. China Architecture & Building Press: Beijing, China, 2019.
Saeed, A.; Li, C.; Gan, Z.; Xie, Y.; Liu, F. A simple approach for short-term wind speed interval prediction based on independently recurrent neural networks and error probability distribution. Energy 2022, 238, 122012. [Google Scholar] [CrossRef]
Cong, J.; Liu, X.; Zhao, X. Boundary Definition and Measurement Methods of Urban Carbon Emission Accounting. China Popul. Resour. Environ. 2014, 4, 19–26. [Google Scholar]
Vet, N.M.; Zner, M.Z. A Consumption-Based Approach to Carbon Emission Accounting—Sectoral Differences and Environmental Benefits. J. Clean. Prod. 2013, 42, 83–95. [Google Scholar]
Ruan, J.; Wang, S.; Wang, Z.; Mei, Y. Residential energy use prediction across different time scales with advanced machine learning techniques. In Proceedings of the 2019 2nd Asia Conference on Energy and Environment Engineering (ACEEE), Hiroshima, Japan, 8–10 June 2019; Institute of Electrical and Electronics Engineers (IEEE): New York, NY, USA, 2019; pp. 20–24. [Google Scholar]
Huo, T.; Ma, Y.; Xu, L.; Feng, W.; Cai, W. Carbon emissions in China’s urban residential building sector through 2060: A dynamic scenario simulation. Energy 2022, 254, 124395. [Google Scholar] [CrossRef]
Tsala, S.; Koronaki, I.P.; Orfanos, G. Utilizing weather forecast meteorological models for building energy simulations: A case study of a multi-unit residential complex. Energy Build. 2024, 305, 113848. [Google Scholar] [CrossRef]
Zhang, H.; Wang, L.K.; Shi, W.X. Seismic control of adaptive variable stiffness intelligent structures using fuzzy control strategy combined with LSTM. J. Build. Eng. 2023, 78, 107549. [Google Scholar] [CrossRef]
Jian, P.; Zheng, H.; Tang, F. An Empirical Study on Carbon Emission Performance of Regional Industries in China and Its Influencing Factors. Soft Sci. 2012, 26, 1–6. [Google Scholar]
Qiao, K.; Yang, X. Research on Carbon Emission Calculation Based on Whole Building Life Cycle: A Case Study of an Air Separation Plant Office Building Project. In Proceedings of the Industrial Building Collection 2022; Central Research Institute of Building and Construction Co., Ltd.: Beijing, China, 2022; p. 5. [Google Scholar]
Zhang, H. Life Cycle Carbon Emission Calculation of Office Buildings in Nanjing Area. Build. Energy Effic. Vent. Air Cond. 2022, 41, 34–36, 46. [Google Scholar]
Zhang, H.; Wang, L.K.; Shi, W.X. Semi-active variable stiffness and damping control for adjacent structures using LSTM-based prediction algorithm. J. Build. Eng. 2025, 103, 112127. [Google Scholar] [CrossRef]
Bai, L. Research on Life Cycle Carbon Emission Prediction Model of Public Buildings. Ph.D. Thesis, Tianjin University, Tianjin, China, 2019. [Google Scholar] [CrossRef]
Yang, T.; Pan, Y.; Yang, Y.; Lin, M.; Qin, B.; Xu, P.; Huang, Z. CO₂ emissions in China’s building sector through 2050: A scenario analysis based on a bottom-up model. Energy 2017, 128, 208–223. [Google Scholar] [CrossRef]
Guo, C.; Liu, Q.; Li, S. Construction of Life Cycle Carbon Emission Accounting Model for Green Buildings Based on Green Building Evaluation System and Case Analysis. Green Build. 2019, 11, 13–18. [Google Scholar]
Liu, M.; Lian, C.; Ge, X.; Yuan, S. Research on Life Cycle Carbon Emission Assessment Method of Office Buildings. Build. Energy Effic. Vent. Air Cond. 2019, 38, 45–48. [Google Scholar]
Zhang, H.; He, Y.; Zhao, X. Calculation and Analysis of Carbon Emissions from Office Buildings. Intell. Build. 2022, 10, 40–46. [Google Scholar]
Chen, S.; Cui, D.; Zhang, H. Carbon Emission Calculation Method and Case Study of Buildings. J. Beijing Univ. Technol. 2016, 42, 594–600. [Google Scholar]
Li, Y.; Wu, Y.; Yu, J.; Bai, L.; Wang, L. Life Cycle Carbon Emission Prediction Model of High-Rise Office Buildings Based on SVR: A Case Study of Tianjin. Build. Energy Effic. 2021, 49, 25–30. [Google Scholar]
Zhang, H.; Wang, L.K.; Shi, W.X. Seismic intelligent retrofitting of aging steel structure using semi-active TMD with LSTM prediction and wavelet transform combined algorithm. Thin-Walled Struct. 2025, 214, 113431. [Google Scholar] [CrossRef]
Lin, Y.; Shan, P.; Huang, W.; Lin, D.; Huang, H.; Zhang, C.; Zhuo, Z. Research on Building Carbon Emission Evaluation Method in Fujian Province Based on CIE-AHP. Fujian Constr. Sci. Technol. 2021, 2, 77–80. [Google Scholar]
Zhong, L.; Yu, J.; Zhu, K.; Tian, W. Comparison of Life Cycle Carbon Emission Calculation Analysis and Software Application for Buildings. Green Build. 2023, 15, 70–75. [Google Scholar]
Zhang, H.; Wang, L.K.; Shi, W.X. Structural translational-torsional coupling response control using Transformer-based bidirectional semi-active TMD. J. Build. Eng. 2025, 114, 114438. [Google Scholar] [CrossRef]
Shen, D. Calculation Model of Building Carbon Emissions in Whole Life Cycle. Build. Constr. 2021, 43, 2162–2166. [Google Scholar]

Figure 1. Schematic illustration of carbon emission stages across the whole building life cycle.

Figure 2. Schematic diagram of multilayer perceptron structure.

Figure 3. Technology roadmap.

Figure 4. Schematic diagram of leave-out method and k-fold cross-validation (k = 10).

Figure 5. Boxplot of Parameters.

Figure 6. Interactive Analysis of Error Distribution.

Figure 7. Round number-percentage of error distribution graph.

Figure 8. Training Set and Validation Set Loss Curves.

Figure 9. Actual vs. Predicted Carbon Emissions.

Figure 10. Model effect diagram.

Figure 11. Low/high-rise building prediction effect diagrams.

Figure 12. Influencing factors contribution diagram.

Figure 13. Building Carbon Calculation Software Showcase.

Table 1. Core data parameter diagram.

No.	Located Territory (R)	Number of Building Floors (F, Floor)	Building Area (A, m²)	Carbon Emission (tCO₂)
1	Hebei	12	17,336.4	31,945.92
2	Shanghai	6	12,038.9	45,565.92
3	Guangzhou	13	27,066	77,185.78
	---	---	---	---
61	Shenzhen	35	78,900	212,000
62	Qingdao	16	31,200	85,400
63	Hangzhou	24	43,200	128,500
	---	---	---	---
148	Wuhan	17	24,500	67,523.98
149	Shanghai	12	244,630.21	665,233.25
150	Tangshan	3	2810	9839.21

Table 2. Climate coding table.

City	Climate Zone	Coded Value (R)	Region Description
Beijing, Tianjin, Shijiazhuang, Tangshan, Shenyang	Cold region	0	Northern heating area with high energy consumption in winter
Xi’an, Zhengzhou, Jinan, Luoyang	Cold regions	0	Northern heating zone, high energy consumption in winter
Nanjing, Shanghai, Hefei, Wuhan, Chongqing	Hot summer and cold winter areas	1	Dual energy consumption of air conditioning in summer and heating in winter
Chengdu, Changsha, Hangzhou, Nanchang	Hot summer and cold winter areas	1	Medium heating load in winter, significant cooling load
Guangzhou, Xiamen, Nanning, Haikou	Hot summer and warm winter areas	2	Almost no heating required throughout the year, carbon emissions are concentrated in cooling operation

Table 3. Summary of Pre-training Dataset Characteristics.

Parameter	Sample Size	Min	Max	Mean	Std. Dev.	Climate Zones
Floor Area (m²)	10,000+	2000	350,000	45,000	55,100	All (0,1,2)
Floors Above Ground (F)	10,000+	2	65	17.5	12.0	All (0,1,2)
Total Carbon Emission (tCO₂)	10,000+	1500	950,000	110,250	125,800	All (0,1,2)

Table 4. Hyperparameter settings for the multilayer perceptron model.

Parameter Category	Parameter Name	Current Settings	Note
Core Training Parameters	Learning rate	0.001	Adam optimizer’s initial learning rate.
	Batch size	32	The number of samples used for each gradient descent update.
	Epochs	1000	The maximum number of training rounds for the entire dataset.
	Optimizer	Adam	Adaptive Learning Rate Optimization Algorithm.
Network Architecture	hidden layers	[64, 32, 16]	Three fully connected layers, with the number of neurons decreasing in each layer.
Network Architecture	Activation Function	Swish	Self-gated activation functions proposed by Google typically outperform ReLU.
Loss Function	Loss Function	Huber	Combining the advantages of MSE and MAE, it is more robust to outliers.
Loss Function	Threshold parameter	1500.0	When the absolute error is less than this value, use square error; when greater, use linear error.
Regularization and Early Stopping	Coefficient	0.01	The weight matrix applied to all hidden layers.
Regularization and Early Stopping	Dropout Rate	0.2	After each hidden layer, randomly drop 20% of the neuron outputs to prevent overfitting.
Early Termination Criteria	Monitor	Val Loss	Huber loss value of the monitoring validation set.
	Patience	30	If the validation set loss does not improve for 30 consecutive epochs, stop training.
	Restore optimal weighting	True	After stopping, roll back to the model parameters that produced the minimum validation set loss.

Table 5. Data Distribution.

Parameter	Minimum Value	Maximum Value	Mean Value	Standard Deviation
Building area (m²)	2647	244,630	34,872	38,921
Number of floors (F)	2	63	16.8	10.4
Geoclimatic code (R)	0 (Cold)	2 (hot summer and warm winter)	-	-
Total carbon emissions (tCO₂)	1474	665,233	98,756	102,345

Table 6. Performance comparison of models before and after fine-tuning.

Model Configuration	MAE (tCO₂)	MAPE	R²
Pre-trained MLP (no fine-tuning)	11,506	23.9%	0.803
Fine-tuned MLP	4160	7.5%	0.966
MLP trained from scratch (local data only)	8223.21	11.4%	0.884

MAPE (Mean Absolute Percentage Error).

Table 7. Performance comparison of different prediction models.

Model	MAE (tCO₂)	RMSE (tCO₂)	R²
Linear regression (LR)	9180	21,050	0.837
Random Forest (RF)	5870	15,690	0.915
Support Vector Machine (SVR)	6200	20,950	0.848
Multilayer Perceptual Machine (MLP)	4160	9610	0.966

Table 8. Results of the ablation study.

Model Configuration	MAE (tCO₂)	∆MAE (Increase)	R²	R² (Decrease)
Full Model (Baseline)	4160	-	0.966	-
Without LPA and HAR	4920	+760	0.941	−0.025
Without R	5810	+1650	0.912	−0.054
Only A and F	6730	+2570	0.886	−0.080

Table 9. Input parameters and output specifications for the rapid calculation tool.

Input 1 (Geographical Region)	Input 2 (Number of Floors)	Input 3 (Area)	Output (Predicted Carbon Emissions tCO₂)	Actual Value (tCO₂)	Absolute Percentage Error (%)
Tianjin	6	12,879	47,170.29	42,699	10.47
Zhengzhou	19	34,500	102,312.15	98,772	3.58
Guilin	14	25,700	63,093.72	66,842	5.61
Dongguan	18	32,500	80,276.64	84,300	4.77

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, Y.; Wang, Q.; Zhang, S. A Lightweight Multi-Layer Perceptron Approach for Carbon Emission Prediction of Public Buildings Under Low-Dimensional Data Scenarios. Buildings 2025, 15, 4508. https://doi.org/10.3390/buildings15244508

AMA Style

Wang Y, Wang Q, Zhang S. A Lightweight Multi-Layer Perceptron Approach for Carbon Emission Prediction of Public Buildings Under Low-Dimensional Data Scenarios. Buildings. 2025; 15(24):4508. https://doi.org/10.3390/buildings15244508

Chicago/Turabian Style

Wang, Yang, Qiming Wang, and Shutong Zhang. 2025. "A Lightweight Multi-Layer Perceptron Approach for Carbon Emission Prediction of Public Buildings Under Low-Dimensional Data Scenarios" Buildings 15, no. 24: 4508. https://doi.org/10.3390/buildings15244508

APA Style

Wang, Y., Wang, Q., & Zhang, S. (2025). A Lightweight Multi-Layer Perceptron Approach for Carbon Emission Prediction of Public Buildings Under Low-Dimensional Data Scenarios. Buildings, 15(24), 4508. https://doi.org/10.3390/buildings15244508

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Lightweight Multi-Layer Perceptron Approach for Carbon Emission Prediction of Public Buildings Under Low-Dimensional Data Scenarios

Abstract

1. Introduction

2. Relevant Theories and Technical Route

2.1. Relevant Theories

2.1.1. Life Cycle Carbon Emission Theory

2.1.2. Multilayer Perceptron Theory

2.2. Technical Route

3. Model Design and Methods

3.1. Input Feature Construction and Preprocessing

3.1.1. Data Source

3.1.2. Feature Enhancement and Interaction Variables

3.2. Model Structure Design

3.2.1. Network Structure Configuration

3.2.2. Loss Function and Optimizer

3.2.3. Regularization and Dropout

3.2.4. Complete Training Model Expression

3.3. Migration Learning Strategy Design

3.3.1. Pre-Training Data Sources

3.3.2. Fine-Tuning Method

3.4. Model Evaluation Methods and Parameters

3.4.1. Training and Validation Mechanism

3.4.2. Parameter Setting

3.4.3. Specific Approach and Optimization Strategy

4. Results

4.1. Dataset Construction and Outlier Handling

4.1.1. Dataset Construction and Division

4.1.2. Data Cleaning Procedures and Outlier Handling

4.2. Model Training Process

4.3. Comparative Analysis of Model Performance

4.3.1. Comparison of Model Performance Before and After Fine-Tuning

4.3.2. Performance Comparison Among Different Models

4.4. Validation of High-Rise Building Adaptation

4.5. Model Interpretability Analysis (SHAP) and Ablation Studies

4.6. Model Output Examples and Visualization

5. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI