Machine Learning-Based Carbon Compliance Forecasting and Energy Performance Assessment in Commercial Buildings
Abstract
1. Introduction
2. Literature Review and Problem Statement
2.1. Literature Review
2.2. Machine Learning Models
2.2.1. Support Vector Machines (SVMs)
2.2.2. Gaussian Process Regression (GPR)
2.2.3. Ensemble Models (LSBoost)
2.3. Gaps in Existing Literature
3. Methodology
3.1. Data Collection and Preprocessing
3.2. Feature Selection and Principal Component Analysis
4. Results and Discussion
4.1. Historical Weather Data
4.2. Evaluating Machine Learning Models
4.3. Hyperparameter Tuning
4.3.1. Gaussian Process Regression Kernel Selection
4.3.2. LSBoost Hyperparameter Selection
- Maximum tree depth: governing the complexity of each decision tree. Shallow trees may result in underfitting, whereas deeper trees have the potential to overfit the data.
- Number of learners (estimators): dictates the total number of boosting iterations. Utilizing too few learners can lead to underfitting, while an excessive number can prolong training time and heighten the risk of overfitting.
- Learning rate: scales the impact of each learner’s contribution. A smaller learning rate can enhance generalization but may necessitate a greater number of learners.
4.3.3. SVM Hyperparameter Selection
4.4. The Cost of Inaction
4.5. Carbon Non-Compliance Fees
5. Conclusions and Limitations
5.1. Conclusions
5.2. Limitations
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Nomenclature
ANN | Artificial Neural Network |
ASHRAE | American Society of Heating, Refrigeration, and Air Conditioning Engineers |
BEPSs | Building Energy Performance Standards |
BIM | Building Information Model |
BPSs | Building Performance Standards |
Btu | British Thermal Units |
CBECS | Commercial Buildings Energy Consumption Survey |
CDD | Cooling Degree Day |
COP | Coefficient of Performance |
CSNA | Climate Solutions Now Act |
DOE | Department of Energy |
EEM | Energy Efficiency Measure |
EO | Executive Order |
EUI | Energy Use Intensity |
GB | Gradient Boosting |
GHG | Greenhouse Gas |
GJ | Gigajoules |
GMM | Gaussian Mixture Model |
GPR | Gaussian Process Regression |
GT | Gigatons |
HDD | Heating Degree Day |
HVAC | Heating, Ventilation, and Air Conditioning |
LED | Light Emitting Diode |
LPD | Lighting Power Density |
MAE | Mean Absolute Error |
MLR | Multiple Linear Regression |
MMmt | Million Metric Tons |
MRMR | Minimum Redundancy Maximum Relevance |
PCA | Principal Component Analysis |
RMSE | Root Mean Square Error |
SVM | Support Vector Machine |
TRNSYS | TRaNsient SYstems Simulation |
xAI | Explainable Artificial Intelligence |
ϕ(X) | maps X non-linearly to a high-dimensional feature space |
Lε | Loss function |
σ | White noise |
Z | Validation parameter |
C | Tree regression weak learners |
References
- IPCC. AR6 (2021), Summary for Policymakers. In Climate Change 2021: The Physical Science Basis; Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change; IPCC: Geneva, Switzerland, 2021. [Google Scholar]
- NPUC. Race to Net Zero: Carbon Neutral Goals by Country; NPUC: Ridgefield, WA, USA, 2021. [Google Scholar]
- T.I.A.S. Paris Agreement to the United Nations Framework Convention on Climate Change; No. 16–1104; T.I.A.S: Tilburg, The Netherlands, 2015. [Google Scholar]
- Tracker, C.A. Warming Projections Global Update; Climate Analytics and New Climate Institute: Berlin, Germany, 2021. [Google Scholar]
- Hoegh-Guldberg, O.; Jacob, D.; Taylor, M.; Bolaños, T.G.; Bindi, M.; Brown, S.; Camilloni, I.A.; Diedhiou, A.; Djalante, R.; Ebi, K.; et al. The human imperative of stabilizing global climate change at 1.5 °C. Science 2019, 365, eaaw6974. [Google Scholar] [CrossRef]
- Betts, R.A.; Alfieri, L.; Bradshaw, C.; Caesar, J.; Feyen, L.; Friedlingstein, P.; Gohar, L.; Koutroulis, A.; Lewis, K.; Morfopoulos, C.; et al. Changes in climate extremes, fresh water availability and vulnerability to food insecurity projected at 1.5 °C and 2 °C global warming with a higher-resolution global climate model. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 2018, 376, 20160452. [Google Scholar] [CrossRef] [PubMed]
- Huovila, P. Buildings and Climate Change: Status, Challenges, and Opportunities; UNEP SBCI: Nairobi, Kenya, 2007. [Google Scholar]
- IEA. Tracking Buildings 2023; IEA: Paris, France, 2023. [Google Scholar]
- Allouhi, A.; El Fouih, Y.; Kousksou, T.; Jamil, A.; Zeraouli, Y.; Mourad, Y. Energy consumption and efficiency in buildings: Current status and future trends. J. Clean. Prod. 2015, 109, 118–130. [Google Scholar] [CrossRef]
- Frequently Asked Questions (faqs)—U.S. Energy Information Administration (EIA). Available online: https://www.eia.gov/tools/faqs/faq.php?id=86&t=1 (accessed on 24 October 2024).
- EIA. U.S. Energy Information Administration—EIA—Independent Statistics and Analysis. Available online: https://www.eia.gov/environment/emissions/carbon/ (accessed on 24 October 2024).
- Bell, M. Energy efficiency in existing buildings: The role of building regulations. In Proceedings of the Construction and Building Research (COBRA) Conference, Leeds, UK, 7–8 September 2004; pp. 7–8. [Google Scholar]
- European Commission. Communication from the Commission to the European Parliament, the Council, the European Economic and Social Committee and the Committee of the Regions: Healthier, Safer, More Confident Citizens: A Health and Consumer Protection Strategy: Proposal for a Decision of the European Parliament and of the Council Establishing a Programme of Community Action in the Field of Health and Consumer Protection 2007–2013 (Text with EEA Relevance); Office for Official Publications of the European Communities: Luxembourg, 2005. [Google Scholar]
- World Economic Forum. For Net-Zero Cities, We Need to Retrofit Our Older Buildings. Here’s What’s Needed. 2022. Available online: https://www.weforum.org/stories/2022/11/net-zero-cities-retrofit-older-buildings-cop27/#:~:text=Around%2080%25%20of%20the%20buildings,retrofit%20them%20for%20energy%20efficiency (accessed on 24 October 2024).
- Aksoezen, M.; Daniel, M.; Hassler, U.; Kohler, N. Building age as an indicator for energy consumption. Energy Build. 2015, 87, 74–86. [Google Scholar] [CrossRef]
- California, S. Supporting California’s Goal to Achieve Carbon Neutrality by 2045. Carbon Neutrality by 2045—Office of Planning and Research. Available online: https://opr.ca.gov/climate/carbon-neutrality.html (accessed on 24 October 2024).
- GHG Pollution Reduction Roadmap 2.0. GHG Pollution Reduction Roadmap 2.0 | Colorado Energy Office. Available online: https://climate.colorado.gov/greenhouse-gas-roadmap (accessed on 24 October 2024).
- Climate Change. DOEE. Available online: https://doee.dc.gov/service/climate-change (accessed on 24 October 2024).
- Maryland Senate Bill 528. Climate Solutions Now Act of 2022. Available online: https://mgaleg.maryland.gov/2022RS/bills/sb/sb0528E.pdf (accessed on 24 October 2024).
- Greenhouse Gas Emissions Reduction. NYSERDA. Available online: https://www.nyserda.ny.gov/Impact-Greenhouse-Gas-Emissions-Reduction (accessed on 24 October 2024).
- Reducing Greenhouse Gas Emissions—Washington State Department of Ecology. Available online: https://ecology.wa.gov/air-climate/reducing-greenhouse-gas-emissions (accessed on 24 October 2024).
- Amasyali, K.; El-Gohary, N.M. A review of data-driven building energy consumption prediction studies. Renew. Sustain. Energy Rev. 2018, 81, 1192–1205. [Google Scholar] [CrossRef]
- EnergyPlus, National Renewable Energy Laboratory, Building Technologies Office, the US Department of Energy (DOE). Available online: https://energyplus.net/downloads (accessed on 24 October 2024).
- TRNSYS, Transient System Simulation Tool, Solar Energy Laboratory, University of Wisconsin-Madison. Available online: https://sel.me.wisc.edu/trnsys/user18-resources/index.html (accessed on 24 October 2024).
- Equest, The Quick Energy Simulation Tool, the US Department of Energy (DOE). Available online: http://doe2.com/equest/index.html (accessed on 24 October 2024).
- ESP-r, University of Strathciyde, Department of Mechanicl Engineering, Scotland, UK. Available online: https://www.strath.ac.uk/research/energysystemsresearchunit/applications/esp-r/ (accessed on 24 October 2024).
- DOE-2, DOE-2 Building Energy Use Modeling, the US Department of Energy (DOE). Available online: http://doe2.com/DOE2/index.html (accessed on 24 October 2024).
- Ecotect, Autodesck Ecotect Analysis. Available online: https://www.asidek.es/en/architecture-and-engineering/autodesk-ecotect-analysis/ (accessed on 24 October 2024).
- Trane Trace® 3D Plus. Trane Commercial HVAC. Available online: https://www.trane.com/commercial/north-america/us/en/products-systems/design-and-analysis-tools/trane-design-tools/trace-3d-plus.html (accessed on 24 October 2024).
- Reinhart, C.F.; Davila, C.C. Urban building energy modeling—A review of a nascent field. Build. Environ. 2016, 97, 196–202. [Google Scholar] [CrossRef]
- Esen, H.; Esen, M.; Ozsolak, O. Modelling and experimental performance analysis of solar-assisted ground source heat pump system. J. Exp. Theor. Artif. Intell. 2015, 29, 1–17. [Google Scholar] [CrossRef]
- Wei, Y.; Zhang, X.; Shi, Y.; Xia, L.; Pan, S.; Wu, J.; Han, M.; Zhao, X. A review of data-driven approaches for prediction and classification of building energy consumption. Renew. Sustain. Energy Rev. 2018, 82, 1027–1047. [Google Scholar] [CrossRef]
- Maryland Building Energy Performance Standards. Maryland Department of the Environment. Available online: https://mde.maryland.gov/programs/air/ClimateChange/Pages/BEPS.aspx (accessed on 24 October 2024).
- Chen, Y.; Guo, M.; Chen, Z.; Chen, Z.; Ji, Y. Physical energy and data-driven models in building energy prediction: A review. Energy Rep. 2022, 8, 2656–2671. [Google Scholar] [CrossRef]
- Ma, M.; Zhou, N.; Feng, W.; Yan, J. Challenges and opportunities in the global net-zero building sector. Cell Rep. Sustain. 2024, 1, 100154. [Google Scholar] [CrossRef]
- Yan, R.; Ma, M.; Zhou, N.; Feng, W.; Xiang, X.; Mao, C. Towards COP27: Decarbonization patterns of residential building in China and India. Appl. Energy 2023, 352, 122003. [Google Scholar] [CrossRef]
- Higgins, J.; Ramnarayan, A.; Family, R.; Ohadi, M. Analysis of Energy Efficiency Opportunities for a Public Transportation Maintenance Facility—A Case Study. Energies 2024, 17, 1907. [Google Scholar] [CrossRef]
- Neto, A.H.; Fiorelli, F.A.S. Comparison between detailed model simulation and artificial neural network for forecasting building energy consumption. Energy Build. 2008, 40, 2169–2176. [Google Scholar] [CrossRef]
- Kim, J.B.; Jeong, W.; Clayton, M.J.; Haberl, J.S.; Yan, W. Developing a physical BIM library for building thermal energy simulation. Autom. Constr. 2015, 50, 16–28. [Google Scholar] [CrossRef]
- Modelica, Modelica Language, Modelica Association, Simulation in Europe Basic Research Working Group (SiE-WG). Available online: https://simulationresearch.lbl.gov/modelica/ (accessed on 24 October 2024).
- Chen, Y.; Chen, Z.; Xu, P.; Li, W.; Sha, H.; Yang, Z.; Li, G.; Hu, C. Quantification of electricity flexibility in demand response: Office building case study. Energy 2019, 188, 116054. [Google Scholar] [CrossRef]
- Ramnarayan, A.; Sarmiento, A.; Gerami, A.; Ohadi, M. An energy consumption intensity ranking system for rapid energy efficiency evaluation of a cluster of commercial buildings. ASHRAE Trans. 2023, 129, 602–611. [Google Scholar]
- Ramnarayan, A.; Sarmiento, A.; Ohadi, M. A Multi-Criterion Ranking Approach for Energy Efficiency Ranking of a Cluster of Commercial Buildings. ASHRAE Trans. 2025, 131. [Google Scholar]
- Zargarzadeh, S.; Ramnarayan, A.; de Castro, F.; Ohadi, M. ML-Enabled Solar PV Electricity Generation Projection for a Large Academic Campus to Reduce Onsite CO2 Emissions. Energies 2024, 17, 6188. [Google Scholar] [CrossRef]
- Olu-Ajayi, R.; Alaka, H.; Sulaimon, I.; Sunmola, F.; Ajayi, S. Building energy consumption prediction for residential buildings using deep learning and other machine learning techniques. J. Build. Eng. 2022, 45, 103406. [Google Scholar] [CrossRef]
- Zhang, Y.; Teoh, B.K.; Wu, M.; Chen, J.; Zhang, L. Data-driven estimation of building energy consumption and GHG emissions using explainable artificial intelligence. Energy 2022, 262, 125468. [Google Scholar] [CrossRef]
- Chen, Y.; Xu, P.; Chu, Y.; Li, W.; Wu, Y.; Ni, L.; Bao, Y.; Wang, K. Short-term electrical load forecasting using the Support Vector Regression (SVR) model to calculate the demand response baseline for office buildings. Appl. Energy 2017, 195, 659–670. [Google Scholar] [CrossRef]
- Wang, Z.; Hong, T.; Piette, M.A. Building thermal load prediction through shallow machine learning and deep learning. Appl. Energy 2020, 263, 114683. [Google Scholar] [CrossRef]
- Yousaf, S.; Bradshaw, C.R.; Kamalapurkar, R.; San, O. Development and performance evaluation of empirical models compatible with building energy modeling engines for unitary equipment. Sci. Technol. Built Environ. 2025, 31, 533–550. [Google Scholar] [CrossRef]
- Yousaf, S.; Bradshaw, C.R.; Kamalapurkar, R.; San, O. A gray-box model for unitary air conditioners developed with symbolic regression. Int. J. Refrig. 2024, 168, 696–707. [Google Scholar] [CrossRef]
- Gassar, A.A.A.; Cha, S.H. Energy prediction techniques for large-scale buildings towards a sustainable built environment: A review. Energy Build. 2020, 224, 110238. [Google Scholar] [CrossRef]
- Dong, B.; Li, Z.; Rahman, S.M.; Vega, R. A hybrid model approach for forecasting future residential electricity consumption. Energy Build. 2016, 117, 341–351. [Google Scholar] [CrossRef]
- Xu, X.; Taylor, J.E.; Pisello, A.L.; Culligan, P.J. The impact of place-based affiliation networks on energy conservation: An holistic model that integrates the influence of buildings, residents and the neighborhood context. Energy Build. 2012, 55, 637–646. [Google Scholar] [CrossRef]
- Bakır, H.; Ağbulut, Ü.; Gürel, A.E.; Yıldız, G.; Güvenç, U.; Soudagar, M.E.M.; Hoang, A.T.; Deepanraj, B.; Saini, G.; Afzal, A. Forecasting of future greenhouse gas emission trajectory for India using energy and economic indexes with various metaheuristic algorithms. J. Clean. Prod. 2022, 360, 131946. [Google Scholar] [CrossRef]
- Sen, P.; Roy, M.; Pal, P. Application of ARIMA for forecasting energy consumption and GHG emission: A case study of an Indian pig iron manufacturing organization. Energy 2016, 116, 1031–1038. [Google Scholar] [CrossRef]
- Javed, S.A.; Cudjoe, D. A novel grey forecasting of greenhouse gas emissions from four industries of China and India. Sustain. Prod. Consum. 2022, 29, 777–790. [Google Scholar] [CrossRef]
- Tanaka, K. Review of policies and measures for energy efficiency in industry sector. Energy Policy 2011, 39, 6532–6550. [Google Scholar] [CrossRef]
- Chang, L.; Mohsin, M.; Hasnaoui, A.; Taghizadeh-Hesary, F. Exploring carbon dioxide emissions forecasting in China: A policy-oriented perspective using projection pursuit regression and machine learning models. Technol. Forecast. Soc. Change 2023, 197, 122872. [Google Scholar] [CrossRef]
- Nam, K.; Hwangbo, S.; Yoo, C. A deep learning-based forecasting model for renewable energy scenarios to guide sustainable energy policy: A case study of Korea. Renew. Sustain. Energy Rev. 2020, 122, 109725. [Google Scholar] [CrossRef]
- Anand, P.; Deb, C.; Yan, K.; Yang, J.; Cheong, D.; Sekhar, C. Occupancy-based energy consumption modelling using machine learning algorithms for institutional buildings. Energy Build. 2021, 252, 111478. [Google Scholar] [CrossRef]
- Veiga, R.; Veloso, A.; Melo, A.; Lamberts, R. Application of machine learning to estimate building energy use intensities. Energy Build. 2021, 249, 111219. [Google Scholar] [CrossRef]
- Egwim, C.N.; Alaka, H.; Egunjobi, O.O.; Gomes, A.; Mporas, I. Comparison of machine learning algorithms for evaluating building energy efficiency using big data analytics. J. Eng. Des. Technol. 2022, 22, 1325–1350. [Google Scholar] [CrossRef]
- Dong, B.; Cao, C.; Lee, S.E. Applying support vector machines to predict building energy consumption in tropical region. Energy Build. 2005, 37, 545–553. [Google Scholar] [CrossRef]
- Massana, J.; Pous, C.; Burgas, L.; Melendez, J.; Colomer, J. Short-term load forecasting in a non-residential building contrasting models and attributes. Energy Build. 2015, 92, 322–330. [Google Scholar] [CrossRef]
- Jain, R.K.; Smith, K.M.; Culligan, P.J.; Taylor, J.E. Forecasting energy consumption of multi-family residential buildings using support vector regression: Investigating the impact of temporal and spatial monitoring granularity on performance accuracy. Appl. Energy 2014, 123, 168–178. [Google Scholar] [CrossRef]
- Seyedzadeh, S.; Rahimian, F.P.; Glesk, I.; Roper, M. Machine learning for estimation of building energy consumption and performance: A review. Vis. Eng. 2018, 6, 5. [Google Scholar] [CrossRef]
- Hearst, M.A.; Dumais, S.T.; Osuna, E.; Platt, J.; Scholkopf, B. Support vector machines. IEEE Intell. Syst. Their Appl. 1998, 13, 18–28. [Google Scholar] [CrossRef]
- Zeng, A.; Ho, H.; Yu, Y. Prediction of building electricity usage using Gaussian Process Regression. J. Build. Eng. 2020, 28, 101054. [Google Scholar] [CrossRef]
- Rastogi, P.; Khan, M.E.; Andersen, M. Gaussian-process-based emulators for building performance simulation. In Proceedings of the BS 2017, San Francisco, CA, USA, 7–9 August 2017. [Google Scholar]
- Zhang, Y.; O’NEill, Z.; Dong, B.; Augenbroe, G. Comparisons of inverse modeling approaches for predicting building energy performance. Build. Environ. 2015, 86, 177–190. [Google Scholar] [CrossRef]
- Srivastav, A.; Tewari, A.; Dong, B. Baseline building energy modeling and localized uncertainty quantification using Gaussian mixture models. Energy Build. 2013, 65, 438–447. [Google Scholar] [CrossRef]
- Li, Z.; Han, Y.; Xu, P. Methods for benchmarking building energy consumption against its past or intended performance: An overview. Appl. Energy 2014, 124, 325–334. [Google Scholar] [CrossRef]
- Heo, Y.; Choudhary, R.; Augenbroe, G. Calibration of building energy models for retrofit analysis under uncertainty. Energy Build. 2012, 47, 550–560. [Google Scholar] [CrossRef]
- Heo, Y.; Zavala, V.M. Gaussian process modeling for measurement and verification of building energy savings. Energy Build. 2012, 53, 7–18. [Google Scholar] [CrossRef]
- Noh, H.Y.; Rajagopal, R. Data-driven forecasting algorithms for building energy consumption. In Sensors and Smart Structures Technologies for Civil, Mechanical, and Aerospace Systems 2013; SPIE: Bellingham, WA, USA, 2013; Volume 8692, pp. 221–228. [Google Scholar]
- Shi, J.Q.; Choi, T. Gaussian Process Regression Analysis for Functional Data; CRC Press: Boca Raton, FL, USA, 2011. [Google Scholar]
- Ahmad, T.; Chen, H. Nonlinear autoregressive and random forest approaches to forecasting electricity load for utility energy management systems. Sustain. Cities Soc. 2019, 45, 460–473. [Google Scholar] [CrossRef]
- Ke, J.; Qin, Y.; Wang, B.; Yang, S.; Wu, H.; Yang, H.; Zhao, X.; Guo, H. Data-Driven Predictive Control of Building Energy Consumption under the IoT Architecture. Wirel. Commun. Mob. Comput. 2020, 2020, 8849541. [Google Scholar] [CrossRef]
- Silva, J.; Praça, I.; Pinto, T.; Vale, Z. Energy consumption forecasting using ensemble learning algorithms. In Proceedings of the Distributed Computing and Artificial Intelligence, 16th International Conference, Special Sessions, Avila, Spain, 26–28 June 2019; Springer International Publishing: Cham, Switzerland, 2019; pp. 5–13. [Google Scholar]
- Wang, Z.; Srinivasan, R.; Wang, Y. Homogeneous ensemble model for building energy prediction: A case study using ensemble regression tree. In Proceedings of the 2016 ACEEE Summer Study on Energy Efficiency in Buildings, Pacific Grove, CA, USA, 21–26 August 2016; pp. 21–26. [Google Scholar]
- EnergyCAP. Energy Management & Monitoring Solutions. Available online: https://www.energycap.com/ (accessed on 24 October 2024).
- Zhao, Y.; Li, T.; Zhang, X.; Zhang, C. Artificial intelligence-based fault detection and diagnosis methods for building energy systems: Advantages, challenges and the future. Renew. Sustain. Energy Rev. 2019, 109, 85–101. [Google Scholar] [CrossRef]
- Little, R.J.; Rubin, D.B. Statistical Analysis with Missing Data; John Wiley & Sons: Hoboken, NJ, USA, 2019; Volume 793. [Google Scholar]
- Zheng, A.; Casari, A. Feature Engineering for Machine Learning: Principles and Techniques for Data Scientists; O’Reilly Media, Inc.: Sebastopol, CA, USA, 2018. [Google Scholar]
- ANSI/ASHRAE Standard 62.12019; Ventilation for Acceptable Indoor Air Quality. American Society of Heating, Refrigerating, and Air-Conditioning Engineers, Inc.: Atlanta, GA, USA, 2019.
- Giannelos, S.; Bellizio, F.; Strbac, G.; Zhang, T. Machine learning approaches for predictions of CO2 emissions in the building sector. Electr. Power Syst. Res. 2024, 235, 110735. [Google Scholar] [CrossRef]
- Cabello-Solorzano, K.; de Araujo, I.O.; Peña, M.; Correia, L.; Tallón-Ballesteros, A.J. The impact of data normalization on the accuracy of machine learning algorithms: A comparative analysis. In Proceedings of the International Conference on Soft Computing Models in Industrial and Environmental Applications, Salamanca, Spain, 5–7 August 2023; Springer: Cham, Switzerland, 2023; pp. 344–353. [Google Scholar]
- Li, J.; Cheng, K.; Wang, S.; Morstatter, F.; Trevino, R.P.; Tang, J.; Liu, H. Feature selection: A data perspective. ACM Comput. Surv. (CSUR) 2017, 50, 1–45. [Google Scholar] [CrossRef]
- Wang, G.; Lauri, F.; El Hassani, A.H. Feature selection by mRMR method for heart disease diagnosis. IEEE Access 2022, 10, 100786–100796. [Google Scholar] [CrossRef]
- Li, B.Q.; Hu, L.L.; Chen, L.; Feng, K.Y.; Cai, Y.D.; Chou, K.C. Prediction of protein domain with mRMR feature selection and analysis. PLoS ONE 2012, 7, e39308. [Google Scholar] [CrossRef]
- Elssied, N.O.F.; Ibrahim, O.; Osman, A.H. A novel feature selection based on one-way anova f-test for e-mail spam classification. Res. J. Appl. Sci. Eng. Technol. 2014, 7, 625–638. [Google Scholar] [CrossRef]
- Dhanya, R.; Paul, I.R.; Akula, S.S.; Sivakumar, M.; Nair, J.J. F-test feature selection in Stacking ensemble model for breast cancer prediction. Procedia Comput. Sci. 2020, 171, 1561–1570. [Google Scholar] [CrossRef]
- Wang, X.; Tao, Y.; Zheng, K. Feature selection methods in the framework of mRMR. In Proceedings of the 2018 Eighth International Conference on Instrumentation & Measurement, Computer, Communication and Control (IMCCC), Harbin, China, 19–21 July 2018; IEEE: New York, NY, USA, 2018; pp. 1490–1495. [Google Scholar]
- Howley, T.; Madden, M.G.; O’Connell, M.L.; Ryder, A.G. The effect of principal component analysis on machine learning accuracy with high dimensional spectral data. In Proceedings of the International Conference on Innovative Techniques and Applications of Artificial Intelligence, Cambridge, UK, 25 December 2005; Springer: London, UK, 2005; pp. 209–222. [Google Scholar]
- Ozdemir, S.; Susarla, D. Feature Engineering Made Easy: Identify Unique Features from Your Dataset in Order to Build Powerful Machine Learning Systems; Packt Publishing Ltd.: Birmingham, UK, 2018. [Google Scholar]
- Alkaya, A.; Eker, İ. Variance sensitive adaptive threshold-based PCA method for fault detection with experimental application. ISA Trans. 2011, 50, 287–302. [Google Scholar] [CrossRef]
- Jolliffe, I.T. Choosing a subset of principal components or variables. In Principal Component Analysis; Springer: Berlin/Heidelberg, Germany, 2002; pp. 111–149. [Google Scholar]
- de Melo Battisti, F.; de Carvalho, T.B.A. Threshold feature selection PCA. In Proceedings of the Symposium on Knowledge Discovery, Mining and Learning (KDMiLe), Campinas, Brasil, 28 November–1 December 2022; SBC: Hong Kong, China, 2022; pp. 50–57. [Google Scholar]
- Holland, S.M. Principal Components Analysis (PCA); GA 30602-2501; Department of Geology, University of Georgia: Athens, Greece, 2008. [Google Scholar]
- Murphy, A.H. Probabilistic weather forecasting. In Probability, Statistics, and Decision Making in the Atmospheric Sciences; CRC Press: Boca Raton, FL, USA, 2019; pp. 337–377. [Google Scholar]
- Wu, J.; Chen, X.Y.; Zhang, H.; Xiong, L.D.; Lei, H.; Deng, S.H. Hyperparameter optimization for machine learning models based on Bayesian optimization. J. Electron. Sci. Technol. 2019, 17, 26–40. [Google Scholar]
- Rasmussen, C.E. Gaussian processes in machine learning. In Summer School on Machine Learning; Springer: Berlin/Heidelberg, Germany, 2003; pp. 63–71. [Google Scholar]
- Williams, C.K.; Rasmussen, C.E. Gaussian Processes for Machine Learning; MIT Press: Cambridge, MA, USA, 2006; Volume 2, p. 4. [Google Scholar]
- Gorbach, N.S.; Bian, A.A.; Fischer, B.; Bauer, S.; Buhmann, J.M. Model selection for gaussian process regression. In Proceedings of the Pattern Recognition: 39th German Conference, GCPR 2017, Basel, Switzerland, 12–15 September 2017; Springer International Publishing: Cham, Switzerland, 2017; pp. 306–318. [Google Scholar]
- Abdessalem, A.B.; Dervilis, N.; Wagg, D.J.; Worden, K. Automatic kernel selection for gaussian processes regression with approximate bayesian computation and sequential monte carlo. Front. Built Environ. 2017, 3, 52. [Google Scholar] [CrossRef]
- Stuke, A.; Rinke, P.; Todorović, M. Efficient hyperparameter tuning for kernel ridge regression with Bayesian optimization. Mach. Learn. Sci. Technol. 2021, 2, 035022. [Google Scholar] [CrossRef]
- Malakouti, S.M.; Menhaj, M.B.; Suratgar, A.A. Applying Grid Search, Random Search, Bayesian Optimization, Genetic Algorithm, and Particle Swarm Optimization to fine-tune the hyperparameters of the ensemble of ML models enhances its predictive accuracy for mud loss. arXiv 2024. [Google Scholar] [CrossRef]
- Alibrahim, H.; Ludwig, S.A. Hyperparameter optimization: Comparing genetic algorithm against grid search and bayesian optimization. In Proceedings of the 2021 IEEE Congress on Evolutionary Computation (CEC), Krakow, Poland, 28 June–1 July 2021; IEEE: New York, NY, USA, 2021; pp. 1551–1559. [Google Scholar]
- Wang, X.; Hong, L.J.; Jiang, Z.; Shen, H. Gaussian process-based random search for continuous optimization via simulation. Oper. Res. 2025, 73, 385–407. [Google Scholar] [CrossRef]
- Palm, N.; Landerer, M.; Palm, H. Gaussian process regression based multi-objective bayesian optimization for power system design. Sustainability 2022, 14, 12777. [Google Scholar] [CrossRef]
- Heaton, M.J.; Datta, A.; Finley, A.O.; Furrer, R.; Guinness, J.; Guhaniyogi, R.; Gerber, F.; Gramacy, R.B.; Hammerling, D.; Katzfuss, M.; et al. A case study competition among methods for analyzing large spatial data. J. Agric. Biol. Environ. Stat. 2018, 24, 398–425. [Google Scholar] [CrossRef]
- Sharma, N.; Juneja, A. Combining of random forest estimates using LSboost for stock market index prediction. In Proceedings of the 2017 2nd International Conference for Convergence in Technology (I2CT), Mumbai, India, 7–9 April 2017; IEEE: New York, NY, USA, 2017; pp. 1199–1202. [Google Scholar]
- Osman, H.; Ghafari, M.; Nierstrasz, O. Hyperparameter optimization to improve bug prediction accuracy. In Proceedings of the 2017 IEEE Workshop on Machine Learning Techniques for Software Quality Evaluation (MaLTeSQuE), Klagenfurt, Austria, 21 February 2017; IEEE: New York, NY, USA, 2017; pp. 33–38. [Google Scholar]
- Sunkad, Z.A. Feature selection and hyperparameter optimization of SVM for human activity recognition. In Proceedings of the 2016 3rd International Conference on Soft Computing & Machine Intelligence (ISCMI), Dubai, United Arab Emirates, 23–25 November 2016; IEEE: New York, NY, USA, 2016; pp. 104–109. [Google Scholar]
- Mantovani, R.G.; Rossi, A.L.; Vanschoren, J.; Bischl, B.; De Carvalho, A.C. Effectiveness of random search in SVM hyper-parameter tuning. In Proceedings of the 2015 International Joint Conference on Neural Networks (IJCNN), Killarney, Ireland, 12–17 July 2015; IEEE: New York, NY, USA, 2015; pp. 1–8. [Google Scholar]
- Kalita, D.J.; Singh, V.P.; Kumar, V. A survey on SVM hyper-parameters optimization techniques. In Proceedings of the Social Networking and Computational Intelligence: Proceedings of SCI-2018, Bhopal, India, 5–6 October 2018; Springer: Singapore, 2018; pp. 243–256. [Google Scholar]
- Governor Moore Signs Executive Order Doubling Maryland’s Energy Conservation Goal Through State Government Greenhouse Gas Reductions; Press Releases—News; Office of Governor Wes Moore: Annapolis, MD, USA, 2023.
- Kang, H.; An, J.; Kim, H.; Ji, C.; Hong, T.; Lee, S. Changes in energy consumption according to building use type under COVID-19 pandemic in South Korea. Renew. Sustain. Energy Rev. 2021, 148, 111294. [Google Scholar] [CrossRef]
- Mokhtari, R.; Jahangir, M.H. The effect of occupant distribution on energy consumption and COVID-19 infection in buildings: A case study of university building. Build. Environ. 2021, 190, 107561. [Google Scholar] [CrossRef]
- Cai, S.; Gou, Z. Impact of COVID-19 on the energy consumption of commercial buildings: A case study in Singapore. Energy Built Environ. 2024, 5, 364–373. [Google Scholar] [CrossRef]
- Sheet, E.F. Social Cost of Carbon; United States Environmental Protection Agency (EPA): Washington, DC, USA, 2013.
State | Emissions Reduction Goals |
---|---|
California [16] | 57% by 2030; net-zero by 2045 |
Colorado [17] | 26% by 2025; 50% by 2030; 90% by 2050 |
District of Columbia [18] | 45% by 2030; 95% by 2050 |
Maryland [19] | 60% by 2031; net-zero by 2045 |
New York [20] | 40% by 2030; 85% by 2050 |
Washington [21] | 58% by 2030; net-zero by 2050 |
Input Features | Energy Consumption (Model 1) | GHG Emission (Model 2) |
---|---|---|
Building Type | ✅ | ✅ |
Square Footage | ✅ | ✅ |
Age | ✅ | ✅ |
Maximum Occupancy based on ASHRAE Standard 62.1 Levels [85] | ✅ | ✅ |
Cooling Degree Days (2018–2040) | ✅ | ✅ |
Heating Degree Days (2018–2040) | ✅ | ✅ |
Utility Use% (2018–2040) | ✅ | ✅ |
Energy Consumption (2018–2040) | N/A | ✅ |
Model | Training Time (s) | R2 | NMAE (kWh) [kBtu] | NRMSE (kWh) [kBtu] |
---|---|---|---|---|
SVM | 121 | 0.82 | 0.227 [0.774] | 0.326 [1.112] |
GPR | 608 | 0.91 | 0.057 [0.194] | 0.108 [0.368] |
Ensemble (LSBoost) | 606 | 0.87 | 0.121 [0.413] | 0.201 [0.686] |
Model | Training Time (s) | R2 | NMAE (Metric Tons) | NRMSE (Metric Tons) |
---|---|---|---|---|
SVM | 113 | 0.93 | 0.092 | 0.124 |
GPR | 609 | 0.90 | 0.139 | 0.248 |
Ensemble (LSBoost) | 601 | 0.96 | 0.067 | 0.112 |
Kernel | Tuning Method | R2 | NRMSE (kWh) [kBtu] | Best Hyperparameters |
---|---|---|---|---|
RBF | Grid search | 0.82 | 0.257 [0.877] | Sigma = 0.0017 |
Matern 5/2 | Bayesian optimization | 0.91 | 0.108 [0.368] | Sigma = 0.0012 |
Rational quadratic | Random search | 0.89 | 0.117 [0.399] | Sigma = 0.0047 |
Exponential | Grid search | 0.86 | 0.201 [0.686] | Sigma = 0.0025 |
Number of Learners | Tuning Method | R2 | NRMSE (Metric Tons of CO2e) | Learning Rate |
---|---|---|---|---|
15 | Grid Search | 0.93 | 0.119 | 0.4642 |
18 | Bayesian Optimization | 0.96 | 0.112 | 0.3758 |
63 | Random Search | 0.89 | 0.291 | 0.0813 |
Kernel | Tuning Method | R2 | NRMSE (kWh) [kBtu] | Best Hyperparameters |
---|---|---|---|---|
RBF | Grid search | 0.75 | 0.412 [1.406] | C = 9, ε = 0.13 |
Linear | Random search | 0.82 | 0.326 [1.112] | C = 2, ε = 0.01 |
Polynomial | Bayesian optimization | 0.79 | 0.381 [1.299] | C = 4, ε = 0.05 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ramnarayan, A.; de Castro, F.; Sarmiento, A.; Ohadi, M. Machine Learning-Based Carbon Compliance Forecasting and Energy Performance Assessment in Commercial Buildings. Energies 2025, 18, 3906. https://doi.org/10.3390/en18153906
Ramnarayan A, de Castro F, Sarmiento A, Ohadi M. Machine Learning-Based Carbon Compliance Forecasting and Energy Performance Assessment in Commercial Buildings. Energies. 2025; 18(15):3906. https://doi.org/10.3390/en18153906
Chicago/Turabian StyleRamnarayan, Aditya, Felipe de Castro, Andres Sarmiento, and Michael Ohadi. 2025. "Machine Learning-Based Carbon Compliance Forecasting and Energy Performance Assessment in Commercial Buildings" Energies 18, no. 15: 3906. https://doi.org/10.3390/en18153906
APA StyleRamnarayan, A., de Castro, F., Sarmiento, A., & Ohadi, M. (2025). Machine Learning-Based Carbon Compliance Forecasting and Energy Performance Assessment in Commercial Buildings. Energies, 18(15), 3906. https://doi.org/10.3390/en18153906