Next-Level Energy Management in Manufacturing: Facility-Level Energy Digital Twin Framework Based on Machine Learning and Automated Data Collection

Vance, David; Jin, Mingzhou; Wenning, Thomas; Nimbalkar, Sachin; Price, Christopher

doi:10.3390/en18133242

Open AccessArticle

Next-Level Energy Management in Manufacturing: Facility-Level Energy Digital Twin Framework Based on Machine Learning and Automated Data Collection

by

David Vance

¹

,

Mingzhou Jin

^1,*

,

Thomas Wenning

²

,

Sachin Nimbalkar

² and

Christopher Price

²

¹

Industrial and Systems Engineering Department, Institute for a Secure and Sustainable Environment, The University of Tennessee, Knoxville, TN 37919, USA

²

Oak Ridge National Laboratory, Oak Ridge, TN 37830, USA

^*

Author to whom correspondence should be addressed.

Energies 2025, 18(13), 3242; https://doi.org/10.3390/en18133242

Submission received: 17 March 2025 / Revised: 6 June 2025 / Accepted: 11 June 2025 / Published: 20 June 2025

Download

Browse Figures

Versions Notes

Abstract

This research introduces an energy prediction framework at the facility level supported by automated data collection and machine learning models. It investigates whether reducing the prediction time scale allows for applying more complex machine learning techniques and if those techniques improve the prediction accuracy. The primary advantages of this framework lie in its automation of the energy prediction process and its provision of real-time energy data suitable for use in energy dashboards or digital twins. A sitewide dataset was created by combining 15 min energy and daily production data of five shops—assembly, battery, body (electric), body (gas), and paint—from a globally recognized electric vehicle manufacturer. Various machine learning models were evaluated on daily, weekly, and monthly datasets, including, in increasingly complex order: naïve, simple linear regression, net regularized generalized linear regression, principal component regression, k-nearest neighbor, random forest, and Bayesian regularized neural network. Compared to the current state-of-the-art energy consumption prediction for the industrial facility level, this research investigates more complex models and smaller time intervals for higher accuracy. The findings revealed that the more complex monthly models require a minimum of a year and a half of data to operate, while weekly models demand a year of data to achieve improved accuracy. Daily models can operate with only six months of data but exhibit poor performance due to reduced prediction accuracy of production. Key challenges identified include access to reliable, high-quality energy and production data and the initial demand for human labor.

Keywords:

energy management; machine learning; digital twin

1. Introduction

The future potential for energy savings in manufacturing lies in implementing energy efficiency projects and developing innovative energy monitoring and management approaches [1]. The digital twin (DT) and Internet of Things (IoT) paradigms enable enhanced visibility, awareness, and improvements in energy efficiency through the use of connected intelligent sensors and meters. This improved real-time energy consumption data can significantly enhance the ability of Energy Management Systems (EnMSs) to achieve sustained energy savings [2]. With a robust energy data collection process, energy efficiency opportunities can be identified at each level of the manufacturing process. Current methods for increasing energy efficiency in discrete part manufacturing are distinguished based on unit, multi-machine, factory, multi-facility, and supply chain levels [3]. The effectiveness of incorporating energy management into any of these levels depends on knowing the present state of consumption, historical consumption, and the system’s responsiveness to changes. This study was guided by the need to establish a data-gathering framework that facilitates energy management to achieve savings at each level. Energy prediction is often directed by ISO 50001:2018, the industry standard for implementing EnMSs [4]. However, manufacturers frequently rely on naïve or simple linear regression models for prediction, if they engage in any at all. While several industrial energy management regression software packages are available, none appear to integrate machine learning models with daily data inputs to make predictions. For example, the U.S. Department of Energy (DOE) Energy Performance Indicator Tool [5] does not offer predictive capabilities, only accommodating monthly data inputs and employing simple regression analysis. Although the DOE VERIFI (beta) tool [6] can implement more advanced regression analysis and generate visualizations and reports, it was not designed for predictive functions and could not accept daily or weekly inputs at the time of this study.

Original Equipment Manufacturers (OEMs) face increasing pressure to illustrate the sustainability of each stage in their manufacturing processes. If an OEM fails to accurately track and predict emissions generated during various manufacturing stages across the supply chain, it risks incurring unnecessary energy expenses, facing costs related to carbon taxes, and suffering negative impacts on its corporate image.

Smart manufacturing (SM), also called Industry 4.0 or the fourth industrial revolution, includes advanced technologies that allow real-time information transparency, improved decision-making, or interconnection between diverse processes [7]. Typical SM technologies include robotics, the Industrial Internet of Things (IIOT), cloud computing, and, importantly for this research, DT, machine learning (ML), and big data analytics.

DT is a virtual or digital counterpart to a physical system. The essential aspect of DT is that it is a uniform data model for describing manifold assets. Data and system integration, integration of cross-life-cycle data, and a service orientation are requirements for advancing to the DT level. A systematic review of machine learning applications for industrial energy efficiency by Narciso and Martins [8] showed that publications are increasing and are dominated by petrochemical production research. Only one of the forty-two papers reviewed predicts future energy consumption based on historical energy consumption, and it focuses on a specific process instead of the facility [9]. Recently, Moghadasi et al. [10] applied ML techniques to implement an energy management system according to ISO 50001:2018. Moghadasi’s method fulfills all the requirements of the ISO 50001 standard except the facility-level energy prediction: identifying significant energy users, defining energy performance indicators, developing a baseline, analyzing current energy performance, quantifying energy conservation opportunities, determining energy targets, and defining the energy action plan. Moghadasi et al. [10] focused on highly detailed process parameters such as steam temperature and pressure, but this study investigates production and historical energy consumption figures as predictors for energy use. Process parameters do not allow prediction in the same manner as production data. Moghadasi’s framework could be helpful in conjunction with this research, but it does not fulfill the energy prediction requirement.

2. Literature Review

This literature review presents the research questions and current status of integrating Smart Manufacturing (SM) concepts, including Digital Twin (DT), Internet of Things (IoT), and big data analytics, with energy management and facility-level energy predictions.

DT, IoT, and advanced enterprise systems are essential for future energy management research and provide excellent innovation potential as real-time data becomes available [11]. Medojevic et al. [12] stated that “without energy management overall at the center of Industry 4.0, there is no Industry 4.0”, pointing to the fundamental role that energy availability and energy reliability play in manufacturing processes.

The IoT has enabled many new efficiency advantages for both productivity and energy. The considerable benefits of combining Industry 4.0 with energy management have not been realized because plant managers lack awareness of the intimate connection between the two themes, and software tools need to be upgraded [12]. Shrouf and Miragliotta [13] reviewed production management practices that are enhanced by IoT technology and found six sets of benefits that can be achieved, including (1) finding and reducing energy waste sources, (2) improving energy-aware production scheduling, (3) reducing energy bills through demand response and reducing energy purchasing cost, (4) efficient maintenance management, (5) improving environmental reputation by meeting customer expectations and obtaining environmental certification, and (6) supporting decentralization in decision-making at production level to increase energy efficiency. Although the potential benefits of combining Industry 4.0 with energy management are apparent, there has been insufficient research to integrate these concepts at the industrial plant level. Also, collecting, transforming, integrating, modeling, storing, securing, analyzing, and presenting big data sets present significant challenges [14].

Several reviews exist on the interaction between energy management and advanced data analytics. Sievers and Blank [15] reviewed the state-of-the-art in data-driven residential and industrial energy management systems, including the system infrastructure’s design, wireless and wired communication protocols, real-time and historical data, forecasting algorithms, and objectives and optimization techniques. A detailed analysis of the research literature on energy management in manufacturing is presented by [16]. Six main lines of research in the field include (i) drivers and barriers, (ii) information and communication technologies, (iii) strategic paradigms, (iv) supporting tools and methods, (v) manufacturing process paradigms, and (vi) manufacturing performances in the trade-off. Since 2009, the number of publications has grown consistently until 2016, the last date in the study, and the authors expect it to continue increasing.

The DT paradigm has promoted sustainability in many sectors and applications. Pater and Stadnicka [17] reviewed twenty papers related to DT and sustainability to determine what problems are included within the two topics. They concluded that there are many areas where the potential of DT for sustainability has not been fully realized, which suggests combining DT with sustainability. The review by Pater and Stadnicka [17] showed that only one paper concerns energy monitoring and forecasting, and none are related to energy at the facility level.

2.1. Energy Management

Few energy prediction models have been identified for industrial factory-level applications. Ref. [18] provides a review of energy consumption forecasting models in the manufacturing industry. Out of 72 examined articles, only four have a system boundary of the entire factory. Most are related to the machine level. On further inspection of these four models, none predict facility energy consumption in a straightforward fashion, as proposed in this research. Ref. [19] proposes a generic statistical event simulation method for modeling the energy consumption behavior of machines. It is stated that this approach can be applied at the plant level, but how the approach could work is not demonstrated. Ref. [20] employs load bus data with a two-stage load estimator algorithm and state estimation theory to estimate the amount of electric power system expansion needed to serve an expanded load. The prediction is at the facility level, but no facility energy consumption prediction is presented. Ref. [21] describes an ML approach for 15 min forecasting of the electric load to reduce volatility in power generation introduced by renewables. This model shows how real-time energy data predictions can be made at the plant level, but it only forecasts 15 min into the future and is not easily implemented. Ref. [22] uses deep learning techniques for energy forecasting in a manufacturing area, but it focuses on building parameters such as workshop air temperature and humidity, does not include production as a predictor, and does not make a total facility prediction, including process equipment.

ML for commercial building energy management is much more widely adopted than it is for industrial facility energy management, so building energy management will be highlighted to demonstrate how industrial energy management could be improved. Chen et al. [23] reviewed the state of the art in interpretable machine learning for building energy management. They outlined the use cases for ML models and how to improve model interpretability. Detailed data acquisition platforms and typical applications exist in building energy management, including load prediction, fault detection and diagnosis, and occupancy behavior. Generalizing across industrial applications is more challenging due to the existence of varied processes and equipment, as well as the stochastic nature of manufacturing systems.

Not many data acquisition platforms have been proposed or implemented for the energy management of intelligent factories. Ref. [24] proposed a data acquisition platform for energy management in smart factories or buildings using openHAB, MQTT, and the Node Microcontroller Unit (NodeMCU). However, the focus is on improving communication security, and the proposed platform has not been implemented. To improve energy management in industrial factories, data acquisition platforms must be advanced.

2.2. Energy Digital Twin (DT)

The capability of a manufacturer to produce DT applications is a crucial goal for digital transformation, Industry 4.0, and SM [25]. The term “digital twin” was introduced by Michael Grieves in a 2002 product lifecycle management (PLM) presentation, although virtual replication of a product had already been realized in the aerospace industry [26]. A widely used definition given by Glaessgen and Stargel [27] reads: “DT is an integrated multi-physics, multi-scale, probabilistic simulation of a complex product and uses the best available physical models, sensor updates, etc., to mirror the life of its corresponding twin.” Another definition of DT in manufacturing is given by Garetti et al. [28]: “DT consists of a virtual representation of a production system that can run on different simulation disciplines characterized by the synchronization between the virtual and real system, thanks to sensed data, connected smart devices, mathematical models, and real-time data elaboration. The topical role within Industry 4.0 manufacturing systems is to exploit these features to forecast and optimize the behavior of the production system at each life cycle phase in real-time.”

In the next phase of DT research, several researchers have reviewed DT applications to determine a more specific definition of DT and present use cases. For example, ref. [29] categorically reviewed the DT in manufacturing applications to classify existing publications according to their type, level of integration, area of focus, and technology used. Before Kritzinger et al. [29], DT was viewed simply as a digital counterpart to a physical object, and the terms digital model, digital shadow, and DT were often used synonymously. Now, the terms can be distinguished by the level of data integration between the physical and digital counterparts. A digital model provides a digital representation of an existing or planned physical object without automated data exchange. A change in the state of the physical object has no direct effect on the digital object, and vice versa. The digital shadow refers to a one-way data flow between the state of a physical object and its corresponding digital representation. A change in the state of the physical object leads to a change in the digital object, but not vice versa. If, and only if, data flows are fully integrated into both directions between the physical and digital object, Kritzinger et al. [29] stated that you can accurately refer to it as a DT. The authors found that most papers used the term DT, while only 18% described a DT with bidirectional data transfer. Using the Kritzinger et al. [29] framework, we could not characterize a real-time energy information model as DT unless it had bidirectional data-transfer capability.

Shao and Helu [30] characterized DT perspectives based on the proposed definition, the relevant viewpoint (product, process, or system), the fidelity of the digital representation (complete or partial), and the temporal integration (real-time or offline). An offline DT would be where real-time communication is not critical and a periodic connection is sufficient. The authors argue that the DT concept should depend on the context and viewpoint for the specific use case and propose three critical factors for assessing the scope and requirements of a DT, including (1) Application, (2) Viewpoint, and (3) Context. A DT needs only to collect the data relevant to the use case of interest, rather than all available data. The Shao and Helu [30] framework would enable practitioners to consider an energy information model based on data in 15 min steps, as a DT application without the real-time bidirectional data-transfer capability. The currently under development ISO 23247 will provide a generic DT manufacturing development framework that considers the context and viewpoint for specific use cases. Yu et al. [31] proposed the Energy Digital Twin (EDT) concept, aiming to manage and optimize site operations to minimize specific energy consumption through four attributes, including (1) looks-like, (2) behaves-like, (3) connected-to, and (4) timescale. Yu et al. (2022) [31] classified EDT, subtly shifting from Kritzinger et al. [29] by introducing the term “Digital Manager” to describe the connected-to attribute and avoid the requirement for bi-directional transfer. The argument is that many EDT applications, such as plant evolution and retrofit, do not rely on two-way, real-time data communication but should still be counted under the broader DT class. Additionally, enabling communication from the DT back into the production environment for bi-directional data transfer requirements may require some disruption to production, which is typically avoided in manufacturing. For this research, automated data collection and integration into the digital model is the goal for energy management DT and not bidirectional data transfer, and we will refer to the application as a DT according to the DT definition from Shao and Helu [30] and the EDT definition from Yu et al. [31].

The DT concept has been fully implemented and validated in several applications, though there is a lack of a proven plant-level energy management DT. Haag and Anderl [32] prove the DT concept with a physical twin of a bending beam test bench, a DT CAD model, an MQTT broker to connect the physical and DT, a web-based dashboard, and a finite element method (FEM) simulation. The results demonstrate that the DT concept can be applied to actual systems. A challenge for DT applications is that current CAx models are designed for use in product development and are not designed to live on as DTs, which adapt to the product characteristics throughout its entire operational phase. Additionally, traditional data collection and processing methods are inadequate.

Tao et al. [33] reviewed DT’s state-of-the-art research and development history and summarized its industrial applications. The most popular application area is prognostics and health management (PHM). Energy efficiency or energy monitoring was not explicitly identified. Assad et al. [34] demonstrated the implementation of a web-based DT (WDT) for improving sustainability in industrial cyber-physical systems, proving that the energy management DT is feasible at the machine level. The three steps are (a) accessing the control parameters influencing energy consumption, (b) logging the energy consumption data, and (c) producing predictions using a computational algorithm. An industrial case study in a battery assembly production line demonstrates the WDT architecture, which has WebGL, Node.js, OPC UA server, and PLC as its major components and uses WebSocket as the communication protocol. Our paper proposes an energy management DT framework for the plant level rather than the machine level.

Several notable frameworks have been proposed for implementing an energy management DT. Ref. [35] proposed a conceptual framework for energy management in various contexts based on overarching themes, including strategy/planning, implementation/operation, controlling, organization, and culture. The framework does not go in-depth into data collection or mention the DT concept, but it provides a comprehensive approach that can serve as a basis for development. Vihkorev et al. [36] proposed a framework for energy monitoring and management at the plant floor level, including standards for data exchange, online energy data analysis, performance measurement, and energy usage display. The Vikhorev framework focuses on the Manufacturing Execution System (MES) and complex event processing for providing real-time energy performance information. Data reduction is proposed by extracting key events from large and continuous data streams. The case study is implemented in a prototype information system in a machining line for a major European automotive manufacturer. The result is an evaluation of state recognition by a pattern machine to estimate the average time in different operation modes and the average energy used during idling. Zhang et al. [37] proposed a framework for equipment energy consumption management (EECM) with a DT shop floor consisting of physical equipment, virtual equipment, EECM services, and data, using a machine tool as an example. The potential applications of EECM with DT at shop floors are energy consumption monitoring, analysis, and optimization. They believe that future work in this area will involve DT shop floor case studies and modeling, the fusion of multiple data sources, and establishing an EECM system on the DT shop floor. Wei et al. [38] developed an IoT-based energy management platform for industrial facilities, which includes a use case for measuring all energy consumption within a facility. However, it does not consider multiple facilities, is not implemented in an industrial process, and focuses on demand responses. There are several energy DT applications for the shop floor, but there is a lack of a framework for implementing energy DT at the plant level.

2.3. Summary

The literature review and introduction highlight key advancements in energy management practices within the manufacturing sector, emphasizing the crucial role of data-driven approaches in improving energy efficiency. Energy management frameworks are evolving, particularly through the integration of big data analytics and IoT technologies, which facilitate real-time monitoring and analysis of energy consumption. The convergence of these technologies points toward the DT concept, which has been proposed and demonstrated in other fields but not for energy management at the facility level. Existing studies reveal a prevalent reliance on basic predictive models, such as naïve and simple linear regression, which often fail to utilize the full potential of more complex machine learning techniques. The review also highlights the limitations of current energy management software, specifically, their inability to process daily data inputs and utilize advanced predictive analytics. This research fills the need for a more robust energy prediction framework that combines automated data collection with advanced machine learning methodologies to drive effective energy management solutions.

Based on the above-identified research gaps, this research seeks to answer the following questions: (1) Is it feasible to perform energy predictions at the industrial facility level using weekly and daily energy data? (2) Can the accuracy of industrial facility-level energy predictions be improved by considering more advanced ML models such as Principal Component Regression (PCR), K-Nearest Neighbor (KNN), Random Forest (RF), and Bayesian Regularized Neural Network (BRNN)? (3) Given that naïve predictions already demonstrate strong performance, does this indicate that data from one year prior could be used as an additional predictor for enhanced accuracy in facility-level energy predictions? (4) What are the significant challenges associated with implementing an automated energy prediction framework at the facility level?

3. Methodology

3.1. Framework Overview

Figure 1 provides a high-level framework for predicting energy consumption. Program #1 integrates data into a usable format for ML models based on historical energy consumption, weather, and production data. The output of Program #1 is a formatted dataset, along with an initial variable analysis of each data source and scatter plots of energy versus production and energy versus temperature. Program #2 uses that formatted dataset to train ML models. Program #2 output is the ML models plus model info, including coefficients, best tuning parameters, and training and holdout results such as the RMSE and

R^{2}

and their standard deviations. Program #3 uses the created ML models with planned production data and forecasted weather data to predict energy consumption. The output of Program #3 is a dataset of each of the model’s predictions and calculations of the RMSE and

R^{2}

if the actual data is available.

This framework could be employed at different time scales and scopes, such as at the area or line level. In our case, we use 15 min energy consumption data and daily weather and production data at the facility level. The electric vehicle manufacturing case described in this paper includes data from five shops: Assembly, Battery, Body Shop (Electric), Body Shop (Gas), and Paint. Sitewide energy is expressed as Assembly + Battery + Body Shop (Electric) + Body Shop (Gas) + Paint. We received 15 min energy data from the utility provider at the end of every week, but this data could be automatically integrated into a communication architecture.

Weather impacts manufacturing energy consumption, and weather data is available from many sources. Ref. [39] demonstrated that an online web services tool can automatically retrieve and preprocess precipitation data. An Application Programming Interface (API) like DegreeDays.net version 1.4 developed by BizEE Software could be implemented to automate this data retrieval [40].

Energy consumption heavily depends on production volumes. In this case, weekly production data is compiled into daily and monthly datasets for Assembly, Battery, Body Shops, Paint, and Sitewide. Production data will likely be the most challenging automated data collection implementation for this framework.

The ML model training can be performed through an open-source and free statistical environment such as R-4.5.1 [41]. In this environment, packages such as Caret can be utilized to develop machine learning models quickly [42]. A thorough overview of statistical learning using R-4.5.1 is provided by [43].

3.2. Prediction Models

This section provides explanations of the prediction models. We chose these well-known models because they can be roughly sorted from simple to complex, according to a review of ML models for predictive process monitoring [44]. Their parameters are explained in more detail in [45]. Chosen models include:

Naïve (Average)—A naïve model does not use sophisticated methods to make a prediction and is often used as a benchmark for testing ML models. An average naïve model takes the average of the training dataset and applies it to all future forecasts. If a model cannot achieve a lower root mean square error (RMSE) than the naïve model, it is not as good as random chance. The $R^{2}$ of an average naïve model is 0.
Naïve (Historical)—A historical naïve model takes data from one year prior and applies it to the future forecast. This is an industry practice that often improves upon the naïve average method.
Linear—A simple linear regression model involving only one variable, in this case, production. The equation for a line of best fit is $y = m x + b$ , where $(x, y)$ represents any point that satisfies the equation. The $y$ -intercept, $b$ , is the $y$ -value when $x = 0$ . The slope, m, is the change in $y$ when $x$ increases by 1.
GLMNET (Net Regularized Generalized Linear Regression Model)—Considers all variables and gives a reasonable estimation of the significant predictors. It fits lasso and elastic-net models for linear, logistic, and multinomial regression using coordinate descent. It is extremely fast and exploits sparsity in the input x matrix, and can make various predictions accurately. For an alpha = 0, ridge regression is employed, which tends to yield equal coefficients and never fully eliminates predictors. For an alpha = 1, lasso regression picks fewer correlated predictors and discards the rest. For values between 0 and 1, the two methods are blended. A Generalized Linear Model (GLM) was also performed, but the results were close to GLMNET, and we chose not to include them.
PCR (Principal Component Regression)—In PCR, principal component analysis is first performed on the original data, then dimension reduction is accomplished by selecting the number of principal components using cross-validation and test error, and finally, regression is conducted using the first $n$ dimension reduced principal components. PCR performs better than previous models on massive datasets and can accurately handle variables like “day of the week” and “month of the year.” Partial Least Squares Regression (PLSR) was also performed in this study, but the results were close to PCR, and we chose not to include them.
KNN (K-Nearest Neighbor)—A non-parametric, supervised learning classification model that uses proximity to make classifications or predictions about the grouping of an individual data point. It is typically used as a classification algorithm for pattern recognition, working off the assumption that similar points can be found near one another. It can sometimes perform better on large datasets than previous models if there is an understanding to be developed based on neighborhoods or groupings that simple linear regression cannot determine.
Random Forest (RF)—Random Forest is a popular ML algorithm that combines the output of decision trees to reach a result. It is popular due to its ease of use, flexibility, and ability to handle classification and regression problems.
Bayesian Regularized Neural Net (BRNN)—A neural network that incorporates posterior inference to reduce overfitting and can be trained based on just one parameter, the number of neurons.

3.3. Determining the Best Model

To determine the best model, we used the

R^{2}

and RMSE metrics to compare them.

R^{2}

is a “goodness-of-fit” statistical measure representing the proportion of the variance for a dependent variable explained by an independent variable in a regression model. The higher the

R^{2}

value, the stronger the relationship between the dependent and independent variables. If the

R^{2}

of a model is 0.5, then approximately half of the observed variation can be explained by the model’s inputs. What qualifies an

R^{2}

value as “good” depends on context. Naïve (average) models have an

R^{2}

value of 0 associated with them because they have no variance. The root mean square error (RMSE) is the standard deviation of the residuals (prediction errors). Residuals measure the distance between data points and the regression line. Essentially, the RMSE tells you how concentrated data is around the line of best fit. The lower the RMSE, the better a given model can “fit” a dataset.

Generally, the RMSE should be prioritized over

R^{2}

in model selection because having a smaller error is preferable to having the right shape. This is apparent in several of the naïve historical models, where a high

R^{2}

is calculated because the shape is correct, but a large RMSE exists because production has increased. In parameter selection, strict requirements for

R^{2}

are appropriate, such that 0.85 or higher is required, but for model selection, lower

R^{2}

can sometimes be appropriate if the RMSE is a small percentage of the total.

A general rule of thumb is that, given the choice between two equally fitting models (i.e., all else being equal), it is generally advisable to select the simpler or more parsimonious model. This principle aligns with the parsimony criterion in model selection, also known as Occam’s razor, which suggests that simpler models should be preferred when their predictive accuracy does not significantly surpass that of more complex alternatives. In general, model parsimony is a function of the number of estimated parameters and other factors, including model configuration [46]. The models in Section 3.2 are organized in roughly increasing complexity, but a detailed parsimony analysis is outside the scope of this research. The results section will highlight the model in each case with the highest

R^{2}

and lowest RMSE, but no specific model is selected as the “best” because that is context-based.

3.4. Experimental Processes

Initial variable testing was performed for dry bulb temperature, heating degree days, and cooling degree days. The decision to only include average dry bulb temperature was made to simplify the models. Heating and cooling degree days are helpful in many energy management processes, but here, when included as a predictor, it was unclear which variable would end up in the model parameters. Also, predicting heating and cooling degree days is not as straightforward as predicting dry bulb temperature. Dry bulb temperature is used as a predictor for weather in all models besides naïve and linear. From here on, the term “temperature” refers to the average dry bulb temperature.

Production and temperature are used in every model except the linear model, which only uses production, and naïve historic and average models, which do not use either. Week of the year, month, and day of the week were included in PCR, KNN, RF, and BRNN models unless the program gave an error stating that the predictors were rank-deficient, meaning they did not contribute to the model’s accuracy. Time periods with a “B” signify that an additional predictor, “Energy Data from Last Year”, is added. Including this predictor reduces the dataset size because, for 2023, we did not have energy data from the previous year. A summary of the predictors used by the model is given in Table 1.

For the shop-level models, the predictors include production data, temperature, month, week of the year (for weekly and daily models), and day of the week (for daily models). For the sitewide models, the predictors include production data for each shop and sitewide, temperature, month, week of the year for daily and weekly models, and day of the week for daily models. Month and week of the year were almost always excluded, but the day of the week was included in most daily models. A summary of the predictors used by the shop area is given in Table 2.

The weekly and daily models were cross-validated ten times, while the monthly models had three cross-validations. Because the monthly models only have 12 data points per year, many complex monthly models struggle initially due to smaller dataset. The RF would not function for any monthly sitewide models except in scenarios where we had more than two years of data. For this reason, it is not included in the monthly sitewide analysis. Because of the limited cross-validation, the data confidence for the monthly models is low. The data was randomly partitioned into 75% training and 25% holdout to ensure best practices, but holdout results are not included here.

For the results labeled 2023 Q1, data was collected from 2022 and then tested against the first quarter of 2023. For 2023 Q4 A, training data are from 2022 and 2023 up to 1 September 2023, and tested in the last quarter of 2023. For 2024 Q1 A, training data is from 2022 and 2023 and tested in the first quarter of 2024. For 2023 Q4 B and 2024 Q1 B, we included energy data from the previous year as a predictor. This reduced the size of the training dataset because we lacked data from 2021. For 2023 Q4 B, the training data is from 2023 up to 1 September 2023, and does not include any training data from 2022 because the energy from last year’s predictor does not have data from 2021. For 2024 Q1 B, training data is from 2023 and tested against the first quarter of 2024. The monthly and weekly models would not perform when we first tested in 2022 Q4, so you must have at least one year of data to perform this analysis.

4. Results

This section presents the daily, weekly, and monthly prediction results for sitewide energy, along with the energy prediction performance for each shop.

4.1. Sitewide Energy Prediction

The sitewide energy metric is derived from the readings of six different shop meters. Some significant processes, equipment, and even whole shops were added between 2022 and 2023. Production and/or temperature are used in every model except naïve historic and average models. The accuracy of their predictions is included in Appendix A.

Table 3, Table 4 and Table 5 present the sitewide energy prediction results at the daily, weekly, and monthly frequencies. In each table, the unit for energy consumption prediction RMSE is kWh. The Historic 2023 Q1 training parameters do not exist because training data was not available to compare against from 2022. “Training” refers to the model’s accuracy against the training dataset, calculated through cross-validation. “Actual” signifies the prediction accuracy in comparison with real values that were collected after model training. “B” signifies that the “Energy Data from Last Year” predictor is included, while “A” signifies that it is not. Figure 2, Figure 3, Figure 4 and Figure 5 graphically summarize the results from Table 3, Table 4 and Table 5. In these figures, the Naïve models include historical and average, the Standard models include linear and GLM without the “Energy Data from Last Year” predictor, and the Proposed models include PCR, KNN, RF, and BRNN with and without the “Energy Data from Last Year” predictor.

Daily sitewide results are given in Table 3. The average sitewide daily energy consumption was 363,040 kWh, and the RMSE standard deviation was around 5000 kWh. Prediction accuracy was above 0.85

R^{2}

in 2023 Q1, less than 0.05 in 2023 Q4, and greater than 0.75

R^{2}

in 2024 Q1. This result also played out in the assembly and paint processes and is assumed to result from low production prediction accuracy for 2023 Q4. Adding the energy data from one year ago as a predictor, shown in 2023 Q4 B and 2024 Q1 B, improved the prediction accuracy. The ML models improved on the naïve results in 2023 Q1 and 2024 Q1 but not in the 2023 Q4 analyses. In these results tables, the yellow cell fill color indicates the lowest RMSE model when compared against actual data, and the orange cell fill color indicates the highest

R^{2}

model.

The results from Table 3 are graphically summarized in Figure 2, which demonstrates the possible value of the proposed method and the poor performance in 2023 Q4.

Table 4 shows weekly sitewide results. The average weekly energy consumption was 2,355,600 kWh, and the standard deviation of the RMSE was around 90,000. Prediction accuracy was not as bad in 2023 Q4 as in other models. As in previous models, the RMSE decreased from 2023 Q1 to 2023 Q4 and increased from 2023 Q4 to 2024 Q1, but not as severely. Comparing 2023 Q4 A with 2023 Q4 B shows that prediction accuracy did not improve based on the actual or training results from adding the energy data from the previous year’s predictor. The RMSE of the Historic model in 2023 Q1 was very good and could not be achieved, but the ML models improved upon the linear model during that period. The ML models improved on the naïve models in all other time periods. In 2024 Q1 A and B, the ML models beat the linear model only once. The sitewide model uses more predictors because it includes production data from each shop. This seems to translate to improved prediction accuracy, especially in training. In 2023 Q1, the prediction accuracy was several standard deviations better than the naïve and linear models. The sitewide weekly ML models could be used for annual predictions, such as those required for ISO 50001. These results show that the naïve and linear models can have large and unpredictable errors.

In order to demonstrate the value of the Proposed method, Figure 3 highlights the best performing weekly models according to time period and their model type. The existing commercial and research model does not allow for weekly time period predictions, but we consider linear and GLM models as the Standard for comparison purposes.

Sitewide monthly model results are given in Table 5. This is the time period that the industry typically performs yearly predictions for ISO 50001 certification. The average sitewide monthly energy consumption was 11,012,000 kWh for 2023, and the standard deviation of the RMSE was 500,000. For the ML models, prediction accuracy compared to actual results decreased from 2023 Q1 to 2023 Q4 and increased from 2023 Q4 to 2024 Q1. Prediction accuracy was not as terrible in 2023 Q4 compared to other time periods, with an

R^{2}

above 0.75 and an RMSE less than 15% of the average monthly energy consumption. The ML models improved on the naïve models in every period besides 2023 Q1, and the linear besides 2024 Q1 A. The RF model ran in 2024 Q1 A for the first time in a monthly model, but it was the worst-performing model, and the results are not included. RF is not a good candidate for initial ML testing in monthly models because it takes more than two years of available data to perform. The BRNN did perform in 2024 Q1 B because it had a higher dimensionality with all the individual shop production data plus the “energy from last year” predictor, and it had the best prediction by the RMSE.

The RMSE of the best-performing monthly models from Table 5 are given in Figure 4. The purpose of including this comparison is to highlight the value of the proposed method. The best method depends on context, but this shows that the process of comparing different models will bring you to the best result. In 2024 Q1, the Proposed method more than halved the standard error compared to Standard and Naïve.

4.2. Shop-Level Energy Prediction

The best prediction models vary across prediction frequencies and shops. Table 6 summarizes the most accurate models and their

R^{2}

for each shop and prediction frequency. The “best model” is chosen based on the lowest RMSE against the actual data because this indicates the model with the lowest expected error. It might be the case that another model besides the one we chose as the “best model” had a higher

R^{2}

. Battery and Body (Electric) shops had just come online in 2023 Q1, so there are no predictions. Models with a (B) indicate the “Energy Data from Last Year” predictor was included. The monthly models in 2023 Q4 did not work with the “Energy Data from Last Year” predictor because there was not enough training data. Out of the 48 best models, 4 were Historic, 1 was Average, 10 were Linear, 5 were GLMNET, 6 were PCR, 5 were RF, 7 were KNN, and 10 were BRNN. The strongest evidence that complex models can outperform simple models is in Sitewide 2024 Q1 where the BRNN model performed two standard deviations better than the linear or naïve ones. Historical models only performed well when there was not much data available, and energy ended up being very similar to the year before, even though production was predicted to change. Of the 30 models where the “Energy Data from Last Year” predictor could be included, there were 17 times where the best model included the predictor. The monthly models had the highest

R^{2}

, but there were instances such as Paint 2023 Q4 and Sitewide 2023 Q4 where the weekly models performed better than the monthly.

5. Discussion

5.1. Framework

Figure 5 shows an ideal application of the framework proposed in Figure 1. In this case, internal data, such as production history and production predictions, is automatically acquired, organized, and stored within a unified namespace (UNS), allowing access to data anywhere it is needed through a common data interface. David et al. [47] present the unified namespace architecture concept for integrating business data. Outside data, such as weather, is automatically acquired through HTTP/API.

Figure 5. Ideal situation of the energy prediction framework.

With good production data, the framework can be applied at various levels within the plant. For example, it could be applied at the area, line, or cell level, given that all equipment has good data. In these scenarios, temperature may or may not be a significant predictor, but others may reveal themselves through further research.

5.2. Challenges

Accurate production data and production predictions constitute a significant challenge. At one point, production data was provided in a different format with values that differed from those previously received. We determined that the reason was that one dataset was from the production line, and the other was derived. Production data must be consistently formatted, reliable, and of high quality. Also, production prediction accuracy was much better at the beginning of the year than in the last quarter. This result suggests that energy managers should exercise caution when predicting production.

Energy data quality and reliability were two more challenges in implementing this research. There were several occasions when data was unavailable when expected, or there were small gaps that had to be filled by the researchers. The utility quickly fixed these gaps, which might have prevented a real-time energy digital twin from functioning. The manufacturer’s capability to receive automated energy data weekly and view it in real-time on a shop level was excellent and bodes well for the future of energy data in manufacturing. However, it needs work to be more reliable.

Another challenge was the amount of human labor involved. This research aimed to reduce the human input needed in energy prediction. However, automating the data retrieval, ML model training, and prediction resulting in a facility-level EDT requires many human hours. The following list details the tasks that need to be automated for the automated energy prediction framework:

Data transmission and retrieval
Data formatting
ML model training and retraining
Energy consumption predictions

One reason for the large amount of human labor was that we tested several ML models. Research is needed to determine which ML model will work best for your application. In our case, we could not find any study that had implemented an ML model to predict energy consumption in the industrial sector at the plant level. The authors knew from personal experience that manufacturers use historic or average naïve models in many cases and sometimes use basic linear regression with production or multiple linear regression with production and temperature, but we had not seen any cases of using a more complex ML model. In the future, the RF will be our first candidate for removal from this process, especially for the monthly models, though it seems to perform well for the daily paint models. The next candidate would be GLM because the GLMNET is similar and improved upon the GLM in almost every model. PCR and PLSR would be our next candidates for removal from the process because they were not the best performing.

The amount of machine learning knowledge required could be challenging for real-world applications. The practitioner needs a fundamental understanding of automated data retrieval, machine learning model selection, tuning parameters, and what the results mean for energy management.

5.3. Lessons Learned

In some situations, historic and average models perform well, mainly when the system is steady, no new processes or equipment are added, and production is stable. Table 5 shows a case where historic and average models have high and low prediction errors. The problem is that there is no confidence in your prediction. Manufacturers that rely on historic and average models for predictions typically do not follow good data science practices, such as maintaining a holdout dataset or testing variables. The historic and average models are sometimes necessary for a quick estimate but can easily be improved. Manufacturers might view energy consumption prediction for energy management as unimportant and want to make the quickest estimate. The authors urge the industry to consider this a chance to improve SM capability and inform predictions.

Daily models benefit from abundant data and performed well in 2023 Q1, with

R^{2}

above 0.7. However, their performance decreased significantly in 2023 Q4 to the point of having

R^{2}

approaching zero. This is likely because production prediction accuracy decreases with a more extended forecast period. Also, daily datasets were derived from the weekly data provided, and if there was a shutdown or missed production target, that production was shifted to the next day. To make a good prediction, you should first have good data to input. The daily model’s application in practice will not be the same as in this research, over months or years, but rather testing for a specific scenario of a day or week. Still, the daily model results show that more flexible ML models are available with more data. One strength of the daily models is that the more complex models can run with less than a year of data.

The BRNN has high complexity and promising initial results. It works with smaller datasets than the RF or KNN. We chose the BRNN model because, compared to other neural net models, it requires less tuning. It worked well with the sitewide models with many predictors but was also the best-performing of several shop models. The KNN also had promising initial results in many of the models.

6. Conclusions

This paper proposes an energy prediction framework supported by automated data collection and machine learning (ML) models. The framework was developed at the industrial facility level, with much less existing facility energy prediction research than in the commercial sector. The novelty of this framework lies in its combination of data collection and integration for smaller-time-period energy predictions, as well as its automation of a process outlined in the ISO 50001 standard. The framework’s ability to provide near-real-time and accurate energy predictions can help manufacturers identify energy-saving opportunities, gain a deeper understanding of their processes, and make informed, data-driven decisions.

The proposed framework and research advance the state of the art in energy management. It enables manufacturers to move beyond the limitations of traditional energy management processes, which rely on inconsistent human input and do not perform energy variable analysis, and move into smart manufacturing capabilities, such as Energy Digital Twin (EDT), through automated data collection and integration into a digital model. The current state-of-the-art energy prediction research does not consider artificial intelligence prediction models or smaller time-period predictions. This study investigated more sophisticated models, including principal component reduction (PCR), K-Nearest Neighbor (KNN), Random Forest (RF), and Bayesian Regularized Neural Net (BRNN). The framework’s simplicity and adaptability allow it to be applied to various manufacturing processes and industries. A requirement of the framework is accurate and available production data and predictions. The framework was tested by performing energy consumption predictions in a large automotive original equipment manufacturer using daily, weekly, and monthly datasets for specific automotive shops and sitewide.

The results proved that ML models could be applied to industrial facility-level energy predictions at weekly and daily time scales. Complex models, such as random forest and principal component regression, were sometimes, but not always, more accurate than simpler models, such as naïve approximation and linear regression. Naïve models, which have no predictor and assume that what happened in a previous time period will happen again, do not provide confidence and should be avoided as the sole prediction method, even though they sometimes provide accurate estimates. Instead, variable analysis should be performed to identify relevant predictors and to understand what drives energy consumption. The research tested “meta” variables such as the day of the week, week of the year, month, and energy data from the previous year and found that including them often increased prediction accuracy. The major challenges identified include collecting reliable, high-quality energy and production data and the required ML expertise.

For manufacturers, the effort is better spent improving data quality and reliability, performing variable analysis, and automating the data collection framework than optimizing model selection. For future research, the authors recommend investigating different artificial intelligence methods in larger datasets, implementing the automated data collection framework, developing use cases besides ISO 50001 certification, and testing other predictors besides production, dry bulb temperature, week of the year, month, and day of the week. A sensitivity analysis of temperature prediction should also be considered. The authors used the historical average for temperature to make predictions, but this does not consider extreme weather, so different prediction sets could be developed for extreme weather.

Author Contributions

Conceptualization, D.V. and M.J.; methodology, D.V. and M.J.; validation, D.V.; formal analysis, D.V.; investigation, D.V.; resources, M.J.; data curation, D.V.; writing—original draft preparation, D.V.; writing—review and editing, M.J., T.W., and C.P.; supervision, M.J. and S.N.; project administration, M.J.; funding acquisition, T.W. and S.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the U.S. Department of Energy through Oak Ridge National Lab, project number PR12643.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Production and Temperature Prediction

Appendix A.1. Production Prediction

Table A1 gives the production prediction’s

R^{2}

, which measures how accurate the partner manufacturer’s forecasts for production were. The projections were conducted at the beginning of each year, so there was a larger period between the 2023 Q4 predictions than the 2023 Q1 and 2024 Q1. Monthly and weekly production estimates were much more accurate than daily, which makes sense because the daily forecasts were derived. Daily production is more volatile due to many factors, such as maintenance, shift change, and weather. In addition, the company might miss daily goals but would make up for them through overtime on the weekend to still meet weekly and monthly goals.

Table A1. Summary of production prediction accuracy.

Shop	Period	2023 Q1 $R^{2}$	2023 Q4 $R^{2}$	2024 Q1 $R^{2}$
Assembly	Daily	0.92	0.03	0.91
	Weekly	0.88	0.52	0.98
	Monthly	1.00	0.04	1.00
Battery	Daily	0.96	0.07	0.69
	Weekly	0.88	0.08	0.88
	Monthly	1.00	0.00	0.94
Body (Electric)	Daily	0.80	0.06	0.60
	Weekly	0.76	0.28	0.52
	Monthly	1.00	0.02	0.74
Body (Gas)	Daily	0.88	0.03	0.85
	Weekly	0.74	0.32	0.97
	Monthly	1.00	0.00	1.00
Paint	Daily	0.91	0.02	0.90
	Weekly	0.88	0.52	0.97
	Monthly	1.00	0.05	1.00
Sitewide	Daily	0.92	0.03	0.91
	Weekly	0.86	0.53	0.96
	Monthly	1.00	0.06	0.99

Appendix A.2. Temperature Prediction

Table A2 gives the temperature prediction’s

R^{2}

, which measures how accurate our forecasts for temperature were. We used the historical averages from NCEI U.S. Daily Climate Normals (2006–2020) closest to the partner companies’ location to forecast future temperatures [48]. For monthly data, the

R^{2}

values are high enough to use for predictions comfortably. However, there are low

R^{2}

values for some daily and weekly predictions, so it is up to data scientists running the models to determine if it is appropriate. This study assumes that the temperature prediction accuracy did not significantly affect energy consumption prediction accuracy because the RMSE is small, and the temperature predictor importance was less than 20%.

Table A2. Summary of temperature prediction accuracy.

Period	2023 Q1 $R^{2}$	2023 Q4 $R^{2}$	2024 Q1 $R^{2}$
Daily	0.26	0.83	0.42
Weekly	0.50	0.92	0.62
Monthly	0.92	0.99	0.86

Appendix B. Assembly and Painting Shop Energy Prediction

The assembly process is more hand-tool-driven than robotics. No significant equipment changes occurred between 2022 and 2023, but production increased, particularly, the electric vehicle line. The daily results of the assembly process are given in Table A3. The average daily energy consumption was 58,600 kWh, and the standard deviation of the RMSE was ~1,000. Prediction accuracy decreased from 2023 Q4 to 2023 Q1 and improved for 2024 Q1. Prediction accuracy decreased when comparing Daily 2023 Q4 and Daily 2024 Q1 A and B, where B uses the energy data from one year ago as a predictor. The ML models were able to improve on the naïve models but could not improve on the linear model in 2023 Q1 or 2024 Q1.

Table A4 shows the weekly results of the assembly process. The average weekly energy consumption of the assembly shop was 408,000 kWh in 2023, and the standard deviation of the RMSE was about 13,000 kWh. Prediction accuracy decreased from 2023 Q4 to 2023 Q1, though not as severely as in Table A1 for daily prediction, and then improved for 2024 Q1. Adding the data from one year prior as a predictor in 2023 Q4 improved the training performance but had mixed results when used for prediction. The ML models improved on the naïve and linear models in 2023 Q1 and 2023 Q4 but not in 2024 Q1.

Table A3. Results of daily assembly shop model training.

Time Period	Parameter	Model Type
Time Period	Parameter	Historic	Average	Linear	GLMNET	PCR	KNN	RF	BRNN
2023 Q1	RMSE (Training)		15,312	7657	4733	4775	9920	11,266	4534
	$R^{2}$ (Training)		0.00	0.76	0.91	0.91	0.73	0.89	0.91
	RMSE (Actual)	16,080	13,191	7300	7627	7679	10,659	9617	7834
	$R^{2}$ (Actual)	0.73	0.00	0.83	0.85	0.85	0.79	0.86	0.83
2023 Q4 A	RMSE (Training)	17,409	15,396	7965	5783	5756	7260	5734	6220
	$R^{2}$ (Training)	0.24	0.00	0.73	0.86	0.86	0.78	0.87	0.83
	RMSE (Actual)	16,436	19,472	24,411	18,633	18,608	14,538	18,757	23,882
	$R^{2}$ (Actual)	0.39	0.00	0.00	0.09	0.09	0.36	0.08	0.00
2023 Q4 B	RMSE (Training)	17,409	15,396	7965	6001	4780	6298	4267	3897
	$R^{2}$ (Training)	0.24	0.00	0.73	0.84	0.90	0.80	0.93	0.93
	RMSE (Actual)	16,436	19,472	24,411	21,822	21,271	13,972	18,950	23,251
	$R^{2}$ (Actual)	0.39	0.00	0.00	0.02	0.01	0.40	0.08	0.01
2024 Q1 A	RMSE (Training)	16,988	15,908	9575	6740	10,356	13,893	8471	6568
	$R^{2}$ (Training)	0.32	0.00	0.63	0.83	0.61	0.40	0.84	0.84
	RMSE (Actual)	20,120	16,804	8321	10,044	13,692	19,672	14,192	9985
	$R^{2}$ (Actual)	0.03	0.00	0.77	0.81	0.64	0.61	0.78	0.81
2024 Q1 B	RMSE (Training)	16,988	15,908	9575	5302	5263	7573	4659	4923
	$R^{2}$ (Training)	0.32	0.00	0.63	0.89	0.89	0.80	0.92	0.91
	RMSE (Actual)	20,120	16,804	8321	8441	8341	12,438	9249	9355
	$R^{2}$ (Actual)	0.03	0.00	0.77	0.81	0.81	0.70	0.85	0.84

Table A4. Results of weekly assembly shop model training.

Time Period	Parameter	Model Type
Time Period	Parameter	Historic	Average	Linear	GLMNET	PCR	KNN	RF	BRNN
2023 Q1	RMSE (Training)		52,182	35,366	33,472	30,211	26,820	32,923	24,567
	$R^{2}$ (Training)		0.00	0.66	0.70	0.66	0.74	0.75	0.67
	RMSE (Actual)	75,220	34,297	31,968	21,609	27,232	34,451	31,692	36,957
	$R^{2}$ (Actual)	0.18	0.00	0.35	0.42	0.45	0.12	0.28	0.09
2023 Q4 A	RMSE (Training)	78,096	56,150	38,932	32,244	33,718	29,433	31,986	31,164
	$R^{2}$ (Training)	0.21	0.00	0.45	0.69	0.64	0.73	0.73	0.71
	RMSE (Actual)	75,803	103,924	83,733	70,474	68,693	73,895	70,567	67,713
	$R^{2}$ (Actual)	0.67	0.00	0.23	0.70	0.72	0.63	0.78	0.79
2023 Q4 B	RMSE (Training)	78,096	56,150	38,932	29,435	29,515	28,280	36,526	24,926
	$R^{2}$ (Training)	0.21	0.00	0.45	0.93	0.93	0.89	0.42	0.72
	RMSE (Actual)	75,803	103,924	83,733	61,223	57,416	74,220	83,150	49,462
	$R^{2}$ (Actual)	0.67	0.00	0.23	0.72	0.73	0.66	0.61	0.78
2024 Q1 A	RMSE (Training)	77,134	66,623	57,635	33,170	32,774	30,512	34,024	32,100
	$R^{2}$ (Training)	0.42	0.00	0.42	0.71	0.71	0.80	0.75	0.76
	RMSE (Actual)	86,288	88,004	36,587	68,900	70,096	61,070	65,864	50,578
	$R^{2}$ (Actual)	0.38	0.00	0.83	0.75	0.75	0.79	0.74	0.84
2024 Q1 B	RMSE (Training)	77,134	66,623	57,635	30,910	30,910	32,955	37,381	25,074
	$R^{2}$ (Training)	0.42	0.00	0.42	0.79	0.79	0.79	0.75	0.85
	RMSE (Actual)	86,288	88,004	36,587	67,905	74,861	61,521	73,195	81,244
	$R^{2}$ (Actual)	0.38	0.00	0.83	0.74	0.74	0.80	0.59	0.84

Table A5 gives the monthly results of the assembly process. The average monthly energy consumption of the assembly shop was 1,750,000 kWh, and the standard deviation of the RMSE was about 100,000 kWh. Prediction accuracy decreased from 2023 Q4 to 2023 Q1 and increased to a high level in 2024 Q1. “Energy Data from Last Year” was not added as a predictor to 2023 Q4 because there was not enough training data, but it was added for 2024 Q1 B, which explains why the KNN and BRNN models did not work for 2024 Q1 B. The RF model did not work in any of the time periods. KNN became available with more than one year of data. ML models significantly improved accuracy compared to naïve and linear in 2023 Q1, insignificantly improved the RMSE in 2023 Q4, and did not improve against linear in 2024 Q1.

Table A5. Results of monthly assembly shop model training.

Time Period	Parameter	Model Type
Time Period	Parameter	Historic	Average	Linear	GLMNET	PCR	KNN	BRNN
2023 Q1	RMSE (Training)		197,900	240,919	227,904	199,246
	$R^{2}$ (Training)		0.00	0.65	0.78	0.71
	RMSE (Actual)	309,741	326,770	141,556	56,614	55,116
	$R^{2}$ (Actual)	0.18	0.00	0.80	0.75	0.76
2023 Q4	RMSE (Training)	245,553	113,581	160,298	118,651	113,108	95,892	112,953
	$R^{2}$ (Training)	0.85	0.00	0.77	0.71	0.73	0.81	0.49
	RMSE (Actual)	342,804	398,250	374,923	338,428	333,659	313,087	347,613
	$R^{2}$ (Actual)	1.00	0.00	0.15	0.67	0.61	0.65	0.71
2024 Q1 A	RMSE (Training)	263,020	205,093	237,222	121,327	118,610	141,539	149,992
	$R^{2}$ (Training)	0.60	0.00	0.99	0.76	0.81	0.63	0.64
	RMSE (Actual)	217,806	151,258	59,200	198,204	210,728	249,189	206,641
	$R^{2}$ (Actual)	0.25	0.00	0.95	0.93	0.91	0.88	0.93
2024 Q1 B	RMSE (Training)	263,020	205,093	237,222	204,262	198,608
	$R^{2}$ (Training)	0.60	0.00	0.99	1.00	0.99
	RMSE (Actual)	217,806	151,258	59,200	108,648	100,846
	$R^{2}$ (Actual)	0.25	0.00	0.95	0.94	0.94

The painting process is almost entirely robotic. No significant process changes occurred between 2022 and 2023, but production increased, and robotics were added. Table A6 shows the daily prediction results for the paint process. The average daily energy consumption of the paint shop was 137,000 kWh in 2023, and the standard deviation of the RMSE was about 3500 kWh. Prediction accuracy fell significantly from 2023 Q1 to 2023 Q4 but improved in 2024 Q1. The most likely cause is the larger time period between production predictions. Adding the energy data from one year ago as a predictor had mixed results when comparing Daily 2023 Q4 A with Daily 2023 Q4 B. The ML models improved on the naïve and linear in most models, but it is unclear which model will be best in advance. Comparing 2023 Q4 A and B and 2024 Q1 A and B, in this case, it is better to have fewer data points and add the “Energy Data from Last Year” predictor.

Table A6. Results of daily paint shop model training.

Time Period	Parameter	Model Type
Time Period	Parameter	Historic	Average	Linear	GLMNET	PCR	KNN	RF	BRNN
2023 Q1	RMSE (Training)		44,368	19,378	20,837	20,384	30,314	16,229	17,829
	$R^{2}$ (Training)		0.00	0.80	0.77	0.78	0.71	0.87	0.82
	RMSE (Actual)	48,017	38,083	18,475	17,080	17,804	23,335	15,754	17,468
	$R^{2}$ (Actual)	0.63	0.00	0.86	0.86	0.86	0.86	0.90	0.89
2023 Q4 A	RMSE (Training)	48,959	42,474	19,001	18,980	18,812	15,731	22,087	15,588
	$R^{2}$ (Training)	0.10	0.00	0.81	0.80	0.80	0.86	0.83	0.87
	RMSE (Actual)	44,075	42,849	60,281	54,436	54,818	59,459	41,851	58,686
	$R^{2}$ (Actual)	0.37	0.00	0.00	0.01	0.01	0.02	0.11	0.00
2023 Q4 B	RMSE (Training)	48,959	42,474	19,001	16,465	16,398	21,848	13,160	15,536
	$R^{2}$ (Training)	0.10	0.00	0.81	0.82	0.82	0.65	0.88	0.83
	RMSE (Actual)	44,075	42,849	60,281	58,745	59,753	33,828	49,966	55,902
	$R^{2}$ (Actual)	0.37	0.00	0.00	0.00	0.00	0.34	0.03	0.00
2024 Q1 A	RMSE (Training)	46,950	38,894	20,466	21,258	21,216	28,645	17,084	17,585
	$R^{2}$ (Training)	0.20	0.00	0.71	0.77	0.77	0.59	0.85	0.84
	RMSE (Actual)	51,195	45,512	34,246	52,924	52,962	42,894	41,670	40,496
	$R^{2}$ (Actual)	0.04	0.00	0.50	0.26	0.26	0.24	0.30	0.32
2024 Q1 B	RMSE (Training)	46,950	38,894	20,466	18,438	18,247	20,213	14,835	14,810
	$R^{2}$ (Training)	0.20	0.00	0.71	0.76	0.76	0.77	0.84	0.84
	RMSE (Actual)	51,195	45,512	34,246	35,670	35,744	38,636	30,729	33,414
	$R^{2}$ (Actual)	0.04	0.00	0.50	0.50	0.54	0.39	0.54	0.49

Table A7 shows the weekly results of the paint process. The average weekly energy consumption in the paint shop was 956,000 kWh, and the standard deviation of the RMSE was about 100,000 kWh. The prediction accuracy was best in 2023 Q1 according to the RMSE, decreased in 2023 Q4, and stayed about the same in 2024 Q1, though

R^{2}

improved. The “energy data from the previous year” predictor decreased the prediction accuracy on training and actual results. The ML models nearly always improved on the naïve models, and some of the ML models improved on linear models.

Table A7. Results of weekly paint shop model training.

Time Period	Parameter	Model Type
Time Period	Parameter	Historic	Average	Linear	GLMNET	PCR	KNN	RF	BRNN
2023 Q1	RMSE (Training)		146,524	62,475	61,884	63,412	82,614	82,291	64,279
	$R^{2}$ (Training)		0.00	0.85	0.84	0.84	0.75	0.80	0.84
	RMSE (Actual)	180,142	343,888	79,420	70,357	78,674	66,431	66,089	63,193
	$R^{2}$ (Actual)	0.12	0.00	0.23	0.24	0.23	0.25	0.07	0.14
2023 Q4 A	RMSE (Training)	209,886	154,557	61,519	62,394	70,766	73,959	81,432	58,131
	$R^{2}$ (Training)	0.01	0.00	0.81	0.81	0.78	0.75	0.73	0.82
	RMSE (Actual)	189,218	228,054	153,305	151,124	151,124	146,692	159,584	138,733
	$R^{2}$ (Actual)	0.63	0.00	0.50	0.51	0.51	0.64	0.52	0.62
2023 Q4 B	RMSE (Training)	209,886	154,557	61,519	61,665	61,694	71,290	91,609	68,477
	$R^{2}$ (Training)	0.01	0.00	0.81	0.89	0.89	0.73	0.44	0.74
	RMSE (Actual)	189,218	228,054	153,305	176,921	176,916	211,926	181,527	152,407
	$R^{2}$ (Actual)	0.63	0.00	0.50	0.32	0.32	0.02	0.41	0.51
2024 Q1 A	RMSE (Training)	201,401	169,952	93,298	51,997	52,582	62,606	62,235	52,162
	$R^{2}$ (Training)	0.17	0.00	0.88	0.82	0.82	0.78	0.80	0.83
	RMSE (Actual)	242,688	261,967	137,637	156,351	165,697	125,986	139,460	111,233
	$R^{2}$ (Actual)	0.41	0.00	0.81	0.66	0.67	0.76	0.69	0.82
2024 Q1 B	RMSE (Training)	201,401	169,952	93,298	85,256	86,346	108,465	95,284	87,986
	$R^{2}$ (Training)	0.17	0.00	0.88	0.82	0.85	0.72	0.78	0.86
	RMSE (Actual)	242,688	261,967	137,637	130,600	132,931	183,532	168,898	126,133
	$R^{2}$ (Actual)	0.41	0.00	0.81	0.78	0.78	0.78	0.74	0.89

The monthly paint process model results are given in Table A8. Prediction accuracy was fair in 2023 Q1, decreased significantly in 2023 Q4, and then was at the highest levels seen in this research for 2024 Q1. Because the historical naïve model fits the actual results’ shape but consistently predicts too low, the

R^{2}

of the historical naïve model is better than other models, but the RMSE is not. The results show that it is possible to improve on linear and naïve predictions by using ML models.

Table A8. Results of monthly paint shop model training.

Time Period	Parameter	Model Type
Time Period	Parameter	Historic	Average	Linear	GLMNET	PCR	KNN	BRNN
2023 Q1	RMSE (Training)		411,711	87,117	80,919	86,222
	R² (Training)		0.00	0.97	0.97	0.97
	RMSE (Actual)	746,943	555,007	133,179	230,830	179,902
	R² (Actual)	0.71	0.00	0.73	0.68	0.69
2023 Q4	RMSE (Training)	661,834	382,636	118,913	131,181	126,773	134,763	122,933
	R² (Training)	0.00	0.00	0.93	0.95	0.95	0.79	0.94
	RMSE (Actual)	760,446	607,307	505,900	453,618	454,308	402,379	436,012
	R² (Actual)	0.15	0.00	0.03	0.01	0.01	0.34	0.03
2024 Q1 A	RMSE (Training)	704,602	368,219	222,891	90,046	90,729	218,414	134,467
	R² (Training)	0.01	0.00	0.97	0.98	0.98	0.65	0.93
	RMSE (Actual)	543,199	608,067	311,394	571,267	576,672	251,302	500,319
	R² (Actual)	0.29	0.00	1.00	0.65	0.65	0.80	0.71
2024 Q1 B	RMSE (Training)	704,602	368,219	222,891	328,567	342,848
	R² (Training)	0.01	0.00	0.97	0.97	0.97
	RMSE (Actual)	543,199	608,067	311,394	152,201	132,343
	R² (Actual)	0.29	0.00	1.00	1.00	1.00

References

Weinert, N.; Chiotellis, S.; Seliger, G. Methodology for Planning and Operating Energy-Efficient Production Systems. CIRP Ann. 2011, 60, 41–44. [Google Scholar] [CrossRef]
Lee, D.; Cheng, C.-C. Energy Savings by Energy Management Systems: A Review. Renew. Sustain. Energy Rev. 2016, 56, 760–777. [Google Scholar] [CrossRef]
Duflou, J.R.; Sutherland, J.W.; Dornfeld, D.; Herrmann, C.; Jeswiet, J.; Kara, S.; Hauschild, M.; Kellens, K. Towards Energy and Resource Efficient Manufacturing: A Processes and Systems Approach. CIRP Ann. 2012, 61, 587–609. [Google Scholar] [CrossRef]
ISO 50001:2018; Energy Management Systems—Requirements with Guidance for Use. International Organization of Standardization: Geneva, Switzerland, 2018. Available online: https://web.archive.org/web/20250429212254/https://www.iso.org/standard/69426.html (accessed on 28 April 2025).
DOE AMO. Energy Performance Indicator Tool. Available online: https://web.archive.org/web/20250406045512/https://www.energy.gov/eere/iedo/articles/energy-performance-indicator-tool?nrg_redirect=465586 (accessed on 28 April 2025).
DOE AMO. Better Plants Software Tools. Available online: https://web.archive.org/web/20250416042229/https://betterbuildingssolutioncenter.energy.gov/better-plants/software-tools (accessed on 28 April 2025).
Vance, D.; Jin, M.; Price, C.; Nimbalkar, S.U.; Wenning, T. Smart Manufacturing Maturity Models and Their Applicability: A Review. J. Manuf. Technol. Manag. 2023, 34, 735–770. [Google Scholar] [CrossRef]
Narciso, D.A.C.; Martins, F.G. Application of Machine Learning Tools for Energy Efficiency in Industry: A Review. Energy Rep. 2020, 6, 1181–1199. [Google Scholar] [CrossRef]
Liu, Z.; Wang, X.; Zhang, Q.; Huang, C. Empirical Mode Decomposition Based Hybrid Ensemble Model for Electrical Energy Consumption Forecasting of the Cement Grinding Process. Measurement 2019, 138, 314–324. [Google Scholar] [CrossRef]
Moghadasi, M.; Izadyar, N.; Moghadasi, A.; Ghadamian, H. Applying Machine Learning Techniques to Implement the Technical Requirements of Energy Management Systems in Accordance with ISO50001:2018, An Industrial Case Study. Energy Sources Part A Recovery Util. Environ. Eff. 2021, 1–18. [Google Scholar] [CrossRef]
Shrouf, F.; Ordieres, J.; Miragliotta, G. Smart Factories in Industry 4.0: A Review of the Concept and of Energy Management Approached in Production Based on the Internet of Things Paradigm. In Proceedings of the 2014 IEEE International Conference on Industrial Engineering and Engineering Management, Selangor, Malaysia, 9–12 December 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 697–701. [Google Scholar] [CrossRef]
Medojevic, M.; Villar, P.D.; Cosic, I.; Rikalovic, A.; Sremcev, N.; Lazarevic, M. Energy Management in Industry 4.0 Ecosystem: A Review on Possibilities and Concerns. Ann. DAAAM Proc. 2018, 29, 674–680. [Google Scholar] [CrossRef]
Shrouf, F.; Miragliotta, G. Energy Management Based on Internet of Things: Practices and Framework for Adoption in Production Management. J. Clean. Prod. 2015, 100, 235–246. [Google Scholar] [CrossRef]
Khan, M.; Wu, X.; Xu, X.; Dou, W. Big Data Challenges and Opportunities in the Hype of Industry 4.0. In Proceedings of the 2017 IEEE International Conference on Communications (ICC), Paris, France, 21–25 May 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 1–6. [Google Scholar] [CrossRef]
Sievers, J.; Blank, T. A Systematic Literature Review on Data-Driven Residential and Industrial Energy Management Systems. Energies 2023, 16, 1688. [Google Scholar] [CrossRef]
May, G.; Stahl, B.; Taisch, M.; Kiritsis, D. Energy Management in Manufacturing: From Literature Review to a Conceptual Framework. J. Clean. Prod. 2017, 167, 1464–1489. [Google Scholar] [CrossRef]
Pater, J.; Stadnicka, D. Towards Digital Twins Development and Implementation to Support Sustainability—Systematic Literature Review. Manag. Prod. Eng. Rev. 2021, 13, 63–73. [Google Scholar] [CrossRef]
Walther, J.; Weigold, M. A Systematic Review on Predicting and Forecasting the Electrical Energy Consumption in the Manufacturing Industry. Energies 2021, 14, 968. [Google Scholar] [CrossRef]
Dietmair, A.; Verl, A. A Generic Energy Consumption Model for Decision Making and Energy Efficiency Optimisation in Manufacturing. Int. J. Sustain. Eng. 2009, 2, 123–133. [Google Scholar] [CrossRef]
Su, C.-L. Load Estimation in Industrial Power Systems for Expansion Planning. IEEE Trans. Ind. Appl. 2011, 47, 2311–2323. [Google Scholar] [CrossRef]
Walther, J.; Spanier, D.; Panten, N.; Abele, E. Very Short-Term Load Forecasting on Factory Level–A Machine Learning Approach. Procedia CIRP 2019, 80, 705–710. [Google Scholar] [CrossRef]
Mawson, V.J.; Hughes, B.R. Deep Learning Techniques for Energy Forecasting and Condition Monitoring in the Manufacturing Sector. Energy Build. 2020, 217, 109966. [Google Scholar] [CrossRef]
Chen, Z.; Xiao, F.; Guo, F.; Yan, J. Interpretable Machine Learning for Building Energy Management: A State-of-the-Art Review. Adv. Appl. Energy 2023, 9, 100123. [Google Scholar] [CrossRef]
Mahir, S.M.; Koch, G.; Herne, J.; Lee, J.J. Data Acquisition Platform for The Energy Management of Smart Factories and Buildings. In Proceedings of the 2023 17th International Conference on Ubiquitous Information Management and Communication (IMCOM), Seoul, Republic of Korea, 3–5 January 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 1–7. [Google Scholar] [CrossRef]
Weber, C.; Königsberger, J.; Kassner, L.; Mitschang, B. M2DDM—A Maturity Model for Data-Driven Manufacturing. Procedia CIRP 2017, 63, 173–178. [Google Scholar] [CrossRef]
Grieves, M. Digital Twin: Manufacturing Excellence Through Virtual Factory Replication. White Pap. 2014, 1, 1–7. [Google Scholar]
Glaessgen, E.; Stargel, D. The Digital Twin Paradigm for Future NASA and US Air Force vehicles. In Proceedings of the 53rd AIAA/ASME/ASCE/AHS/ASC Structures, Structural Dynamics and Materials Conference, Honolulu, HI, USA, 23–26 April 2012; p. 1818. [Google Scholar] [CrossRef]
Garetti, M.; Rosa, P.; Terzi, S. Life Cycle Simulation for the Design of Product-Service Systems. Comput. Ind. 2012, 63, 361–369. [Google Scholar] [CrossRef]
Kritzinger, W.; Karner, M.; Traar, G.; Jan, H.; Wilfried, S. Digital Twin in Manufacturing: A Categorical Literature Review and Classification. IFAC-Pap. 2018, 51, 1016–1022. [Google Scholar] [CrossRef]
Shao, G.; Helu, M. Framework for a Digital Twin in Manufacturing: Scope and Requirements. Manuf. Lett. 2020, 24, 105–107. [Google Scholar] [CrossRef] [PubMed]
Yu, W.; Patros, P.; Young, B.; Klinac, E.; Walmsley, T.G. Energy Digital Twin Technology for Industrial Energy Management: Classification, Challenges and Future. Renew. Sustain. Energy Rev. 2022, 161, 112407. [Google Scholar] [CrossRef]
Haag, S.; Anderl, R. Digital twin—Proof of Concept. Manuf. Lett. 2018, 15, 64–66. [Google Scholar] [CrossRef]
Tao, F.; Zhang, H.; Liu, A.; Nee, A.Y.C. Digital Twin in Industry: State-of-the-Art. IEEE Trans. Ind. Inf. 2018, 15, 2405–2415. [Google Scholar] [CrossRef]
Assad, F.; Konstantinov, S.; Ahmad, M.H.; Rushforth, E.J.; Harrison, R. Utilising Web-based Digital Twin to Promote Assembly Line Sustainability. In Proceedings of the 2021 4th IEEE International Conference on Industrial Cyber-Physical Systems (ICPS), Victoria, BC, Canada, 10–12 May 2021; pp. 381–386. [Google Scholar] [CrossRef]
Schulze, M.; Nehler, H.; Ottosson, M.; Thollander, P. Energy Management in Industry—A Systematic Review of Previous Findings and an Integrative Conceptual Framework. J. Clean. Prod. 2016, 112, 3692–3708. [Google Scholar] [CrossRef]
Vikhorev, K.; Greenough, R.; Brown, N. An Advanced Energy Management Framework to Promote Energy Awareness. J. Clean. Prod. 2013, 43, 103–112. [Google Scholar] [CrossRef]
Zhang, M.; Zuo, Y.; Tao, F. Equipment Energy Consumption Management in Digital Twin Shop-Floor: A Framework and Potential Applications. In Proceedings of the 2018 IEEE 15th International Conference on Networking, Sensing and Control (ICNSC), Zhuhai, China, 27–29 March 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 1–5. [Google Scholar] [CrossRef]
Wei, M.; Hong, S.H.; Alam, M. An IoT-Based Energy-Management Platform for Industrial Facilities. Appl. Energy 2016, 164, 607–619. [Google Scholar] [CrossRef]
Sitterson, J.; Sinnathamby, S.; Parmar, R.; Koblich, J.; Wolfe, K.; Knightes, C.D. Demonstration of an Online Web Services Tool Incorporating Automatic Retrieval and Comparison of Precipitation Data. Environ. Model. Softw. 2020, 123, 104570. [Google Scholar] [CrossRef]
BizEE Software. Degree Days Calculated Accurately for Locations Worldwide. Available online: https://web.archive.org/web/20250402181725/https://www.degreedays.net/ (accessed on 28 April 2025).
R. What is R? Available online: https://web.archive.org/web/20250424231453/https://www.r-project.org/about.html (accessed on 28 April 2025).
Prabhakaran, S. Caret Package—A Practical Guide to Machine Learning in R. Available online: https://web.archive.org/web/20250306212733/https://www.machinelearningplus.com/machine-learning/caret-package/ (accessed on 28 April 2025).
James, G.; Witten, D.; Hastie, T.; Tibshirani, R.; Taylor, J. An Introduction to Statistical Learning; Springer Texts in Statistics; Springer International Publishing: Cham, Switzerland, 2023; Volume 112. [Google Scholar] [CrossRef]
Mehdiyev, N.; Majlatow, M.; Fettke, P. Interpretable and Explainable Machine Learning Methods for Predictive Process Monitoring: A Systematic Literature Review. arXiv 2023. arXiv:2312.17584. [Google Scholar] [CrossRef]
Caret Documentation. CARET: A List of Available Models in TRAIN, rdrr.io. Available online: https://web.archive.org/web/20250429202653/https://rdrr.io/cran/caret/man/models.html (accessed on 28 April 2024).
Falk, C.F.; Muthukrishna, M. Parsimony in Model Selection: Tools for Assessing Fit Propensity. Psychol. Methods 2023, 28, 123–136. [Google Scholar] [CrossRef] [PubMed]
David, J.; Martikkala, A.; Lobov, A.; Lanz, M. A Unified Ontology Namespace for Enterprise Integration—A Digital Twin Case Study. In Proceedings of the Instrumentation Engineering, Electronics and Telecommunications—2019, Izhevsk, Russia, 20–22 November 2019. [Google Scholar] [CrossRef]
National Centers for Environmental Information. U.S. Climate Normals. Available online: https://web.archive.org/web/20250426104931/https://www.ncei.noaa.gov/products/land-based-station/us-climate-normals (accessed on 28 April 2025).

Figure 1. Framework for energy consumption prediction of a manufacturing facility.

Figure 2. RMSE of best performing daily sitewide models by model type.

Figure 3. RMSE of best performing sitewide weekly models by model type.

Figure 4. RMSE of the best performing sitewide monthly models by model type.

Table 1. Summary of predictors by model type.

Model	Predictors
Historic	A naïve model, meaning no predictors are used. The output is historical energy consumption.
Average	Another naïve model. The output is the average of historical energy consumption.
Linear	Production
GLMNET	Production, Temperature
PCR	Production, Temperature, Day of the Week, Week of the Year, Month
KNN	Production, Temperature, Day of the Week, Week of the Year, Month
RF	Production, Temperature, Day of the Week, Week of the Year, Month
BRNN	Production, Temperature, Day of the Week, Week of the Year, Month
B	An additional predictor, “Energy Data from Last Year”, is included.

Table 2. Summary of predictors by plant area.

Plant Area	Time Period	Predictors
Assembly	Daily	Assembly Production, Temperature, Day of the Week, Week of the Year, Month
	Weekly	Assembly Production, Temperature, Week of the Year
	Monthly	Assembly Production, Temperature, Month
Battery	Daily	Battery Production, Temperature, Day of the Week, Week of the Year, Month
	Weekly	Battery Production, Temperature, Week of the Year
	Monthly	Battery Production, Temperature, Month
Body (Electric)	Daily	Body (Electric) Production, Temperature, Day of the Week, Week of the Year, Month
	Weekly	Body (Electric) Production, Temperature, Week of the Year
	Monthly	Body (Electric) Production, Temperature, Month
Body (Gas)	Daily	Body (Gas) Production, Temperature, Day of the Week, Week of the Year, Month
	Weekly	Body (Gas) Production, Temperature, Week of the Year
	Monthly	Body (Gas) Production, Temperature, Week of the Year
Paint	Daily	Paint Production, Temperature, Day of the Week, Week of the Year, Month
	Weekly	Paint Production, Temperature, Week of the Year
	Monthly	Paint Production, Temperature, Week of the Year
Sitewide	Daily	Assembly Production, Battery Production, Body (Electric) Production, Body (Gas) Production, Paint Production, Temperature, Day of the Week, Week of the Year, Month
	Weekly	Assembly Production, Battery Production, Body (Electric) Production, Body (Gas) Production, Paint Production, Temperature, Week of the Year
	Monthly	Assembly Production, Battery Production, Body (Electric) Production, Body (Gas) Production, Paint Production, Temperature, Month

Table 3. Results of daily sitewide model training.

Period	Parameter	Model Type
Period	Parameter	Historic	Average	Linear	GLMNET	PCR	KNN	RF	BRNN
2023 Q1	RMSE (Training)		105,629	54,291	34,989	33,881	45,848	58,329	33,456
	$R^{2}$ (Training)		0.00	0.73	0.89	0.90	0.88	0.88	0.91
	RMSE (Actual)	78,852	74,088	181,577	54,026	53,378	33,078	49,524	31,823
	$R^{2}$ (Actual)	0.70	0.00	0.87	0.92	0.92	0.89	0.92	0.87
2023 Q4 A	RMSE (Training)	113,657	103,478	54,356	34,990	34,845	32,749	30,351	27,950
	$R^{2}$ (Training)	0.19	0.00	0.73	0.89	0.89	0.90	0.92	0.93
	RMSE (Actual)	112,459	124,464	160,524	132,719	135,631	135,333	140,233	143,182
	$R^{2}$ (Actual)	0.44	0.00	0.00	0.03	0.02	0.01	0.01	0.00
2023 Q4 B	RMSE (Training)	113,657	101,958	54,356	40,698	40,274	36,105	28,857	26,677
	$R^{2}$ (Training)	0.19	0.00	0.73	0.84	0.84	0.87	0.91	0.92
	RMSE (Actual)	112,459	120,878	160,524	129,048	124,096	131,786	142,825	127,605
	$R^{2}$ (Actual)	0.44	0.00	0.00	0.03	0.04	0.03	0.00	0.02
2024 Q1 A	RMSE (Training)	113,028	103,345	68,079	42,906	42,834	37,580	30,844	31,530
	$R^{2}$ (Training)	0.29	0.00	0.56	0.85	0.85	0.88	0.92	0.92
	RMSE (Actual)	114,349	107,524	58,084	57,842	60,113	78,928	67,846	58,269
	$R^{2}$ (Actual)	0.05	0.00	0.74	0.78	0.76	0.65	0.79	0.79
2024 Q1 B	RMSE (Training)	113,028	103,345	68,079	39,529	35,924	40,251	31,939	33,243
	$R^{2}$ (Training)	0.29	0.00	0.56	0.85	0.87	0.86	0.91	0.89
	RMSE (Actual)	114,349	107,524	58,084	52,544	46,197	57,928	57,711	56,881
	$R^{2}$ (Actual)	0.05	0.00	0.74	0.78	0.81	0.78	0.79	0.81

The Historic 2023 Q1 training parameters (signified with black background color) do not exist because no training data was available to compare against from 2022. “B” signifies that the “Energy Data from Last Year” predictor is included, while “A” signifies that it is not included.

Table 4. Results of weekly sitewide training.

Period	Parameter	Model Type
Period	Parameter	Historic	Average	Linear	GLMNET	PCR	KNN	RF	BRNN
2023 Q1	RMSE (Training)		456,963	297,607	186,355	183,373	160,581	205,657	107,720
	$R^{2}$ (Training)		0.00	0.73	0.84	0.85	0.79	0.89	0.95
	RMSE (Actual)	230,610	1,806,313	1,321,370	252,767	534,254	391,638	290,422	350,465
	$R^{2}$ (Actual)	0.40	0.00	0.45	0.41	0.45	0.23	0.17	0.27
2023 Q4 A	RMSE (Training)	507,147	406,336	293,128	160,703	160,628	150,152	193,717	95,710
	$R^{2}$ (Training)	0.18	0.00	0.55	0.88	0.87	0.91	0.83	0.96
	RMSE (Actual)	564,164	697,198	553,789	384,559	384,182	403,828	461,169	333,189
	$R^{2}$ (Actual)	0.67	0.00	0.21	0.75	0.74	0.73	0.63	0.77
2023 Q4 B	RMSE (Training)	507,147	406,336	293,128	235,608	230,918	242,368	296,876	244,444
	$R^{2}$ (Training)	0.18	0.00	0.55	0.96	0.96	0.99	0.25	0.53
	RMSE (Actual)	564,164	697,198	553,789	385,571	367,732	540,490	578,331	321,153
	$R^{2}$ (Actual)	0.67	0.00	0.21	0.72	0.75	0.59	0.54	0.78
2024 Q1 A	RMSE (Training)	532,016	474,094	419,285	157,103	158,933	150,871	173,645	110,520
	$R^{2}$ (Training)	0.37	0.00	0.39	0.90	0.90	0.91	0.90	0.95
	RMSE (Actual)	500,572	609,475	251,523	280,070	279,656	222,876	273,626	262,489
	$R^{2}$ (Actual)	0.44	0.00	0.85	0.77	0.77	0.83	0.82	0.83
2024 Q1 B	RMSE (Training)	532,016	474,094	419,285	201,078	198,221	263,295	262,053	184,889
	$R^{2}$ (Training)	0.37	0.00	0.39	0.87	0.88	0.87	0.81	0.95
	RMSE (Actual)	500,572	609,475	251,523	322,663	363,583	304,783	464,141	424,085
	$R^{2}$ (Actual)	0.44	0.00	0.85	0.79	0.79	0.76	0.57	0.85

“B” signifies that the “Energy Data from Last Year” predictor is included, while “A” signifies that it is not included. The Historic 2023 Q1 training parameters (signified with black background color) do not exist because no training data was available to compare against from 2022.

Table 5. Results of monthly sitewide model training.

Period	Parameter	Model Type
Period	Parameter	Historic	Average	Linear	GLMNET	PCR	KNN	BRNN
2023 Q1	RMSE (Training)		1,747,3113	1,227,409	682,160	737,764
	$R^{2}$ (Training)		0.00	0.66	0.91	0.92
	RMSE (Actual)	748,792	1,445,674	4,219,015	3,263,449	1,159,504
	$R^{2}$ (Actual)	0.89	0.00	0.82	0.12	0.72
2023 Q4	RMSE (Training)	1,706,915	1,378,826	1,431,324	607,102	558,804	1,026,088	1,286,387
	$R^{2}$ (Training)	0.58	0.00	0.38	0.97	0.97	0.64	0.69
	RMSE (Actual)	2,288,114	2,544,555	2,272,790	1,388,637	1,302,062	1,346,754	1,212,710
	$R^{2}$ (Actual)	0.68	0.00	0.15	0.77	0.79	0.86	0.80
2024 Q1 A	RMSE (Training)	1,970,030	1,533,310	1,757,608	585,057	606,529	1,021,876	567,022
	$R^{2}$ (Training)	0.51	0.00	0.98	0.96	0.94	0.74	0.95
	RMSE (Actual)	1,067,821	1,625,809	630,998	680,560	966,446	1,009,443	758,261
	$R^{2}$ (Actual)	0.51	0.00	1.00	0.96	0.95	0.71	0.97
2024 Q1 B	RMSE (Training)	1,970,030	1,533,310	1,757,608	1,024,396	1,301,112		1,339,096
	$R^{2}$ (Training)	0.51	0.00	0.98	0.97	0.99		0.97
	RMSE (Actual)	1,067,821	1,625,809	630,998	1,250,344	428,949		291,403
	$R^{2}$ (Actual)	0.51	0.00	1.00	0.99	0.99		0.96

Models with black backgrounds, including KNN and BRNN 2023 Q1 models, do not exist because of insufficient data. In 2023 Q1, with data collection starting in January 2022, there are only 12 monthly data points. The KNN model in Q1B 2024 also does not work due to insufficient data. Adding the “Energy Data from Last Year” predictor eliminates a year of data because we do not have data from a year prior.

Table 6. Summary of the best models for each shop.

Plant Area	Period	2023 Q1		2023 Q4		2024 Q1
Plant Area	Period	Best Model	$R^{2}$	Best Model	$R^{2}$	Best Model	$R^{2}$
Assembly	Daily	Linear	0.83	KNN (B)	0.40	Linear	0.77
	Weekly	GLMNET	0.42	BRNN (B)	0.78	Linear	0.83
	Monthly	PCR	0.76	Historical	0.72	Linear	0.95
Battery	Daily			KNN (B)	0.49	Linear	0.42
	Weekly			GLMNET (B)	0.72	RF (B)	0.58
	Monthly			BRNN	0.78	Average	0
Body (Electric)	Daily			KNN (B)	0.23	RF (B)	0.78
	Weekly			BRNN (A)	0.77	RF (A)	0.81
	Monthly			BRNN	0.85	GLMNET (B)	0.86
Body (Gas)	Daily	PCR	0.93	KNN (B)	0.46	GLMNET (A)	0.79
	Weekly	Linear	0.59	BRNN (B)	0.69	Linear	0.81
	Monthly	Linear	0.96	KNN	0.66	PCR (B)	0.98
Paint	Daily	RF	0.90	KNN (B)	0.34	RF (B)	0.54
	Weekly	BRNN	0.14	BRNN (A)	0.62	BRNN (A)	0.82
	Monthly	Linear	0.73	KNN	0.34	PCR (B)	1.00
Sitewide	Daily	PCR	0.92	Historical	0.44	PCR (B)	0.80
	Weekly	Historical	0.40	BRNN (B)	0.78	Linear	0.85
	Monthly	Historical	0.89	GLMNET	0.75	BRNN (B)	0.96

Battery and Body (Electric) came online in 2023 Q1, so there is no result for 2023 Q1. “B” signifies that the “Energy Data from Last Year” predictor is included, while “A” signifies that it is not included.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Vance, D.; Jin, M.; Wenning, T.; Nimbalkar, S.; Price, C. Next-Level Energy Management in Manufacturing: Facility-Level Energy Digital Twin Framework Based on Machine Learning and Automated Data Collection. Energies 2025, 18, 3242. https://doi.org/10.3390/en18133242

AMA Style

Vance D, Jin M, Wenning T, Nimbalkar S, Price C. Next-Level Energy Management in Manufacturing: Facility-Level Energy Digital Twin Framework Based on Machine Learning and Automated Data Collection. Energies. 2025; 18(13):3242. https://doi.org/10.3390/en18133242

Chicago/Turabian Style

Vance, David, Mingzhou Jin, Thomas Wenning, Sachin Nimbalkar, and Christopher Price. 2025. "Next-Level Energy Management in Manufacturing: Facility-Level Energy Digital Twin Framework Based on Machine Learning and Automated Data Collection" Energies 18, no. 13: 3242. https://doi.org/10.3390/en18133242

APA Style

Vance, D., Jin, M., Wenning, T., Nimbalkar, S., & Price, C. (2025). Next-Level Energy Management in Manufacturing: Facility-Level Energy Digital Twin Framework Based on Machine Learning and Automated Data Collection. Energies, 18(13), 3242. https://doi.org/10.3390/en18133242

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Next-Level Energy Management in Manufacturing: Facility-Level Energy Digital Twin Framework Based on Machine Learning and Automated Data Collection

Abstract

1. Introduction

2. Literature Review

2.1. Energy Management

2.2. Energy Digital Twin (DT)

2.3. Summary

3. Methodology

3.1. Framework Overview

3.2. Prediction Models

3.3. Determining the Best Model

3.4. Experimental Processes

4. Results

4.1. Sitewide Energy Prediction

4.2. Shop-Level Energy Prediction

5. Discussion

5.1. Framework

5.2. Challenges

5.3. Lessons Learned

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Production and Temperature Prediction

Appendix A.1. Production Prediction

Appendix A.2. Temperature Prediction

Appendix B. Assembly and Painting Shop Energy Prediction

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI