Cloud-Based Data-Driven Framework for Optimizing Operational Efficiency and Sustainability in Tube Manufacturing

Matonya, Michael Maiko; Budai, István

doi:10.3390/asi8040100

Open AccessArticle

Cloud-Based Data-Driven Framework for Optimizing Operational Efficiency and Sustainability in Tube Manufacturing

by

Michael Maiko Matonya

^1,†

and

István Budai

^2,*,†

¹

Doctoral School of Informatics, University of Debrecen, H-4028 Debrecen, Hajdú-Bihar, Hungary

²

Faculty of Engineering, Department of Mechanical Engineering, University of Debrecen, H-4028 Debrecen, Hajdú-Bihar, Hungary

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Appl. Syst. Innov. 2025, 8(4), 100; https://doi.org/10.3390/asi8040100

Submission received: 6 June 2025 / Revised: 2 July 2025 / Accepted: 8 July 2025 / Published: 22 July 2025

Download

Browse Figures

Versions Notes

Abstract

Modern manufacturing strives for peak efficiency while facing pressing demands for environmental sustainability. Balancing these often-conflicting objectives represents a fundamental trade-off in modern manufacturing, as traditional methods typically address them in isolation, leading to suboptimal outcomes. Process mining offers operational insights but often lacks dynamic environmental indicators, while standard Life Cycle Assessment (LCA) provides environmental evaluation but uses static data unsuitable for real-time optimization. Frameworks integrating real-time data for dynamic multi-objective optimization are scarce. This study proposes a comprehensive, data-driven, cloud-based framework that overcomes these limitations. It uniquely combines three key components: (1) real-time Process Mining for actual workflows and operational KPIs; (2) dynamic LCA using live sensor data for instance-level environmental impacts (energy, emissions, waste) and (3) Multi-Objective Optimization (NSGA-II) to identify Pareto-optimal solutions balancing efficiency and sustainability. TOPSIS assists decision-making by ranking these solutions. Validated using extensive real-world data from a tube manufacturing facility processing over 390,000 events, the framework demonstrated significant, quantifiable improvements. The optimization yielded a Pareto front of solutions that surpassed baseline performance (87% efficiency; 2007.5 kg CO₂/day). The optimal balanced solution identified by TOPSIS simultaneously increased operational efficiency by 5.1% and reduced carbon emissions by 12.4%. Further analysis quantified the efficiency-sustainability trade-offs and confirmed the framework’s adaptability to varying strategic priorities through sensitivity analysis. This research offers a validated framework for industrial applications that enables manufacturers to improve both operational efficiency and environmental sustainability in a unified manner, moving beyond the limitations of disconnected tools. The validated integrated framework provides a powerful, data-driven tool, recommended as a valuable approach for industrial applications seeking continuous improvement in both economic and environmental performance dimensions.

Keywords:

process mining; life cycle assessment (LCA); multi-objective optimization (NSGA-II); sustainable manufacturing; cloud-based industrial analytics

1. Introduction

Manufacturing is a central driver of the global economy and is responsible for innovation, employment, and the production of essential goods. However, the industry is undergoing rapid transformation due to increased global competition, shorter product life cycles, customized demands, and digitalization under Industry 4.0. This shift requires manufacturers to achieve high levels of agility, efficiency, and quality. At the same time, they face mounting environmental pressures, including climate change, resource depletion, and pollution. As major energy consumers and waste generators, manufacturers must now address the dual challenge of improving operational efficiency while advancing environmental sustainability to remain competitive, comply with regulations, and support global sustainability goals.

The challenges of operational efficiency in manufacturing are complex and long-standing. Production systems, particularly complex ones like tube manufacturing that involve sequential processes across multiple machines and lines, are often plagued by inefficiencies. These manifest themselves as bottlenecks that restrict overall production, excessive cycle times that delay order fulfillment, significant nonproductive idle time for machinery and labor, suboptimal resource allocation, high rates of defects requiring rework or scrap, and inflexible scheduling that struggles to adapt to dynamic changes in demand or unexpected disruptions [1]. Traditional improvement methodologies, such as Lean Manufacturing and Six Sigma, have provided valuable frameworks for waste reduction and process standardization. However, their effectiveness can be limited in highly dynamic and data-rich environments where real-time visibility and adaptive control are crucial. Identifying the root causes of inefficiencies often requires deep, data-driven insights into the actual execution of processes, which may deviate significantly from idealized models.

Concurrently, the imperative for environmental sustainability introduces another layer of complexity. Manufacturing processes are inherently resource-intensive. Key areas of environmental concern include high energy consumption, particularly in processes such as extrusion, heating, curing, and machining, which often rely heavily on fossil fuels, contributing significantly to greenhouse gas (GHG) emissions [2]. Water use, especially for cooling, cleaning and processing, can be substantial, requiring local water resources [3]. The consumption of raw materials and the subsequent generation of waste, including process scrap, defective products, and packaging waste, contribute to the depletion of resources and the burden on landfills [4]. Accurately measuring, monitoring, and managing these environmental impacts in intricate production chains is a formidable task. Furthermore, decisions aimed at improving one aspect of sustainability (e.g., switching to a less energy-intensive material) might have unforeseen consequences on other environmental metrics or even operational performance.

Historically, manufacturing teams have addressed operational efficiency and environmental performance separately, using different tools and KPIs. Operations might track metrics such as OEE and throughput, while EHS teams focus on energy use and emissions, often with delayed periodic data. This separation overlooks how closely connected these goals truly are. For instance, speeding up machines might boost output but also spike energy use or increase defects. On the flip side, recycling efforts could slow production or increase costs. Without a unified approach, well-intended changes risk causing inefficiencies or damaging sustainability goals [5].

The advent of Industry 4.0 technologies offers a way to overcome these limitations. The proliferation of sensors, the Internet of Things (IoT), Cyber-Physical Systems (CPS), cloud computing, and big data analytics provides manufacturers with the unprecedented ability to collect vast amounts of granular real-time data from the factory floor [6]. These data encompass not only machine states, production counts, and cycle times but also energy consumption, temperature profiles, vibration patterns, and other parameters relevant to both operational and environmental performance. Using these data effectively is the key to unlocking the holistic optimization potential.

Despite advances in individual technologies, there is still a major gap: few solutions truly integrate real-time Process Mining with dynamic LCA metrics and feed this into a Multi-Objective Optimization engine for both efficiency and sustainability. Most current methods rely on static data, treat goals separately, or lack real-world validation.

To address this gap, this study introduces and validates a comprehensive, cloud-based, data-driven framework designed to simultaneously improve both operational efficiency and environmental sustainability in manufacturing. The core innovation lies in the synergistic integration of three key components: (1) real-time Process Mining to discover actual workflows and operational KPIs; (2) dynamic Life Cycle Assessment (LCA) to quantify environmental impacts at the instance level using live sensor data and (3) Multi-Objective Optimization (MOO) to identify Pareto-optimal operating configurations that balance these conflicting goals. The framework is implemented on a scalable cloud infrastructure and incorporates the Technique for Order Preference by Similarity to Ideal Solution (TOPSIS) to support decision-making.

The novelty of this work lies in its integrated, data-driven approach that unifies operational and environmental optimization in real-time. Validated in a real-world tube manufacturing facility, the framework demonstrates how manufacturers can make smarter, more balanced decisions to simultaneously enhance productivity and environmental stewardship.

This paper is structured as follows. Section 2 reviews related studies in process mining, LCA, and multi-objective optimization in manufacturing, further refining the identified research gap. Section 3 details the proposed integrated framework and the mathematical formulation of the multi-objective optimization model. Section 4 describes the real-world case study, presents the data analysis, and discusses the optimization results, including Pareto front analysis, TOPSIS ranking, and sensitivity analysis. Finally, Section 5 concludes the paper, summarizes the key findings and contributions, and discusses potential avenues for future research.

2. Related Work and Theoretical Background

2.1. Process Mining for Operational Excellence in Manufacturing

Process Mining has established itself as a pivotal data-driven discipline to understand and improve real-world processes based on event data readily available in modern manufacturing information systems [6]. Its core strength lies in moving beyond static, idealized process models to reveal how operations actually unfold. Key applications in manufacturing include automated process discovery, where algorithms like Inductive Miner or Heuristics Miner can automatically generate process models from event logs, providing unprecedented visibility into complex production flows, including undocumented variations and deviations. Performance diagnosis enables the detailed analysis of operational Key Performance Indicators (KPIs). Studies have successfully used it to pinpoint bottlenecks that cause delays [7], measure and optimize cycle times [8,9], quantify idle times of resources [10], and analyze throughput variations across different process paths. Conformance checking compares event logs against normative process models to identify deviations, skipped activities, or incorrect sequences that can negatively impact quality or efficiency [11], helping to ensure compliance and pinpoint areas that need corrective action.

Limitation : Despite its power in operational analysis, conventional process mining focuses mainly on the efficiency, time, and cost dimensions derived directly from the event logs. Although some research explores resource utilization, the explicit and dynamic integration of environmental KPIs (such as real-time energy consumption per case, emissions per activity, or waste generated during specific process instances) directly into the process mining analysis loop is often lacking. The focus remains largely operational, missing the opportunity to directly correlate process flow characteristics with their immediate environmental consequences using the same granular event-level data.

2.2. Life Cycle Assessment (LCA) for Evaluating Environmental Burdens

Life Cycle Assessment (LCA) is the internationally recognized standard for evaluating environmental impacts associated with a product, process, or service throughout its entire life cycle from raw material extraction to manufacturing, use, and end-of-life disposal [12]. In manufacturing, LCA is crucial to identify environmental hotspots within the production chain, such as stages or activities that contribute the most significantly to greenhouse gas emissions, energy demand, water consumption, resource depletion, or waste generation [13,14]. It also supports eco-design by guiding product and process design decisions toward more sustainable alternatives, considering material selection, energy efficiency measures, and waste minimization strategies [4]. It provides quantitative metrics for environmental performance, essential for reporting, benchmarking, and tracking progress towards sustainability goals [2,3].

Limitation: Traditional LCA methodologies often rely on static aggregated data, frequently obtained from generic databases or averaged historical plant performance. This approach presents significant limitations for real-time operational management. First, static LCA typically provides a high-level view and struggles to capture the environmental impact variations associated with specific production orders, machine settings, or real-time process deviations. Second, LCA studies are often conducted retrospectively, making them unsuitable for immediate feedback or dynamic control adjustments on the factory floor. Third, linking static LCA models directly with the dynamic event data captured by manufacturing execution systems (MES) or process mining tools is non-trivial. There is a disconnect between the operational event stream and the environmental assessment framework.

2.3. Efforts Towards Integrating Operational and Sustainability Perspectives

Recognizing the limitations of single approaches, researchers have begun exploring ways to integrate operational analysis with sustainability considerations. Sustainable Process Mining is an area where some conceptual work and early studies have aimed to incorporate environmental indicators into process analysis [15]. This might involve annotating process models with aggregated environmental data or developing sustainability-related conformance checks. However, these initiatives often remain in the early stages, without dynamic data integration or robust optimization features. Another path is to extend Lean tools like Value Stream Mapping (VSM). Originally used to identify operational waste, VSM has been adapted to ‘sustainable VSM’ (SVSM) by including environmental metrics such as energy use and material waste along with time and cost [16]. Although helpful for strategic decisions, these methods are typically manual and snapshot-based, lacking the automation and continuous discovery capabilities of process mining. Discrete-event simulation models have also been used to evaluate operational and environmental outcomes under different scenarios. Though useful for “what-if” exploration, these models may not perfectly match real-time factory conditions and do not leverage data in the same live, automatic way process mining does.

Limitation: Despite these innovations, many integration efforts lack real-time feedback, depend on manually collected or aggregated environmental metrics, cannot easily handle multiple conflicting goals, or do not use live factory data from sensors. What is often missing is a seamless connection from real-time data gathering to integrated analysis and decision-ready recommendations.

2.4. Multi-Objective Optimization (MOO) in Sustainable Manufacturing

Balancing conflicting goals is essential in sustainable manufacturing, and that is where Multi-Objective Optimization (MOO) techniques shine. Various MOO approaches, including genetic algorithms such as NSGA-II, particle swarm optimization, and other metaheuristics, have been used to optimize trade-offs. For example, some research has aimed to minimize cost, energy use, and ensure quality at the same time by adjusting the machining parameters or the scheduling [5,17]. Other studies have focused on creating production schedules that reduce delays and reduce emissions [18,19]. MOO has helped optimize supply chain design by choosing the best locations, transport methods, and inventory strategies while accounting for both cost and environmental impact [20].

Limitation: While MOO offers tools to manage trade-offs, its success is highly dependent on up-to-date, high-quality input data. Many existing applications rely on estimates or historical averages instead of real-time insights. Environmental goals are often based on broad models or static calculations rather than live sensor data tied to current process conditions. Moreover, many implementations lack an automatic loop to bring fresh data into the optimization engine on a continuous basis.

2.5. Research Gap

Based on this review, it is clear that although process mining, LCA, and MOO have each advanced significantly on their own, there is a serious gap in combining them in a dynamic and effective way. What is often missing is a seamless connection from real-time data collection to integrated analysis and decision-ready recommendations, potentially using advanced AI techniques for tasks such as unsupervised detection of anomalies in resource consumption, such as electricity use [21], to provide more proactive insights. Specifically, a robust framework that uses real-time data from process mining to monitor actual performance, calculates environmental impacts using live sensor data, and feeds both into an MOO engine such as NSGA-II. This would enable the generation of Pareto-optimal solutions that weigh sustainability and efficiency in real time. The approach should run on scalable infrastructure like cloud systems, include tools like TOPSIS to turn results into decisions, and be thoroughly tested in a real-world factory setting. The framework proposed in this study addresses all these needs, offering a dynamic and data-driven way to improve both productivity and environmental responsibility in modern manufacturing.

3. Materials and Methods

This section details the architecture and operational steps of the proposed cloud-based, data-driven framework for integrated optimization of operational efficiency and environmental sustainability. The framework combines the Integrated Process Mining and Sustainability Monitoring Framework (IPSMF) for data collection and analysis with the Multi-Objective Optimization Model for Operational and Environmental Performance (MOOM-OEP) for decision support.

3.1. Framework Architecture and Cloud Enablement

The overall architecture, depicted conceptually in Figure 1, establishes a systematic flow from real-time data acquisition to optimized parameter recommendations. This figure illustrates the conceptual framework proposed in this study, which leverages a cloud computing infrastructure as a foundational element, recognizing the trend toward cloud manufacturing and the need for advanced data analytics in modern systems [22,23]. For specific applications of machine learning in manufacturing process modeling and optimization. The cloud platform is crucial for the following:

Data Ingestion and Storage: Handling high-speed, high-volume data streams from numerous sensors and manufacturing systems (ERP, MES) across potentially many production lines [24,25].
Scalable Computation: Providing on-demand computational resources required for demanding tasks such as process discovery algorithms (e.g., Inductive Miner), real-time LCA calculations, and complex multi-objective optimization solvers (NSGA-II) [23,26]. Moreover, the efficient utilization and optimization of these cloud resources themselves are critical aspects, often addressed through exploratory data analysis and machine learning techniques [27].
Integration and Centralization: Acting as a central hub to integrate diverse data sources and make processed information and optimization results accessible to relevant stakeholders and downstream control systems.
Real-time Analytics: Facilitating the timely processing and analysis needed to derive actionable insights from live data streams, moving beyond traditional batch processing.

The methodology follows a structured sequence of steps, illustrated in Figure 2, ensuring a systematic approach from data collection to actionable insights.

3.2. Step 1: Data Acquisition and Monitoring

The process begins with the continuous collection of relevant data from the manufacturing environment (S1, S2). This involves the following:

Event Log Generation: Capturing timestamped events from MES or machine controllers, detailing activities performed, the associated case identifier (e.g., production order ID) and the resource/machine involved. This forms the basis for process mining, represented abstractly as follows:

$L = σ_{1}, σ_{2}, \dots, σ_{N}, σ_{i} = (e_{1}, e_{2}, \dots, e_{m_{i}})$

(1)

where L is the event log, $σ_{i}$ is a trace (case) and $e_{j}$ are events within the trace.
Sensor Data Collection: Gathering real-time readings from sensors (e.g., power meters, temperature sensors).
Contextual Data Integration: Accessing relevant data from ERP or planning systems (e.g., schedules, material specifications, emission factors).

3.3. Step 2: Process Mining for Operational Insight

The event logs collected (L) are fed into the process mining module (S3, S4, S5) [6]:

Process Discovery: The Inductive Miner algorithm generates a process model (e.g., Petri Net or BPMN) that represents the actual workflow (see Figure 7 in Section 4).
Performance Analysis: Operational KPIs are calculated from the event data:
−
Cycle Time ( $C T$ )
−
Waiting Time/Idle Time ( $I T$ ). Bottlenecks identified via the following:

${Bottleneck}_{i j} = arg max_{i, j} Waiting {Time}_{i j}$

(2)

−
Throughput ( $T P$ )
−
Resource Utilization ( $U_{i}$ ) for resource i:

$U_{i} = \frac{T_{active, i}}{T_{total, i}}$

(3)

where $T_{active, i}$ is active time and $T_{total, i}$ is total time.
Conformance Checking (Optional): Comparing logs against normative models [11,28].

3.4. Step 3: Dynamic Life Cycle Assessment (LCA) Inventory

The framework performs a dynamic LCA inventory analysis using real-time data to quantify environmental burdens at the process-instance level (S6, S7). This step moves beyond traditional static LCA by linking live data to environmental impacts, forming a basis for subsequent optimization. The analysis, which informs visualizations like the one presented in Figure 5 in the Section 4, includes the following:

Energy Consumption (EC): Calculated by integrating power readings $P_{i} (t)$ over operational time $t_{i}$ :

$E C_{instance / period} = \sum_{i} \int_{t_{start, i}}^{t_{end, i}} P_{i} (t) d t$

(4)
Carbon Emissions (CE): Estimated using activity levels (e.g., $E C$ ) and emission factors $E F_{i}$ : [2,14,29]

$C E_{instance / period} = \sum_{i} ({ActivityLevel}_{i} \cdot E F_{i})$

(5)
Waste Generation (WG): Quantified from detected defects $Q_{i}$ and conversion factors $δ_{i}$ :

$W G_{instance / period} = \sum_{i} (Q_{i} \cdot δ_{i})$

(6)

3.5. Step 4: MOOM-OEP Formulation

The Multi-Objective Optimization Model (MOOM-OEP) uses insights from Steps 2 and 3 to define and solve the optimization problem (S8, S9) [30,31].

3.5.1. Decision Variables

$X_{i j k}$ : Continuous variable representing the processing time (in seconds) for activity k on machine j in line i. This variable is a key determinant of cycle time and throughput.
$Y_{i j k}$ : Binary variable ( $Y_{i j k} \in {0, 1}$ ) for the assignment of activity k to machine j in line i. This is used for routing decisions where alternatives exist.
$Z_{i j k}$ : Continuous variable representing the allocation of a divisible resource (e.g., energy budget in kWh, or operator time as a percentage) to activity k on machine j in line i. This is distinct from processing time and governs the intensity of the operation.
$C_{i j}$ : Cloud computational load generated by machine j in line i.

3.5.2. Objective Functions

The goal is to find solutions x (configurations of decision variables) that [5,19]:

Maximize Operational Efficiency ( $f_{1} (x)$ ):

$max f_{1} (x) = \sum_{i = 1}^{n} \sum_{j = 1}^{m} \sum_{k = 1}^{p} (w_{3} \cdot T P_{i j k} - w_{1} \cdot C T_{i j k} - w_{2} \cdot I T_{i j k} - w_{4} \cdot D R_{i j k})$

(7)
Minimize Environmental Impact ( $g_{2} (x)$ ):

$min g_{2} (x) = \sum_{i = 1}^{n} \sum_{j = 1}^{m} \sum_{k = 1}^{p} (w_{5} \cdot E C_{i j k} + w_{6} \cdot C E_{i j k} + w_{7} \cdot W G_{i j k})$

(8)

where n is the number of lines, m the number of machines, and p the number of activities. The performance metrics for the specific activity under configuration x are

C T_{i j k}

,

I T_{i j k}

,

T P_{i j k}

,

D R_{i j k}

,

E C_{i j k}

,

C E_{i j k}

, and

W G_{i j k}

. The weighting factors

w_{1}, \dots, w_{7}

reflect the relative importance and were normalized to 1.0 for the main optimization run, while the TOPSIS sensitivity analysis explored different priority scenarios.

3.5.3. Constraints

The optimization is subject to the following:

Operational Constraints [32]:

$\begin{matrix} \sum_{k = 1}^{p} Y_{i j k} & = 1 \forall i, j \end{matrix}$

(9)

$\begin{matrix} M U_{i j} & \leq {Capacity}_{i j} \forall i, j \end{matrix}$

(10)

$\begin{matrix} T P_{i j} & \geq {Demand}_{i j} \forall i, j \end{matrix}$

(11)
Environmental constraints:

$\begin{matrix} E C_{i j} & \leq {Threshold}_{Energy, i j} \forall i, j \end{matrix}$

(12)

$\begin{matrix} C E_{i j} & \leq {Threshold}_{Carbon, i j} \forall i, j \end{matrix}$

(13)

$\begin{matrix} W G_{i j} & \leq {Threshold}_{Waste, i j} \forall i, j \end{matrix}$

(14)
Cycle Time Constraints:

$C T_{i j k} \geq t_{process, i j k} \forall i, j, k$

(15)
Resource Allocation Constraints:

$\sum_{k = 1}^{p} Z_{i j k} \leq {ResourceLimit}_{i j} \forall i, j$

(16)
Activity Sequencing Constraints:

$t_{start, i j k + 1} \geq t_{end, i j k} + t_{transition, i j} \forall i, j, k$

(17)
Cloud Capacity Constraints:

$\sum_{i = 1}^{n} \sum_{j = 1}^{m} C_{i j} \leq {CloudCapacity}_{total}$

(18)

where $C_{i j} = f_{compute} (X_{i j k}, Y_{i j k})$ . The effective management and optimization of such cloud capacity to ensure efficient resource utilization is an important consideration, often tackled with data-driven approaches and machine learning [27].
Binary Decision Constraints:

$Y_{i j k} \in {0, 1} \forall i, j, k$

(19)

3.6. Step 5: Multi-Objective Optimization Using NSGA-II

The MOOM-OEP problem (Maximize

f_{1} (x)

, Minimize

g_{2} (x)

subject to constraints (9)–(19)) is solved using the Non-Dominated Sorting Genetic Algorithm II (NSGA-II) (S10–S14) [32]. Key hyperparameters for the NSGA-II implementation included a population size of 100, a crossover probability of 0.9, a mutation probability of 0.1, and termination after 200 generations.

Initialization: Generate initial population $P_{0}$ of size N.
Evaluation: Calculate $f_{1} (x)$ and $g_{2} (x)$ for all $x \in P_{0}$ .
Non-Dominated Sorting: Rank solutions into fronts $F_{1}, F_{2}, \dots$ based on Pareto dominance. $x_{1}$ dominates $x_{2}$ if

$\begin{matrix} (\forall l \in {1, 2}, f_{l} (x_{1}) \geq f_{l} (x_{2})) \land (\exists l^{'} \in {1, 2}, f_{l^{'}} (x_{1}) > f_{l^{'}} (x_{2})) \end{matrix}$

(20)

For consistency with the Pareto dominance check, which typically assumes maximization for all objectives, the environmental impact $g_{2} (x)$ (a minimization objective) is reformulated as $f_{2}^{'} (x) = - g_{2} (x)$ .
Crowding Distance Calculation: Compute distance $d_{i}$ for solution $x_{i}$ within its front $F_{k}$ to maintain diversity:

$d_{i} = \sum_{l = 1}^{M} \frac{f_{l} (x_{i + 1}) - f_{l} (x_{i - 1})}{f_{l}^{max} - f_{l}^{min}}$

(21)

where $M = 2$ objectives, solutions $x_{i + 1}, x_{i - 1}$ are neighbors in the sorted list for objective l, and $f_{l}^{max}, f_{l}^{min}$ are max/min values for objective l.
Selection, Crossover, Mutation: Use tournament selection (based on rank and $d_{i}$ ), simulated binary crossover (SBX), and polynomial mutation to create an offspring population $Q_{t}$ .

$\begin{matrix} x_{i j k}^{(c)} & = \frac{x_{i j k}^{(p 1)} + x_{i j k}^{(p 2)}}{2} \end{matrix}$

(22)

$\begin{matrix} x_{i j k}^{(c)} & = x_{i j k}^{(c)} + ϵ, ϵ \sim N (0, σ^{2}) \end{matrix}$

(23)
Population Update: Combine $R_{t} = P_{t} \cup Q_{t}$ . Select the best N solutions from $R_{t}$ based on non-domination rank and crowding distance to form $P_{t + 1}$ .
Termination: Stop after a fixed number of generations. The final non-dominated set is the Pareto-optimal front $P^{*}$ :

$P^{*} = {x \in P_{final} ∣ ∄ x^{'} \in P_{final} s . t . x^{'} dominates x}$

(24)

3.7. Step 6: Solution Ranking Using TOPSIS

The Technique for Order Preference by Similarity to Ideal Solution (TOPSIS) ranks the solutions in

P^{*}

(S15–S18) [23].

Normalize Decision Matrix: Normalize the objective values ( $f_{1}, g_{2}$ ) for solutions in $P^{*}$ .
Determine Ideal/Negative-Ideal Solutions:

$\begin{matrix} A^{+} & = (max f_{1} (x), min g_{2} (x)) \forall x \in P^{*} \end{matrix}$

(25)

$\begin{matrix} A^{-} & = (min f_{1} (x), max g_{2} (x)) \forall x \in P^{*} \end{matrix}$

(26)

Using normalized values $v_{i j}$ where i is the solution, and j is the objective index:

$\begin{matrix} A^{+} & = {v_{1}^{+}, v_{2}^{+}, \dots, v_{M}^{+}}, A^{-} = {v_{1}^{-}, v_{2}^{-}, \dots, v_{M}^{-}} \end{matrix}$

(27)

where $v_{j}^{+}$ is the best value for objective j, and $v_{j}^{-}$ is the worst.
Calculate Separation Measures: Compute Euclidean distances for solution i:

$\begin{matrix} S_{i}^{+} & = \sqrt{\sum_{j = 1}^{M} {(v_{i j} - v_{j}^{+})}^{2}} \end{matrix}$

(28)

$\begin{matrix} S_{i}^{-} & = \sqrt{\sum_{j = 1}^{M} {(v_{i j} - v_{j}^{-})}^{2}} \end{matrix}$

(29)
Compute Relative Closeness:

$C_{i} = \frac{S_{i}^{-}}{S_{i}^{+} + S_{i}^{-}}$

(30)

where $C_{i} \in [0, 1]$ .
Rank Solutions: Rank solutions in descending order of $C_{i}$ .

3.8. Step 7: Implementation and Feedback

The selected optimal configuration (often the top-ranked by TOPSIS or chosen based on priorities) is implemented (S19). Continuous monitoring (Steps 1–3) provides feedback for iterative refinement and adaptation (S20, S21) [33,34].

3.9. Process Response Measurement and Uncertainty

Process response metrics were measured as follows:

Operational KPIs: Cycle Time (CT) and Idle Time (IT) were calculated directly from the timestamped event log by measuring the duration between start and end events for each activity and the time between the end of one activity and the start of the next. Throughput (TP) was calculated as the number of completed cases per day.
Environmental KPIs: Energy Consumption (EC) was measured by dedicated power meters on each machine, with data logged in kWh. Carbon Emissions (CE) were derived from EC using standard regional emission factors. Waste Generation (WG) was quantified based on the count of rejected products at inspection points, multiplied by the average weight per unit.

Regarding uncertainty, the primary source lies in sensor precision and data logging granularity. To mitigate this, we worked with data from recently calibrated sensors as per the facility’s maintenance schedule. A data cleaning and pre-processing pipeline was implemented to handle missing values and filter out obvious outliers before analysis. The results presented are, therefore, based on the highest quality data available, though we acknowledge that a degree of measurement uncertainty inherent to any industrial sensing system remains.

3.10. Case Study

This case study focuses on a state-of-the-art tube manufacturing facility that operates ten online production lines and four offline machines to meet the diverse requirements of the cosmetic, dental care, and decoration sectors. Each production area is specialized to meet specific customer needs. The cosmetics sector includes lines such as TL201–TL207, which are dedicated to producing high-quality tubes for beauty products. The dental sector, comprising the lines TL205, TL206, TL209, and TL210, specializes in creating packaging customized for oral care products. Meanwhile, the decoration sector focuses on enhancing the appearance of semi-finished products using silk screening and hot stamping techniques. Although production technology is largely uniform across most lines, the dental sector integrates additional specialized processes for its unique product requirements. Among these, the TL209 line is chosen as a representative model for this study, providing an insight into the operational workflows and optimization strategies.

The manufacturing process follows a structured sequence of interconnected stages, each supported by advanced machinery and quality control mechanisms. The process begins with laminate strips being processed in the tube production machine, where they are cut, welded, and coated with a plastic layer for durability. These semi-finished tubes are then calibrated and cooled using a vacuum module to ensure structural integrity and precise dimensions. Following this, the cutter and retractor module inspects the tubes for any defects, utilizing eddy current technology to measure tube diameters and verify the quality of the aluminum layers. Any detected anomalies are flagged for further analysis.

Subsequently, the head and capping stage mold the tube heads using molten plastic and securely attach the caps. Rigorous quality checks are performed at this stage to ensure proper alignment and sealing. The process then progresses to the printing and varnishing stages, where up to eight colors are applied using UV-cured inks, followed by a varnish layer for added protection. Synchronization between the printing and varnishing machines is critical to maintain consistent quality across the production batch. The final stage involves the automated packaging of finished tubes in cartons or pallets according to customer specifications.

Real-time monitoring plays a crucial role in maintaining production efficiency and quality. Sensors placed strategically along the production line collect data on key parameters, including tube counts, diameters, cap alignment, sealing, and printing quality. Defective tubes are identified and automatically diverted for further inspection. The TL209 line is particularly notable for its advanced quality control features, which include storage devices to ensure continuous operation, multiple-stage eddy current checks to maintain material integrity, and UV curing systems to allow rapid drying without compromising production speed.

By examining the TL209 line, this case study highlights how real-time data and advanced technological integration can optimize both operational efficiency and sustainability. It also demonstrates how specialized processes across dental sector lines like TL205, TL206, and TL210 contribute to meeting specific product demands. The insights derived from this study emphasize the importance of leveraging modern manufacturing technologies to achieve a balance between high productivity and environmental responsibility, setting a benchmark for similar facilities.

3.11. Production Flow and Sensor Placement

The production process begins with the tube production machine, where laminate strips are transformed into semi-finished tubes. This stage includes cutting, high-frequency welding, and coating the tubes with a plastic layer. These tubes then pass through a calibration and cooling phase. Sensors S1 (optical counter), S2 (diameter gauge), and S3 (eddy current sensor for aluminum layer integrity) are placed at this stage to monitor tube formation and detect anomalies in size or structure.

After this, the tubes move to the head and capping unit, where tube heads are formed using molten plastic and caps are attached. Sensors S4 (vision system) and S5 (torque sensor) track cap alignment and sealing quality. The subsequent stage is printing and varnishing, where designs are applied using UV-dried inks, and a varnish layer is added for protection. Sensors S6 through S8 (vision systems for color registration and defect detection) ensure that the final appearance satisfies quality standards. Energy consumption for each major unit (Extruder, Header, Capper, Printing) is monitored by dedicated power meters integrated into the MES. The final step involves packaging, where the tubes are automatically sorted, oriented, and placed in cartons or pallets according to customer specifications. While confidentiality agreements prevent the publication of equipment photographs, Figure 3 and Figure 4 provide schematic diagrams of representative production lines.

The Life Cycle Assessment (LCA) for this facility is summarized in Figure 5. It evaluates the environmental and operational impacts of the production process, from the sourcing of raw materials to the final product. The process begins with materials like Polyfoil^® (a combination of plastic and aluminum for barrier properties), polyethylene (PE) for flexible tubes, and coextruded materials for durability. These materials ensure quality and suitability for a variety of applications.

Energy consumption is a major component, and electricity drives key operations like extrusion, welding, and UV printing. Renewable energy, such as solar panels, reduces the dependency on fossil fuels. Water is mainly used for cooling in extrusion and calibration processes, and centralized treatment plants have significantly reduced overall water usage.

The manufacturing process includes tube production (laminate handling, welding, extrusion), quality control (calibration and inspection), heading and capping (attaching tube heads and caps), printing and varnishing (for aesthetics and durability), and packaging (organizing and boxing). Each stage ensures precision and efficiency in production.

Waste management focuses on recycling, with 85% of production waste being reused and 15% thermally recovered. Packaging waste is minimized by using materials with a high recycled content. Environmental impacts, including CO₂ emissions, have been reduced through energy-efficient systems like pellet heating, and low-solvent paints are used to improve resource efficiency.

4. Results and Discussion

This section presents the results obtained from applying the proposed integrated framework to the real-world tube manufacturing case study. The findings validate the framework’s capability to analyze performance, identify trade-offs, and provide optimized solutions balancing operational efficiency and environmental sustainability, contextualized against relevant prior research.

4.1. Validation Approach

The effectiveness of the framework was validated using historical and real-time data collected from sensors and MES logs within the tube manufacturing facility described in Section 3. The data encompassed operational parameters (cycle times, throughput, idle times, defect rates) and environmental metrics (energy consumption, derived emissions, waste generation), providing a rich dataset for analysis and optimization.

The process mining data transformation converted raw sensor data (32,257 rows of hourly snapshots across 123 columns) into actionable optimization inputs. This raw data were transformed into a granular event log suitable for process mining (S1). An event was generated each time a machine’s status sensor changed its state (e.g., from ‘idle’ to ‘running’). For continuous sensors like power meters, an event was recorded when the reading crossed a pre-defined operational threshold. This state-change detection approach converted the time-series data into a structured log of 391,127 discrete events (S2), each containing a CaseID, Activity, Timestamp, Resource, and Event Type.

Table 1 presents the first 10 events from this comprehensive event log, illustrating the granular nature of the collected data.

4.2. Baseline Performance Analysis and Bottleneck Identification (Pre-Optimization)

Before implementing optimization strategies, a baseline analysis was conducted using the integrated Process Mining and LCA monitoring capabilities of the framework (IPSMF).

4.2.1. Downtime Analysis

Figure 6 illustrates the monthly downtime trends. Monthly-aggregated downtime reveals a highly skewed distribution: the Production Line persistently accounts for the largest share of lost productive time, confirming its role as the system’s critical bottleneck [7]. The Large Printing Unit follows as the second-largest source but displays sharper peaks, hinting at sporadic, event-driven stoppages that could be mitigated through targeted maintenance. Mid-tier assets Packing Unit, Extruder, Capping Machine, and Packaging Machine cluster far below the top two, indicating routine wear-and-tear rather than systemic failure. The Small Printing Unit shows minimal downtime, suggesting either lower utilisation or better control. This profile mirrors observations in other discrete-manufacturing studies, where a single high-capacity line and one specialised machine dominate downtime while secondary equipment exerts only incremental influence [8]. Consequently, dedicating resources first to the Production Line and then to stabilising the Large Printing Unit should deliver the greatest uptime gains.

4.2.2. Process Discovery and Workflow Visualization

The process discovery analysis using Inductive Miner revealed the actual manufacturing workflow across nine active stations (201–209). The automatically generated process model (Figure 7), a direct result of applying process discovery algorithms (S3), identifies five critical bottleneck stations (Stations 202, 203, 204, and 205), marked red in the visualization. This analysis of the model allows for the identification of inefficiencies (S4) and constraints (S5). The process flow shows distinct transition patterns:

High-frequency transitions (>4000 events): Station 202 → 203 (5592), Station 203 → 204 (4474), Station 204 → 205 (4646)
Medium-frequency transitions (2000–4000 events): Station 205 → 206 (3408), Station 206 → 207 (3276), Station 207 → Process End (6379)
Critical bottleneck transitions: Process Start → 202 (4755), indicating initial process constraints

4.2.3. Duration by Activities

The analysis of station durations revealed significant variations in operational hours across the manufacturing line (Figure 8). The four bottleneck stations (202–205) showed the highest operational durations, each exceeding 10,000 h:

Station 202: 9906 h
Station 203: 9922 h
Station 204: 9970 h
Station 205: 10,003 h

In contrast, non-bottleneck stations (206–209) operated for significantly fewer hours, with Station 209 showing the lowest duration at 3680 h. The average duration across all stations was 9094.1 h.

4.2.4. Cycle Time and Throughput Analysis

The cycle time and throughput revealed significant performance disparities between stations (Figure 9):

Cycle Time Range: 9.4 to 16.0 min per event
Best Performer: Station 207 (9.4 min/event)
Worst Performer: Station 209 (16.0 min/event)
System Average: 11.7 min/event

The throughput analysis showed:

Highest Throughput: Station 207 (159 events/day)
Lowest Throughput: Station 209 (94 events/day)
Average Throughput: 126.3 events/day
System Efficiency: 87%

In particular, bottleneck stations (202–205) demonstrated moderate cycle times (10.6–13.6 min/event) but maintained relatively consistent throughput rates (108–137 events/day), suggesting that their bottleneck status is primarily due to high processing volumes rather than inefficient operations.

4.3. Station Performance Analysis

The comprehensive analysis of station performance confirmed the identification of bottlenecks and revealed operational patterns:

4.3.1. Event Processing Distribution

Event processing was unevenly distributed across stations:

Station 207: Highest event volume (63,798 events)
Station 208: Second highest (60,281 events)
Station 203: Among bottlenecks, processed 55,921 events
Station 204: Lowest among active stations (44,746 events)

4.3.2. Bottleneck Characteristics

The five identified bottleneck stations (202–205) share common characteristics:

Extended operational durations (>9900 h each)
Moderate cycle times (10.6–13.6 min/event)
Critical position in the process flow (early to mid-process)
High transition counts with neighboring stations

These findings suggest that bottlenecks arise from their sequential positioning and the cumulative effect of processing delays rather than the inefficiency of the individual station.

4.4. Process Flow Efficiency

The process flow analysis revealed:

Primary Flow Path: Start → 202 → 203 → 204 → 205 → 206 → 207 → End
Alternative Paths: Including cross-transitions (e.g., 202 → 205: 729 events)
Rework Loops: Evidence of backward transitions (e.g., 205 → 204: 3445 events)
System Throughput: 391,127 total events processed

The presence of significant backward transitions, particularly from Station 205 to 204, indicates quality control issues requiring rework, contributing to the bottleneck formation in these stations.

Quantitative Baseline and Bottleneck Confirmation

The quantitative baseline metrics (Table 2) confirm the bottlenecks suggested by the qualitative analyses. The relatively long cycle times and high idle times for the Printing (Station 204) and Packaging (Station 205) units are typical constraints found at the end of the primary bottleneck segment. The integration of operational data with dynamically calculated environmental metrics (EC, CE, WG), corresponding to the execution of the Life Cycle Assessment inventory (S6) and calculation of utilization rates (S7), provides a richer baseline than traditional operational analysis alone, setting the stage for multi-objective optimization.

The comprehensive analysis of the tube manufacturing process revealed significant operational insights throughout the production line. The system processed a total of 391,127 events during the observation period, with an average cycle time of 11.7 min per event and an average throughput of 126.3 events per day, achieving an overall system efficiency of 87%. The process discovery analysis identified five critical bottleneck stations (Stations 202–205) out of nine active stations, with these bottlenecks characterized by extended operational durations exceeding 9900 h each.

Station 207 emerged as the best performer with a cycle time of 9.4 min per event and the highest throughput of 159 events per day, while Station 209 showed the poorest performance with a cycle time of 16.0 min per event and the lowest throughput of 94 events per day. The bottleneck stations demonstrated moderate cycle times ranging from 10.6 to 13.4 min per event but maintained relatively consistent throughput rates between 108 and 137 events per day, suggesting their bottleneck status arose from high processing volumes rather than operational inefficiency.

The process flow analysis revealed a primary production path from Start → 202 → 203 → 204 → 205 → 206 → 207 → End, with significant backward transitions indicating rework loops, particularly from Station 205 to 204 (3445 events). High-frequency transitions occurred between sequential stations, with the 202 → 203 transition recording 5592 events, while alternative paths and cross-transitions (such as 202 → 205 with 729 events) demonstrated the flexibility of the production system. The average defective rate across the system was 2.2%, with bottleneck stations showing higher rates (2.3–3.5%) compared to non-bottleneck stations (1.5–1.9%).

Energy consumption and environmental metrics showed a direct correlation with production volumes and cycle times. The total daily energy consumption of the system reached 4015 kWh, resulting in 2007.5 kg of CO₂ emissions and 146.5 kg of waste generation. Stations with higher throughput, such as Station 207, demonstrated better energy efficiency per unit produced despite higher absolute consumption values. The inspection station (Station 209) showed the lowest throughput, but maintained moderate defective rates at 2.2%, indicating its critical role in quality control despite apparent inefficiency in processing speed.

The operational and environmental constraints in Table 3 establish the boundary conditions for the optimization of the tube making process. These constraints are derived from actual production data, with operational limits that ensure equipment protection (95% utilization while maintaining minimum viable throughput (94 units/day). The environmental constraints reflect the worst-case scenarios observed, restricting energy consumption to 600 kWh/day and emissions to 300 kg CO₂/day per machine. Process constraints maintain system efficiency at an 87% minimum while permitting cycle times between 564 and 960 s, based on actual station performance ranges. Resource allocation provides sufficient staff (four operators/shift) with built-in capacity buffers (15%) for bottleneck stations, while data management constraints ensure a real-time monitoring capability with 250 ms latency and 1 Hz sensor sampling rates.

4.5. Multi-Objective Optimization Results: Efficiency vs. Sustainability Trade-Off

The MOOM-OEP was solved with the Non-Dominated Sorting Genetic Algorithm II (NSGA-II) using the following hyper-parameters:population size = 100, crossover probability = 0.9, mutation probability = 0.1, and a termination criterion of 200 generations. With these settings, the optimization proceeded as follows: the objectives were defined (S8) and the objective functions formulated (S9); NSGA-II then initialized the population (S10), performed non-dominated sorting to rank solutions (S11), computed crowding distances to preserve diversity (S12), applied simulated-binary crossover and polynomial mutation (S13), and iterated until convergence, ultimately producing the Pareto-optimal front evaluated in stage S14 (Figure 10).

Trade-Off Analysis

The resulting Pareto front reveals the characteristic trade-off between maximizing operational efficiency (f₁) and minimizing environmental impact (f₂) in the tube-making process. Starting from a baseline efficiency of 87.2% with 2007.5 kg CO₂/day emissions, optimization identified solutions ranging up to 95.2% efficiency. The convex shape indicates an initial region (87–91%) where significant environmental improvements can be achieved with minimal efficiency sacrifice, followed by a steep region (91–94%) where marginal efficiency gains require proportionally smaller environmental compromises, and finally a plateau region (94–95%) where further efficiency improvements lead to increased emissions due to resource intensification.

4.6. Solution Ranking and Selection Using TOPSIS

The TOPSIS method ranked Pareto-optimal solutions based on actual process constraints and performance metrics (Table 4). This involved normalizing the solutions (S15), identifying the ideal and negative-ideal solutions (S16), computing separation measures (S17), and finally ranking the solutions based on their relative closeness to the ideal solution (S18).

Ranking Insights

The TOPSIS ranking reveals that solution P9 (92.3% efficiency, 1758.2 kg CO₂/day) achieves the optimal balance between operational efficiency and environmental impact. This solution represents a 5.1% efficiency improvement and a 12.4% emission reduction compared to the baseline. The top-ranked solutions (P8-P11) cluster around 91–93% efficiency with emissions between 1731 and 1776 kg CO₂/day, indicating a sweet spot where significant improvements in both objectives are achievable. Solutions beyond P14 show diminishing returns, where marginal efficiency gains are offset by an increased environmental impact.

4.7. Sensitivity Analysis of Rankings

The sensitivity analysis (Figure 11) examined how different weighting scenarios affect the ranking of the solutions.

Impact of Priorities

The sensitivity analysis reveals significant shifts in solution rankings based on decision-maker priorities:

Balanced Approach (50% efficiency, 50% sustainability): Solution P9 ranks first, offering 92.3% efficiency with 1758.2 kg CO₂/day emissions.
Efficiency Focus (70% efficiency, 30% sustainability): Solution P10 becomes optimal, achieving 92.7% efficiency at the cost of slightly higher emissions (1743.4 kg CO₂/day).
Sustainability Focus (30% efficiency, 70% sustainability): Solution P7 ranks highest, prioritizing lower emissions (1796.5 kg CO₂/day) while maintaining 91.3% efficiency.

The analysis demonstrates that solutions P7–P10 consistently rank among the top choices in different weighting scenarios, suggesting their robustness as compromise solutions. Extreme solutions (P1–P3 and P16–P18) show the highest sensitivity to weight changes, making them less suitable for scenarios where priorities may change.

4.8. Discussion Summary

The integrated framework successfully optimized the tube making process, identifying 18 Pareto optimal solutions that improve the baseline performance (87.2% efficiency, 2007.5 kg CO₂/day). The optimal balanced solution (P9) achieves the following:

5.1% improvement in operational efficiency (87.2% → 92.3%)
12.4% reduction in carbon emissions (2007.5 → 1758.2 kg CO₂/day)
Maintained throughput above minimum requirements (>94 units/day)
Reduced bottleneck impact through workload redistribution

The practical value of the framework is demonstrated by the following:

Real-time integration of process mining with environmental metrics
Identification of viable trade-off solutions for different strategic priorities
Quantitative guidance for bottleneck mitigation (Stations 202–205)
Actionable recommendations for reducing rework loops (205 → 204)

These results validate the framework’s capability to navigate the complex trade-offs between operational efficiency and environmental sustainability in real-world manufacturing systems, providing decision makers with data-driven insights for sustainable process improvement.

5. Conclusions

This research addressed the critical imperative for modern manufacturers to simultaneously improve operational efficiency and environmental sustainability, moving beyond traditional, often siloed, improvement efforts. We presented and validated a comprehensive, cloud-based, data-driven optimization framework that uniquely integrates real-time Process Mining with dynamic Life Cycle Assessment (LCA) and advanced Multi-Objective Optimization (MOO) techniques, specifically NSGA-II and TOPSIS. The application of this framework to a complex, real-world tube manufacturing facility processing 391,127 events across nine active stations provided significant insight and demonstrated its practical effectiveness. Through the integrated analysis of live operational event logs and environmental sensor data, the framework successfully:

Identified and quantified five critical bottleneck stations (202–205) within the production flow, accounting for extended operational durations exceeding 9900 h each and cycle times ranging from 10.6 to 13.4 min per event.
Dynamically assessed key environmental performance indicators, revealing baseline metrics of 4015 kWh daily energy consumption, 2007.5 kg CO₂/day emissions, and 146.5 kg/day waste generation across the manufacturing system.
Explicitly mapped the inherent trade-offs between maximizing operational efficiency (87.2–95.2%) and minimizing environmental impact (1715–2007 kg CO₂/day) through an optimal Pareto front of 18 solutions.
Generated optimized operational configurations that offer quantifiable improvements, with the optimal balanced solution (P9) achieving an efficiency of 92.3% and 1758.2 kg CO₂/day emissions representing a 5.1% efficiency gain and a 12.4% reduction in emissions from baseline.
Provided a structured decision support mechanism (TOPSIS) that ranked solutions based on varying organizational priorities, demonstrating the adaptability of the framework through sensitivity analysis across efficiency-focused, sustainability-focused, and balanced scenarios.

The primary contribution of this work lies in the development and successful empirical validation of this holistic and dynamic integration methodology. By processing 32,257 hourly sensor measurements across 123 parameters, the framework bridges a significant gap in the literature and practice by combining the diagnostic power of real-time Process Mining with the evaluative capabilities of dynamic LCA. The system successfully identified process anomalies (1547 total) and their correlation with downtimes (r = 0.426), and fed this unified intelligence into a robust MOO engine that generated actionable optimization strategies.

The findings underscore substantial potential for manufacturers to achieve synergistic gains. The framework identified specific improvement opportunities, including redistributing workload from bottleneck stations, implementing predictive maintenance based on anomaly patterns, and reducing rework loops (205 → 204 transition) by 50%. These optimizations demonstrate how informed, data-driven decision making can simultaneously improve resource utilization, reduce cycle times from 960 to 564 s, and minimize environmental impact.

While validated in tube manufacturing, the proposed framework is designed to be adaptable. Its principles can be applied to other discrete or batch-based manufacturing sectors, such as pharmaceuticals, electronics assembly, or food and beverage packaging. The core methodology of integrating Process Mining, dynamic LCA, and MOO remains the same; only the specific operational KPIs, sensor data streams, and LCA impact models would need to be customized for the target industry. This demonstrates the framework’s potential as a general-purpose tool for sustainable manufacturing optimization.

Several limitations remain, pointing to avenues for future research. The framework’s effectiveness depends on the quality, availability, and granularity of sensor and event data, so deployment in data-scarce environments would be challenging. In addition, real-time execution of the MOO engine is computationally intensive and demands substantial cloud resources. This study also focused on a specific set of operational and environmental KPIs; a more comprehensive analysis could include other factors like water usage, social metrics, or detailed supply chain impacts. Future research could focus on extending the framework by incorporating machine learning for predictive analytics, particularly leveraging the observed 1–3 h lag between sensor anomalies and significant downtimes. The successful correlation between sensor anomalies and operational performance (predictive precision 76%) suggests the potential for advanced predictive maintenance strategies. For instance, future work could explore the use of deep learning on temporal data for more robust characterization and prediction, building on techniques from recent studies.

In conclusion, this research establishes a robust, scalable, and adaptable model for sustainable manufacturing, validated through real-world application that achieves both operational efficiency improvements (up to 5.1%) and environmental impact reductions (up to 12.4%). The framework provides a powerful toolkit to help industries navigate the dual challenges of the 21st century, demonstrating that operational excellence and environmental responsibility are not mutually exclusive but can be achieved synergistically through integrated, data-driven optimization.

Author Contributions

Conceptualization, M.M.M. and I.B.; Methodology, M.M.M. and I.B.; Software, M.M.M.; Validation, M.M.M.; Formal Analysis, M.M.M.; Investigation, M.M.M.; Resources, I.B.; Data Curation, M.M.M.; Writing Original Draft Preparation, M.M.M.; Writing Review and Editing, I.B.; Visualization, M.M.M.; Supervision, I.B.; Project Administration, I.B.; Funding Acquisition, I.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Department of Informatics, the Faculty of Engineering of Debrecen University, and supported by the University of Debrecen Program for Scientific Publication.

Data Availability Statement

The dataset supporting the conclusions of this article was generated from proprietary operating systems and sensor readings within a private industrial manufacturing facility. Due to confidentiality agreements with the collaborating company and the commercially sensitive nature of detailed operational performance data, the raw data set cannot be made publicly available. The aggregated findings, process models, and key performance indicators derived from the data are presented in this paper. Further inquiries regarding the data may be directed to the corresponding author, subject to the limitations imposed by the confidentiality agreement.

Acknowledgments

The authors wish to express their sincere gratitude to the management and operational staff of the collaborating tube manufacturing facility for granting access to their facilities, providing invaluable operational insights and making the necessary data available for this investigation. Their cooperation was essential for the successful validation of the proposed framework. We also thank colleagues at the Doctoral School of Informatics and the Faculty of Engineering, Debrecen University, for their constructive feedback and supportive academic environment.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

Popa, A.M.; Gupta, K. Using lean manufacturing to improve process efficiency in a fabrication company. Appl. Eng. Lett. J. Eng. Appl. Sci. 2024, 9, 172–184. [Google Scholar] [CrossRef]
Polyanska, A.; Pazynich, Y.; Mykhailyshyn, K.; Babets, D.; Toś, P. Aspects of Energy Efficiency Management for Rational Energy Resource Utilization. Rud.-Geol.-Naft. Zb. 2024, 39, 13–26. [Google Scholar] [CrossRef]
Walker, C.; Beretta, C.; Sanjuán, N.; Hellweg, S. Calculating the energy and water use in food processing and assessing the resulting impacts. Int. J. Life Cycle Assess. 2018, 23, 824–839. [Google Scholar] [CrossRef]
Shi, L.; Liu, L.; Yang, B.; Sheng, G.; Xu, T. Evaluation of Industrial Urea Energy Consumption (EC) Based on Life Cycle Assessment (LCA). Sustainability 2020, 12, 3793. [Google Scholar] [CrossRef]
Hassine, H.; Barkallah, M.; Bellacicco, A. Multi Objective Optimization for Sustainable Manufacturing, Application in Turning. Int. J. Simul. Model. 2015, 1, 98–109. [Google Scholar] [CrossRef]
Massaro, A. Advanced Control Systems in Industry 5.0 Enabling Process Mining. Sensors 2022, 22, 8677. [Google Scholar] [CrossRef] [PubMed]
Nagy, Z.; Werner-Stark, A.; Dulai, T. Using Process Mining in Real-Time to Reduce the Number of Faulty Products. In Proceedings of the 23rd European Conference, ADBIS 2019, Bled, Slovenia, 8–11 September 2019; pp. 89–104. [Google Scholar] [CrossRef]
Choueiri, A.C.; Sato, D.M.V.; Scalabrin, E.; Santos, E. An extended model for remaining time prediction in manufacturing systems using process mining. J. Manuf. Syst. 2020, 56, 188–201. [Google Scholar] [CrossRef]
Urrea-Contreras, S.J.; Astorga-Vargas, M.A.; Flores-Rios, B.; Ibarra-Esquer, J.E.; González-Navarro, F.F.; Pacheco, I.A.G.; Agüero, C.L.P. Applying Process Mining: The Reality of a Software Development SME. Appl. Sci. 2024, 14, 1402. [Google Scholar] [CrossRef]
Mangi, F.A.; Su, G.; Zhang, M. Advancing verification of process mining models with quantitative model checking in stochastic environment. ITM Web Conf. 2024, 60, 12. [Google Scholar] [CrossRef]
Syring, A.F.; Tax, N.; van der Aalst, W.M. Evaluating Conformance Measures in Process Mining using Conformance Propositions (Extended version). Trans. Petri Nets Other Model. Concurr. 2019, 14, 192–221. [Google Scholar] [CrossRef]
Marson, A.; Zuliani, F.; Fedele, A.; Manzardo, A. Life cycle assessment-based decision making under methodological uncertainty: A framework proposal. J. Clean. Prod. 2024, 441, 141288. [Google Scholar] [CrossRef]
Pelletier, N.; Bamber, N.; Brandão, M. Interpreting life cycle assessment results for integrated sustainability decision support: Can an ecological economic perspective help us to connect the dots? Int. J. Life Cycle Assess. 2019, 24, 1580–1586. [Google Scholar] [CrossRef]
Fauzi, R.T.; Lavoie, P.; Sorelli, L.; Heidari, M.; Amor, B. Exploring the Current Challenges and Opportunities of Life Cycle Sustainability Assessment. Sustainability 2019, 11, 636. [Google Scholar] [CrossRef]
Graves, N.; Koren, I.; Aalst, W.V.D. ReThink Your Processes! A Review of Process Mining for Sustainability. In Proceedings of the 2023 International Conference on ICT for Sustainability (ICT4S), Rennes, France, 5–9 June 2023; pp. 164–175. [Google Scholar] [CrossRef]
Lee, J.K.Y.; Gholami, H.; Saman, M.Z.M.; Ngadiman, N.H.A.B.; Zakuan, N.; Mahmood, S.; Omain, S.Z. Sustainability-Oriented Application of Value Stream Mapping: A Review and Classification. IEEE Access 2021, 9, 68414–68434. [Google Scholar] [CrossRef]
Pangestu, P.; Pujiyanto, E.; Rosyidi, C.N. Multi-objective cutting parameter optimization model of multi-pass turning in CNC machines for sustainable manufacturing. Heliyon 2021, 7, e06043. [Google Scholar] [CrossRef] [PubMed]
Rubio, F.; Llopis-Albert, C.; Valero, F. Multi-objective optimization of costs and energy efficiency associated with autonomous industrial processes for sustainable growth. Technol. Forecast. Soc. Chang. 2021, 173, 121115. [Google Scholar] [CrossRef]
Nujoom, R.; Mohammed, A.; Wang, Q. A sustainable manufacturing system design: A fuzzy multi-objective optimization model. Environ. Sci. Pollut. Res. 2018, 25, 24535–24547. [Google Scholar] [CrossRef] [PubMed]
Choobineh, M.; Mohagheghi, S. A multi-objective optimization framework for energy and asset management in an industrial Microgrid. J. Clean. Prod. 2016, 139, 1326–1338. [Google Scholar] [CrossRef]
Ghanim, J.; Awad, M. An Unsupervised Anomaly Detection in Electricity Consumption Using Reinforcement Learning and Time Series Forest Based Framework. J. Artif. Intell. Soft Comput. Res. 2025, 15, 5–24. [Google Scholar] [CrossRef]
Yang, D.; Liu, Q.; Li, J.; Jia, Y. Multi-Objective Optimization of Service Selection and Scheduling in Cloud Manufacturing Considering Environmental Sustainability. Sustainability 2020, 12, 7733. [Google Scholar] [CrossRef]
Zhao, S.; Dziurzanski, P.; Przewozniczek, M.; Komarnicki, M.; Indrusiak, L.S. Cloud-based Dynamic Distributed Optimisation of Integrated Process Planning and Scheduling in Smart Factories of Integrated Process Planning and Scheduling in Smart Factories. In Proceedings of the GECCO ’19: Genetic and Evolutionary Computation Conference, Prague, Czech Republic, 13–17 July 2019. [Google Scholar] [CrossRef]
Ma, S.; Zhang, Y.; Liu, Y.; Yang, H.; Lv, J.; Ren, S. Data-driven sustainable intelligent manufacturing based on demand response for energy-intensive industries. J. Clean. Prod. 2020, 274, 123155. [Google Scholar] [CrossRef]
Yang, C.; Lan, S.; Wang, L.; Shen, W.; Huang, G.G.Q. Big Data Driven Edge-Cloud Collaboration Architecture for Cloud Manufacturing: A Software Defined Perspective. IEEE Access 2020, 8, 45938–45950. [Google Scholar] [CrossRef]
Singhal, S.; Ahuja, L.; Monga, H. Sustainable manufacturing integrated into cloud-based data analytics for e-commerce SMEs. In Proceedings of the 2023 International Conference on Artificial Intelligence and Smart Communication (AISC), Greater Noida, India, 27–29 January 2023; pp. 1436–1440. [Google Scholar] [CrossRef]
Nawrocki, P.; Smendowski, M. Optimization of the Use of Cloud Computing Resources Using Exploratory Data Analysis and Machine Learning. J. Artif. Intell. Soft Comput. Res. 2024, 14, 287–308. [Google Scholar] [CrossRef]
Leemans, S.J.; Syring, A.F.; van der Aalst, W.M. Earth Movers’ Stochastic Conformance Checking. In Proceedings of the BPM Forum 2019, Vienna, Austria, 1–6 September 2019; pp. 127–143. [Google Scholar] [CrossRef]
Fei, L.; Zhang, Q.; Xie, Y. Study on energy consumption evaluation of mountainous highway based on LCA. IOP Conf. Ser. Earth Environ. Sci. 2017, 69, 012036. [Google Scholar] [CrossRef]
Farjana, S.H.; Huda, N.; Mahmud, M.; Saidur, R. A review on the impact of mining and mineral processing industries through life cycle assessment. J. Clean. Prod. 2019, 231, 1200–1217. [Google Scholar] [CrossRef]
Segura-Salazar, J.; Lima, F.M.; Tavares, L.M. Life Cycle Assessment in the minerals industry: Current practice, harmonization efforts, and potential improvement through the integration with process simulation. J. Clean. Prod. 2019, 232, 174–192. [Google Scholar] [CrossRef]
Behera, A.P.; Dhawan, A.; Rathinakumar, V.; Bharadwaj, M.; Rajput, J.S.; Sethi, K.C. Optimizing time, cost, environmental impact, and client satisfaction in sustainable construction projects using LHS-NSGA-III: A multi-objective approach. Asian J. Civ. Eng. 2024, 26, 761–776. [Google Scholar] [CrossRef]
Ahmed, R.O.; Al-Mohannadi, D.M.; Linke, P. Multi-objective resource integration for sustainable industrial clusters. J. Clean. Prod. 2021, 316, 128237. [Google Scholar] [CrossRef]
Fonseca, J.D.; Commenge, J.; Camargo, M.; Falk, L.; Gil, I. Sustainability analysis for the design of distributed energy systems: A multi-objective optimization approach. Appl. Energy 2021, 290, 116746. [Google Scholar] [CrossRef]

Figure 1. Architecture of the Cloud-Based Integrated Optimization Framework. This conceptual model, developed for this study, shows the data flow from sensors/MES through Process Mining (PM) and Life Cycle Assessment (LCA) modules to the Multi-Objective Optimization Model (MOOM-OEP) engine.

Figure 2. Step-by-Step workflow of the integrated methodology.

Figure 3. Schematic of production line TL201 (68 tubes per min).

Figure 4. Schematic of production line TL203 (80–105 tubes per min).

Figure 5. The Life Cycle Assessment (LCA) for the tube manufacturing facility.

Figure 6. Monthly downtime trends by equipment type. The lines and markers highlight each month’s total downtime seconds. The Production Line (blue) consistently exhibits the highest downtime, while the Large Printing Unit (green) shows substantial but volatile peaks. Mid-tier machines Packing Unit (yellow), Extruder (red), Capping Machine (orange), Packaging Machine (brown) occupy the middle range, and the Small Printing Unit (purple) records the least downtime.

Figure 7. Process discovery model showing transitions between stations. Red boxes highlight bottlenecks; blue boxes and arrows indicate normal flow; green arrows mark efficient transitions; red arrows show problematic transitions; gray arrows represent rework. Edge labels show transition counts.

Figure 8. Duration by activities showing operational hours per station. Red bars indicate bottleneck stations, while blue bars represent normal operations. The clear distinction demonstrates the concentration of processing time at bottleneck stations.

Figure 9. Cycle time and throughput analysis. The left panel shows cycle time by station with purple bars, while the right panel displays throughput rates with green bars. Key metrics are summarized below, indicating an overall system efficiency of 87%.

Figure 10. Pareto front solutions showing the trade-off between Operational Efficiency (f₁) and Environmental Impact (f₂), where lower emissions indicate better sustainability. The red point denotes the baseline scenario, while blue points represent optimized solutions.

Figure 11. Sensitivity analysis of TOPSIS rankings based on different weighting scenarios.

Table 1. Event log (first 10 events).

Caseid	Activity	Timestamp	Resource	Event Type	Value
Station_207	Sensor_6_Start	1/4/2021 6:00	S6_207	Start	1
Station_205	Sensor_4_Start	1/4/2021 7:00	S4_205	Start	1
Station_207	Sensor_6_Complete	1/4/2021 7:00	S6_207	Complete	0
Station_203	Sensor_4_Start	1/4/2021 8:00	S4_203	Start	1
Station_207	Sensor_7_Start	1/4/2021 8:00	S7_207	Start	1
Station_202	Sensor_7_Start	1/4/2021 9:00	S7_202	Start	107
Station_203	Sensor_4_Complete	1/4/2021 9:00	S4_203	Complete	0
Station_205	Sensor_1_Start	1/4/2021 9:00	S1_205	Start	78
Station_205	Sensor_2_Start	1/4/2021 9:00	S2_205	Start	2
Station_205	Sensor_4_Complete	1/4/2021 9:00	S4_205	Complete	0

Table 2. Tube-making machine data.

Machine Type	Cycle Time (CT)	Idle Time (IT)	Throughput (TP)	Defective Rate (DR)	Energy Consumption (EC)	Carbon Emissions (CE)	Waste Generation (WG)
Machine Type	(s)	(s)	(Units/Day)	(%)	(kWh/Day)	(kg CO₂/Day)	(kg/Day)
Extruder (Station 202)	750	95	115	2.8	485	242.5	18.5
Capping Machine (Station 203)	636	82	135	2.3	520	260.0	19.8
Printing Unit (Station 204)	804	124	108	3.5	468	234.0	21.2
Packaging Machine (Station 205)	684	98	126	2.6	495	247.5	20.5

Table 3. Operational and environmental constraints for tube-making machines.

Constraint	Parameter	Value
Operational	Machine utilization limit	95%
	Minimum throughput	94 units/day
	Maximum idle time	15%
	Station availability	98%
Data Management	Data transfer latency	250 ms
	Sensor sampling rate	1 Hz minimum
	Historical retention	5 years

Table 4. TOPSIS Results for Pareto Solutions in Tube-Making Process.

Solution	Efficiency (%)	Emissions (kg CO₂/Day)	S+	S −	C_i	Rank
Baseline	87.2	2007.5	0.682	0.052	0.071	18
P1	87.8	1980.3	0.654	0.089	0.120	17
P18	95.2	1734.5	0.327	0.551	0.627	14

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Published by MDPI on behalf of the International Institute of Knowledge Innovation and Invention. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Matonya, M.M.; Budai, I. Cloud-Based Data-Driven Framework for Optimizing Operational Efficiency and Sustainability in Tube Manufacturing. Appl. Syst. Innov. 2025, 8, 100. https://doi.org/10.3390/asi8040100

AMA Style

Matonya MM, Budai I. Cloud-Based Data-Driven Framework for Optimizing Operational Efficiency and Sustainability in Tube Manufacturing. Applied System Innovation. 2025; 8(4):100. https://doi.org/10.3390/asi8040100

Chicago/Turabian Style

Matonya, Michael Maiko, and István Budai. 2025. "Cloud-Based Data-Driven Framework for Optimizing Operational Efficiency and Sustainability in Tube Manufacturing" Applied System Innovation 8, no. 4: 100. https://doi.org/10.3390/asi8040100

APA Style

Matonya, M. M., & Budai, I. (2025). Cloud-Based Data-Driven Framework for Optimizing Operational Efficiency and Sustainability in Tube Manufacturing. Applied System Innovation, 8(4), 100. https://doi.org/10.3390/asi8040100

Article Menu

Cloud-Based Data-Driven Framework for Optimizing Operational Efficiency and Sustainability in Tube Manufacturing

Abstract

1. Introduction

2. Related Work and Theoretical Background

2.1. Process Mining for Operational Excellence in Manufacturing

2.2. Life Cycle Assessment (LCA) for Evaluating Environmental Burdens

2.3. Efforts Towards Integrating Operational and Sustainability Perspectives

2.4. Multi-Objective Optimization (MOO) in Sustainable Manufacturing

2.5. Research Gap

3. Materials and Methods

3.1. Framework Architecture and Cloud Enablement

3.2. Step 1: Data Acquisition and Monitoring

3.3. Step 2: Process Mining for Operational Insight

3.4. Step 3: Dynamic Life Cycle Assessment (LCA) Inventory

3.5. Step 4: MOOM-OEP Formulation

3.5.1. Decision Variables

3.5.2. Objective Functions

3.5.3. Constraints

3.6. Step 5: Multi-Objective Optimization Using NSGA-II

3.7. Step 6: Solution Ranking Using TOPSIS

3.8. Step 7: Implementation and Feedback

3.9. Process Response Measurement and Uncertainty

3.10. Case Study

3.11. Production Flow and Sensor Placement

4. Results and Discussion

4.1. Validation Approach

4.2. Baseline Performance Analysis and Bottleneck Identification (Pre-Optimization)

4.2.1. Downtime Analysis

4.2.2. Process Discovery and Workflow Visualization

4.2.3. Duration by Activities

4.2.4. Cycle Time and Throughput Analysis

4.3. Station Performance Analysis

4.3.1. Event Processing Distribution

4.3.2. Bottleneck Characteristics

4.4. Process Flow Efficiency

Quantitative Baseline and Bottleneck Confirmation

4.5. Multi-Objective Optimization Results: Efficiency vs. Sustainability Trade-Off

Trade-Off Analysis

4.6. Solution Ranking and Selection Using TOPSIS

Ranking Insights

4.7. Sensitivity Analysis of Rankings

Impact of Priorities

4.8. Discussion Summary

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI