Failure Mode and Effects Analysis of a Microcontroller-Based Dual-Axis Solar Tracking System with Testing Capabilities

Raul Rotar; Anca-Adriana Petcuț-Lasc; Flavius-Maxim Petcuț; Flavius Oprițoiu; Mircea Vlăduțiu

doi:10.3390/asi8060159

,

and

¹

Computer and Information Technology, “Politehnica” University of Timisoara, 2 V. Parvan Blvd, 300223 Timisoara, Romania

²

Doctoral School of Systems Engineering, Petroleum-Gas University of Ploiesti, Bulevardul București, Nr. 39, 100680 Ploiești, Romania

³

Faculty of Exact Sciences, Department of Information Science, “Aurel Vlaicu” University of Arad, Str. Elena Drăgoi, Nr. 2, 310330 Arad, Romania

⁴

Faculty of Engineering, Department of Automation and Applied Information Science, “Aurel Vlaicu” University of Arad, Str. Elena Drăgoi, Nr. 2, 310330 Arad, Romania

Appl. Syst. Innov.2025, 8(6), 159;https://doi.org/10.3390/asi8060159

This article belongs to the Section Control and Systems Engineering

Version Notes

Order Reprints

Abstract

This paper investigates the reliability of a dual-axis solar tracking system using Failure Mode and Effects Analysis (FMEA), Fault Tree Analysis (FTA), and Reliability Block Diagrams (RBD). The system’s control and data transfer subsystems are evaluated under indoor and outdoor conditions using failure rate data. Key vulnerabilities—particularly sensor degradation—are modeled through probabilistic analysis. Results show a significant drop in reliability (to 15.02%) in harsh environments, primarily due to light sensor failures. However, mitigation strategies such as Built-In Self-Test (BIST) architectures improve test coverage, thereby increasing the chance of fault detection. The findings highlight the need for reliability-focused design in solar trackers to ensure long-term energy efficiency and fault resilience.

Keywords:

solar tracking; reliability evaluation; failure mode and effects analysis; fault tree analysis; reliability block diagram; environmental stressors; risk mitigation; built-in self-test

1. Introduction

The ongoing demand for alternative energy options has pushed humanity to embrace sustainable energy sources. Solar energy is the most abundant renewable energy source, and it may be gathered successfully using Photovoltaic (PV) devices. Home customers choose static placed PV panels due to their inexpensive installation costs and easier maintenance processes. However, because of their fixed position, home PV panels can only generate power during the afternoon, when the sun rays are perpendicular to the payload’s surface. This lack of optimal orientation has generated interest in constructing mobile variations, known as dual-axis solar trackers, that are capable of optimizing the azimuth and elevation angles, hence increasing energy generation by 30–45% [,]. However, these performance advantages come with trade-offs—namely increased system complexity, cost, and vulnerability to environmental and mechanical stressors.

The reliability of solar tracking systems is a central concern in both commercial and research contexts []. Maximum energy generation is generally affected by two crucial aspects: unfavorable weather conditions and PV panel misalignment. In the second case, several factors can lead to sub-optimal angle positioning, such as sensor degradation and inactivity, mechanical damage, or complete control system shutdown [], all of which can alter the financial Return on Investment (ROI). More frequently, sensors that are directly exposed to outdoor conditions may suffer additional risks of failure [,], due to humidity, UV radiation, dust, and temperature fluctuations, to name only a few. Consequently. evaluating and improving the reliability of solar tracking systems is vital not only for technical reasons, but also for their long-term economic capacity and safety.

Under these circumstances, FMEA [] and FTA [] represent valuable assessment tools in reliability engineering. These structured methodologies enable a systematic exploration of potential failure pathways, root causes, and their impacts on overall system performance. Specifically, this research focuses on applying these techniques to a dual-axis solar tracking prototype, with an emphasis on modeling component failures, quantifying system reliability, and identifying critical vulnerabilities in both control and data transmission subsystems [].

The novelty of this research is predicated upon the integration of quantitative reliability analysis, graph-based fault propagation, and explicit intervention modeling, notably the incorporation of BIST architectures, to effectively simulate and enhance fault tolerance. This hybrid methodological approach affords the capability not only to delineate high-risk failure modes (e.g., sensor drift, power outages) but also to quantify the efficacy of diverse mitigation strategies, such as the application of BIST architectures specifically designed for Electronic Control Units [], and System-On-Chip (SoC) devices [].

Ultimately, the research contributes to the development of more resilient and self-aware solar tracking systems capable of maintaining high efficiency even under adverse operating conditions. By addressing the interplay between hardware failures, environmental exposure, and diagnostic mechanisms, this study aims to establish a foundational reliability framework for future solar energy applications where uptime, safety, and ROI are non-negotiable factors.

The remainder of the paper is organized as follows. Section 2 explores the growing market demand for reliable solar tracking systems by presenting supporting arguments, and it establishes a clear connection between energy production efficiency and system reliability. Section 3 describes the research methodology consisting of RBDs, FTAs, and FMEA for assessing the reliability of solar trackers. Section 4 presents the experimental platform and results, beginning with the analysis of failure modes and their occurrence probabilities, followed by a detailed discussion of the effects and their implications. Section 5 introduces the mathematical framework, including the energy production computation formula and a novel method based on intervention nodes within Bayesian networks. Finally, Section 6 concludes this paper and presents the future work.

2. Literature Review

2.1. Reliability and Power Generation in Modern PV Systems

Modern literature emphasizes PV system reliability as an important factor for establishing their long-term power generation and efficiency. Several studies explore this detail by investigating the performance of individual components as well as to more advanced system-level grid structures.

The authors in [] present a reliability evaluation approach that incorporates the thermal characteristics of inverters into the overall PV system model. Their analysis demonstrates that sustained heat stress can reduce inverter reliability by approximately 30%, which in turn leads to an overall reduction of nearly 20% in PV plant reliability. Similarly, the study in [] applies Reliability, Availability, and Maintainability (RAM) analysis to PV plants of different scales over 20 years, revealing that inverter reliability declines sharply over time, especially in larger systems, eventually reaching zero in multi-megawatt configurations. The research presented in [] employs the FTA approach to systematically identify the most critical PV component failures, highlighting issues such as solder bond failures, broken cells and interconnects, delamination, shading-related faults, and material discoloration as primary contributors to performance decline.

The review in [] synthesizes findings from long-term outdoor testing and concludes that PV module degradation under field conditions tends to follow a near-linear pattern over decades, leading to a steady decline in power output over the 25–30 year expected lifespan. The authors in [] focus on optical degradation mechanisms, demonstrating that soiling, discoloration, and delamination negatively affect the module’s spectral response and significantly reduce its short-circuit current. According to the data compiled in [], soiling alone can result in average annual energy losses of 3–4%, with the associated economic impact reaching billions of euros globally. Potential-Induced Degradation (PID) is discussed extensively in [], where the authors describe its primary causes—namely high operating voltage, elevated temperature, and high humidity—and note that it can lead to power losses of up to 30%. The study further outlines preventive measures such as effective grounding strategies and careful inverter configuration.

The article in [] outlines a set of widely used techniques for assessing the reliability of grid-connected PV systems, including RAM analysis, Weibull statistical modeling, Mean Time Between Failures (MTBF), Mean Time To Repair (MTTR), and FMEA. Expanding on this, the authors in [] review methodological advancements in reliability modeling, emphasizing statistical and simulation-based approaches that improve performance prediction accuracy. More recent developments are highlighted in [], where the authors present predictive modeling techniques using artificial neural networks (ANN), Adaptive Neuro-Fuzzy Inference Systems (ANFIS), and Hardware-In-the-Loop (HIL) simulations to forecast reliability trends. In [], the discussion shifts toward accelerated reliability testing, linking laboratory-based stress tests to field performance data to shorten evaluation timeframes and improve predictive accuracy.

The study in [] examines the broader impact of high PV penetration on power systems, noting that the use of power electronic interfaces decouples PV generation from the grid’s rotational inertia. This reduction in inertia can impair frequency stability, prompting the development of control strategies that introduce “synthetic inertia” into PV inverters. A more comprehensive review is presented in [], where the authors discuss how large-scale PV integration affects multiple aspects of grid operation, including voltage stability, frequency regulation, protection coordination, harmonic distortion, rotor-angle stability, and overall grid flexibility.

2.2. The Demand for Reliable Solar Tracking Systems

The performance of solar energy systems is fundamentally tied not only to solar irradiance [] and panel efficiency [] but also to the operational reliability of the entire system. In particular, dual-axis solar tracking systems, while capable of maximizing incident sunlight capture, introduce additional mechanical and electronic components—such as motors, sensors, controllers, and communication interfaces—that increase the likelihood of failure over time. Any malfunction or degraded performance within these components can result in significant misalignment or downtime, directly reducing energy production and, by extension, economic viability.

Assessing the reliability of solar tracking devices is important for several reasons, as these systems play an essential role in the efficiency and profitability of solar energy installations []. A few years ago, a significant concern was that solar tracking systems were considered less cost-effective than fixed PV installations, primarily due to their relatively modest energy gain, ranging from +15% to a maximum of +30%. However, recent technological advancements have significantly improved the appeal of solar trackers in the renewable energy market. A recent study [] supports this shift, indicating a promising future for solar tracking technology. Comparative data reveal a 5.95–57.4% efficiency range across 14 solar tracker groups, with encoder-based control systems offering enhanced reliability and superior Sun-tracking performance. According to [], solar trackers can increase energy efficiency by 15% to 67.65% compared to stationary PV systems.

Solar tracking systems are designed to adjust the position of solar panels based on the sun’s movement across the sky to maximize sunlight capture and, by implication, energy production. If a solar tracking system is not reliable, it can work incorrectly, leading to a suboptimal orientation of the panels and therefore a significant reduction in the energy generated []. Malfunctions or errors in system operation can lead to costly outages and negatively affect the long-term profitability of the solar installation. Although two-axis PV systems could be economically non-profitable [], reliable solar tracking systems reduce the need for frequent interventions for repairs or adjustments, which contributes to lower operating and maintenance costs []. Additionally, a proper assessment of the reliability of tracking systems helps to identify potential problems and to implement preventive solutions that extend the life of the system [,]. This ensures that the systems perform to the manufacturer’s specifications and are not subject to premature wear or damage from malfunction.

Another valid argument for improving the reliability of solar tracking systems is linked to the ROI key factor. Investments in solar tracking technology are significant, and system dependability directly impacts the return on this investment. If the tracking system is not reliable, the ROI decreases due to energy losses and additional costs. By rigorously assessing dependability, steps can be taken to minimize risks and ensure ROI is met or even exceeded []. For manufacturers, careful testing and documentation of results help to improve product quality and establish a positive reputation in the market. Customers will have greater confidence in products that have undergone rigorous testing and are known for their high performance and reliability [].

Solar tracking systems involve moving mechanical components and electrical equipment, which must comply with applicable safety standards and regulations. Dependability is essential to prevent accidents, mechanical failures, and other risks that could affect the safety of personnel or the environment. A relevant article supporting the idea of safety concerns about solar trackers mounted on stations built for space missions. These stations are highly dependent on the energy generated by the solar tracker; therefore, optimizing the dependability of these systems is critical [].

Furthermore, solar tracking systems must work correctly in various weather conditions (wind, rain, extreme temperatures). Their dependability determines their ability to withstand these conditions without deteriorating or losing their effectiveness. Reliability assessment ensures that systems are designed and built to perform optimally in all possible scenarios [].

3. Research Methodology

3.1. Reliability Block Diagrams

In the field of reliability engineering, the configuration of components within a system significantly influences the overall system reliability. One of the most fundamental configurations is the series connection, where components are arranged such that the failure of any single component results in the failure of the entire system. This configuration is commonly encountered in many engineering systems, particularly in power transmission, aerospace systems, and manufacturing processes [].

In a series system, the reliability of the entire system is the product of the reliabilities of its individual components. The series model assumes statistical independence between component failures and no repair during operation. While simplistic, this assumption provides a foundational understanding of system behavior and is widely used in early-stage design and risk assessments [].

However, the inherent vulnerability of series systems—where a single point of failure leads to total system failure—has prompted the development of more robust configurations and the inclusion of redundancy in practical applications []. Despite this, series models remain critically important for understanding baseline system reliability and for evaluating the impact of component quality on overall performance.

Parallel system configurations are a key strategy in reliability engineering for enhancing system robustness and minimizing the risk of total system failure. In contrast to series systems, parallel systems are designed so that the failure of one or more components does not necessarily lead to overall system failure, as long as at least one component remains functional. This architectural approach is widely applied in critical systems where uninterrupted operation is essential, such as in aerospace, nuclear power plants, and data centers [,].

In a parallel configuration, system reliability increases with the number of redundant components. As a result, parallel arrangements are often used to introduce fault tolerance and ensure high availability, especially in mission-critical applications [].

Parallel systems may be implemented as active redundancy—where all units operate simultaneously—or as standby redundancy, where backup units are activated only upon failure of the primary ones. Each approach has distinct implications for system reliability and maintenance strategy []. Moreover, advanced reliability modeling often incorporates non-identical component reliabilities, dependencies, and repair policies, allowing for more accurate assessments of system performance in real-world environments [].

To construct the RBD of the dual-axis solar tracking prototype, the system components must first be categorized based on their role within the overall architecture. Based on its functionality, the solar tracking system can be divided into two primary subsystems: the control subsystem and the data transfer subsystem, as depicted in Figure 1. The control subsystem is responsible for executing key operations such as the homing function, the flat position function, and the optimal sunlight positioning function. In contrast, the data transfer subsystem handles the acquisition of sensor data and its transmission to the virtual IoT platform for remote monitoring and analysis. This classification, presented in Table 1, allows for a structured representation of how each component contributes to the system’s functionality and reliability. The failure rates of each element are expressed in Failure Per Million Hours (FPMH) according to the international standard presented in [].

Figure 1. Dual-Axis Solar Tracker Architecture comprising the Control Subsystem (green), Actuators/Gear (red), Photovoltaic System (light orange), Power Supply (orange), Data Transfer Subsystem (blue), and Cloud layer (purple).

Table 1. Solar Tracker Components with their respective Failure Rates.

The Arduino Mega 2560 MCU (Arduino S.r.l., Monza, Italy) is commonly housed in a plastic enclosure when installed indoors, but can be exposed in outdoor setups as well. Failure modes primarily include corrosion of contacts and degraded solder joints due to humidity and condensation. In outdoor use, UV radiation and temperature cycling can cause plastic warping and micro-cracking in solder joints, significantly increasing their failure rate. The TB6560 Motor Driver, packaged typically in a heatsinked PCB mount, handles stepper motor control. Its primary vulnerability is overheating, especially when ventilation is poor. Outdoor conditions like rain ingress and high humidity can lead to short circuits or corrosion of exposed terminals, drastically reducing reliability.

The Tongling 5 V module, enclosed in a semi-sealed plastic case, is an electromechanical relay that can suffer from arcing, contact pitting, and coil degradation. High outdoor failure rates are due to moisture penetration, which leads to corrosion or even coil failure.

The Weidmuller 24 V Relay is an industrial-grade relay with relatively better sealing. Nonetheless, oxidation of contacts and thermal fatigue due to outdoor temperature fluctuations can cause operational failure over time, especially in less-protected outdoor installations.

The Astrosyn Stepper Motor is usually mounted without full environmental sealing. Dust ingress and water exposure can lead to bearing failure or internal corrosion. Over time, these stressors result in increased resistance or stalling. The Superior Electric Slo-Syn represents another stepper motor variant, vulnerable to similar issues as the Astrosyn motor. Wind-blown particles, thermal cycling, and humidity can reduce the insulation resistance, potentially causing shorts or excessive wear in the gear mechanism.

Regarding the sensor components, we can identify the TEMT6000 module. This light sensor module is exposed to ambient light and UV radiation. In outdoor conditions, degradation of the lens material and solder fatigue are common. Moisture ingress can cause measurement drift or total failure.

Another sensor component, the ACS172, is an analog current sensor in a plastic DIP or SOIC package. While relatively robust, long-term outdoor use may cause epoxy encapsulant degradation, leading to pin corrosion or erratic readings due to electromagnetic interference amplified by weather changes. The ML8511 is an UV sensor sensitive to environmental damage. Outdoor use exposes it to actual UV radiation, which can paradoxically degrade the sensor itself. PCB corrosion and encapsulation failure are also concerns under high humidity and rain.

The BH1750, packaged in a small IC form, is a digital light sensor that can degrade in performance due to condensation, lens fogging, or PCB corrosion. Even indoors, fluctuating humidity can cause internal oxidation over time. The DHT22 is a digital temperature and humidity sensor that is notoriously sensitive to condensation. In outdoor setups, if not well-sealed, it suffers from accuracy drift, rust on pins, and eventual sensor failure due to exposure to rain or frost.

Mechanical encoders are vulnerable to dust and moisture. Outdoor conditions may lead to rusting of mechanical parts and misreading due to signal bounce or degradation of optical elements in some variants. On the other hand, limit switches are typically mechanical and enclosed in plastic or metal housings. However, they can still fail from moisture ingress, leading to contact corrosion or mechanical jamming due to dirt buildup.

The SIM800L V2 module, with a compact module design, is usually sensitive to temperature extremes and condensation. Failures can result from corrosion on antenna connectors, solder fatigue on small pins, and Electrostatic Discharge (ESD) events during storms. The Solar Charge Controller is critical for battery health and is often mounted near panels. Outdoor usage may see degradation of connectors, internal MOSFET failure due to heat, or board-level corrosion, especially if the housing is not IP-rated. The LM2596 is a buck converter exposed to outdoor elements may fail due to overheating or corrosion of its inductor or capacitors. Electrolytic capacitors are particularly prone to drying out in heat or swelling due to moisture. Finally, the Varta 12 V·44 Ah battery is a sealed lead-acid battery that performs well under moderate conditions but suffers under high temperatures, which accelerate electrolyte evaporation and plate degradation. Cold temperatures can reduce capacity and cause internal pressure buildup, leading to case rupture in extreme cases.

Based on the above-mentioned system components and their respective failure rates, the RBD of the dual-axis solar tracker is illustrated in Figure 2.

Figure 2. Dual-Axis Solar Tracker Reliability Block Diagram depicting a mixture of Series and Parallel Connections.

The first step for computing the reliability of the entire system is to convert the failure rates from FPMH to failures per hour (λ in 1/h). The lambda subsystem indoor and outdoor values will be calculated for comparison purposes only. It is essential to determine if the solar tracker’s subsystem components manage to withstand stress factors and weather conditions. As depicted in Figure 2, the reliability values will be independently calculated for the control circuits, motors, sensors, and power supply components. For this calculation, only the midpoint value of each failure rate interval will be considered.

For the control circuits, which encompass the first four components, the associated failure rate exhibits a significant difference based on the operating environment. Specifically, the indoor failure rate is λ_{control, indoor} = 2.35 × 10⁻⁶ failures/h, whereas the outdoor rate is substantially higher at λ_{control, outdoor} = 8 × 10⁻⁶ failures/h. The reliability of the control subsystem, denoted as R(t), was subsequently computed using the exponential reliability function for a time interval of t = 1000 h. The resulting values are R(1000)_{control, indoor} ≈ 0.9977 for indoor usage, and R(1000)_{control, outdoor} ≈ 0.9920 for outdoor usage.

The automation circuits, encompassing the following four components, exhibit distinct failure rates based on the operational environment: λ_{automation, indoor =} 1.975 × 10⁻⁷ failures/h and λ_{automation, outdoor} = 5.965 × 10⁻⁶ failures/h. Consequently, the reliability of the automation and gear subsystem after t = 1000 h is R(1000)_{automation, indoor} ≈ 0.9998, and R(1000)_{automation, outdoor} ≈ 0.9941 for indoor and outdoor usage, respectively.

The RBD for the data transfer subsystem is structured as a combination of series (GSM module) and parallel configurations (sensors). This reflects the system’s dependency on multiple components for successful data transmission. Specifically, sensor data will fail to be transmitted if: (a) all sensors simultaneously fail to capture environmental data; (b) the GSM/GPRS module becomes faulty, preventing communication with the Things Speak server. This configuration, illustrated in Figure 2, highlights critical points of failure that must be addressed to ensure the reliability of the data transfer process. Therefore, the reliability of the following five components (9 through 13) when connected in a parallel configuration under indoor and outdoor conditions for a time interval of t = 1000 h is extremely high, computed as approximately R(1000)_{data transfer, indoor, outdoor} ≈ 1.0 (specifically, 1 − 5.46 × 10⁻²⁰). In reliability engineering, a value this close to unity (1.0) is indicative of a system where the probability of all components failing simultaneously is negligible.

Regarding the GSM/GPRS module, comprising the next two components in a series connection, the computed lambda values are λ_{datatransfer, indoor} = 6.25 × 10⁻⁷ failures/h (for outdoor usage) and λ_{datatransfer, outdoor} = 2.10 × 10⁻⁶ failures/h. Substituting the values for t = 1000 h will result in a reliability value of R(1000)_{datatransfer, indoor} ≈ 0.999375 (for indoor usage), and R(1000)_{datatransfer, outdoor} ≈ 0.9979 (for outdoor usage).

Concerning the last three components, which are responsible for the power supply unit, the corresponding lambda values are λ_{powersupply, indoor} = 7.7 × 10⁻⁷ failures/h and λ_{powersupply, outdoor} = 1.8 × 10⁻⁶ failures/h. The associated reliability values are computed for t = 1000, as follows: R(1000)_{powersupply, indoor} ≈ 0.99923 and R(1000) _{powersupply, outdoor} ≈ 0.9982.

Finally, the reliability of the entire system (components 1 through 18) for indoor usage over a time interval of t = 1000 h is approximately R(1000)_{system, indoor} ≈ 0.9961 and R(1000)_{system, outdoor} ≈ 0.9823 for outdoor usage.

The reliability of the entire system is highly sensitive to the environmental transition from indoor to outdoor conditions. While the total system reliability only decreased by approximately 1.38% over 1000 h, this small change is underpinned by a 351.1% increase in the system’s equivalent failure rate. This difference is largely driven by the extreme sensitivity of electromechanical components (such as the Anemometer) and the cumulative, additive effect of increased failure rates within the system’s series architecture. Consequently, long-term operational success for the outdoor application is critically dependent on focused design efforts, such as isolating the most sensitive components (e.g., through robust enclosures) or implementing redundancy, as demonstrated by the resilient parallel subsystem.

3.2. Fault Tree Analysis

FTA is a systematic, deductive methodology employed to evaluate the reliability and safety of complex systems. Originally developed in 1962 by H.A. Watson at Bell Laboratories for the U.S. Air Force’s Minuteman ICBM program, FTA has since become a cornerstone in reliability engineering across various high-risk industries, including aerospace, nuclear energy, and chemical processing [].

The essence of FTA lies in constructing a graphical representation—a fault tree—that maps the logical relationships between system failures and their root causes. This tree begins with a “top event,” representing the undesired system failure, and branches downward through intermediate events to basic events, which are the fundamental causes of failure. Logical gates such as AND and OR are used to depict how these events combine to lead to the top event [,].

FTA serves both qualitative and quantitative purposes. Qualitatively, it helps identify minimal cut sets—the smallest combinations of basic events that can cause the top event—thereby highlighting critical vulnerabilities within the system. Quantitatively, it allows for the calculation of the probability of the top event occurring, based on the probabilities of the basic events and the logical structure of the fault tree [].

The versatility of FTA makes it applicable in various domains. For instance, in the energy sector, it aids in assessing the reliability of power systems and identifying potential points of failure. In the context of control automation, FTA is instrumental in analyzing complex systems like nuclear plants and water distribution networks, where it helps in designing robust systems by identifying and mitigating potential faults during the design phase [].

FTA can also be utilized to classify system failures based on their severity. In the context of the solar tracking system, three distinct levels of failure criticality can be identified: (a) critical—failures that cause complete system shutdown or pose safety risks, (b) less critical (malfunction)—faults that impair performance but do not halt operation entirely, and (c) non-critical—minor faults with negligible impact on system functionality. To illustrate each level of severity, an individual FTA will be constructed for each corresponding failure scenario.

A critical failure in the solar tracking system typically results from an unexpected power supply outage occurring during execution cycles. This type of failure can lead to a complete system shutdown, interrupting all ongoing operations. The corresponding FTA illustrating this scenario is presented in Figure 3. The FTA diagram systematically depicts the sequence of failures that can lead to a power supply outage within the solar tracking system. At the top of the tree, the undesired event—power supply outage—is broken down into two main contributory paths: battery failure and power conversion failure. The battery failure branch includes internal faults in the Varta 12 V 44 Ah battery as well as malfunctions in the solar charge controller, both of which are influenced by underlying stressors such as long-term usage, thermal cycling, and moisture ingress. On the other hand, the power conversion failure branch considers the malfunction of the LM2596 converter and the overloading of downstream components, including the SIM800L GSM module and relay circuitry.

Figure 3. Critical Scenario involving a Complete Power Supply Outage in the Solar Tracking System.

These failures may arise independently or in combination, as represented by OR logic gates. The analysis emphasizes how environmental and operational stress factors at the component level can propagate upward through the system architecture, ultimately resulting in a complete power disruption.

A less critical failure scenario in the solar tracking system is the tracking misalignment. This event occurs when the solar panel is no longer accurately oriented toward the sun, resulting in reduced energy harvest. It does not completely disable the system but significantly lowers performance. The corresponding FTA is illustrated in Figure 4.

Figure 4. Less Critical Scenario involving a Tracking Misalignment in the Solar Tracking System.

This FTA illustrates a secondary-level failure scenario in the solar tracking system, focusing on tracking misalignment as the top event. The misalignment may arise from one of three primary causes: sensor failure, actuator malfunction, or control signal error. Sensor failure is further traced to faults in components such as the TEMT6000 and ML8511 modules, influenced by environmental aging and UV-induced degradation. Actuator malfunction is attributed to the failure of the stepper motor, often caused by wind-blown dust or corrosion. Control signal errors originate from faults in the rotary encoder and misreadings by the Arduino, with mechanical debris and noise acting as the triggering factors. The hierarchical structure captures how less-critical, yet impactful, faults can reduce system efficiency without leading to a total power outage.

The FTA in Figure 5 models a non-critical failure scenario in the solar tracking system, namely a data communication failure, which impacts remote monitoring and data logging functionalities. The top-level event is decomposed into three main contributing branches: GSM module fault, signal loss or network issues, and microcontroller communication error. The GSM module fault centers on the SIM800L V2, which may fail due to electrical defects or environmental stressors such as humidity and corrosion. Signal loss arises from weak cellular reception, electromagnetic interference, or antenna malfunction, all of which can interrupt data transmission. Microcontroller-related errors are linked to the Arduino Mega 2560, including UART protocol issues and command parsing faults, often caused by firmware glitches or transient electrical noise. These contributing factors are modeled using OR gates, highlighting that the failure of any single component or condition can lead to communication loss without affecting energy generation.

Figure 5. Non-Critical Scenario involving Data Communication Failure in the Solar Tracking System.

Additionally, the FTA incorporates AND gates to emphasize that a data communication failure arises also when all sensor modules fail simultaneously. In such a case, the system is unable to collect any environmental data, making it impossible to transmit information to the Things Speak server.

In summary, FTA is a vital tool in reliability engineering, offering a structured approach to identifying and mitigating potential system failures. Its ability to provide both a visual representation of failure pathways and quantitative risk assessments makes it indispensable in the design and analysis of complex, safety-critical systems.

3.3. Failure Mode and Effects Analysis

FMEA is a structured, inductive methodology utilized in reliability engineering to proactively identify and mitigate potential failure modes within systems, products, or processes. The FMEA process involves a systematic examination of components and subsystems to determine how they might fail (failure modes), the causes of these failures, and the potential effects on system performance. Each identified failure mode is assessed based on three criteria: Severity (S), which measures the impact of the failure; Occurrence (O), which estimates the likelihood of the failure; and Detection (D), which evaluates the probability of detecting the failure before it occurs. These factors are combined to calculate a Risk Priority Number (RPN), guiding engineers in prioritizing corrective actions []. A FMEA is conducted on the solar tracking prototype, as presented in Figure 6. This analysis provides a detailed overview of the most common potential failure modes within the system, their root causes, and the corresponding impact on system performance and overall operation.

Figure 6. Failure Mode Analysis performed on the Solar Tracking System’s Light Dependent Resistors and Anemometer.

Most of the solar tracker’s electrical components, listed in Table 1, are housed within a metallic enclosure that offers additional protection against environmental stressors. However, several sensors essential for automating the control subsystem and enabling remote monitoring within the data transfer subsystem are mounted externally, leaving them directly exposed to varying environmental conditions.

As shown in Figure 6, one such component is the Light Dependent Resistor (LDR), which is used to measure light distribution across different corners of the PV panel. According to the first layer, if an LDR in a solar tracking system is directly exposed to environmental stressors like sunlight, moisture, dust, and temperature swings, several potential failure root causes arise. For instance, prolonged exposure to UV rays can chemically degrade the plastic encapsulation and even the sensing material itself (typically cadmium sulfide in many LDRs). The result will be shifted sensitivity or permanent loss of responsiveness to light over time. A second root cause for failure is moisture that can oxidize the metallic contacts and the sensitive material inside the LDR. The result will be increased resistance, intermittent operation, or complete failure. A third root cause is constant heating (from the sun) and cooling (at night) over the entire summer days, which creates thermal expansion and contraction. This leads to microcracks in the internal structure, leading to fatigue or delamination of internal layers. A fourth root cause is contamination due to dust and pollution, which results in wrong readings, delayed or inaccurate tracking. A fifth root cause is mechanical damage due to rain, hail, and wind-borne particles. This usually leads to physical destruction or altered optical properties.

The second layer (bottom-up approach) in Figure 6 represents the failure mode, which most commonly involves the malfunction or complete failure of an LDR. Given that four LDRs are utilized to orient the solar panel toward the Sun, four individual points of potential failure can be identified. Two of these failure points are associated with the West–East rotation along the horizontal axis, while the other two are linked to the North–South rotation along the vertical axis.

For the azimuth rotation, if the West LDR, shown in layer 3, becomes faulty, the Microcontroller Unit (MCU) will behave as described in layer 4. A malfunctioning LDR typically leads to a sudden decrease in resistance, causing the Arduino Mega MCU to continuously register a low voltage value on the A0 analog input. Typically, a damaged photoresistor is interpreted as shading on one side of the PV panel, prompting the solar tracking system to rotate the payload until the West–East sensors read equal values. However, with the West LDR malfunctioning, the system continuously detects an imbalance, causing the solar tracker to keep moving the PV panel until it reaches the sunset position, marked by the maximum horizontal limit switch, where it ultimately becomes stuck, as depicted in layer 5 of Figure 6. Similarly, if the East LDR malfunctions, the MCU will continuously receive a low voltage reading on input A1. This will cause the solar tracking system to rotate in the opposite direction, ultimately becoming stuck at the homing (sunrise) position, triggered by the activation of the lower horizontal limit switch.

For the elevation rotation, the MCU’s behavior mirrors the previous scenarios. If the North LDR malfunctions, the Arduino Mega board will consistently detect a low voltage value on input A2, prompting the solar tracking system to search for the optimal position by moving the PV panel upward. As a result, the solar tracker will eventually become stuck in the flat position, triggered by the activation of the maximum vertical limit switch. If the South LDR malfunctions, the MCU will continuously read a low voltage value on pin A3. This will cause the solar tracking system to remain stuck at the initial homing position, marked by the activation of the lower vertical limit switch, as shown in layer 5.

The FTA diagram for the previously described failure scenario involving all four LDRs is presented in Figure 7. The FTA highlights critical failure pathways that lead to the system becoming immobilized in specific operational states—sunset, homing, or flat. Immobilization at the sunset position occurs when a low voltage is detected at sensor A0 concurrently with a failure of the West Light Dependent Resistor (LDR). Similarly, the system may remain stuck in the homing position due to either a combination of low voltage at A1 and a faulty East LDR, or low voltage at A3 paired with a malfunctioning South LDR. In the case of the flat position, the system becomes fixed when low voltage at A2 is accompanied by failure of the North LDR. These individual fault conditions collectively define a broader category of LDR malfunction, which can be traced to either environmental stressors or degradation from age and usage. Environmental stress is further broken down into contributing factors such as ultraviolet radiation, high humidity, thermal cycling, dust and pollution, and mechanical damage—any of which can independently impose detrimental effects on the LDR sensors.

Figure 7. Solar Tracking Optimal Position Failure due to Environmental Stressors and Aging affecting the Light Dependent Resistors.

4. Experimental Setup and Results

A summarized view of the dual-axis solar tracking prototype, as well as of the electrical equipment contained in the control panel box can be seen in Figure 8. While the prototype hardware is small-scale, the entire reliability methodology is component-dependent and fully scalable. For a large-scale PV farm, the component list (Table 1) and failure rates (λ) would be updated for industrial-grade actuators and controllers, but the fundamental RBD, FTA, FMA approaches remain directly applicable. Additionally, our study is restrained only to the investigation of electrical/electronic components failure modes, instead of studying the effects of mechanical component failure.

Figure 8. Summarized View of the Physical Solar Tracking Prototype (Left), and Electronic Circuits installed in the Control Panel Box (Right).

The current experimental evaluation utilized a focused dataset derived from in-field solar tracker prototype testing located in Arad, Romania (46°11′30.4″ N 21°20′41.6″ E) using a 100 W·PV panel from Bruma brand. The experimental trials were carried out between 26 June 2025 and 2 July 2025. We acknowledge that the scope of this dataset is deliberately circumscribed. This approach is primarily justified by the foundational objective of this study, which is the analytical assessment of system reliability through established engineering methodologies, namely FMEA, FTA, and RBDs.

The principal contribution of this work lies not in the exhaustive empirical validation across all operational conditions, but in the systematic development and proposal of the BIST architecture as a critical mechanism for proactive reliability enhancement. More precisely, the experimental data presented is derived from a fault-injection study, specifically designed to quantify the immediate energy yield degradation caused by the system being stuck in critical positions. This short-term scope allowed for a robust, quantitative FMEA output rather than a general long-term reliability assessment.

4.1. Failure Mode and Occurrence Probability

As mentioned earlier, both the FTA and FMEA are bottom-up approaches in which root causes are evaluated and rated based on their probabilities of occurrence. These methods help identify and prioritize potential failure modes by assessing their likelihood and impact on system performance, as shown in Figure 9.

Figure 9. Failure Analysis Diagram of a Dual-Axis Solar Tracker, illustrating Mechanical and Electrical Root Causes.

Hence, the failure analysis of the dual-axis solar tracker can be broadly divided into two domains: mechanical root causes and electrical root causes. Both categories stem from the overall system-level design and trace down to the component level, where the failures ultimately manifest. The diagram illustrates how environmental conditions, lifespan limitations, and electrical disturbances affect the various subsystems of the tracker, leading to either mechanical or electrical failure.

On the mechanical side, two key origins of failure are identified: environmental stressors and lifespan degradation. Environmental stressors such as wind, hail, and gusts exert continuous physical loads on the structural components, particularly the worm gear and stepper mechanisms. These elements are essential for positioning and rotation, yet they are highly susceptible to fatigue and damage under extreme weather conditions. Additionally, lifespan-related factors, including usage over time and natural aging, contribute to mechanical wear, reducing efficiency and reliability. Together, these mechanical challenges compromise the stability and precision of the tracking system, often leading to gear misalignment or actuator breakdowns.

In contrast, the electrical root causes are driven by both environmental stressors and inherent electrical vulnerabilities. Environmental factors such as moisture, temperature fluctuations, dust, and prolonged exposure to ultraviolet radiation directly affect sensitive components, including the microcontroller unit, stepper motor, power supply, and control circuits. These stressors can result in corrosion, insulation breakdown, or overheating, thereby degrading performance. Meanwhile, purely electrical causes such as short circuits, overvoltage, and voltage drops pose additional risks to motor drivers, cables, and connectors. These disruptions may trigger sudden failures, data communication issues, or permanent damage to electronic circuitry, undermining the control and responsiveness of the tracking system.

At the component level, the analysis shows a clear mapping between root causes and the specific parts most vulnerable to them. Mechanical components like worm gears and steppers primarily fail due to external stress and wear, whereas electrical components—from microcontrollers to connectors—are compromised by environmental exposure and unstable power conditions. The cumulative effect of these vulnerabilities results in either mechanical failure or electrical failure, both of which reduce the operational lifespan and efficiency of the solar tracker.

Assigning occurrence probabilities for each of the listed environmental factors affecting LDRs in a solar tracking system (Figure 7) involves estimating how likely each factor is to occur in a given operational environment, as depicted in Table 2.

Table 2. Estimated Occurrence Probability Distribution for Environmental Factors Affecting the Solar Tracker.

To compute the likelihood of at least one environmental stressor affecting the LDRs, we will assign the average probabilities to each of the items from Table 2, as follows: P_Moisture = 0.55; P_Heating = 0.75; P_UV = 0.90; P_Dust; P_Mechanical = 0.20. The final answer, P(environmental stressors) ≈ 0.9997, shows a very high probability (~99.97%) that at least one environmental stressor will affect the LDR system.

Regarding the next OR operation from Figure 7, let us assume an average use case of a solar tracking system over time. The probability of failure due to age or usage can vary, but for estimation, we will consider the following values from Table 3.

Table 3. Estimated Occurrence Probability Distribution for Aging affecting the Solar Tracking System [,,].

Let us further suppose that the system is around 3 years old with moderate to high daily usage—a reasonable average. So, we can assign P(Age/Usage) = 0.4. Then, by solving the second OR gate, we can write P(LDR Malfunction) = 1 − (1 − 0.9997)(1 − 0.40). The final result, P(LDR Malfunction) ≈ 0.9998, indicates that there is a 99.98% chance of malfunction when both environmental stressors and aging are considered with an OR gate.

The upper layer, which incorporates the AND gates, can be interpreted as a series connection between the previously determined malfunction condition and the probability that the corresponding analog pin on the Arduino module registers a low voltage reading. This configuration emphasizes that both conditions must occur simultaneously to trigger the associated system failure. In practical scenarios, residual voltage levels often persist on the analog inputs of the board. As a result, the probability that a malfunctioning LDR will lead to a low voltage reading on the corresponding analog pin is estimated at p = 85%. Therefore, for any direction—West, East, South, or North—the estimated probability that the system becomes stuck in a specific position is P(stuck) = 0.9998 × 0.85 = 0.84983. This calculation assumes that the probability of LDR malfunction is 0.9998 and the probability of the analog pin registering a low voltage due to that malfunction is 0.85. Hence, there is approximately an 84.98% chance the system will lock in a given position under these fault conditions.

Since the FTA quantifies the system’s failure probability, the reliability R of the solar tracking device under the given fault scenario can be calculated as R = 1 − F = 1 − 0.8498 = 0.1502. In other words, when considering the likelihood of the system becoming stuck in one of its operational positions due to LDR failures and corresponding low voltage readings, the overall reliability of the solar tracker drops to 15.02%. This highlights a significant vulnerability that must be addressed to ensure dependable operation.

4.2. Effects Analysis and Discussion

Recent progress in solar tracker reliability research highlights a stronger emphasis on environmental stressors impacting the mechanical components of these devices (left-hand side of Figure 9), while comparatively less attention has been given to analyzing the effects of electronic component failures on overall system performance (right-hand side of Figure 9).

Failure mode identification is crucial for reliability analysis, as demonstrated in the investigation of solar tracker collapse under wind loads. According to [], solar trackers are vulnerable to structural failure or collapse due to aerodynamic loads. Fatigue is a major concern, defined as failure due to repeated stress cycles, making it critical to analyze cyclic wind effects rather than only static loads. Failure is most likely at connection joints (between the main beam and supporting post), which were identified as critical regions in the study. Therefore, identifying where maximum stresses and strains occur allows engineers to design safer and more reliable structures. The study found the factor of safety = 2.16, meaning the design is safe under expected loads, but stresses concentrate in specific regions, highlighting potential weak spots. Fatigue analysis predicted the system could withstand >21,700 load cycles before failure, showing how life expectancy estimates support reliability claims. Finite element modeling was applied with static and cyclic wind load scenarios to simulate realistic operating conditions. A stress-life (S–N) fatigue approach was used, directly linking load-induced stresses to structural life expectancy. Validation with earlier studies gave confidence in the setup, as results were within an acceptable deviation range.

Furthermore, Valentín et al. showed in [] that static wind forces alone could not explain solar tracker failure; instead, the root cause was torsional galloping, a dynamic aeroelastic instability producing large torsional deformations. Identifying this failure mode was essential to reveal the weakest structural elements (shaft joints, PV supports, and module frames) and to explain how local damage triggered a chain of progressive failures, ultimately leading to catastrophic collapse. The study further highlighted that only by correctly identifying the governing failure mechanism could targeted mitigation measures—such as increasing torsional stiffness or implementing real-time monitoring and tilt control—be developed. This underscores the role of failure mode identification in ensuring realistic reliability assessments and effective preventive strategies, such as Artificial Intelligence (AI) algorithms [,].

To the best of our knowledge, this work represents the first attempt to investigate the effects of failure modes in light sensors that are directly exposed to environmental factors. Therefore, the failure mode scenarios discussed in Section 3.3 are illustrated in Figure 10. These include: the stuck-on-sunrise position, resulting from failures in the East and South LDRs (Figure 10a), which drives the PV panel to the homing position when both the lower horizontal and vertical limit switches are active HIGH; the stuck-on-sunset position, caused by a malfunction of the West LDR (Figure 10b), which forces the solar panel to rotate 180 degrees until it reaches the maximum horizontal limit switch; and the stuck-on-safe position (Figure 10c), triggered by a North LDR failure, which drives the solar panel into a flat orientation when the payload activates the maximum vertical limit switch.

Figure 10. Dual-Axis Solar Tracker Stuck on (a) sunrise position, (b) sunset position, and (c) safe-flat position due to Light Dependent Resistor Failure.

For this purpose, a series of experiments was carried out over a 4-day trial conducted at the end of June and the beginning of July 2025. Test samples were recorded every hour, from 08:00 AM to 08:00 PM. The experiments primarily focused on key parameters, including ambient temperature, solar panel temperature, PV panel voltage and current output, and the overall power generation of the system, as can be seen in Table 4.

Table 4. Solar Tracking Normal Operation Experimental Results.

The first set of results was obtained on 26 June 2025. The ambient temperature ranged from a minimum of 26 °C in the morning to a maximum of 37 °C in the afternoon, with a daily average of 34.32 °C. The PV panel temperature showed higher fluctuations, with a minimum of 34.6 °C in the morning, a peak of 63.1 °C in the afternoon, and an average of 53.56 °C over the day.

For the electrical parameters, the PV panel output voltage varied between a maximum of 21.14 V in the early morning and a minimum of 19.19 V in the afternoon, averaging 19.64 V across the full cycle. The PV panel current output recorded a maximum of 5 A in the afternoon and a minimum of 1.16 A in the late evening, with a daily mean of 4 A. Finally, the power generation ranged from a minimum of 22.85 W in the late evening to a maximum of 95.9 W in the afternoon, yielding an average of 78.29 W throughout the day. The results collected from the following trial day are presented in Table 5.

Table 5. Solar Tracker Stuck-on-Sunrise Position Experimental Results.

The second set of results was recorded on 29 June 2025. The ambient temperature started at a minimum of 21 °C in the morning, rose to a maximum of 32 °C in the afternoon, and averaged 28.38 °C over the course of the day. The PV panel temperature followed this trend, with a minimum of 34.2 °C in the morning, a peak of 44.5 °C in the afternoon, and a daily average of 40.49 °C.

For the electrical performance, the PV output voltage reached a maximum of 20.80 V in the early morning, dropped to a minimum of 18.56 V in the evening, and had an average of 19.6 V. The current output ranged between a maximum of 3.25 A in the afternoon and a minimum of 0.14 A in the late evening, with an average of 1.11 A. Finally, the power generation values showed a minimum of 2.6 W in the late evening, a maximum of 65.81 W in the morning, and an average of 22.23 W throughout the day. These results will then be further analyzed with the next set of results, as depicted in Table 6.

Table 6. Solar Tracker Stuck-on-Sunset Position Experimental Results.

The third set of measurements was collected on 30 June 2025. The ambient temperature ranged from a minimum of 23 °C in the morning to a maximum of 34 °C in the late afternoon, with a daily average of 30.69 °C. Correspondingly, the PV panel temperature recorded a minimum of 29.2 °C in the early morning, peaked at 48.6 °C in the evening, and averaged 42.25 °C across the day.

From the electrical perspective, the PV panel output voltage fluctuated between a minimum of 18.51 V in the afternoon and a maximum of 20.41 V in the evening, with an average of 19.48 V. The current output showed a minimum of 2.65 A in the late evening, a maximum of 3.4 A in the evening, and an average of 1.5 A. Finally, the power generation values indicated a minimum of 7.46 W in the morning, a maximum of 69.39 W in the morning, and an average of 29.97 W for the full-day cycle. These values will then be further examined side-by-side with the final results from Table 7.

Table 7. Solar Tracker Stuck-on-Flat Position Experimental Results.

The final set of measurements was obtained on 2 July 2025. The ambient temperature varied between a minimum of 18 °C in the morning and a maximum of 32 °C in the late afternoon, resulting in a daily average of 28.46 °C. The PV panel temperature ranged from a minimum of 23.9 °C in the early morning to a peak of 59.1 °C in the afternoon, with an average of 43.66 °C throughout the day.

In terms of electrical performance, the PV panel output voltage recorded a minimum of 19.57 V in the evening and a maximum of 20.84 V in the morning, yielding a daily average of 20.01 V. The current output varied between a minimum of 1.62 A in the morning and a maximum of 5 A in the afternoon, with an average of 3.44 A. Finally, the power generation values showed a minimum of 14.26 W in the late evening, a maximum of 98.65 W in the morning, and a daily average of 68.8 W.

5. Energy Production Computation and Mitigation Strategies

5.1. Energy Production Graphical Charts

Since instantaneous power was recorded at hourly timestamps according to Table 5, Table 6, Table 7 and Table 8, we can construct the energy production graphical charts for each experimental day. The graphical chart corresponding to the normal operation of the solar tracking device is depicted in Figure 11.

Table 8. Side-by-Side Comparison of Current, Voltage, and Power Parameters.

Figure 11. Dual-Axis Solar Tracker Energy Production (green label) computed between hourly intervals (under normal operation conditions).

As shown in Figure 11, the energy production reached a peak value of 94.6 Wh between 01:00 PM and 02:00 PM, followed by a sharp decline in the evening, reaching 36.07 Wh between 07:00 PM and 08:00 PM. The average energy production over the course of the day was 80.25 Wh. The next chart, shown in Figure 12, depicts the most severe test scenario, in which energy production experiences a significant drop compared to normal operating conditions.

Figure 12. Dual-Axis Solar Tracker Energy Production (dark red label) computed between hourly intervals (when the system is stuck on the sunrise position).

According to Figure 12, the energy production reached a maximum value of 58.49 Wh between 11:00 AM and 12:00 PM, followed by a sudden drop to 6.68 Wh between 07:00 PM and 08:00 PM. The average recorded value here was 23.23 Wh during the entire experimental day. A less critical scenario is illustrated in Figure 13, depicting the stuck-on-sunset position. As shown in Figure 13, energy production peaked at 68.33 Wh between 05:00 and 06:00 PM, while the minimum of 6.69 Wh occurred between 12:00 and 01:00 PM. The average output was 29.67 Wh, slightly higher than in the previously described scenario.

Figure 13. Dual-Axis Solar Tracker Energy Production (red label) computed between hourly intervals (when the system is stuck on the sunset position).

The final graphical representation illustrates the least severe scenario, in which the PV panel remains in a flat orientation while facing the sun. As illustrated in Figure 14, energy production peaked at 98.45 Wh, then declined sharply in the evening to 24.55 Wh, with a daily average of 72.55 Wh.

Figure 14. Dual-Axis Solar Tracker Energy Production (light green label) computed between hourly intervals (when the system is stuck in the flat position).

Based on these findings, it becomes evident that power generation plays a crucial role in conducting the FMEA procedure for the dual-axis solar tracking prototype. A side-by-side comparison is provided in Table 5, highlighting the daily energy yield. The daily average power was used as a reference metric, under the assumption that the measurement window remained consistent across all test days, with computed losses expressed relative to Day 1 (representing the fully healthy system). To complement the FMEA analysis, additional indicators such as peak power, average current, and average voltage were also taken into consideration.

According to Table 8, variations in average voltage are negligible (<2%), whereas average current exhibits significant reductions on faulty days. This indicates that the observed performance degradation is primarily caused by reduced incident irradiance on the module (i.e., tracking/orientation losses), rather than by temperature-induced effects or electrical operating point changes within the system. On day 2, average power decreased by 71.6% compared to the healthy baseline, while peak power was reduced by approximately 31%. This behavior is consistent with the system capturing solar irradiance only during the early hours and missing most of the afternoon, leading to substantial energy yield losses. On Day 3, average power decreased by 61.7% relative to the healthy case. The overall behavior was similar to Day 2, though slightly improved—likely attributable to differences in morning cloud coverage or seasonal/site-related variations in morning irradiance.

On Day 4, average power decreased by only 12.1% compared to the healthy case, while peak power was actually ≈2.9% higher than on Day 1. This suggests that under the tested conditions (late June–early July, at the given site latitude), a flat orientation was still able to capture most of the midday irradiance. Consequently, the incremental benefit of dual-axis tracking was limited on these days—though this effect is strongly dependent on latitude, season, and solar path geometry. Panel temperatures varied across the measurement days, with Day 1 showing the highest peak value of 63.1 °C. While elevated cell temperatures are typically associated with reduced voltage and increased efficiency losses, the observed voltage variations across days were minimal (<2%). Instead, the main performance driver was the change in output current, indicating that temperature played only a secondary role in the measured energy losses.

Finally, for this subsection, a sensitivity analysis will be performed on the energy production computation in order to compare the energy harvesting with the solar irradiance registered during the trial days. More precisely, based on the analyzed data, the total energy production under normal operation reached a maximum of 963 Wh, whereas the maximum solar irradiance level on Day 1 was 1211.3457 Wh/m². In comparison, the most severe day recorded only 279 Wh at maximum, while the maximum solar irradiance on Day 2 was 1209.765 Wh/m². For the moderately severe day, the energy production reached a maximum of 356 Wh, whereas the maximum solar irradiance was rated at 1209.1378 Wh/m². Regarding the least severe day, the solar tracker achieved up to 870 Wh at maximum peak value, while the maximum solar irradiance level was observed to be 1207.7322 Wh/m². Thus, the most severe day registered losses of 71%, the moderately severe day showed losses of 63%, while the least severe day recorded a reduction of only 9.7%

5.2. Built-In Self-Test Architecture

When referring strictly to Figure 6, the mitigation strategies previously mentioned serve to address the LDR-related vulnerabilities and enhance system reliability. These include: (i) BIST architectures [,], which allow autonomous hardware verification during runtime; (ii) Reconfigurable features [], providing adaptive fault isolation and improved fault coverage; (iii) Maintenance-oriented metric systems [], which track wear, aging, and stress exposure to predict failures and schedule preventive interventions. These layered methods form a comprehensive fault tolerance framework to significantly reduce the system’s failure risk depicted in the fault tree. Since it is out of scope to detail all previously mentioned test methodologies, this work will exclusively focus on the BIST architecture.

The first step in designing the BIST architecture is referred to as Test Pattern Generation. At this stage, a Pseudo-Random Pattern Generator (PRPG) is constructed to provide test patterns that will eventually be connected as stimulus signals to the Device Under Test (DUT) input. Because the Arduino Mega 2560’s ADC module instantly translates analog data to digital format, we can effectively replicate these sensor values using an Linear Feedback Shift Register (LFSR)-based device and a reduced number of test vectors, as illustrated in Figure 15.

Figure 15. Proposed PRPG Architecture for Test Vector Generation using Set and Reset Lines.

The hardware design of the 5-bit LFSR is implemented according to the non-primitive polynomial P(x) = x⁵ + x⁴ + x³ + x² + x + 1 and requires only 5 clock cycles to generate distinct test patterns to its outputs. Moreover, since one memory component can output two boolean values (one binary digit and its complement), we can generate the required test vector length using only five DFF units. However, since it is necessary to inject test patterns on two 10-bit digital channels (West and East photoresistors for the horizontal rotation and North and South photoresistors for the vertical rotation), only one 5-bit LFSR component may not suffice. Therefore, our complete PRPG architecture is composed of a pair of 5-bit LFSRs, whose outputs will be tethered, in turn, to each of the digital channels.

Regarding the seed value of the LFSR device, we included two command lines: reset (highlighted with blue wire) and set (highlighted with red wire). When the asynchronous reset line is active low, the 5-bit shift register will output the seed value ‘1010101010’ on the first positive edge of the clock signal, representing a random digital value (682). On the other hand, a second 5-bit LFSR with the same characteristics could be employed with a distinct seed value ‘1001010101’ (597) generated by the asynchronous set line on the same clock edge. Thus, we can simulate the horizontal and vertical movement of the solar tracking system by generating higher values on the first TPG unit and lower values on the second TPG unit, in a parallel manner. Based on the digital values’ concurrent generation, the horizontal and vertical modules of the solar tracking device will output a binary sequence equal to the axis rotation step.

The second step for constructing the BIST design is more complex and encompasses three sub-steps: signature generation, compression, and analysis. The aforementioned binary sequences provided by the DUT will be further processed through a Signature Analyzer, known as an ORA unit. The RBIST architecture, shown in Figure 16, can function as both a TPG and an ORA unit, depending on the control signal configuration. Table 5 summarizes the operational modes of the proposed Reconfigurable Built-in Self-Test (RBIST) design.

Figure 16. Proposed RBIST Architecture for Test Vector Generation and Signature Compression.

The RBIST architecture, such as the PRPG design shown in Figure 15, adds a set and reset signal to each of the five DFF units to establish the initial seed value of the shift register. According to Table 9, when the set signal is active and reset inactive, the entire content of the register will be set to a non-zero value, indicating that the architecture can now operate as an LFSR. On the other hand, when set is disabled and reset is active, the register’s content will be cleared, thus preparing the architecture to operate as a Multiple Input Signature Register (MISR). As depicted in Figure 16, the RBIST architecture makes use of a minimum number of digital components, (8 × AND gates, 6 × OR gates, and 6 additional EXOR gate) in addition to the structure of a PRPG, to construct the reconfigurable mechanism. A third command line denoted with C allows the switching between the PRPG and MISR modes of the shift register, according to the input values depicted in Table 9. An expanded RBIST mode of operation can be employed by inserting three additional input lines denoted with G1, G2, and G3, representing the generator polynomial functions (G1(x), G2(x), and G3(x)) on the feedback paths of the proposed architecture. The expanded RBIST functions describe our proposed solution for the aliasing issue which usually occurs when two signals (signatures) are identical, thus hindering the Signature Analyzer to diagnose the system correctly. The actual cost-performance evaluation will be presented as future work for our further investigations toward incorporating these devices into the solar tracking system design.

Table 9. Operational Modes for the Proposed RBIST Architecture.

The BIST architecture models a diagnostic routine that constantly monitors the four LDR analog inputs for values falling outside a predefined operational window (e.g., V_min < V_LDR < V_max). This runtime monitoring allows the system to detect sensor degradation before it causes a complete system lock-up.

The experimental cases are divided into three main batches encompassing single stuck-at-faults, double stuck-at-faults, and triple stuck-at-faults. Several test scenarios were considered for each of these batches via a testbench module which was written and executed in the Altera Modelsim environment. Single stuck-at-faults generally affect the functionality of only one digital pin of the MCU while operating.

Initially, valid signatures for each axis movement (horizontal anticlockwise and clockwise, correspondingly vertical anticlockwise and clockwise) have been generated using the polynomial function G1(x) associated with the hardware layout shown in Figure 15.

In the first batch of stuck-at-faults, the diagnosis system recognized only 12 Stuck-at-0 and Stuck-at-1 faults out of 32 test scenarios using a single polynomial function (G1), yielding an initial fault coverage of 37.5%. During the second simulation, when the polynomial function of the proposed RBIST design changed to G2, 27 samples were detected as Stuck-at-0 and Stuck-at-1 faults, increasing fault coverage by 46.87%. Finally, during the third trial, the fault diagnosis system was able to identify all Stuck-at-0 and Stuck-at-1 faults with the help of the G3 polynomial function, further enhancing fault coverage by 15.62%. As can be seen in the left graphical chart of Figure 17, the low fault coverage is the result of aliasing. In the context of computer systems and digital communication, aliasing refers to the phenomenon where two different signals produce the same output when sampled or quantized by a digital system. Due to the limitations of the polynomial function G1 in identifying more stuck-at faults, reconfigurability is a significant advantage for fault coverage improvement. While aliasing was rated at 63% after the first simulation trial, the implementation of the G2 polynomial function reduces the aliasing characteristic to 15.63%, as observed in the middle portion of Figure 17. Moreover, by employing the G3 polynomial function, aliasing is completely removed from the right graphical chart of Figure 17. As a result, the final fault coverage for all 96 test cases, regarding single Stuck-at-0 and Stuck-at-1 faults, after applying reconfigurability is 100%, which is consistent with a case study conducted in []. However, our previous OBIST design used a 16-bit MISR architecture, that required additional 1-bit registers in the DFF chain, increasing power consumption and delay times.

Figure 17. Fault Coverage and Aliasing Evaluation for Single Stuck-at-Faults using Reconfigurability.

In the second batch of stuck-at-faults, the fault diagnostic approach identified 12 Stuck-at-0 and Stuck-at-1 out of 24 test instances, utilizing the G1 polynomial function, resulting in an initial fault coverage of 50%, as shown in the left chart of Figure 18. After using the second set of test vectors and modifying the polynomial to G2, the initial fault coverage increased to 79.16%, lowering aliasing by 29%, as seen in the middle chart of Figure 18. Furthermore, by performing an additional set of 24 test vectors with the G3 polynomial function, we were able to increase fault coverage by 20.8% while decreasing aliasing to 0%, as seen in the right portion of Figure 18. After completing all 72 test cases with double Stuck-at-0 and Stuck-at-1 faults, the final result reveals that fault coverage can be maintained at 100%, identical to the single stuck-at-fault scenario.

Figure 18. Fault Coverage and Aliasing Evaluation for Double Stuck-at-Faults using Reconfigurability.

In the third batch of stuck-at-faults, the RBIST approach was able to detect 15 Stuck-at-0 and Stuck-at-1 faults out of 32 test scenarios, representing an initial fault coverage of 46.87% with only the G1 polynomial function. Using an additional set of 32 test vectors and switching to the G2 polynomial results in an increased fault coverage of 81.24%, with 26 successful detections. However, during the third simulation run, using the G3 polynomial function, the fault coverage increases by only 12.51%, leaving 6.25% aliasing as a leftover, as shown in Figure 19. Therefore, the overall fault coverage for all 96 test cases was rated at 93.75%, representing 30 successful identifications out of 32 possible Stuck-at-0 and Stuck-at-1 fault scenarios.

Figure 19. Fault Coverage and Aliasing Evaluation for Triple Stuck-at-Faults using Reconfigurability.

Conclusively, the global fault coverage for all 264 experimental trials was rated at an average value of 98%, with only 2% aliasing occurrence during the simulation period. The experimental results show that the proposed RBIST architecture improves the initial fault coverage by an average value of 53%, highlighting its potential as a low-cost and flexible design for an efficient fault diagnosis implementation.

5.3. Comparison with Other Related Works

The quantitative comparison between the FMEA of polycrystalline PV panels derived from the manuscript in [] and the FMEA/FTA of the dual-axis solar tracking system, in this work, reveals a distinction in the nature, location, and quantification of the most critical system vulnerabilities. More exactly, the work in [] performs a system analysis on the PV generation unit itself, focusing on material and component degradation (e.g., cells, glass, encapsulant), whereas our research efforts are driven toward the electronic and mechanical control system responsible for panel movement, including the microcontroller, sensors, and actuators. Whereas the author’s primary methodology in [] is the traditional FMEA utilizing the Risk Priority Number (RPN) metric, this work employs a combination of FMEA, FTA, and RBDs, emphasizing system reliability (R) calculation and performance loss.

The quantitative results of the two studies demonstrate that risk concentrations occur in fundamentally different subsystems. Patil et al. in [] utilize the RPN as the primary metric, identifying the most severe risks associated with the physical degradation of the PV module. The encapsulant material was determined to be the most critical component, contributing 40.30% to the total system RPN. According to the authors study, critical failure modes are primarily physical and environmental: (i) delamination (maximum RPN: 224); (ii) soiling (maximum RPN: 140).

Our research, on the other hand, which focuses on the control system, quantifies risk by the resultant loss in system reliability (R) and energy generation, directly linking component failure to operational downtime. The most critical component was determined to be the LDR (TEMT6000) within the control subsystem. It was established that the sensor malfunction generally leads to a “stuck” position. The system reliability (R) was calculated to drop to a mere 15.02% in the presence of this single critical fault scenario. Experimental results show a substantial reduction in average power output when the tracker is stuck in a suboptimal position, namely 71.61% for stuck-at-sunrise position and 61.72% for the stuck-at-sunset position.

In summary, while the PV panel itself is highly susceptible to material degradation quantified by a high RPN, the dual-axis tracking system possesses a potential point of failure, found in the LDR sensor array that, when compromised, results in a far more immediate and quantifiable catastrophic loss of energy yield, nearly three-quarters of the optimal generation.

6. Conclusions and Future Work

This study investigated the reliability of a microcontroller-based dual-axis solar tracking system through a combined application of FMEA, FTA, and RBDs. The analysis demonstrated that environmental stressors—particularly light sensor degradation—represent the most significant source of vulnerability, leading to a sharp reliability decline of up to 15.02% under harsh outdoor conditions.

The originality of this work lies in its integration of probabilistic reliability modeling with experimental validation. The proposed mathematical framework, based on probabilistic occurrence modeling and energy production computations, enabled a rigorous quantification of fault impact and recovery strategies. Experimental trials confirmed the theoretical findings, showing that system malfunctions such as LDR failures can significantly reduce daily power generation, while the implementation of diagnostic and fault-tolerant mechanisms restores stable and efficient operation. More precisely, based on the analyzed data, the total energy production under normal operation reached a maximum of 963 Wh. In comparison, the most severe day recorded only 279 Wh, the moderately severe day reached 356 Wh, while the least severe day achieved up to 870 Wh. Thus, the most severe day registered losses of 71%, the moderately severe day showed losses of 63%, while the least severe day recorded a reduction of only 9.7%.

The experimental data regarding fault coverage were obtained using simulation in the Altera Modelsim environment, via a dedicated FSM architecture. The Stuck-at-0 and Stuck-at-1 faults were divided into several test batches according to the number of altered digital pins on the Arduino Mega 2560 MCU ((Arduino S.r.l., Monza, Italy). Concerning the first simulation round, the initial fault coverage of 37% (obtained by G1) was increased by 63% using the available polynomial functions (G2, G3), reducing the aliasing characteristic to 0%. During the second trial of simulations, the initial fault coverage rated at 50% (via G1) was doubled by means of G2 and G3, reaching 100%, and reducing the aliasing to 0%, as in the previous case. Regarding the last simulation batch, the initial fault coverage (47%) computed via G1 was improved by 53%, thus reducing the aliasing to 6.25%. Throughout all 264 test scenarios, the global fault coverage was assessed at 98%, with only a 2% margin for undetected errors. As a result, the proposed RBIST design can enhance the fault coverage by an average of 53% while providing a significantly superior detection ratio and anti-aliasing capabilities compared to the standard BIST architecture.

The reliability methodology presented herein, centered on the integration of BIST architecture and formalized through FMEA, possesses substantial scalability for application within utility-scale PV farms. While the implemented prototype utilizes a singular MCU for a dual-axis system, the fundamental principles of localized diagnostics and proactive error reporting are highly transferable. In a large-scale deployment comprising thousands of individual tracking units, each module would incorporate the BIST functionality for autonomous, continuous monitoring of its critical sub-systems (e.g., actuation motors, sensor integrity, communication links).

Scalability is achieved through a hierarchical architecture. The BIST-enabled tracking units operate as the foundational diagnostic layer, generating real-time, high-fidelity fault data. This data can then be aggregated by a Supervisory Control and Data Acquisition system (SCADA), allowing for predictive maintenance scheduling and the immediate identification of underperforming units without the need for manual inspection. This hierarchical BIST approach transforms the system from a series of independent trackers into a resilient, self-diagnosing network. Consequently, the potential for catastrophic, widespread failures is mitigated, and the Mean Time To Repair (MTTR) is significantly reduced, offering a compelling case for the economic and operational viability of this reliability enhancement strategy in vast, geographically dispersed solar energy installations.

It is important to emphasize that the study of PV tracking systems remains essential in comparison to static solar panels []. While static installations [] are simpler and less costly, they suffer from significant efficiency limitations due to fixed orientation. Dual-axis tracking, as demonstrated in this work, can increase energy yield from 30% to 60% under real conditions, directly improving the ROI and ensuring more consistent power generation across variable seasons and geographic locations. Reliability-focused studies such as this one are therefore critical to bridging the gap between theoretical energy gains and practical, long-term system performance.

Future research should extend this reliability framework by validating it across larger-scale deployments and diverse climatic environments. Additional work is also required to refine sensor encapsulation techniques, integrate advanced predictive maintenance strategies, and explore the use of intelligent algorithms for autonomous fault detection. Such developments would further enhance the robustness, scalability, and economic viability of solar tracking systems, contributing to more resilient renewable energy infrastructures. Additionally, our future research endeavors are designated to extend the experimental trials to encompass several distinct PV system configurations and to investigate an expanded repertoire of failure scenarios. A key component of this subsequent work will involve the rigorous quantification of the BIST architecture’s cost relative to its performance and the analysis of its intrinsic failure mode. Finally, we intend to develop a Markov model to serve as the formal mathematical foundation for the currently presented RBD, FTA, and FMEA methodologies, thereby establishing a more accurate and robust mechanism for system reliability prediction.

Author Contributions

Conceptualization, R.R. and F.-M.P.; Methodology, R.R. and A.-A.P.-L.; software, R.R.; Validation, R.R., F.-M.P. and A.-A.P.-L.; Formal analysis, M.V.; Investigation, R.R. and F.-M.P.; Resources, R.R.; Data curation, F.-M.P.; Writing—original draft preparation, R.R. and F.-M.P.; Writing—review and editing, F.-M.P.; Visualization, F.O.; Supervision, M.V. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data presented in this study are available on request from the corresponding author due to privacy reasons imposed by the co-authors who contributed to the software and hardware implementations.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Chow, T.T. A review on photovoltaic/thermal hybrid solar technology. Appl. Energy 2010, 87, 365–379. [Google Scholar] [CrossRef]
Rehman, S.; Al-Hadhrami, L.M. Study of a solar PV–diesel–battery hybrid power system for a remotely located population near Rafha, Saudi Arabia. Energy 2010, 35, 4986–4995. [Google Scholar] [CrossRef]
Aghaei, M.; Fairbrother, A.; Gok, A.; Ahmad, S.; Kazim, S.; Lobato, K.; Oreski, G.; Reinders, A.; Schmitz, J.; Theelen, M.; et al. Review of degradation and failure phenomena in photovoltaic modules. Renew. Sustain. Energy Rev. 2022, 159, 112160. [Google Scholar] [CrossRef]
Patil, R.B.; Khalkar, A.; Al-Dahidi, S.; Pimpalkar, R.S.; Bhandari, S.; Pecht, M. A Reliability and Risk Assessment of Solar Photovoltaic Panels Using a Failure Mode and Effects Analysis Approach: A Case Study. Sustainability 2024, 16, 4183. [Google Scholar] [CrossRef]
Yang, Z. A comprehensive analysis of environmental factors affecting solar cells: Dust accumulation, ambient temperature, and humidity. Appl. Comput. Eng. 2023, 23, 216–222. [Google Scholar] [CrossRef]
Sepúlveda-Oviedo, E.H. Impact of environmental factors on photovoltaic system performance degradation. Energy Strategy Rev. 2025, 59, 101682. [Google Scholar] [CrossRef]
Pimpalkar, R.; Sahu, A.; Patil, R.B.; Roy, A. A comprehensive review on failure modes and effect analysis of solar photovoltaic system. Mater. Today Proc. 2022, 77, 687–691. [Google Scholar] [CrossRef]
Ong, N.A.F.M.N.; Sadiq, M.A.; Said, M.S.M.; Jomaas, G.; Tohir, M.Z.M.; Kristensen, J.S. Fault tree analysis of fires on rooftops with photovoltaic systems. J. Build. Eng. 2022, 46, 103752. [Google Scholar] [CrossRef]
Mpodi, E.K.; Tjiparuro, Z.; Matsebe, O. Review of dual axis solar tracking and development of its functional model. Procedia Manuf. 2019, 35, 580–588. [Google Scholar] [CrossRef]
Przytula, K.W.; Allen, D.; Lu, T.-C.; Anderson, N.; Wanner, J. Analysis of Built-In Self-Tests for Electronic Control Units. In Proceedings of the Annual Conference of the PHM Society, Turin, Italy, 29 November–2 December 2021; Volume 1. Available online: https://papers.phmsociety.org/index.php/phmconf/article/view/1715 (accessed on 19 October 2025).
Bernardi, P.; Filipponi, G.; Iaria, G.; Bertani, C.; Tancorre, V. Logic Diagnosis Based on Logic Built-In Self-Test Signatures Collected In-Field from Failing System-on-Chips. Electronics 2024, 13, 4234. [Google Scholar] [CrossRef]
Li, T.; Tao, S.; Zhang, R.; Liu, Z.; Ma, L.; Sun, J.; Sun, Y. Reliability Evaluation of Photovoltaic System Considering Inverter Thermal Characteristics. Electronics 2021, 10, 1763. [Google Scholar] [CrossRef]
Sayed, A.; El-Shimy, M.; El-Metwally, M.; Elshahed, M. Reliability, Availability and Maintainability Analysis for Grid-Connected Solar Photovoltaic Systems. Energies 2019, 12, 1213. [Google Scholar] [CrossRef]
Sonawane, P.R.; Bhandari, S.; Patil, R.B.; Al-Dahidi, S. Reliability and Criticality Analysis of a Large-Scale Solar Photovoltaic System Using Fault Tree Analysis Approach. Sustainability 2023, 15, 4609. [Google Scholar] [CrossRef]
Sharma, V.; Chandel, S. Performance and degradation analysis for long term reliability of solar photovoltaic systems: A review. Renew. Sustain. Energy Rev. 2013, 27, 753–767. [Google Scholar] [CrossRef]
Fernández-Solas, Á.; Micheli, L.; Almonacid, F.; Fernández, E.F. Optical degradation impact on the spectral performance of photovoltaic technology. arXiv 2021, arXiv:2103.04639. [Google Scholar] [CrossRef]
Ilse, K.; Micheli, L.; Figgis, B.W.; Lange, K.; Daßler, D.; Hanifi, H.; Wolfertstetter, F.; Naumann, V.; Hagendorf, C.; Gottschalg, R.; et al. Techno-Economic Assessment of Soiling Losses and Mitigation Strategies for Solar Power Generation. Joule 2019, 3, 2303–2321. [Google Scholar] [CrossRef]
Li, H.-Y. Assessment of Photovoltaic Module Failures in the Field. 2017. Available online: https://iea-pvps.org/wp-content/uploads/2017/09/170515_IEA-PVPS-report_T13-09-2017_Internetversion_2.pdf (accessed on 19 October 2025).
Obatola, S.O. Reliability Overview of Grid-Connected Solar PV System: A Review. Arch. Adv. Eng. Sci. 2024, 1–10. [Google Scholar] [CrossRef]
Singh, S.; Saket, R.K.; Khan, B. A comprehensive review of reliability assessment methodologies for grid—Connected photovoltaic systems. IET Renew. Power Gener. 2023, 17, 1859–1880. [Google Scholar] [CrossRef]
Solouma, N.; El Berry, A. A Predictive Reliability Model to Assess the Performance of Photovoltaic Systems. Appl. Sci. 2022, 12, 2885. [Google Scholar] [CrossRef]
Fara, L.; Craciunescu, D. Reliability Analysis of Photovoltaic Systems for Specific Applications. In Reliability and Ecological Aspects of Photovoltaic Modules; IntechOpen: London, UK, 2020. [Google Scholar] [CrossRef]
Fernández-Guillamón, A.; Gómez-Lázaro, E.; Muljadi, E.; Molina-García, Á. Power systems with high renewable energy sources: A review of inertia and frequency control strategies over time. Renew. Sustain. Energy Rev. 2019, 115, 109369. [Google Scholar] [CrossRef]
Gandhi, O.; Kumar, D.S.; Rodríguez-Gallegos, C.D.; Srinivasan, D. Review of power system impacts at high PV penetration Part I: Factors limiting PV penetration. Sol. Energy 2020, 210, 181–201. [Google Scholar] [CrossRef]
Ibrahim, K.A.; Musa, G.; Aliyu, S. The Effect of Solar Irradiation on Solar Cells. Sci. World J. 2019, 14, 20–22. Available online: https://www.scienceworldjournal.org/article/view/19182 (accessed on 19 October 2025).
Govindasamy, D.; Kumar, A. Experimental analysis of solar panel efficiency improvement with composite phase change materials. Renew. Energy 2023, 212, 175–184. [Google Scholar] [CrossRef]
García, H.A.; Duke, A.R.; Flores, H.V. Techno-economic comparison between photovoltaic systems with solar trackers and fixed structure in “El Valle de Sula”, Honduras. IOP Conf. Ser. Earth Environ. Sci. 2021, 776, 012011. [Google Scholar] [CrossRef]
Kuttybay, N.; Mekhilef, S.; Koshkarbay, N.; Saymbetov, A.; Nurgaliyev, M.; Dosymbetova, G.; Orynbassar, S.; Yershov, E.; Kapparova, A.; Zholamanov, B.; et al. Assessment of solar tracking systems: A comprehensive review. Sustain. Energy Technol. Assess. 2024, 68, 103879. [Google Scholar] [CrossRef]
Ghaedi, A.; Mahmoudian, M.; Sedaghati, R. The Impact of Different Solar Tracker Systems on Reliability of Photovoltaic Farms. J. Energy Manag. Technol. 2024, 8, 68–77. [Google Scholar] [CrossRef]
Talavera, D.; Muñoz-Cerón, E.; Ferrer-Rodríguez, J.; Pérez-Higueras, P.J. Assessment of cost-competitiveness and profitability of fixed and tracking photovoltaic systems: The case of five specific sites. Renew. Energy 2019, 134, 902–913. [Google Scholar] [CrossRef]
Sadat-Mohammadi, M.; Nazari-Heris, M.; Nafisi, H.; Abedi, M. A Comprehensive Financial Analysis for Dual-Axis Sun Tracking System in Iran Photovoltaic Panels. In Proceedings of the 2018 Smart Grid Conference (SGC), Sanandaj, Iran, 28–29 November 2018; pp. 1–6. [Google Scholar] [CrossRef]
Antonanzas, J.; Arbeloa-Ibero, M.; Quinn, J. Comparative life cycle assessment of fixed and single axis tracking systems for photovoltaics. J. Clean. Prod. 2019, 240, 118016. [Google Scholar] [CrossRef]
Lassio, J.G.; Branco, D.C.; Magrini, A.; Matos, D. Environmental life cycle-based analysis of fixed and single-axis tracking systems for photovoltaic power plants: A case study in Brazil. Clean. Eng. Technol. 2022, 11, 100586. [Google Scholar] [CrossRef]
Ramful, R.; Sowaruth, N. Low-cost solar tracker to maximize the capture of solar energy in tropical countries. Energy Rep. 2022, 8 (Suppl. 15), 295–302. [Google Scholar] [CrossRef]
Thungsuk, N.; Tanaram, T.; Chaithanakulwat, A.; Savangboon, T.; Songruk, A.; Mungkung, N.; Maneepen, T.; Arunrungrusmi, S.; Poonthong, W.; Kasayapanand, N.; et al. Performance Analysis of Solar Tracking Systems by Five-Position Angles with a Single Axis and Dual Axis. Energies 2023, 16, 5869. [Google Scholar] [CrossRef]
Shao, Y.; Li, Z.; Yang, X.; Huang, Y.; Li, B.; Lin, G.; Li, J. Methods of Analyzing the Error and Rectifying the Calibration of a Solar Tracking System for High-Precision Solar Tracking in Orbit. Remote Sens. 2023, 15, 2213. [Google Scholar] [CrossRef]
Kuttybay, N.; Saymbetov, A.; Mekhilef, S.; Nurgaliyev, M.; Tukymbekov, D.; Dosymbetova, G.; Meiirkhanov, A.; Svanbayev, Y. Optimized Single-Axis Schedule Solar Tracker in Different Weather Conditions. Energies 2020, 13, 5226. [Google Scholar] [CrossRef]
Modarres, M.; Kaminskiy, M.P.; Krivtsov, V. Reliability Engineering and Risk Analysis: A Practical Guide; CRC Press: Boca Raton, FL, USA, 2016. [Google Scholar]
Birolini, A. Reliability Engineering: Theory and Practice; Springer: Berlin/Heidelberg, Germany, 2017. [Google Scholar]
Elsayed, E.A. Reliability Engineering, 2nd ed.; Wiley: Hoboken, NJ, USA, 2012. [Google Scholar]
Blanchard, B.S.; Fabrycky, W.J. Systems Engineering and Analysis, 5th ed.; Pearson: London, UK, 2011. [Google Scholar]
O’Connor, P.D.T.; Kleyner, A. Practical Reliability Engineering, 5th ed.; Wiley: Hoboken, NJ, USA, 2012. [Google Scholar]
Afsharnia, F. Failure Rate Analysis, Failure Analysis and Prevention; InTech: London, UK, 2017. [Google Scholar] [CrossRef]
Sharma, P.; Singh, A. Overview of Fault Tree Analysis. Int. J. Eng. Res. Technol. (IJERT) 2015, 4, 337–340. [Google Scholar] [CrossRef]
Chen, Z. 5-Power electronic converter systems for direct drive renewable energy applications. In Woodhead Publishing Series in Energy, Electrical Drives for Direct Drive Renewable Energy Systems; Mueller, M., Polinder, H., Eds.; Woodhead Publishing: London, UK, 2013; pp. 106–135. [Google Scholar] [CrossRef]
Lakner, A.A.; Anderson, R.T. Reliability Engineering for Nuclear and Other High Technology Systems (1985): A Practical Guide, 1st ed.; CRC Press: Boca Raton, FL, USA, 1985. [Google Scholar] [CrossRef]
Devasia, A. Introduction to Fault Tree Analysis. 16 June 2021. Available online: https://control.com/technical-articles/introduction-to-fault-tree-analysis/?utm_source=chatgpt.com (accessed on 20 April 2025).
Chakrabarty, A.; Mannan, S.; Cagin, T. Chapter 2—Process Safety. In Multiscale Modeling for Process Safety Applications; Chakrabarty, A., Mannan, S., Cagin, T., Eds.; Butterworth-Heinemann: Oxford, UK, 2016; pp. 5–110. [Google Scholar] [CrossRef]
Segbefia, O.K.; Imenes, A.G.; Sætre, T.O. Moisture ingress in photovoltaic modules: A review. Sol. Energy 2021, 224, 889–906. [Google Scholar] [CrossRef]
Anand, A.; Shukla, A.; Panchal, H.; Sharma, A. Thermal regulation of photovoltaic system for enhanced power production: A review. J. Energy Storage 2021, 35, 102236. [Google Scholar] [CrossRef]
Bora, B.; Rai, S.; Dhar, A.; Banerjee, C. Effect of UV irradiation on PV modules and their simulation in newly designed site-specific accelerated ageing tests. Sol. Energy 2023, 253, 309–320. [Google Scholar] [CrossRef]
Said, S.Z.; Islam, S.Z.; Radzi, N.H.; Wekesa, C.W.; Altimania, M.; Uddin, J. Dust impact on solar PV performance: A critical review of optimal cleaning techniques for yield enhancement across varied environmental conditions. Energy Rep. 2024, 12, 1121–1141. [Google Scholar] [CrossRef]
Rahman, T.; Al Mansur, A.; Islam, S.; Islam, I.; Sahin, M.; Awal, R.; Shihavuddin, A.; Haq, M.A.U. Effects of Aging Factors on PV Modules Output Power: An Experimental Investigation. In Proceedings of the 2022 4th International Conference on Sustainable Technologies for Industry 4.0 (STI), Dhaka, Bangladesh, 17–18 December 2022; pp. 1–5. [Google Scholar] [CrossRef]
Ceran, B.; Mielcarek, A.; Hassan, Q.; Teneta, J.; Jaszczur, M. Aging effects on modelling and operation of a photovoltaic system with hydrogen storage. Appl. Energy 2021, 297, 117161. [Google Scholar] [CrossRef]
Kazem, H.A.; Chaichan, M.T.; Al-Waeli, A.H.; Sopian, K. Evaluation of aging and performance of grid-connected photovoltaic system northern Oman: Seven years’ experimental study. Sol. Energy 2020, 207, 1247–1258. [Google Scholar] [CrossRef]
Al-Rashidi, A. Structural Stability And Fatigue Assessments of Dual-Axis Solar Trackers Using Finite Elements Analysis. Int. J. GEOMATE 2020, 19, 8–13. [Google Scholar] [CrossRef]
Valentín, D.; Valero, C.; Egusquiza, M.; Presas, A. Failure investigation of a solar tracker due to wind-induced torsional galloping. Eng. Fail. Anal. 2022, 135, 106137. [Google Scholar] [CrossRef]
Paliyal, P.S.; Mondal, S.; Layek, S.; Kuchhal, P.; Pandey, J.K. Automatic solar tracking system: A review pertaining to advancements and challenges in the current scenario. Clean. Energy 2024, 8, 237–262. [Google Scholar] [CrossRef]
Kumba, K.; Upender, P.; Buduma, P.; Sarkar, M.; Simon, S.P.; Gundu, V. Solar tracking systems: Advancements, challenges, and future directions: A review. Energy Rep. 2024, 12, 3566–3583. [Google Scholar] [CrossRef]
Jurj, S.L.; Rotar, R.; Opritoiu, F.; Vladutiu, M. Online Built-In Self-Test Architecture for Automated Testing of a Solar Tracking Equipment. In Proceedings of the 2020 IEEE International Conference on Environment and Electrical Engineering and 2020 IEEE Industrial and Commercial Power Systems Europe (EEEIC/I&CPS Europe), Madrid, Spain, 9–12 June 2020; pp. 1–7. [Google Scholar] [CrossRef]
Rotar, R.; Jurj, S.L. Configurable Built-In Self-Test Architecture for Automated Testing of a Dual-Axis Solar Tracker. In Proceedings of the 2021 IEEE International Conference on Environment and Electrical Engineering and 2021 IEEE Industrial and Commercial Power Systems Europe (EEEIC/I&CPS Europe), Bari, Italy, 7–10 September 2021; pp. 1–9. [Google Scholar] [CrossRef]
Rotar, R.; Vârtaci, N.; Bălaş, M.; Opriţoiu, F.; Vlăduţiu, M. Digital Twin Architecture for an Automated PV System with Self-Testing Capabilities. In Proceedings of the 2023 IEEE 29th International Symposium for Design and Technology in Electronic Packaging (SIITME), Craiova, Romania, 18–22 October 2023; pp. 28–33. [Google Scholar] [CrossRef]
Rotar, R.; Petcuț, F.M.; Susany, R.; Oprițoiu, F.; Vlăduțiu, M. Dependability Assessment of a Dual-Axis Solar Tracking Prototype Using a Maintenance-Oriented Metric System. Appl. Syst. Innov. 2024, 7, 67. [Google Scholar] [CrossRef]
Petcuţ-Lasc, A.-A.; Bălaş, V.-E.; Petcuţ, F.-M.; Rotar, R.; Alexuţă, D. Performance Evaluation of a Residential Photovoltaic System in Matlab Simulink. In Proceedings of the 2025 IEEE 23rd World Symposium on Applied Machine Intelligence and Informatics (SAMI), Stará Lesná, Slovakia, 23–25 January 2025; pp. 435–440. [Google Scholar] [CrossRef]
Dragomir, T.L.; Petcut, F.M.; Dragomir, L.E. Maximum power point determination for a photovoltaic panel using a Simulink model. In Proceedings of the SOFA 2010—4th International Workshop on Soft Computing Applications, Arad, Romania, 15–17 July 2010. [Google Scholar] [CrossRef]

Figure 1. Dual-Axis Solar Tracker Architecture comprising the Control Subsystem (green), Actuators/Gear (red), Photovoltaic System (light orange), Power Supply (orange), Data Transfer Subsystem (blue), and Cloud layer (purple).

Figure 2. Dual-Axis Solar Tracker Reliability Block Diagram depicting a mixture of Series and Parallel Connections.

Figure 3. Critical Scenario involving a Complete Power Supply Outage in the Solar Tracking System.

Figure 4. Less Critical Scenario involving a Tracking Misalignment in the Solar Tracking System.

Figure 5. Non-Critical Scenario involving Data Communication Failure in the Solar Tracking System.

Figure 6. Failure Mode Analysis performed on the Solar Tracking System’s Light Dependent Resistors and Anemometer.

Figure 7. Solar Tracking Optimal Position Failure due to Environmental Stressors and Aging affecting the Light Dependent Resistors.

Figure 8. Summarized View of the Physical Solar Tracking Prototype (Left), and Electronic Circuits installed in the Control Panel Box (Right).

Figure 9. Failure Analysis Diagram of a Dual-Axis Solar Tracker, illustrating Mechanical and Electrical Root Causes.

Figure 10. Dual-Axis Solar Tracker Stuck on (a) sunrise position, (b) sunset position, and (c) safe-flat position due to Light Dependent Resistor Failure.

Figure 11. Dual-Axis Solar Tracker Energy Production (green label) computed between hourly intervals (under normal operation conditions).

Figure 12. Dual-Axis Solar Tracker Energy Production (dark red label) computed between hourly intervals (when the system is stuck on the sunrise position).

Figure 13. Dual-Axis Solar Tracker Energy Production (red label) computed between hourly intervals (when the system is stuck on the sunset position).

Figure 14. Dual-Axis Solar Tracker Energy Production (light green label) computed between hourly intervals (when the system is stuck in the flat position).

Figure 15. Proposed PRPG Architecture for Test Vector Generation using Set and Reset Lines.

Figure 16. Proposed RBIST Architecture for Test Vector Generation and Signature Compression.

Figure 17. Fault Coverage and Aliasing Evaluation for Single Stuck-at-Faults using Reconfigurability.

Figure 18. Fault Coverage and Aliasing Evaluation for Double Stuck-at-Faults using Reconfigurability.

Figure 19. Fault Coverage and Aliasing Evaluation for Triple Stuck-at-Faults using Reconfigurability.

Table 1. Solar Tracker Components with their respective Failure Rates.

Crt. Nr.	Component	Type	Failure Rate λ (FPMH) *
Crt. Nr.	Component	Type	Chip Only	Indoor	Outdoor
1	Arduino Mega 2560	Control Circuit	0.002–0.01	0.1–0.5	0.5–2.5
2	TB6560 Motor Driver		0.05–0.15	0.3–0.8	1.0–4.0
3	Tongling 5 V Relay Module		0.3–1.5	0.5–2.0	1.0–6.0
4	Weidmuller 24 V Relay		0.05–0.3	0.1–0.4	0.2–0.8
5	Astrosyn Stepper Motor	Gear	N/A	0.02–0.1	0.1–0.5
6	Superior Electric Slo-Syn	Gear	N/A	0.02–0.1	0.1–0.5
7	Anemometer (DC Motor)	Sensor	N/A	0.02–0.1	0.6–10.0
8	TEMT6000 Module		N/A	0.005–0.03	0.01–0.12
9	ACS712 Module		0.02–0.08	0.1–0.3	0.2–1.2
10	ML8511		0.01–0.05	0.1–0.2	0.2–1.0
11	BH1750		0.005–0.02	0.08–0.2	0.2–1.0
12	DHT22		N/A	0.1–0.3	0.3–1.5
13	Rotary Encoder		N/A	0.03–0.1	0.1–0.6
14	Limit Switch		N/A	0.2–0.5	0.4–2.5
15	SIM800L V2 Module	GSM/GPRS	0.05–0.15	0.15–0.4	0.3–1.0
16	Solar Charge Controller	Monitoring Circuit	N/A	0.1–0.3	0.3–1.0
17	LM2596 Converter Module	Power Supply	0.1–0.25	0.3–0.8	0.5–1.5
18	Varta 12 V 44 Ah Battery	Power Supply	N/A	0.01–0.03	0.1–0.2

* The Failure Rates were extracted from the Department of Defense Handbook (MIL-HDBK-189C, 14 June 2011).

Table 2. Estimated Occurrence Probability Distribution for Environmental Factors Affecting the Solar Tracker.

Environmental Factor	Estimated Occurrence Probability (%)	Description
Moisture	40–70%	Depends on local humidity and sealing; higher in humid/rainy areas [].
Heating	60–90%	High probability due to direct sunlight exposure; can be mitigated with proper materials [].
UV Rays	80–100%	Very likely, as LDRs in solar trackers are exposed to sunlight constantly [].
Dust	50–80%	Depends on environment; desert or industrial areas increase this risk [].
Mechanical Damage	10–30%	Less likely if the system is well protected, but still possible from birds, hail, or human interference [].

Table 3. Estimated Occurrence Probability Distribution for Aging affecting the Solar Tracking System [,,].

Age of System	Usage Level	Estimated Probability (%)
<1 year	Low-Medium	0.05–0.10
1–3 years	Medium	0.15–0.30
3–5 years	High	0.35–0.50
>5 years	Very High	0.50–0.70

Table 4. Solar Tracking Normal Operation Experimental Results.

Hour	Solar Irradiance * (Wh/m²)	Ambiental Temperature (°C)	Solar Panel Temperature (°C)	Panel Output Voltage (V)	Panel Output Current (A)	Panel Power Generation (W)
08:00	349.4591	26	34.6	21.14	4.1	86.67
09:00	566.3557	28	45.4	20.86	4.0	83.44
10:00	770.3445	31	55.6	19.31	4.4	84.96
11:00	947.5248	33	60.4	19.25	4.6	88.55
12:00	1085.8209	35	63	19.19	4.8	92.11
13:00	1175.8062	36	65	19.19	5.0	95.9
14:00	1211.3457	36	56.9	19.24	4.85	93.31
15:00	1190.0148	37	63.1	19.20	4.87	93.5
16:00	1113.2646	38	58.4	19.36	4.57	88.47
17:00	986.3231	37	55.2	19.45	3.66	71.18
18:00	817.8385	37	54.9	19.58	3.45	67.55
19:00	619.2896	37	45.7	19.80	2.49	49.3
20:00	404.2028	34	38.1	19.70	1.16	22.85
Average	864.43	34.23	53.56	19.64	4	78.29

* The Solar Irradiance was obtained and validated from the Copernicus Atmosphere Monitoring Service (CAMS) (www.soda-pro.com).

Table 5. Solar Tracker Stuck-on-Sunrise Position Experimental Results.

Hour	Solar Irradiance * (Wh/m²)	Ambiental Temperature (°C)	Solar Panel Temperature (°C)	Panel Output Voltage (V)	Panel Output Current (A)	Panel Power Generation (W)
08:00	345.0257	21	34.2	20.80	0.86	17.88
09:00	562.1456	22	45.2	19.97	0.72	14.37
10:00	766.5009	24	49	19.70	0.64	12.61
11:00	944.1643	26	34.9	20.25	3.25	65.81
12:00	1083.0259	28	37.9	20.15	2.54	51.18
13:00	1173.6188	29	41.5	19.92	2.02	40.24
14:00	1209.765	30	41.2	19.33	0.88	17.01
15:00	1188.9967	31	44.5	19.29	0.93	17.93
16:00	1112.7251	31	43.5	19.26	0.95	18.29
17:00	986.144	32	40.3	19.37	0.4	7.75
18:00	817.8754	32	42.9	19.06	0.66	12.57
19:00	619.3823	31	35.8	19.21	0.56	10.75
20:00	404.1863	32	35.5	18.56	0.14	2.6
Average	862.58	28.38	40.49	19.6	1.11	22.23

* The Solar Irradiance was obtained and validated from the Copernicus Atmosphere Monitoring Service (CAMS) (www.soda-pro.com).

Table 6. Solar Tracker Stuck-on-Sunset Position Experimental Results.

Hour	Solar Irradiance * (Wh/m²)	Ambiental Temperature (°C)	Solar Panel Temperature (°C)	Panel Output Voltage (V)	Panel Output Current (A)	Panel Power Generation (W)
08:00	343.3695	23	29.2	19.85	0.59	11.71
09:00	560.5891	25	42.8	19.23	0.69	13.26
10:00	765.0886	27	44.4	19.02	0.55	10.46
11:00	942.9303	29	41.6	18.9	0.44	8.31
12:00	1081.9917	32	43.3	18.67	0.4	7.47
13:00	1172.7917	32	39.8	18.51	0.319	5.92
14:00	1209.1378	32	39.1	18.83	0.463	8.66
15:00	1188.5477	33	45.2	19.38	1.069	21.31
16:00	1112.4202	34	47.8	20.1	2.884	57.89
17:00	985.9384	34	43.3	20.08	3.35	67.26
18:00	817.7175	34	44.8	20.41	3.406	69.39
19:00	619.2165	32	48.6	19.88	2.65	52.68
20:00	403.9575	31	39.4	20.39	2.71	55.25
Average	861.82	30.69	42.25	19.48	1.5	29.97

* The Solar Irradiance was obtained and validated from the Copernicus Atmosphere Monitoring Service (CAMS) (www.soda-pro.com).

Table 7. Solar Tracker Stuck-on-Flat Position Experimental Results.

Hour	Solar Irradiance * (Wh/m²)	Ambiental Temperature (°C)	Solar Panel Temperature (°C)	Panel Output Voltage (V)	Panel Output Current (A)	Panel Power Generation (W)
08:00	339.8	18	23.9	20.84	1.62	33.76
09:00	557.2559	22	32	20.33	2.82	57.33
10:00	762.0752	25	34.7	20.23	3.48	70.4
11:00	940.2976	27	45.2	20.3	4.12	83.64
12:00	1079.7734	28	49.3	20.44	4.67	95.45
13:00	1170.9924	29	59.1	19.77	4.99	98.65
14:00	1207.7322	31	57.2	19.65	5	98.25
15:00	1187.4825	31	53.2	19.85	4.84	96.07
16:00	1111.6179	32	52.2	19.65	4.34	85.28
17:00	985.3027	32	50.2	19.74	3.85	75.99
18:00	817.1395	32	44.1	19.85	2.54	50.41
19:00	618.5829	32	33.3	20.03	1.74	34.85
20:00	403.1581	31	33.2	19.57	0.729	14.26
Average	860.09	28.46	43.66	20.01	3.44	68.8

* The Solar Irradiance was obtained and validated from the Copernicus Atmosphere Monitoring Service (CAMS) (www.soda-pro.com).

Table 8. Side-by-Side Comparison of Current, Voltage, and Power Parameters.

Metric	Day 1 (Healthy)	Day 2 (Stuck-at-Sunrise Position)	Day 3 (Stuck-at-Sunset Position)	Day 4 (Stuck-at-Flat Position)
Average Power (W)	78.29	22.23	29.97	68.80
Avg. Power Loss (%)	—	71.61	61.72	12.12
Peak Power (W)	95.90	65.81	69.39	98.65
Peak Power Loss (%)	—	31.38	27.64	−2.87 (peak higher than Day 1)
Average Current (A)	4.00	1.11	1.50	3.44
Avg. Current Loss (%)	—	72.25	62.50	14.00
Average Voltage (V)	19.64	19.60	19.48	20.01
Avg. Voltage Change (%)	—	−0.20	−0.82	+1.88

Table 9. Operational Modes for the Proposed RBIST Architecture.

Proposed RBIST Modes of Operation
C	set	reset	Modes of Operation
X	0	1	Initialization mode (Q_t ≤ 1)
X	1	0	Initialization mode (Q_t ≤ 0))
0	1	1	TPG (BIST) mode (Q_t ≤ 0 ⊕ Q_t − 1)
1	1	1	MISR (BIST) mode (Q_t ≤ Z_t ⊕ Q_t − 1)
Expanded RBIST modes of operation
C	G1	G2	G3	Mode of Operation
0	0	0	0	Initialization (Q_t ≤ 0)
1	1	0	0	MISR (BIST) mode (G1(x) = x⁵ + x² + 1)
1	0	1	0	MISR (BIST) mode (G2(x) = x⁵ + x⁴ + x² + 1)
1	0	0	1	MISR (BIST) mode (G3(x) = x⁵ + x⁴ + x³ + x² + 1)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Published by MDPI on behalf of the International Institute of Knowledge Innovation and Invention. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Failure Mode and Effects Analysis of a Microcontroller-Based Dual-Axis Solar Tracking System with Testing Capabilities

Abstract

1. Introduction

2. Literature Review

2.1. Reliability and Power Generation in Modern PV Systems

2.2. The Demand for Reliable Solar Tracking Systems

3. Research Methodology

3.1. Reliability Block Diagrams

3.2. Fault Tree Analysis

3.3. Failure Mode and Effects Analysis

4. Experimental Setup and Results

4.1. Failure Mode and Occurrence Probability

4.2. Effects Analysis and Discussion

5. Energy Production Computation and Mitigation Strategies

5.1. Energy Production Graphical Charts

5.2. Built-In Self-Test Architecture

5.3. Comparison with Other Related Works

6. Conclusions and Future Work

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics