3.1. Model Validity
To assess the validity of the Air-STORM monitor internal temperature prediction model, we compare predicted internal temperatures with experimental results. Monitor temperatures were simulated using co-located solar radiation and meteorology data, and temperatures were averaged to a 15 min resolution to address autocorrelation and correct temporal dependencies.
Performance metrics were calculated in
Table 1 to evaluate the predictive accuracy of the model. A strong linear agreement was seen for all the enclosure types and sizes tested, with each coefficient of determination (R
2) over 0.87, suggesting the Air-STORM model captured most observed temperature variations, and corresponding to a Pearson correlation coefficient (r) over 0.96 in each case. The root mean squared error (RMSE) ranged from 1.4 °C for the PVC enclosure to 3.5 °C for the aluminum enclosure. The simulated PVC enclosure internal temperatures had the best correlations and smallest error compared to observed temperatures. This is likely due to increased mixing from this enclosure being a PurpleAir monitor with an open-end, whereby PVC endcaps are used to house the sensor components. However, it also demonstrates the ability of the Air-STORM tool to accurately simulate temperatures for sensors of non-rectangular geometries.
Model error was found to increase with the conductivity of the enclosure material. The sensitivity of both the error and mean bias of predictive performance with material conductivity is related to the computational complexity of highly conductive materials. Conductive materials exhibit higher heat transfer rates and require smaller FDM time steps and step sizes to accurately capture thermal gradients (SI Equation (S8)). The simulated aluminum enclosure had the poorest correlation and highest error with observed temperatures, likely due to the increased complexity of modeling highly thermally conductive enclosures. It also exhibited the largest mean bias, with the Air-STORM tool underestimating temperatures by 1.89 °C. The bias between simulated and observed temperatures within the aluminum enclosure demonstrates a limitation of this model. The Air-STORM tool assumes no conduction radially between enclosure sides, which is a poorer assumption for more conductive materials and smaller enclosures. However, mean biases and RMSE between observed and predicted temperatures for the three plastic enclosures were under 1.1 °C and 2.7 °C, respectively, suggesting the Air-STORM tool, can accurately predict internal temperatures for monitors with a variety of plastic enclosure materials.
Figure 2 (left) shows a scatterplot of observed vs. simulated internal temperatures for the four enclosures of various sizes and shapes tested. Simulated temperatures exhibited the highest bias compared to observed temperatures during periods with the most variable solar radiation. In the absence of solar radiation, the internal monitor temperature typically approaches ambient temperatures because there is no temperature differential between the enclosure and outside air. For example, at night, simulated temperatures, ambient temperatures, and observed temperatures are nearly identical (
Figure 2 (right)). Though model validation only extended for one month of continuous data collection, this period exhibited highly transient average diurnal DNI and ambient temperature profiles, in addition to day-to-day variation (
SI Figures S20 and S21), making it a challenging environment for Air-STORM’s numerical heat transfer simulation, and a fitting validation period.
The model error is significantly smaller than the error if predicting internal sampler temperatures was conducted using ambient outdoor air temperatures (
Figure 2 (right)). In particular, predicting the maximum daily temperature, which is most relevant for deploying sensors, using just ambient outdoor air temperatures results in mean errors and biases exceeding 10 °C. However, the daily maximum internal temperatures were accurately predicted for plastic enclosures with under 3 °C of mean absolute error (MAE), and under 1.2 °C MB. An uncertainty or error of 3 °C is unlikely to substantially impact decisions about monitor placement, as it is always advised to allow for some margin of safety when considering the operational limits of an instrument. This error is also smaller than the uncertainties related to internal heat generation (
SI Figure S16), emissivity of enclosure materials (
SI Figure S17), and probability of shading such as cloud cover (
SI Figure S18).
Although more computational resources could be dedicated during analysis to reduce model error, this would increase computational time. Differences in solar shading due to imperfect collocation also contribute to the overall error. The validation enclosures and the reference solar sensor were spaced about 1.5 m apart, leading to slight differences in incident solar radiation (
Figure 1). For example, instances where only the reference sensor or the sampler is shaded by partial obstructions can create slight discrepancies between modeled predictions and actual measurements. Although these discrepancies are real, they are minor in the context of predicting probable failures, as the analysis focuses on long-term performance over extended periods.
The accuracy of the solar charging model was similarly evaluated by comparing predicted charges with observed charges of a sampler operating on a battery with a solar panel co-located with the reference incident solar sensor (
SI Figures S5 and S6). The simulated charge of the 120 Wh battery closely matched observations, with a mean bias of −1.63 Wh, root mean square error of 3.76 Wh, mean absolute error of 2.84 Wh, and coefficient of determination of 0.93. The worst model performance occurred when the solar panel was partially shaded. The 80 mm
2 reference sensor would often measure either full sunlight or full shade, whereas the larger solar panel (0.18 m
2) would be in partial sunlight. Despite these minor discrepancies, the overall trends are closely aligned. It should be noted that solar panel performance was evaluated to establish a charging efficiency of 16% before running the simulations.
Although model accuracy is important, Air-STORM’s value lies in its ability to assess the feasibility of running a sampler on solar power without exceeding maximum temperature thresholds over extended periods. Uncertainty exists in both model predictions and the meteorological information obtained from the POWER dataset. However, modeling with years of historical data allows users to predict when and how often failure will occur in a typical year. Combining this information with general engineering standards and safety factors enables the design and iteration of sampling systems for specific deployment needs.
3.2. Sensitivity Analysis
The results of the surface-temperature vs. air-temperature experiment shown in
Figure 3 demonstrate that the internal air temperature, enclosure wall temperature, and battery surface temperature were nearly identical throughout the measurement period. The temperature differences among the three locations were consistently below 0.25 °C, even during peak solar loading. This supports the assumption used in the Air-STORM model that the internal air temperature equilibrates rapidly with enclosure surfaces and battery components.
To put this in perspective, the thermal equilibration time of air is governed primarily by the enclosure’s internal volume and the surface area available for heat exchange. In small sensor enclosures, the air has low thermal inertia (e.g., for 5 L of air, the heat capacity is ~6 J/K), and conduction and radiation from the surrounding surfaces rapidly bring it to thermal equilibrium. Based on standard convective heat transfer coefficients (~5–10 W/m2·K) and enclosure wall areas, the thermal response time of air in such enclosures is on the order of seconds to minutes.
Internal air temperatures only meaningfully diverge from surface temperatures to a degree such that spatial gradients become significant in large enclosures with poor air circulation. Since most air pollution enclosures are relatively small, the simplified approach remains valid. More complex modeling of air volume and internal convection would not appreciably improve predictive performance under the use cases examined here.
A sensitivity analysis was conducted to assess the influence of other modeling assumptions and input parameters on the predicted temperature and power outcomes. Key factors evaluated included the number of nodes used in the FDM calculations, the impact of external influences such as fan and wind speeds, and modeling reradiation from a solar shield. Additional considerations included averaging approaches for internal surface temperatures, assumed starting temperature, internal heat generation, and material emissivity. Among these, internal heat generation and material emissivity were found to have the most substantial impact on model predictions. Both parameters can be determined with relatively high accuracy for a given sampler, reducing uncertainty in real-world applications. Full details of the sensitivity analysis, including specific parameter variations and their effects, are provided in the
Supplemental Information.
3.3. Model Demonstration
Designing a plan for a successful solar-powered air quality monitoring system must account for the specific environmental conditions of each location where monitors will be placed. Solar potential and the resulting internal monitor temperatures vary greatly by region and can lead to high failure rates in certain areas. Accurate and predictive modeling is essential to anticipate these challenges and minimize failures during sampler deployment. In some cases, extreme heat and intense solar radiation may render samplers unsuitable for specific locations, even if shading is used. To illustrate the variability of air quality monitor failures, a typical low-cost solar-powered sampler setup was modeled for 2023 at a 2° × 2° resolution across all non-Antarctica land areas. While the maps presented reflect simulations for a single year, the model can analyze data over any desired period, providing valuable insights for long-term planning.
Monitors without solar shading experienced failure during >5% of the total modeled duration in many regions, including South America, southern North America, the Sahara, southern Africa, the Arabian Peninsula, South Asia, Southeast Asia, and Australia (
Figure 4). In these areas, the frequency of days with temperature exceedances was also high. Some locations surpassed the maximum rated internal temperatures on over 60% of days in 2023. Even failure rates as low as a few percent can significantly impact long-term monitoring. Temperature-based failures are more likely to occur during the day, leading to under-sampling in the afternoon. This can introduce data bias and affect the representativeness of the samples, especially if emission sources follow strong diurnal patterns. Examples include traffic emissions, evening cooking activities, and pollution changes driven by atmospheric mixing layer variations due to temperature shifts. For samplers that require manual restarting to resume operation, remote locations with difficult access are of particular concern, as a single failure could result in weeks or months of lost data.
Equipping monitors with a solar shield, such as a solar panel above the sampler, substantially reduced temperature-based failure rates by limiting incident solar radiation to the enclosures. The area-weighted average percentage of time spent above 45 °C for the global domain was 2.6% with no shade, 1.0% with solar shielding, and 0.7% with full shade. The average percentage of days with temperatures simulated above 45 °C at least once was 11.8% with no shade, 4.7% with solar shielding, and 3.2% with full shade. Solar shields were especially effective in the Americas, eastern Australia, sub-Saharan Africa, and Southeast Asia, reducing failure rates in some areas from over 10% of the total time to below 5%. Area-weighted averages are used because due to the latitude–longitude gridding of the domain of the Mercator projection, grid-cells at higher latitudes have smaller areas than tropical grid-cells.
Predictions from the Air-STORM model indicate that much of the world would likely experience temperature exceedance failures at least once during the modeled year, even if failures were infrequent. 69.2% of areas with no shading, 43.4% of areas with solar shielding, and 35.6% of areas with full shading were simulated to have temperatures surpass 45 °C, presenting a substantial fraction of potential sites for sampler deployments. The 25.8% reduction in temperature failures with solar shielding highlights the efficacy of solar shields to mitigate temperature threshold exceedances. Full shading was shown to further reduce high-temperature-based failures, but the incremental benefits of full shading in comparison to the use of only a solar shield were found to be minor in most regions. These results underscore the significant impact of solar radiation on internal temperatures.
In some regions, temperature concerns exist even if solar radiation is fully removed. For example, monitor temperatures in regions such as Saharan Africa, the Arabian Peninsula, the Persian Gulf, and Western Australia were still modeled exceed 45 °C on more than 30% of days and 10% of the total time with full shading. In these areas, monitors would require active cooling systems or specially rated batteries and power systems to operate reliably under such extreme conditions.
Sampling campaigns’ plans must also account for low temperatures. Sensor networks can encounter battery performance issues when internal temperatures drop below the minimum operating limits. Consumer-grade lithium-ion batteries cannot be safely charged at sub-zero temperatures due to the formation of metallic lithium on the anode, which can degrade battery performance and pose safety risks [
24]. To ensure reliable operation, it is often recommended that batteries are only charged at temperatures above freezing, which can limit when solar-powered systems can be used in some regions.
Figure 5 shows simulated failure rates due to battery freezing for the same hypothetical monitor enclosure used in overheating scenarios. In this case, shade would be detrimental to sampling success, as solar radiation can help prevent batteries from reaching sub-zero temperatures. As before, two failure modes are considered: the percentage of time the battery is at sub-zero temperatures and the percentage of days during which batteries reach sub-zero temperatures at least once. The two modes of simulated failure rates for temperatures below zero are much more similar than when considering overheating. Peak solar potential windows are narrow, leading monitors in warmer climates to overheat for short periods, whereas if monitors reach minimum temperature thresholds, they typically dip below for extended hours of the day.
Failure rates of battery temperatures dropping below zero were generally higher at higher latitudes, with topographical exceptions in locations at altitude, which are typically much cooler than at latitudinally adjacent locations at sea level. Greenland was modeled to have internal battery temperatures below 0 °C year-round, with northern territories of North America and Eurasia reaching similar failure rates. Failure rates were elevated at high altitudes, including in the Andes Mountains, Swiss Alps, Rocky Mountains, Caucus Mountains, Tibetan Plateau, and the Tian Shan ranges.
For each mode of failure and shade case, 46.3% of areas experienced sub-zero battery temperatures at least once during the simulated year. Notably, solar shielding had minimal effects on the average percentage of days spent below zero for the global domain (no shade: 18.0%, solar shield: 18.1%, full shade 18.1%) while it only had slightly higher impacts on the average percent of time spent freezing (no shade: 13.8%, solar shield: 14.2%, full shade 14.4%). Such a small effect suggests that solar shielding is more significant as a solution to mitigate battery overheating than as a contributor to battery freezing. Additionally, monitor failure rates were sensitive to heat generation from the circuitry and enclosure insulation. To size a heater, users could simply model higher heat generation rates, which would increase battery temperatures, and to model insulation, users can provide the thickness and thermal properties of the insulation rather than the enclosure, as the insulation will dominate heat transfer behavior.
Similarly, solar charging was modeled globally to identify high-risk regions for solar-powered air quality monitors and demonstrate the utility of Air-STORM. A solar-powered system consisting of a 0.5 m2 panel with 18% efficiency, a 300 Wh battery with 80% storage efficiency, and a panel at 0 degrees tilt was used for all modeling scenarios. A single configuration was used for all regions to demonstrate the striking variabilities in power failure rate risk for a given, completely unoptimized system. As in previous sections, two failure modes were examined: the percentage of time the battery was discharged and the percentage of days during which the battery was discharged at least once. While a general trend of reduced solar power efficacy farther from the equator was observed, other non-intuitive factors, such as monsoon and hurricane seasons, coastal meteorological phenomena, and anthropogenic air pollution, also substantially impacted failure rates over time.
The highest failure rates were observed in the northernmost latitudes of North America and Eurasia due to reduced solar potentials, where batteries were modeled to be empty over 60% of the time and on 70% of days (
Figure 6). These areas are also influenced by having optimal solar elevation angles far from zero, further reducing solar power. Similarly, high failure rates were noted at Cape Horn in South America, the southernmost region studied. The two failure modes evaluated exhibited similar failure rates. Solar potential typically peaks for only a few hours a day, leading monitors to overheat for shorter periods while under the highest solar radiation before they cool down. Conversely, discharged batteries of solar modules will remain empty for hours or even days while solar potentials are low until periods of high solar radiation recharge batteries and can meet sampler demands.
In addition to regions farthest from the equator, tropical regions, including central South America (excluding the Andes), west-central Africa, India, and Southeast Asia, were found to have power concerns on over 20% of days, although for much smaller percentages of the time. These regions benefit from high clear-sky DNI throughout the year, but incident solar radiation is highly inconsistent due to factors such as high humidity and frequent cloud cover, especially during monsoon and hurricane seasons. A notable hotspot in battery failure rates was identified in East China, likely resulting from high levels of air pollution, which can significantly reduce incident solar radiation through scattering and absorbing radiation [
25,
26,
27,
28].
Power and temperature failures were modeled separately, but in practice, those deploying monitors are likely most concerned with the overall risk of failure. When combining the temperature-related and solar-energy battery failure rates, it becomes evident that at least one of these issues will be of concern when deploying air quality monitors with solar energy systems, regardless of the monitoring location. In some tropical, warm-climate regions, there is a risk of failure due to both temperature exceedances and insufficient solar charging. New Delhi India was selected as a case study due to the significant challenges for sensor deployment from both temperature and solar-charging concerns. Here, we analyze annual and diurnal failure profiles to investigate potential design solutions.
3.4. New Delhi Case Study
The heat map in
Figure 7 illustrates monthly and diurnal failure rates predicted for a standard monitor, identical to those previously discussed, over a 10-year period (2014–2023). Color intensity represents overall failure rates, while color pigment differentiates failure causes: orange indicates temperature exceedances, and teal represents battery depletion due to insufficient solar charging. Temperature-related failures were concentrated in the months of April to June, when high ambient temperatures and intense solar radiation were likely to cause temperature concerns. Despite high ambient outdoor air temperatures, these failures were less frequent during the summer months of July and August due to reduced solar radiation during the monsoon season. Diurnally, temperature failures occurred almost exclusively during peak daylight hours (10:00–16:00), when peak outdoor temperatures coincided with maximum direct normal irradiance (DNI). Isolated temperature and solar failure heatmaps are shown in
SI Figures S2 and S3.
As expected, insufficient solar potential for battery charging was concentrated in November, December, and January due to reduced daylight hours and lower solar radiation. However, failures of similar frequency were also observed during July, August, and parts of September, coinciding with the monsoon season. During monsoon months, extensive cloud cover and high humidity drastically reduce the amount of solar radiation available for battery charging. This finding is counterintuitive because summer is conventionally associated with high solar potential. Solar failures were most frequent at night and in the early morning, when the power drawn from the sampler depleted the energy stored during daylight hours. However, seasonal variations in solar potential had a stronger influence on failure rates than diurnal patterns, emphasizing the importance of considering annual trends when designing solar systems.
Regions such as New Delhi present significant challenges for air quality monitoring due to the dual risks of high temperatures and limited solar potential, which persist year-round. Only three months: February, March, and October, were predicted to have average total failure rates below 10% at each hour of the day. Highlighting these risks does not suggest that sampling cannot be successfully conducted in these areas, but emphasizes the need to tailor solutions to the specific regions where deployments are planned. These failure patterns highlight the importance of optimized system designs to address both temperature- and solar-related challenges. Predictive models using historical weather data can uncover counterintuitive failure trends and guide the optimization of monitoring systems to prevent data losses during critical seasons and times of interest.
In addition to the New Delhi case study, an illustrative analysis was also conducted for Montreal, Canada, a region with vastly different temperature conditions (
SI Figure S3). While both solar potential and temperature remain key concerns, the primary failure risk in this case is from prolonged exposure to extreme cold rather than excessive heat. This contrasts with the challenges observed in New Delhi, where overheating and seasonal reductions in solar radiation were the dominant issues. Despite these differences between New Delhi and Montreal, the model identified the periods of the year with the highest failure risk in both locations, demonstrating its adaptability across diverse environments. The mitigation strategies required for these two cases differ significantly, highlighting the value of predictive modeling in informing location-specific deployment decisions. Details of the Montreal case study are provided in the
Supplemental Information.
Informed by the model developed in this study, monitoring campaigns can implement targeted measures to maximize efficiency and ensure reliable sampling. Such strategies include shading sensors in warm climates, sizing panels and batteries to balance performance and cost, optimizing panel orientation to maximize solar potential during high-priority seasons and times for data collection, and heating or insulating monitors in cold locations.
3.5. Model Limitations
While the Air-STORM tool can provide valuable insight into the feasibility and challenges of solar-powered air pollution monitoring, several limitations should be acknowledged. First, the model relies on historical meteorological data available at hourly resolution and limited spatial resolutions. In contrast, the validation of the model was conducted using 5 min resolution experimental data. The coarser data used for predictive modeling does not account for short-term fluctuations in solar radiation. This discrepancy can introduce uncertainties, particularly in regions with highly variable cloud cover or rapid weather shifts. Additionally, while the tool accounts for major thermal and power constraints, other environmental factors, such as dust accumulation on solar panels, humidity effects on electronics, and potential sensor degradation, are not explicitly modeled. FDM modeling was chosen for its small computational expense to enable multiple, quick simulations of multi-year periods. However, the accuracy of the assumptions associated with 1D FDM modeling decreases when simulating smaller monitors and higher thermal conductivities. In the Air-STORM application, shading from obstructions is currently modeled to occur at fixed times, but shading times will change over the course of the year, introducing slight bias into shading approximations. Battery aging is not explicitly modeled; however, users are encouraged to assume a conservative, end-of-life battery storage efficiency when designing deployments. Lithium-ion batteries can experience a 10–30% reduction in capacity over 2–5 years, depending on use conditions, especially under sustained high temperatures or deep discharge cycles [
29]. Incorporating this degradation into sizing decisions can help maintain reliability throughout the device’s operational lifespan. Lastly, the model demonstration section of this study used a typical low-power example air sampler; however, the model is structured to allow modeling a wide range of device powers. A key consideration when running higher-power devices is accurately estimating the proportion of power input converted to heat, as this relationship changes with scale. To model a higher-powered monitor in the Air-STORM tool, the internal heat generation would likely increase (
SI Figure S16), and the battery and solar panel would need to be sized to accommodate the larger power consumption (
SI Figure S22). Despite these limitations, the Air-STORM tool remains a useful resource for planning solar-powered air monitoring systems and can provide valuable insights into expected performance and failure risks.
The temperature bounds of 0 °C and 45 °C used in this study are based on commonly recommended operating limits for lithium-ion batteries and electronic components; however, these should not be interpreted as absolute cutoffs. Exceeding these thresholds may not immediately lead to system failure, as battery degradation and electronic performance are influenced by both temperature duration and cumulative exposure rather than instantaneous peaks. These limits are also likely somewhat conservative to account for potential safety margins and variations in manufacturer specifications. Recognizing that different applications may tolerate higher or lower risks, the Air-STORM tool allows users to adjust these temperature thresholds based on their specific deployment needs and risk tolerance, providing flexibility to accommodate a range of operating conditions.
The POWER data is available at 1° × 1° spatial resolution, which could affect the precision of the model in regions with high sub-grid heterogeneity. For cases requiring higher spatial resolution, the model could be adapted to use datasets such as MERRA-2 [
30] (0.5° × 0.625° resolution) or ERA5 [
31] (0.25° × 0.25° resolution), which offer higher spatial precision. However, POWER was chosen for this study as it is specifically tailored for solar energy applications, providing pre-processed data that reduces the computational burden when running the Air-STORM model. In special cases, the higher precision data or the collection of site-specific radiation data may be justifiable, but these will typically be special use cases.
Although the current implementation of Air-STORM uses rectangular enclosures, the model’s thermal predictions are primarily governed by surface area, material properties, and external environmental forcing, not geometry. For compact, sealed enclosures, internal temperatures are largely uniform due to low air heat capacity and dominant conductive and radiative exchange with external surfaces. Comparative calculations (
Table S4) show that multiple enclosure shapes (e.g., boxes, cylinders, spheres) with similar surface area-to-volume ratios exhibit nearly identical thermal response characteristics. Although internal radiation between components was not explicitly modeled, measured temperature gradients within the enclosures were found to be small, resulting in negligible radiative redistribution. For the FDM framework used by Air-STORM, geometry would primarily influence mesh layout rather than net energy balance. As a result, enclosure shape would have minimal impact on predicted overheating, and no impact on battery depletion. A numerical comparison of enclosures of similar sizes is presented in the SI. Future versions of Air-STORM may allow user-defined shapes through equivalent surface area representations.