Building a Regional Platform for Monitoring Air Quality

Stoyanov, Stanimir Nedyalkov; Belichev, Boyan Lyubomirov; Tabakova-Komsalova, Veneta Veselinova; Todorov, Yordan Georgiev; Golev, Angel Atanasov; Maglizhanov, Georgi Kostadinov; Stoyanov, Ivan Stanimirov; Stoyanova-Doycheva, Asya Georgieva

doi:10.3390/fi18020078

Open AccessArticle

Building a Regional Platform for Monitoring Air Quality

by

Stanimir Nedyalkov Stoyanov

^1,2,*

,

Boyan Lyubomirov Belichev

¹,

Veneta Veselinova Tabakova-Komsalova

^1,2,*

,

Yordan Georgiev Todorov

^1,2,

Angel Atanasov Golev

^1,2,

Georgi Kostadinov Maglizhanov

¹,

Ivan Stanimirov Stoyanov

³

and

Asya Georgieva Stoyanova-Doycheva

¹

Faculty of Mathematics and Informatics, University of Plovdiv “Paisii Hilendarski”, 4027 Plovdiv, Bulgaria

²

Centre of Excellence in Informatics and Information and Communication Technologies Sofia, acad. G. Bonchev St., Block 2, 1113 Sofia, Bulgaria

³

Institute of Information and Communication Technologies, Bulgarian Academy of Sciences, 1113 Sofia, Bulgaria

^*

Authors to whom correspondence should be addressed.

Future Internet 2026, 18(2), 78; https://doi.org/10.3390/fi18020078 (registering DOI)

Submission received: 14 December 2025 / Revised: 20 January 2026 / Accepted: 23 January 2026 / Published: 2 February 2026

(This article belongs to the Special Issue Intelligent Agents and Their Application)

Download

Browse Figures

Versions Notes

Abstract

This paper presents PLAM (Plovdiv Air Monitoring)—a regional multi-agent platform for air quality monitoring, semantic reasoning, and forecasting. The platform uses a hybrid architecture that combines two types of intelligent agents: classic BDI (Belief-Desire-Intention) agents for complex, goal-oriented behavior and planning, and ReAct agents based on large language models (LLM) for quick response, analysis, and interaction with users. The system integrates data from heterogeneous sources, including local IoT sensor networks and public external services, enriching it with a specialized OWL ontology of environmental norms. Based on this data, the platform performs comparative analysis, detection of anomalies and inconsistencies between measurements, as well as predictions using machine learning models. The results are visualized and presented to users via a web interface and mobile application, including personalized alerts and recommendations. The architecture demonstrates essential properties of an intelligent agent such as autonomy, proactivity, reactivity, and social capabilities. The implementation and testing in the city of Plovdiv demonstrate the system’s ability to provide a more objective and comprehensive assessment of air quality, revealing significant differences between measurements from different institutions. The platform offers a modular and adaptive design, making it applicable to other regions, and outlines future development directions, such as creating a specialized small language model and expanding sensor capabilities.

Keywords:

multi-agent systems; BDI agents; air quality monitoring; semantic ontology; event-driven architecture; machine learning; pollution forecasting; Random Forest; intelligent environmental systems; IoT; Chain of Thought (CoT)

Graphical Abstract

1. Introduction

Air pollution remains a serious problem worldwide. Assessing and managing the negative impact on the health of people living in urban areas, as well as the impact on the environment in these areas, requires a comprehensive and combined study of all factors contributing to undesirably high levels of pollution. Information is needed on the sources of pollution and the types of activities that cause it, on meteorological factors that determine the transport and dispersion of pollutants, and on the chemical transformations that pollutants undergo. Air pollution is a complex problem requiring in-depth and multifaceted research and joint efforts by various groups of experts and stakeholders.

Air pollution has been and remains a serious problem for Plovdiv as well. The various measures taken so far, specifically for the area, remain ineffective. This is largely due to difficulties in collecting, analyzing, and summarizing the data obtained. To make the right decisions to combat air pollution, objective information on air quality is needed first and foremost. There are various measuring devices and sources of information on air quality in the Plovdiv area. Their data usually differs, so one of the challenges is to determine the objective data on air quality.

To solve the problems related to air pollution in the Plovdiv area, we are developing an agent-orientated platform that effectively combines the existing Internet infrastructure with intelligent components. We expect this to open new opportunities such as a natural user interface, intelligent and context-aware systems that take into account the physical environment, and automated air monitoring. The platform can generate predictive analyses, including personalized ones. Based on the analysis of the large volume of information, we expect to be able to offer new business models and services aimed at addressing the problem. We also hope that the platform will improve the connectivity of measuring devices on a regional scale.

In the study, we encountered difficulties and challenges, such as: differences and missing parts in the data collected from measuring devices and multiple information sources. Also challenging are the meteorological conditions and complex chemical processes that require analysis and prediction models. The integration and operation of stakeholders and the many technical systems create major management and technical challenges.

In our previous article [1], we presented the first version of an air monitoring platform in Plovdiv, called ACreM. In this article, we present the second, further developed and improved version of ACreM. To emphasize its regional character, the second version was named PLAM (Plovdiv Air Monitor).

2. State of the Art

This section provides a comprehensive overview of the current state of technology and research in the field of air quality monitoring. The analysis covers both established platforms and methods, as well as new trends driven by Future Internet technologies, the Internet of Things (IoT), artificial intelligence (AI), and multi-agent systems (MAS).

2.1. Established Monitoring Systems and Regulatory Frameworks

Global air monitoring is based on classical statistical models and ground-based measurement networks. The European DELTA v7.0 platform [2], developed as part of the FAIRMODE initiative [3], provides tools for model evaluation and benchmarking of emission inventories. In parallel, the European Air Quality Index [4] provides access to real-time data from over 4000 stations, combining measurements with predictive models for health recommendations. A study by [5] demonstrates how “in-depth validation” based on extensive spatial data can significantly improve model performance. The World Health Organization (WHO) plays a key role in developing global standards [6], with its reports highlighting that 99% of the world’s population breathes air that exceeds recommended values [7].

In the field of integrated commercial solutions, there is a trend towards hybrid approaches. Platforms such as SafetyCulture [8], Aeroqual [9], and Envirosuite [10] combine IoT sensors with machine learning to predict pollution in industrial areas. With a view to processing, scalability, and security, future research is focused on overcoming the limitations of integrating IoT and data science approaches [11]. Companies such as Kaiterra [12] and ERA Environmental [13] integrate LSTM (Long Short-Term Memory) networks to analyze spatiotemporal dependencies [14], while ENVEA [15] focuses on validating data from low-cost sensors—an issue described in [16].

To forecast transboundary pollutants, the Copernicus Atmosphere Monitoring Service (CAMS) [17] combines satellite observations (e.g., Sentinel-5P) with the ECMWF Ensemble Forecast model [18], achieving a 15% reduction in the Root Mean Square (RMS) error for ozone compared to autonomous ground-based systems. However, the analysis in [19] notes the insufficient spatial resolution (>10 km) of the model for accurate urban diagnostics.

2.2. The Rise of Smart Platforms and the IoT

Significant progress has been made in the development of intelligent platforms that intensively integrate technologies from the Future Internet. The European initiative FIWARE stands out in this regard—an open platform with open-source code, offering standardized APIs and components for managing contextual information in real time [20]. It is used in air quality management in cities such as Helsinki, Vienna, and Porto.

Advances in sensor technology and IoT infrastructure are another key aspect. A 2025 systematic review analyzes over 140 systems combining low-cost sensors, cloud infrastructures, and AI algorithms, and highlights the trend towards open architectures [21]. In addition, studies such as [22] focus on new generations of smart sensors that are more energy efficient and suitable for deployment in dense urban networks. Studies such as [23] focus on improving the accuracy of low-cost sensors through statistical and machine learning calibration methods in real urban environments. Other approaches, such as [24], consider the integration of edge computing, which enables local data processing and reduces latency.

2.3. Applications for Artificial Intelligence and Multi-Agent Systems

The artificial intelligence revolution in this field has been particularly pronounced over the last five years, with scientific literature documenting the application of self-encoding models with a focus on predicting PM_2.5/PM₁₀ with an accuracy of >90% [25]; hybrid architectures (CNN-BiLSTM), improving CO₂ forecasts by 30% [26]; multiscale fusion systems for combining satellite data with sensor networks [27].

However, a critical review [28] points out the risks of overfitting (over-adjustment to data) in models based exclusively on data without physical validation.

A persistent trend in scientific research is the application of an agent-oriented approach to building complex, distributed, and adaptive systems. This approach is valued for the properties of agents—autonomy, proactivity, reactivity, and social behavior. With the advancement of large language models (LLMs), a new direction has emerged, focused on their integration as a “reasoning engine” within multi-agent systems (MAS LLMs). Despite promising improved cognitive abilities, there are criticisms regarding the compliance of these implementations with the fundamental principles of classical MAS theory [29]. This inconsistency highlights the need for hybrid approaches that combine established paradigms such as the “Belief-Desire-Intention” (BDI) architecture with the capabilities of LLM.

In the context of regional monitoring, especially for Bulgaria, developments such as the platform presented in [1] are relevant, which integrates data from various sources, detects anomalies, and uses a hybrid model involving BDI and AI agents. Similar studies, such as the autonomous monitoring system at the University of Ruse, Bulgaria, demonstrate the implementation of local IoT systems [30]. Regional initiatives such as LIFE IP CLEAN AIR, COMPAIR, and AIRQUEST emphasize the role of citizen participation, the use of low-cost sensors, and open visualization platforms.

2.4. Key Challenges and Prospects

An analysis of the current situation reveals several key challenges:

Data validity: A comparative analysis [31] reveals discrepancies of up to 35% between different methods of measuring PM_2.5, which compromises compliance with standards.
Spatial limitations: Traditional methods are ineffective in “blind spots” between stations [32]. Mobile sensor platforms and gradient boosting models offer a solution [33].
Multivariate uncertainty: The influence of meteorological variables requires adaptive approaches, such as platforms based on recurrent neural networks [34].

Current research [35,36] focuses on:

Explainable AI (XAI) for model interpretability.
Assimilation of data from heterogeneous sources.
Edge computing for real-time processing, as highlighted in [37] in a systematic review.

2.5. The State of Research

Air quality monitoring is undergoing a transformation driven by technological innovations and heightened public concern. Integrating traditional and novel approaches, combining different data sources, and applying advanced analytical methods will be key to developing more effective strategies. This work builds on these recent advances by proposing a hybrid multi-agent platform that integrates classical BDI agents with LLM-based AI agents. This hybrid approach aims to combine the proactivity and structural strictness of the BDI paradigm with the powerful reasoning capabilities of LLM, addressing both the need for objective data and the efficient use of resources in a regional context.

3. General Characteristics and Architecture of the PLAM Platform

The PLAM (PLovdiv Air Monitoring) platform was developed as a multi-agent system with a high degree of autonomy, adaptability, and interconnectivity between individual agents. Conceptually, it is designed as a research platform in which agents do not simply execute commands, but engage in logical interaction, knowledge exchange, and decision-making in the context of dynamic air quality data. Taking into account various factors, as well as the main air pollutants (particulate matter, nitrogen dioxide, sulfur dioxide, volatile organic compounds, polycyclic aromatic hydrocarbons, ozone), PLAM is designed to perform the following functions with optimal use of available computing resources:

Monitoring air quality based on data from various information sources.
Identification, localization, and analysis of contradictions, inconsistencies, and anomalies.
Establishing a specific “profile” of the Plovdiv area—urban air pollution has a complex structure, including natural background, regional background (formed by sources located outside the city but affecting the air quality in the city), urban background (concentrations of various pollutants caused by industrial facilities in the city itself), traffic, and other local sources. Establishing the specific profile of the city will help find more effective solutions to improve air quality.
Assisting in the establishment of secondary standards for the Plovdiv area—the secondary effect of pollution considers the impact of pollution on crops, buildings, and facilities. This is a significant issue, as the Plovdiv area is the center of vegetable production in Bulgaria.

In general, the platform can be characterized as agent-oriented, hybrid, and regional. The agents in the platform are autonomous, proactive, reactive, social, and environmentally aware software components. These properties make them extremely useful for achieving the goals we have set ourselves with the implementation of PLAM. For automated air monitoring, it is necessary to distinguish between the measuring devices and sensor networks as an environment separate from the analytical part. Autonomy and proactive behavior allow agents to self-activate when exceeding standards that users (citizens) have no way of knowing (i.e., they do not stand idly by and wait for people to tell them what to do). Agents are always active and ready to respond to changes in the environment that they can perceive. The assessment of newly received measurements of air pollutant values (i.e., changes in the agents’ environment) does not depend on specific users (who cannot know about them). The agents themselves must act depending on changes in their environment. They make decisions, invoke services, and generate results continuously and asynchronously, without the need for user intervention. Whenever an agent monitoring deviations from the norm receives a new measurement, it analyzes it based on the permissible limits and flags any discrepancies, then decides whether to generate a warning without anyone asking it to do so; it simply reacts to the event “receiving new measurements.” Agents are also reactive. As such, they can perceive and affect their environment, as well as fulfill user requests. As social components, agents can interact with each other to solve common tasks or achieve common goals, as demonstrated in the article.

The platform is hybrid because it uses two types of agents, which we have named BDI agents and ReAct (LLM-based) agents. While BDI agents focus on complex, goal-oriented behavior, reactive agents are optimized for quick response to specific events. They operate on a “stimulus-response” principle, continuously monitoring incoming messages and responding immediately upon receipt. In the system, they perform roles such as notifications and rapid processing of user requests. Agreeing with the conclusions in [37], we also believe that ReAct agents do not have typical agent-oriented behavior, but are rather reactive components, often relying on simplified, LLM-centric architectures. However, much of the literature on LLM-based agents appropriates the terminology without committing to the core principles of the agent-oriented paradigm. There are critical discrepancies between the theory of the agent-oriented paradigm and the current implementations of ReAct architecture. On the one hand, with hybridity, we aim to reduce the effects of this discrepancy and provide a unified understanding and interpretation of the mental properties of the agents participating in the platform. On the other hand, we want to take advantage of the benefits that both types of architectures provide. Thus, in our architecture, BDI agents are primarily those that make decisions and plan the necessary actions. ReAct agents are usually more operational components that have the advantage of being relatively easy to use as an LLM reasoning engine, as well as effectively integrating and accessing various tools and services, which is essential for our platform. BDI agents have runtime support, i.e., the development environment used provides a corresponding BDI interpreter. To facilitate and unify the interaction between the two types of agents, we have developed our own BDI interpreter, which ReAct agents can use if necessary.

Most of the platforms presented in the review are global in nature, reporting on air quality over wide areas. There are good reasons for this—air pollution is transboundary. However, determining the objective air quality is difficult due to a number of local factors that affect air quality. Our goal is to build an objective “profile” of air quality in Plovdiv. In terms of the language model used, we are developing a small language model that can be fully deployed on our server configuration and trained with information specific to the Plovdiv area. Our goal is for our platform to use computing resources efficiently and sparingly. However, the platform can also be adapted for other areas (as demonstrated later in the article).

The overall architecture of the platform is shown in Figure 1. Users can interact with the platform via a mobile device or its website. The first and important step towards complex reasoning in generative models was through a method called “chain of thought” (CoT). CoT aims for the generative model to first “think” rather than directly answering the question without any reasoning [38]. In the PLAM platform, CoT is developed as a hybrid structure incorporating the BDI and ReAct (or AI) agents presented above. We hope that combining the positive aspects of both architectures will have a synergistic effect.

A characteristic feature of agents is that they operate in an environment with which they can interact. In PLAM, we distinguish between structured and unstructured environments. The structured environment includes a specialized ontology (ontology is presented in more detail in our previous article for this journal [1]), storing basic knowledge for air monitoring, and a relational database in which data from university sensor network measurements are recorded. The unstructured environment consists of external information sources that publish air measurement data from various interested institutions.

The lowest level of the platform is formed by IoT nodes. For this study, an integrated environmental monitoring system comprising three proprietary sensor networks located in various spatial and functional zones was employed to cover a wide range of atmospheric and microclimatic conditions. The networks are centralized in town area of Plovdiv, at a Research Institute of Vegetable Crops ‘Maritsa’ (RIVC), located on the outskirts of the city as well as at a Plant Genetic Resources Institute (PGRI), which is approximately 20 km from Plovdiv. The sensor infrastructure at PGRI comprises a sensor set consisting of about 7–8 sensors designed for outdoor condition monitoring. At RIVC a total of eight sensor clusters has been installed. Each sensor cluster encompasses between 5 and 10 different sensors measuring a variety of environmental parameters, which are relevant for the growth and development of vegetable crops. This configuration provides high spatial and parametric resolution of data in an agroecological context.

The urban sensor network in Plovdiv consists of 7 sensor groups, each equipped with 8 sensors covering basic atmospheric and ecological parameters. Additionally, fixed remote measuring systems are installed in the urban area, including a spectral instrument type DOAS (Differential Optical Absorption Spectroscopy) and a LIDAR system for obtaining vertical atmospheric aerosol profiles. These tools expand the analysis capabilities through the application of remote sensing methods. Regarding data collection methods, the majority of measurements are done by in situ sensors that provide continuous and locally representative data. DOAS and LIDAR systems are classified as remote sensing methods and contribute to gathering integral and profile information of the composition and structure of the atmosphere. Data transmission from all sensor networks is performed via the POST mechanism, which ensures reliable and standardized communication between measurement devices and the central data storage and processing system. It is implemented on standard TCP/IP protocols. This automated transmission supports structured data exchange formats (e.g., JSON or XML), which ensures compatibility, traceability and allows for subsequent processing and archiving.

The combination of various types of sensors, spatially distributed networks, and a hybrid measurement approach (in situ and remote) provides a solid basis for complex environmental analysis and facilitates scientific research in the fields of atmospheric physics, agroecology, and natural resource management.

The three sensor networks consist of cheap PM_2.5/PM₁₀ nodes, covering a 35 km² area in the vicinity of Plovdiv. Data is collected at 5 min intervals and transmitted every 30 min through secure channels to the central platform via MQTT. The data are processed at a speed of 1.2 million records per month with an end-to-end latency of less than 4 s. The data received from the three sensor networks is stored in a single relational database.

4. PLAM-CoT

The PLAM-CoT architecture combines deliberative (implemented by JADEX environment [39] and given on a gray background in the diagram) and reactive agents (implemented by Atomic Agents [40] and LangChain [41] and given on a black background in the diagram) into a harmonious system. The BDIAgent class provides an environment for complex planning and long-term goal execution, while the ReactiveAgent class ensures immediate response to significant events. This hybrid design allows the system to address strategic tasks and operational requirements simultaneously. The CoT architecture is built on the following four levels (Figure 2): interaction with users, management and coordination, operational level, interfaces with the environment.

Management and Communication Level. The agent-based paradigm lies at the foundation of the architecture, where each module performs specific responsibilities within the overall platform. The agents operate in an environment managed by a Message Broker, which serves as a central communication intermediary coordinating the flow of messages, requests, and events. MessageBroker is the core of the communication infrastructure. It facilitates asynchronous message exchange among the agents in the system, ensuring independence and eliminating dependencies between individual components. This makes the system easy to extend, as new agents can be added without risk of disrupting existing dependencies. The applied model resembles “publish–subscribe” architectures used in large distributed systems but is strongly adapted to the cognitive environment in which the agents operate, focusing on logical interactions among them. Such a structure facilitates distributed processing and the easy addition of new agents without requiring changes to the core logic. This modularity is typical of cognitive architectures of the BDI (Belief–Desire–Intention) type, where the system implements a reasoning process for decision-making based on beliefs, desires, and intentions. The DataSenderAgent serves as a communication bridge, sending data to the deliberative CorrespondentAgent. The agent has a BDI function, with the primary task of formatting internal knowledge—represented as data within ACL messages—and sending it to the CorrespondentAgent. This ensures the possibility of sharing data with the deliberative part of the platform. The CorrespondentAgent, on the other hand, acts as the facilitator and coordinator for the platform’s reasoning processes. It is the point of contact for external queries and orchestrates the fulfillment of those queries by delegating to the appropriate agents. The CorrespondentAgent’s beliefs include the current set of pending user requests and a map of partial results received from other agents. Its desires/goals typically center on providing a comprehensive answer about the air quality status for a given city and pollutant, which often requires combining information from multiple knowledge sources. The CorrespondentAgent’s plans are designed to achieve these goals: upon receiving a request it simultaneously dispatches sub-requests to the OntologyAgent and DatabaseAgent. This parallel invocation is a key advantage of the agent approach—by fetching normative data and live sensor data concurrently, the overall response is faster. The CorrespondentAgent then waits for both replies (or error notifications) to arrive, aggregates the results and sends the combined information back to the requester. If one of the sub-results fails, the CorrespondentAgent can decide how to handle it—for instance, it may still return the available part (with a warning that one component is missing), thus exhibiting flexibility in partial fulfillment. Internally, this agent can be seen as executing a simple workflow. It logs interactions in its belief base so that responses can be correlated with requests. The CorrespondentAgent essentially abstracts the rest of the multi-agent system as a unified “service” to outside clients, hiding the complexity of coordination behind a single interface.

The platform is implemented as an autonomous, continuously operating multi-agent system with an asynchronous MessageBroker architecture, designed for long-term monitoring and analysis of environmental data. To assess its practical applicability and engineering reliability, a systematic measurement of key performance indicators was conducted under clearly defined conditions and an experimental protocol.

End-to-end latency is defined as the time interval between the moment new data enters the system (successful retrieval from a sensor or external API and registration in the database) and the moment the processing result becomes available to the end user via a dashboard or notification. This includes coordination between agents through the MessageBroker, internal logical processing, and, where applicable, a single LLM request. All measurements were performed on a standard server configuration with 4 CPU cores, 8 GB RAM, and stable network connectivity, using a fixed software version and identical agent settings.

With a standard monitoring interval of 30 min and LLM analysis enabled, the average end-to-end latency is 2.8 s, while the 95th percentile (P95) of the latency distribution remains below 4.1 s. The use of the P95 percentile allows evaluation of worst-case yet still typical scenarios and is standard practice in real-time system assessment. In configurations without LLM analysis, the average latency is below 1.2 s, indicating that the primary contribution to delay comes from the external LLM service rather than from the MessageBroker layer or agent coordination.

LLM analysis is implemented through a single request with a constrained and fixed prompt and a maximum length of 1500 tokens, ensuring a balance between analytical depth and predictable response time. Caching of immutable context components (norms, structural instructions) is employed, further reducing latency and variability. This makes it possible to achieve an average end-to-end latency below 3 s even with the LLM component enabled.

The throughput of the MessageBroker was measured using synthetic load with concurrently active agents. In a configuration with 20 agents, each generating an average of 5 messages per second, the system stably processes over 100 messages per second without loss or degradation of latency. These values significantly exceed realistic operational conditions, providing sufficient headroom for peak loads. Under simulated short-term spikes of up to 500 messages per second, a temporary increase in queue sizes is observed, but without system failure or loss of functionality. Latency increases smoothly, confirming the effectiveness of the back-pressure mechanism.

The limitation of queues to 100 messages per agent is an intentional design choice. When a queue is full, additional messages are deliberately dropped, with the event logged and interpreted by the sending agent as an overload signal. This prevents uncontrolled message accumulation and cascading system degradation, treating message loss as controlled and observable behavior rather than as a hidden failure.

System reliability was evaluated through continuous operation for over 30 days. A critical failure is defined as a state in which the MessageBroker or central coordination ceases to function, and data processing becomes impossible. No such failures were recorded during the observation period. Individual agents may temporarily fail due to external causes (e.g., API unavailability), but this does not affect the rest of the system. The recovery time after an agent error is defined as the interval until the next successful monitoring cycle, without requiring a platform restart.

The results were obtained through repeated trials under identical conditions, analyzing averages and distributions rather than single measurements. This ensures the robustness of the observed metrics. The selected test parameters, such as a 30 min monitoring interval and 20 agents, reflect a scenario for a medium-sized city and represent a balanced compromise between operational frequency and resource efficiency.

Operational Level. The next processing level is represented by the PM25ComparisonAgent and AnalysisAgent, which implement comparative and analytical functions in the system. Their operation is based on a cognitive approach: they not only compute statistics but also interpret results, extract patterns from the data, and formulate hypotheses, interacting with large language models (LLMs) for this purpose. This gives the system a better understanding of the meaning of the data and allows it to perform more complex analyses of environmental information. The agents function as independent experts who autonomously manage their actions by maintaining their own internal state and activation criteria. They determine for themselves when to act, based on time intervals and available data. The PM25ComparisonAgent and AnalysisAgent respond immediately to environmental changes, not simply waiting for events but proactively checking for them and planning their activities in advance. The agents work closely together, with the PM25ComparisonAgent providing data comparisons that the AnalysisAgent uses for deeper analysis performed by LLM. The PollutantWeatherAgent and NotificationAgent represent reactive units within the hybrid PLAM architecture. The PollutantWeatherAgent acts as a reactive analytical agent functioning as an interpreter of sensor data. It receives raw CSV data from external sources—specifically AQICN and IQAir, which relate solely to atmospheric conditions for each individual day—and performs a full processing cycle, from validation and parsing of the data to calculating metrics and statistical dependencies. At the core of its operation lies the concept of empirical cognition: the agent does not generate its own intentions but reacts to changes in input data, which then activate higher-order cognitive agents in the system. An important aspect of the PollutantWeatherAgent’s work is the use of an LLM. After processing all data, the agent passes it to the language model, which identifies causal relationships between meteorological factors and air pollution. The agent thus acts as a mediator between the empirical and cognitive layers of the architecture. The ForecastAgent extracts data from the AQICN and IQAir websites at a preset interval (default 30 min) and generates an alert when PM_2.5 or PM₁₀ levels are elevated. The agent integrates machine learning approaches (Random Forest Regressor), temporal analysis and data visualization, as well as automatic email notifications.

Environment Interface Level. The DataFetcherAgent provides the incoming data stream by combining inputs from different sources—on the one hand, scraping mechanisms, and on the other, external API services. The agent serves as the “sensory input” of the system, ensuring the continuous updating of empirical data. The agent independently decides when to retrieve data without human intervention, managing its goals and priorities and automatically recovering from errors so that it can continue functioning. Its reactivity and proactivity appear in its rapid response when current data is missing, by initiating preliminary actions to obtain such data, as well as planning and performing extraordinary actions when urgent information is required. The DataFetcherAgent communicates with the rest of the agents in the system using ACL messages. An example of such communication is the “inform” message it sends to the PM25ComparisonAgent, which then uses the received data to compare pollutant measurements for PM_2.5 from different sources.

To ensure reliable storage and access to information from measurements, forecasts, comparisons, and analytical results, PLAM uses a database. It functions as the system’s memory, on which agents build their beliefs and from which they draw information for decision-making. Through the creation of aggregation functions and statistical extraction, the system achieves a well-structured information architecture. The DatabaseAgent serves as the bridge to the live and historical observation data stored in the relational database. Its beliefs include the endpoint or connection parameters for the data API and placeholders for query parameters like city and pollutant. The DatabaseAgent’s primary goal is to fetch the latest measurement for a given pollutant in a given city. Upon receiving a request, the DatabaseAgent formulates an HTTP query to the data service and sends it out. Its intention then is to parse the JSON response (or any returned data structure), extract the relevant fields (e.g., value and timestamp), and deliver that as an ACL message back to the requester. If the query yields no data (e.g., the database has no recent entry for that pollutant/city) or if a timeout/error occurs, the DatabaseAgent replies with a failure message to indicate the observation could not be obtained. Under normal operation, this agent effectively wraps the database with an agent interface, translating a high-level question into the necessary data operations. This design allows us to change the underlying data source or API without affecting other agents—only the DatabaseAgent’s implementation would need updating if, say, we moved from a local DB to a cloud data warehouse. Additionally, the DatabaseAgent could perform on-the-fly data post-processing: for instance, if the request was for an average over a period, it could compute that after retrieving raw data. In our current scenario, it focuses on current values to complement the OntologyAgent’s normative output.

The OntologyAgent encapsulates access to the domain knowledge and regulatory rules encoded in the system’s OWL ontology (presented in the previous article [1]. Its beliefs include the loaded ontology model (an in-memory representation of the OWL file) and relevant identifiers or mappings for classes/properties representing cities, pollutants, and limit values. The ontology itself contains individuals for each city and pollutant, with data properties defining threshold values. The OntologyAgent’s desire is to provide the “official” permissible level (or other semantic info) for any pollutant–city combination requested. When it receives a request (from CorrespondentAgent) specifying a city and pollutant, its intention/plan is to query the ontology via a reasoner or OWL API for the corresponding limit value. If the ontology contains the information, the agent retrieves it and formats it into a reply message. The OntologyAgent demonstrates how incorporating a semantic component allows the platform to reason about meaning rather than just data.

User Interaction Level. The NotificationAgent, in turn, is a reactive communication agent focused on synchronization among the system’s other agents. Beyond functioning as a notification component, it also acts as a “social mediator” within the agency, receiving, interpreting, and forwarding messages between agents. It monitors the proper completion of each process and informs the user interface accordingly. Its key characteristic is rapid response to events, ensuring that every system state is communicated clearly and promptly. Through a continuous cycle of monitoring and event checking, the NotificationAgent ensures the connection between the system and the user. The ChatAgent and ForecastLLMAgent extend the system toward interactivity, enabling communication with the user through natural language. They function as cognitive mediators that contextualize the system’s knowledge and present it to the user in an accessible way. The agents act based on user requests, demonstrating high sensitivity to external stimuli and an ability for immediate response. Although highly reactive, the ChatAgent does not merely wait for queries—it proactively gathers information with the ultimate goal of delivering more accurate responses to the user. The ForecastLLMAgent generates its forecasts using predefined templates and evaluation norms. Social capability is strongly expressed in both agents: aside from communicating with the user, they interact actively with other components of the system through ACL messages to obtain data, which they then structure into a clear and comprehensible form for the client.

In the PLAM platform, LLM-based agents are used as an interpretative and explanatory analytical layer rather than as a primary source of numerical estimates or as an autonomous predictive model. All quantitative data, including atmospheric pollutant concentrations, meteorological parameters, and background radiation levels, are obtained exclusively from deterministic sources—the local sensor network and external data providers—and are supplied to the language model in a structured, unaltered, and semantically annotated form. The LLM does not perform statistical aggregation, interpolation, or extrapolation of raw measurements; instead, it solely synthesizes a textual explanation of already available numerical values and their interpretation with respect to clearly defined normative reference thresholds.

The analytical process is fully specified by a fixed command, which serves as a formalized analytical protocol. This command is immutable within the system configuration. The command used has the following structure Figure 3.

The prompt explicitly prohibits free generation, the introduction of external knowledge, and hidden assumptions, thereby minimizing the risk of hallucinations and conceptual drift. The formal reasoning sequence is predefined, which enables comparability of results over time and across different experimental configurations.

To ensure result stability, the DeepSeek language model is used with fixed parameters that are part of the experimental protocol. During all analyses, the generation temperature is set to 0.1, which practically eliminates stochastic variability in the output. The top-p parameter is fixed at 0.9, preserving linguistic fluency without allowing semantic deviation. The maximum response length is limited to a predefined 1500 tokens, sufficient for a complete analytical report but insufficient for uncontrolled expansion. A fixed version of the model (a specific checkpoint) is used, ensuring that results are not affected by future updates to the architecture or training process.

With respect to reliability and error management, the platform applies a multi-layer control mechanism. At the input level, all data are validated for presence, type, and physical plausibility, with missing or inconsistent values explicitly marked and passed to the LLM as such. Within the analysis, the model is instructed to treat these cases as sources of uncertainty rather than as opportunities for interpolation. At the output level, LLM results are treated as interpretative text rather than machine-executable decisions. They are stored together with the input context, timestamps, and the model settings used, ensuring full traceability and auditability.

Potential hallucinations are addressed through a combination of preventive and reactive mechanisms. Preventively, the prompt and parameter settings strongly constrain the model’s generative freedom. Reactively, the system enables cross-checking between the textual analysis and deterministic calculations of normative states performed outside the LLM. In the event of discrepancies between numerical evaluations and textual interpretation, the result is flagged for additional review and is not used for automated notifications or visualizations without human intervention.

The role of LLM-based analysis is analogous to that of an expert interpreter who formalizes and communicates the outputs of the measurement and analytical infrastructure without replacing the primary data. This positioning allows LLM-dependent results to be methodologically defensible.

We demonstrate the principal interaction between some of the agents in one session with two sequencing diagrams. In the first Sequence diagram (Figure 4), the autonomous data-processing cycle and the independent operation of the multi-agent system are presented. The DataFetcher agent retrieves data and stores it in the database. The system then simultaneously notifies the PM25ComparisonAgent and the AnalysisAgent about the available data. The PM25ComparisonAgent compares PM_2.5 values from different sources and generates alerts when critical deviations occur. The AnalysisAgent extracts enriched data and sends it to LLM for analysis using a chain-of-thought methodology. The results are presented to the user and stored in the database. The cycle demonstrates full autonomy—each stage triggers the next without human involvement. The system integrates heterogeneous data sources, applies specialized algorithms, and uses artificial intelligence to perform continuous monitoring.

The second sequence diagram (Figure 5) illustrates the chat function using contextual memory, which demonstrates how the ChatAgent coordinates information from multiple sources to produce accurate and complete responses. When a user request is received, the agent simultaneously retrieves current sensor data through the DataFetcherAgent, comparative analyses from the PM25ComparisonAgent, and semantic interpretations from the AnalysisAgent. After processing all contextual elements, the system generates an enriched query for the LLM. The personalized response produced by the LLM is a synthesis of multidimensional data, transforming a simple user request into a dialogue with full awareness of the current air conditions and pollution levels. The presented mechanism implements contextual intelligence by integrating dispersed information into coherent communication.

The hybrid architecture of the PLAM platform implements a clearly delineated yet tightly coordinated model of cooperation between two distinct agent construction paradigms: the classical BDI agent (Belief–Desire–Intention) and an LLM-based ReAct agent. This cooperation is organized to leverage the strengths of each paradigm while simultaneously minimizing their limitations.

The BDI agent performs the role of a central cognitive coordinator and is responsible for the long-term, goal-oriented behavior of the system. It maintains an explicit representation of the system’s internal state through beliefs, which are updated upon the arrival of new data or events. Based on these beliefs, the agent activates desires, which reflect both periodic tasks (e.g., regular analyses) and externally initiated requirements (e.g., manually triggered analyses). The selection of intentions and the creation of plans are carried out through a priority-based evaluation of active desires, ensuring predictability and enabling controlled behavior.

The LLM-based ReAct agent, in contrast, is designed as a reactive component with minimal internal state and no long-term planning. Its function is the rapid interpretation of incoming requests, the performance of contextual analysis via a language model, and the generation of immediate responses, predictions, and explanations oriented toward the end user. The ReAct agent does not make autonomous strategic decisions and does not manage the global behavior of the system; instead, it acts as a specialized cognitive “coprocessor” activated on demand.

Cooperation between the two types of agents is implemented entirely through asynchronous, message-based communication implemented via a central broker.

An important architectural decision is that the LLM agent does not directly modify the beliefs, desires, or intentions of the BDI agent. The results of the LLM analysis are treated as informational messages that may be used by the BDI layer but do not bypass its evaluation and planning mechanism. In this case, the LLM agent generates and sends an inform performative according to the ACL specification, which the BDI agent receives and processes. In this way uncontrolled influence of the language model on the system logic and significantly limits the risk of hallucinations or logically inconsistent actions is prevented.

The BDI agent guarantees consistency, robustness, and control over the lifecycle of the analysis, while the ReAct agent optimizes latency and interaction quality by providing high-quality responses. This asymmetry reflects the philosophy of the platform: the language model is a powerful analytical tool, but not an autonomous decision-making entity.

The hybrid cooperation model in PLAM can be formalized as a hierarchical cognitive architecture in which the BDI agent acts rationally and embodies the system’s intentions, while the LLM-based ReAct agent functions as a highly efficient, context-sensitive analyzer and human-facing interface.

5. Demonstration Example

We would like to demonstrate the use of the platform in real conditions with one example. One of the first tasks that the platform has to solve is to compare data obtained from different sources for greater objectivity. It turns out that this is not a trivial task for the Plovdiv region. In most cases, analyses are based on partial data, and it was intuitively assumed that there were discrepancies in the measurements of the various institutions involved. The use of the platform shows that it can contribute to the objectivity of information by using the full set of measurements and preparing an objective comparative analysis. For example, in the first version of the platform [1], we took the data from our university sensor network as a reference (i.e., we compared the deviations in other systems against it). However, subsequent research and analysis with the platform showed that this (somewhat intuitive) understanding was wrong. So, in the current version, we changed our decision and now accept as reference the data obtained from the Open-Meteo, IQAir, and AQICN systems.

A sample from a general analysis of the air quality, generated based on the current data, is presented in Figure 6. The AnalysisAgent uses the LLM deepseek-v3.2-exp to produce a comprehensive assessment of the conditions. Due to space limitations, the model’s response shown in the figure is a shortened version of the full analysis, displaying only the first two elements of the structured LLM report.

Significant discrepancies in measurements from different sources are documented by the PM25ComparisonAgent, which generates a comparative table (Figure 7) after extracting the necessary data. European standards extracted from the specialized ontology are used as reference threshold values.

A summary overview of the measurements from two sources (ours and external) and their deviations from the norms is presented in Figure 8. The pollutant whose instantaneous value is close to the thresholds regulated in European norms (extracted from ontology) is shown in red.

The results of the platform’s work reveal some interesting details that manually prepared analyses were unable to establish. For example, poor meteorological conditions, such as high humidity, as shown in Figure 9, bring measurements from different sources closer together (the first half of the diagram). Such results provide ideas for different considerations in new directions and at the same time reconfirm the need to use this type of assessment and analysis tools.

The PollutantWeatherAgent and ForecastAgent work with the same data and reach similar conclusions. Figure 10 shows an analysis generated by the platform after PollutantWeatherAgent calls the services of LLM deepseek-v3.2-exp. This action results in an analysis that links atmospheric conditions to pollutant levels for the relevant period.

The second task we are tackling with the platform is the preparation of statistics and analysis on air quality in the Plovdiv area. In Figure 11, the platform’s working data is shown in a clear tabular layout. The table “Data from the station in Plovdiv (meter.ac)” illustrates the sensor readings collected through the monitoring sources of Plovdiv University “Paisii Hilendarski”, while the table “Air quality data in Plovdiv (Open-Meteo)” provides key pollution indicators obtained through the Open-Meteo API. Due to space constraints, the figure includes only a portion of the full dataset. Complete information remains accessible to users through the Streamlit interface [42], where they can explore it interactively with flexible navigation and filtering options.

Figure 12 shows a graph that visualizes the data collected for PM_2.5 pollutant from the sensors of Plovdiv University “Paisii Hilendarski” and from an external source via API (Open-Meteo). The differences between the various data sources are clearly distinguishable, with the difference measured for the pollutant by the university station, located in the Gagarin district of the city, and the data we receive from the external source of information, marked with a dotted line, being particularly striking. The data from the university station is significantly lower than that from the external source.

Another function of the PLAM platform is to make predictions. We will demonstrate its capabilities in this regard with two examples. Using ForecastLLMAgent, the user can request a pollution forecast for the next 24 h, and the agent generates it using LLM deepseek-v3.2-exp (Figure 13). When applying MAE/RMSE validation in Random Forest Regressor, the selected result shows that the MAE is approximately in the range of 3–7 μg/m³ for PM_2.5 at a 24 h forecast horizon, with a performance improvement of 18–22% over the baseline resilience-based approach. As part of the evaluation and interpretation of the results, thresholds are extracted and used from a domain-specific ontology, allowing for semantic enrichment of the predictions and more accurate classification of pollution conditions with respect to regulatory and health criteria.

When interacting with ChatAgent, the user can chat with PLAM, with the conversation limited to the topic of air pollution. In this particular case, the customer asks whether they should open their windows today, and the model generates its response based on the specific data available at that moment in order to best answer the user’s query (Figure 14).

To verify the correctness of its operation, the PLAM platform logs actions taken by agents and communication modules in the system. Some of the logs can be seen on Figure 15.

Although the PLAM platform is regional and developed primarily for use in the Plovdiv area, it can be adapted for other regions. Initial experiments conducted for the regions of Sofia, Varna, Burgas, and Ruse (Figure 16) show that it can be adapted relatively easily, using public data from air quality measurements for the respective region. The difference is that for the Plovdiv region, we use additional data stored in the structured environment of the agents, which gives greater depth to the analysis. For Varna, for example, 36 exceedances of the PM₁₀ standards were recorded in the first three months of 2025 (Figure 17a), and by mid-September 2025, the exceedances were 40 (Figure 17b).

In contrast to ACreM, the following key aspects are introduced in PLAM: hybrid multi-agent architecture, evaluation across different sources, quantitative validation of forecasting, and integration of the LLM from the perspective of reproducibility. All of these elements turn the local experimental setup of the former ACreM a region-centric air quality monitoring platform PLAM.

Table 1 systematically summarizes the differences between ACreM and PLAM and clearly shows that PLAM is not an iteration of the software, but a fully expanded platform with a new system scope, architectural components, and measurable engineering characteristics. While ACreM was developed as a local experimental system focused on ontology-assisted BDI reasoning, PLAM introduces:

regionally scalable architecture;
hybrid agent model (BDI + LLM-based ReAct);
multi-source, automated comparative profiling;
quantitatively validated KPIs (latency, data volume, prediction errors).
Thus, the novelty of PLAM is simultaneously architectural, methodological, and engineer-measurable.

6. Conclusions and Future Work

This paper presents a new version of a platform supporting air monitoring in the Plovdiv region. The core of the platform is a dedicated air monitoring Chain of Thought. Monitoring and analysis of air quality is demonstrated through various examples. The initial results from the platform’s operation make it possible to seek opportunities for creating a more effective infrastructure for the urban system for objective measurement of air pollutants. Analyses on our platform have identified significant differences in the measurements. Manual analysis examines individual cases and covers a small amount of data, which means that the conclusions and findings are incomplete, partial, and rather intuitive. In contrast to such analyses, those performed using the platform are complete, much more objective, and solidly substantiated. Using our platform, the first conclusion we can draw is that the general perception that there is a discrepancy in the measurements of different institutions is confirmed.

The observed discrepancies arise mainly from differences in the type and calibration of the sensors, their spatial representation, processing algorithms and temporal alignment. This is due to the fact that we do not have access to the calibration of the external sources. The external sources are considered reference for comparison, given the fact that they all have their own calibration procedures and cover a wider area. Also, unlike our sources, the external ones are certified and provide quality control procedures. It is necessary to note that PLAM is transparent about uncertainties and discrepancies and does not hide them. Objectivity in this work is determined by the operational process of source independence and consistent comparison over time.

In the future, it would be appropriate to consider the proposed platform together with existing architectural and methodological solutions for air quality monitoring and analysis. As an illustration, the inclusion of FIWARE in air quality monitoring systems is discussed in detail in [43], presenting an IoT-based solution that integrates Cloudino and FIWARE components for sensor data collection and management. This work can be taken as a direct basis for subsequent actions, as it shows a generic platform implementation focused primarily on data collection and visualization. Therefore, the added value of our architecture, which goes beyond simply extending standard FIWARE AQM systems through semantic interoperability, integration of predictive models, and a deeper analytical layer, should be emphasized.

Furthermore, for “objective profiling” of pollution and source-related analysis, it is appropriate to use modern remote sensing methods with high spatial-temporal resolution. The article [44] reveals a hyperspectral method for determining the hourly distribution of trace gases with a horizontal resolution of 100 m, along with accurate identification of emission sources. All this means that, in addition to conventional ground stations and public APIs, remote sensing and optical methods can significantly deepen the data ecosystem used by platforms such as PLAM, thereby extending their functionalities to determination, source identification, and attribution.

A future direction for the platform’s development is to improve our sensor networks with the ability to measure new parameters that affect air quality. We are also continuing our experiments with the aim of improving the platform’s components.

One way in which PLAM can be further developed is by creating a domain-specific language model (LLM) trained on the vocabulary, standards, ontologies, and real-world cases in the field of air quality monitoring and analysis. Such a model can act as an intelligent mediator between users and the platform, thus allowing the use of natural language for queries, obtaining self-sufficient summaries of analytical results, assisting in their interpretation, and even creating context-aware explanations (e.g., with regard to regulatory thresholds, seasonal fluctuations, meteorological conditions, and possible sources of emissions).

Our first step was to investigate and test various options for creating a specialized small language model that we could call our own. A domain-specific language model has the potential to enhance semantic interoperability by efficiently mapping and aligning the concepts of different data schemas and standards without human intervention. Moreover, by combining textual knowledge (documentation, methodologies, expert rules) and numerical measurements obtained from ground sensors and remote sensing, such a tool can also help detect anomalies. Essentially, for our research, these models represent a very effective solution for merging extremely niche, scarce information on air quality and monitoring. The small language model can be easily fine-tuned for air monitoring tasks, and at the same time, it will enhance the platform’s performance and accuracy. PLAM is intended to mainly address regional problems. Hence, we think that the use of SLM trained with region-specific datasets is a rational move. On one side, it would give us the possibility to optimize the use of our servers and also make the platform less dependent on external resources and information sources. On the other side, the SLM platform fits IoT level since it offers a combination of efficiency, low consumption of computing resources and electricity, quick response, and can provide even better security. Consequently, implementing a domain-specific language model not only aims at making the platform smarter and more functional, but it also contributes to its application in real institutional and scientific scenarios that are usually resource-constrained but can be demanding regarding autonomy and reliability.

Moreover, a recent study [45] claims that the two factors of very rapid development along with the diversity of IoT ecosystems, on top of the fact that they have limited computing power, significantly raise the risk of cybersecurity threats. Thus, this is another reason for implementing lightweight ML/AI-based security mechanisms in IoT platforms. A detailed investigation [46] of the IoT security area confirms the shift in the focus of new technologies, especially machine learning, anomaly detection methods, and lightweight solutions, to the core of robust IoT applications, particularly when there are real-world constraints. Concurrently, a recent paper [47] points out that IoT security models, in turn, are increasingly victim to adversarial machine learning attacks, which can lead to a breach of the trust of ML-based decision-making in critical IoT environments. Hence, a strengthened defense against adversarial manipulation should be considered as one of the main features of IoT security architectures, along with the existing ones such as efficiency and scalability.

The feature of security and resilience in the development of the PLAM platform should probably be given substantial consideration. These two aspects are not only among the hottest topics in research at present, but they are also areas that are changing very rapidly. As the complexity of IoT-based architectures, distributed sensor networks and multi-agent systems increases, there will be a great rise in the probability of attacks on the integrity or availability of data and processes for malicious purposes. Under these conditions, ML-based intrusion detection systems (IDSs) as well as the newest domain adaptation methods can provide capable and effective assistance in detection even at large, weakly labeled IoT scenarios where the transfer of semantic knowledge is done between domains [48]. In addition, generative AI and large language models are increasingly considered as components of the potential next-generation IoT security toolkit; therefore, they might offer solutions for threat detection, automation, and adaptive resilience strategies [45]. On the other hand, the expansion of ML-based security mechanisms paves the way for more new vulnerabilities to be exploited, since through adversarial attacks, major IoT security frameworks such as intrusion detection systems (IDS), malware detection systems (MDS), and device identification systems (DIS) have been consistently penetrated [47]. This thus reiterates the importance of having robust and lightweight defensive mechanisms that are capable of ensuring trustworthy operation even in adversarial conditions in the real world.

At the same time, guaranteeing that the operation is secure requires more visibility and the ability to detect devices operating in the different parts of IoT infrastructures. Previously, large language models were believed possible to be used for real-world IoT device identification by treating diverse network metadata as a language modeling task, and they surprisingly achieved high accuracy as well as robustness even when the data was noisy, incomplete, or under adversarial scenarios [49].

The specific feature is quite relevant to PLAM, whereby it is indispensable at the device level to have trust, accountability, and anomaly detection in order to guarantee the integrity of the monitoring data streams. Meanwhile, recent reviews [50] highlight that future IoT security solutions should be scalable, adaptive, and capable of addressing rapidly evolving threats, where emerging paradigms such as edge computing, federated learning, and explainable AI offer promising, strengthening resilience and compliance. The approaches are very important for monitoring infrastructures with limited resources, where security mechanisms should still be efficient without causing too much computational overhead.

PLAM’s future development path should be that of a platform being such a platform: rich, smart, and secure; seamlessly providing semantic interoperability; having multimodal data sources (ground-based, remote sensing, and hyperspectral); having state-of-the-art analytical and predictive models; and having built-in cybersecurity and resilience features. A comprehensive effort in such a way will change the platform from merely a monitoring tool to an academic and operational environment of a trustworthy institution for the study and management of air quality in real, dynamic, and possibly even hostile situations.

In this case, robustness withstanding not only traditional cyber threats but also adversarial manipulations of ML-based components should be taken as an unconditional prerequisite for the operational reliability of real deployments [47,50].

Author Contributions

Conceptualization, S.N.S. and A.A.G.; methodology, V.V.T.-K. and A.G.S.-D.; software, B.L.B., Y.G.T., G.K.M. and I.S.S.; validation, V.V.T.-K. and A.G.S.-D.; formal analysis, S.N.S.; investigation, S.N.S., A.A.G., V.V.T.-K., A.G.S.-D., B.L.B., Y.G.T., G.K.M. and I.S.S.; resources, A.A.G.; data curation, B.L.B., G.K.M. and I.S.S.; writing—original draft preparation, S.N.S., V.V.T.-K., B.L.B., G.K.M. and Y.G.T.; writing—review and editing, A.G.S.-D. and A.A.G.; visualization, B.L.B. and V.V.T.-K.; supervision, S.N.S.; project administration, A.A.G. All authors have read and agreed to the published version of the manuscript.

Funding

The work was partially supported by the Centre of Excellence in Informatics and ICT under the Grant No BG16RFPR002-1.014-0018, financed by the Research, Innovation and Digitalization for Smart Transformation Programme 2021–2027 and co-financed by the European Union.

Data Availability Statement

The original data presented in the study are openly available at https://meter.ac/gs/nodes/html/current.html (accessed on 20 January 2026); https://open-meteo.com/ (accessed on 20 January 2026); https://aqicn.org/here/ (accessed on 20 January 2026).

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

PLAM	Plovdiv Air Monitoring
ACreM	Air Credible Monitoring
IoT	Internet of Things
AI	Artificial Intelligence
MAS	Multi-agent systems
WHO	World Health Organization
LLMs	Large language Models
BDI	Belief-Desire-Intention
XAI	Explainable AI
CoT	Chain of Thought
ACL	Agent Communication Language
CSV	“comma-separated values”, a plain text file format used to store tabular data, with each line representing a data record and commas separating the values in each record

References

Stoyanov, S.; Doychev, E.; Stoyanova-Doycheva, A.; Tabakova-Komsalova, V.; Stoyanov, I.; Nedelchev, I. A Regional Multi-Agent Air Monitoring Platform. Future Internet 2025, 17, 112. [Google Scholar] [CrossRef]
An Official Website of the European Union. Air Quality Modeling: Background. Available online: https://aqm.jrc.ec.europa.eu/Section/Assessment/Background (accessed on 21 October 2025).
An Official Website of the European Union. FAIRMODE. Available online: https://fairmode.jrc.ec.europa.eu/ (accessed on 25 October 2025).
European Environment Agency. European Air Quality Index App Now Available in All EU Languages. Available online: https://www.eea.europa.eu/en/newsroom/news/european-air-quality-index-app (accessed on 21 October 2025).
Hooyberghs, H.; De Craemer, S.; Lefebvre, W.; Vranckx, S.; Maiheu, B.; Trimpeneers, E.; Vanpoucke, C.; Janssen, S.; Meysman, F.J.R.; Fierens, F. Validation and optimization of the ATMO-Street air quality model chain by means of a large-scale citizen-science dataset. Atmos. Environ. 2022, 272, 118946. [Google Scholar] [CrossRef]
WHO. Available online: https://www.who.int/teams/environment-climate-change-and-health/air-quality-energy-and-health (accessed on 24 October 2025).
WHO Global Air Quality Guidelines: Particulate Matter (PM_2.5 and PM₁₀), Ozone, Nitrogen Dioxide, Sulfur Dioxide and Carbon Monoxide. Available online: https://www.who.int/publications/i/item/9789240034228 (accessed on 21 October 2025).
SafetyCulture. Available online: https://safetyculture.com/app/air-quality-monitoring-software/ (accessed on 21 October 2025).
Real-Time Air Quality Monitoring Solution. Available online: https://www.aeroqual.com/ (accessed on 23 October 2025).
Real-Time Impact on the Community and the Planet. Available online: https://envirosuite.com/ (accessed on 23 October 2025).
Ullah, I.; Adhikari, D.; Su, X.; Palmieri, F.; Wu, C.; Choi, C. Integration of data science with the intelligent IoT (IIoT): Current challenges and future perspectives. Digit. Commun. Netw. 2025, 11, 280–298. [Google Scholar] [CrossRef]
KAITERRA. Air Quality Monitors Created by Industry Expert. Available online: https://www.kaiterra.com/ (accessed on 21 October 2025).
ERA ENVIRONMENTAL. Air Emissions Management Software. Available online: https://www.era-environmental.com/solutions/environmental/air (accessed on 25 October 2025).
Malashin, I.; Tynchenko, V.; Gantimurov, A.; Nelyub, V.; Borodulin, A. Applications of Long Short-Term Memory (LSTM) Networks in Polymeric Sciences: A Review. Polymers 2024, 16, 2607. [Google Scholar] [CrossRef]
ENVEA. Available online: https://www.envea.global/ (accessed on 25 October 2025).
Schneider, F.D.; Fichtmueller, D.; Gossner, M.M.; Güntsch, A.; Jochum, M.; König-Ries, B.; Le Provost, G.; Manning, P.; Ostrowski, A. Towards an ecological trait-data standard. Br. Ecol. Soc. Methods Ecol. 2019, 10, 2006–2019. [Google Scholar] [CrossRef]
ATMOSPHERE MONITORING SERVICE, ECMWF as Part of the Copernicus Programme, Air Quality. Available online: https://atmosphere.copernicus.eu/air-quality (accessed on 25 October 2025).
ECMWF. Ensemble Forecasting. Available online: https://www.ecmwf.int/en/elibrary/75394-ensemble-forecasting (accessed on 24 October 2025).
Sokhi, R.S.; Moussiopoulos, N.; Baklanov, A.; Bartzis, J.; Coll, I.; Finardi, S.; Friedrich, R.; Geels, C.; Grönholm, T.; Halenka, T.; et al. Advances in air quality research—Current and emerging challenges. Atmos. Chem. Phys. 2022, 22, 4615–4703. [Google Scholar] [CrossRef]
FIWARE. Smart Cities & Quality of Life. FIWARE Foundation. ECMWF. 2023. Available online: https://www.fiware.org/category/smartcities/ (accessed on 24 October 2025).
Garcia, A.; Saez, Y.; Harris, I.; Huang, X.; Collado, E. Advancements in air quality monitoring: A systematic review of IoT-based air quality monitoring and AI technologies. Artif. Intell. Rev. 2025, 58, 275. [Google Scholar] [CrossRef]
Shahid, S.; Brown, D.J.; Wright, P.; Khasawneh, A.M.; Taylor, B.; Kaiwartya, O. Innovations in Air Quality Monitoring: Sensors, IoT and Future Research. Sensors 2025, 25, 2070. [Google Scholar] [CrossRef]
Johnson, T.; Woodward, K. Enviro-IoT: Calibrating low-cost environmental sensors in urban settings. arXiv 2025, arXiv:2502.07596. [Google Scholar] [CrossRef]
Wiese, P.; Kartsch, V.; Guermandi, M.; Benini, L. A Multi-Modal IoT Node for Energy-Efficient Environmental Monitoring with Edge AI Processing. In Proceedings of the 2025 IEEE International Conference on Omni-layer Intelligent Systems (COINS), Madison, WI, USA, 4–6 August 2025; pp. 1–7. [Google Scholar] [CrossRef]
Mengara Mengara, A.G.; Park, E.; Jang, J.; Yoo, Y. Attention-Based Distributed Deep Learning Model for Air Quality Forecasting. Sustainability 2022, 14, 3269. [Google Scholar] [CrossRef]
Soares, P.H.; Monteiro, J.P.; Gaioto, F.J.; Ogiboski, L.; Andrade, C.M.G. Use of Association Algorithms in Air Quality Monitoring. Atmosphere 2023, 14, 648. [Google Scholar] [CrossRef]
Zhang, J.; Xia, W. Prediction of PM_2.5 Concentration on the Basis of Multi-Time Scale Fusion. Processes 2022, 10, 171. [Google Scholar] [CrossRef]
Rahman, M.M.; Joha, M.I.; Nazim, M.S.; Jang, Y.M. Enhancing IoT-Based Environmental Monitoring and Power Forecasting: A Comparative Analysis of AI Models for Real-Time Applications. Appl. Sci. 2024, 14, 11970. [Google Scholar] [CrossRef]
La Malfa, E.; La Malfa, G.; Marro, S.; Zhang, J.M.; Black, E.; Luck, M.; Torr, P.; Wooldridge, M. Large Language Models Miss the Multi-Agent Mark. arXiv 2025, arXiv:2505.21298. [Google Scholar] [CrossRef]
Kozłowski, M.; Asenov, A.; Pencheva, V.; Bęczkowska, S.A.; Czerepicki, A.; Zysk, Z. Autonomous System for Air Quality Monitoring on the Campus of the University of Ruse: Implementation and Statistical Analysis. Sustainability 2025, 17, 6260. [Google Scholar] [CrossRef]
Khan, T.R.; Emerson, Z.I.; Mentz, K.H. Evaluation of Fine Particulate Matter (PM_2.5) Concentrations Measured by Collocated Federal Reference Method and Federal Equivalent Method Monitors in the U.S. Atmosphere 2024, 15, 978. [Google Scholar] [CrossRef]
Nycz, B.; Pietrucha-Urbanik, K. Advancements in Air Quality Monitoring: The Role of Drone Technology. Proceedings 2024, 105, 19. [Google Scholar] [CrossRef]
Yang, J.; Tian, Y.; Wu, C.H. Air Quality Prediction and Ranking Assessment Based on Bootstrap-XGBoost Algorithm and Ordinal Classification Models. Atmosphere 2024, 15, 925. [Google Scholar] [CrossRef]
Rescio, G.; Manni, A.; Caroppo, A.; Carluccio, A.M.; Siciliano, P.; Leone, A. Multi-Sensor Platform for Predictive Air Quality Monitoring. Sensors 2023, 23, 5139. [Google Scholar] [CrossRef]
Kang, G.K.; Gao, J.Z.; Chiao, S.; Lu, S.; Xie, G. Air Quality Prediction: Big Data and Machine Learning Approaches. Int. J. Environ. Sci. Dev. 2018, 9, 8–16. [Google Scholar] [CrossRef]
Mahajan, S.; Kumar, P. Evaluation of low-cost sensors for quantitative personal exposure monitoring. Sustain. Cities Soc. 2020, 57, 102076. [Google Scholar] [CrossRef]
Minlah, M.K.; Zhang, X.; Ganyoh, P.N.; Bibi, A. When the last tree dies, the last man dies: Do forests hold the key to survival in Ghana? A critical analysis using the bootstrap rolling-window Granger causality test approach. Environ. Sci. Pollut. Res. Int. 2023, 30, 45740–45749. [Google Scholar] [CrossRef] [PubMed]
Wei, J.; Wang, X.; Schuurmans, D.; Bosma, M.; Ichter, B.; Xia, F.; Chi, E.; Le, Q.; Zhou, D. Chain-of-thought prompting elicits reasoning in large language models. In Proceedings of the 36th International Conference on Neural Information Processing Systems 35, New Orleans, LA, USA, 28 November–9 December 2022; Curran Associates Inc.: Red Hook, NY, USA, 2022; pp. 24824–24837. [Google Scholar] [CrossRef]
JADEX. Available online: https://www.activecomponents.org/ (accessed on 12 October 2025).
Atomic Agents. Available online: https://bestaiagents.ai/agent/atomic-agents (accessed on 3 December 2025).
LangChain. Available online: https://www.langchain.com/ (accessed on 5 December 2025).
A Faster Way to Build and Share Data Apps. Available online: https://streamlit.io/ (accessed on 8 December 2025).
Baca Gómez, Y.R.; Estrada Esquivel, H.; Martínez Rebollar, A.; Villanueva Vásquez, D. A Novel Air Quality Monitoring Unit Using Cloudino and FIWARE Technologies. Math. Comput. Appl. 2019, 24, 15. [Google Scholar] [CrossRef]
Lu, C.; Li, Q.; Xing, C.; Hu, Q.; Tan, W.; Lin, H.; Lin, J.; Zhang, Z.; Chang, B.; Liu, C. A Novel Hyperspectral Remote Sensing Technique with Hour-Hectometer Level Horizontal Distribution of Trace Gases: To Accurate Identify Emission Sources. J. Remote Sens. 2023, 3, 0098. [Google Scholar] [CrossRef]
Alwahedi, F.; Aldhaheri, A.; Ferrag, M.A.; Battah, A.; Tihanyi, N. Machine learning techniques for IoT security: Current research and future vision with generative AI and large language models. Internet Things Cyber-Phys. Syst. 2024, 4, 167–185. [Google Scholar] [CrossRef]
Sebestyen, H.; Popescu, D.E.; Zmaranda, R.D. A Literature Review on Security in the Internet of Things: Identifying and Analysing Critical Categories. Computers 2025, 14, 61. [Google Scholar] [CrossRef]
Khazane, H.; Ridouani, M.; Salahdine, F.; Kaabouch, N. A Holistic Review of Machine Learning Adversarial Attacks in IoT Networks. Future Internet 2024, 16, 32. [Google Scholar] [CrossRef]
WWu, J.; Wang, Y.; Xie, B.; Li, S.; Dai, H.; Ye, K.; Xu, C. Joint Semantic Transfer Network for IoT Intrusion Detection. IEEE Internet Things J. 2022, 10, 3368–3383. [Google Scholar] [CrossRef]
Rameen, M.; Ahmed, T.; Peddinti, S.T.; Huang, D.Y. Large Language Models for Real-World IoT Device Identification. arXiv 2025, arXiv:2510.13817. [Google Scholar] [CrossRef]
Alfahaid, A.; Alalwany, E.; Almars, A.M.; Alharbi, F.; Atlam, E.; Mahgoub, I. Machine Learning-Based Security Solutions for IoT Networks: A Comprehensive Survey. Sensors 2025, 25, 3341. [Google Scholar] [CrossRef]

Figure 1. Architecture of the PLAM platform.

Figure 2. General diagram of PLAM-CoT.

Figure 3. Fixed command.

Figure 4. Sequence diagram of a data-processing cycle.

Figure 5. Sequencing diagram of chat function using contextual memory.

Figure 6. Sample from a general analysis of the air quality.

Figure 7. Example of PM2.5 measurement discrepancy.

Figure 8. Table of deviations.

Figure 9. Matches between values from different sources at high humidity.

Figure 10. Analysis generated by the platform (segment).

Figure 11. Example of platform’s working data.

Figure 12. Graph visualization of the data collected for PM_2.5 pollutant.

Figure 13. Forecast for the next 24 h.

Figure 14. Dialogue between user and platform.

Figure 15. Excerpt from the PLAM protocol.

Figure 16. Map of Bulgaria with marked experimental regions—Sofia, Varna, Burgas, and Ruse.

Figure 17. Adaptation of the platform for the region of Varna.

Table 1. Comparison between ACreM and PLAM platforms.

Aspect	ACreM (First Version)	PLAM (This Work)	Improvement/Novelty
System scope	Local experimental platform	Regional, extensible platform	Support for multi-city deployment
Agent architecture	Classical BDI agents only	Hybrid BDI + LLM-based ReAct agents	Combines deliberative planning with fast reactive reasoning
Reasoning paradigm	Rule-based and BDI planning	Hybrid Chain-of-Thought (BDI + LLM)	Improved interpretability and flexibility
Data sources	Proprietary sensor network	Proprietary sensors + Open-Meteo, IQAir, AQICN	Heterogeneous data fusion
Cross-source comparison	Implicit, manual reference selection	Automated, agent-driven comparison	Continuous, objective discrepancy detection
Reference handling	Single intuitive reference source	Multi-source comparative reference	Uncertainty-aware objectivity
Forecasting capability	Limited/exploratory	Integrated Random Forest forecasting (24 h)	Quantitatively evaluated prediction module
Forecast validation	Not reported	MAE, RMSE, baseline comparison	Reproducible evaluation protocol
LLM integration	Not available	Structured LLM agents with templates	Natural language explanations and analysis
LLM reproducibility controls	Not applicable	Fixed temperature, prompt templates, post-validation	Reduced hallucination risk
Ontology usage	Static semantic layer	Active ontology-driven reasoning	Dynamic thresholding and alerts
User interaction	Dashboard only	Dashboard + conversational chat	Personalized explanations and recommendations
Automation level	Semi-automated workflows	Fully autonomous event-driven operation	Reduced need for human intervention
System latency	Not measured	<12 s end-to-end	Engineering-level validation
Scalability evidence	Single city (Plovdiv)	Pilot tests in 5 cities	Demonstrated regional portability
Data volume handling	Not specified	~1.2 M records/month	Quantified system capacity

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Stoyanov, S.N.; Belichev, B.L.; Tabakova-Komsalova, V.V.; Todorov, Y.G.; Golev, A.A.; Maglizhanov, G.K.; Stoyanov, I.S.; Stoyanova-Doycheva, A.G. Building a Regional Platform for Monitoring Air Quality. Future Internet 2026, 18, 78. https://doi.org/10.3390/fi18020078

AMA Style

Stoyanov SN, Belichev BL, Tabakova-Komsalova VV, Todorov YG, Golev AA, Maglizhanov GK, Stoyanov IS, Stoyanova-Doycheva AG. Building a Regional Platform for Monitoring Air Quality. Future Internet. 2026; 18(2):78. https://doi.org/10.3390/fi18020078

Chicago/Turabian Style

Stoyanov, Stanimir Nedyalkov, Boyan Lyubomirov Belichev, Veneta Veselinova Tabakova-Komsalova, Yordan Georgiev Todorov, Angel Atanasov Golev, Georgi Kostadinov Maglizhanov, Ivan Stanimirov Stoyanov, and Asya Georgieva Stoyanova-Doycheva. 2026. "Building a Regional Platform for Monitoring Air Quality" Future Internet 18, no. 2: 78. https://doi.org/10.3390/fi18020078

APA Style

Stoyanov, S. N., Belichev, B. L., Tabakova-Komsalova, V. V., Todorov, Y. G., Golev, A. A., Maglizhanov, G. K., Stoyanov, I. S., & Stoyanova-Doycheva, A. G. (2026). Building a Regional Platform for Monitoring Air Quality. Future Internet, 18(2), 78. https://doi.org/10.3390/fi18020078

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.

Article Menu

Building a Regional Platform for Monitoring Air Quality

Abstract

1. Introduction

2. State of the Art

2.1. Established Monitoring Systems and Regulatory Frameworks

2.2. The Rise of Smart Platforms and the IoT

2.3. Applications for Artificial Intelligence and Multi-Agent Systems

2.4. Key Challenges and Prospects

2.5. The State of Research

3. General Characteristics and Architecture of the PLAM Platform

4. PLAM-CoT

5. Demonstration Example

6. Conclusions and Future Work

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI