Advanced Reinforcement Learning in Internet of Things Networks

A special issue of Mathematics (ISSN 2227-7390). This special issue belongs to the section "E1: Mathematics and Computer Science".

Deadline for manuscript submissions: 10 November 2025 | Viewed by 541

Special Issue Editors


E-Mail Website
Guest Editor
Research Centre for Computational Science and Mathematical Modelling, University of Warwick, Coventry, UK
Interests: machine learning; reinforcement learning; internet of things; communication
Special Issues, Collections and Topics in MDPI journals

E-Mail Website
Guest Editor
1. School of Computing, Electronics and Mathematics, Coventry University, Coventry, UK
2. Associate, Faculty Research Centre for Data Science, Coventry University, Coventry, UK
Interests: channel measurements and modeling; (wearable) internet of things; vehicular communications; mmwave communications; future cities
Special Issues, Collections and Topics in MDPI journals

Special Issue Information

Dear Colleagues,

The proliferation of the Internet of Things (IoT) has revolutionised how devices, systems, and people interact. IoT networks are increasingly complex, dynamic, and heterogeneous, posing significant challenges in resource allocation, energy efficiency, security, and scalability. Reinforcement learning (RL), particularly deep RL and multi-agent RL, has emerged as a powerful tool to address these challenges, enabling intelligent decision making and adaptive control in IoT ecosystems.

This Special Issue seeks to gather innovative research and cutting-edge developments in applying RL techniques to optimise IoT networks. We invite contributions addressing theoretical advancements, novel algorithms, system architectures, and practical applications of RL in the context of IoT. Submissions focusing on interdisciplinary approaches and real-world deployments are especially encouraged.

Topics of Interest

The topics of interest include, but are not limited to, the following:

  • Energy efficiency: RL-based techniques for energy harvesting, power management, and extending IoT device lifetimes;
  • Network optimisation: Dynamic resource allocation, routing, and bandwidth optimisation using RL;
  • Multi-agent systems: Collaborative and competitive RL for IoT devices in distributed and decentralised networks;
  • Security and privacy: RL-driven solutions for intrusion detection, secure communication, and data protection in IoT networks;
  • Edge Computing: RL for optimising computation offloading and workload distribution in resource-constrained environments;
  • 5G and beyond: Reinforcement learning for IoT integration with 5G/6G networks, network slicing, and ultra-low-latency communications;
  • Autonomous IoT management: RL approaches for self-healing, self-configuring, and self-optimising IoT networks;
  • Interdisciplinary research: RL applications combining IoT with smart cities, smart healthcare, smart manufacturing, and other domains;
  • Scalability and adaptability: RL algorithms designed for large-scale, highly dynamic IoT environments;
  • Benchmarking and evaluation: Real-world case studies, datasets, and performance benchmarks for RL in IoT systems.;
  • Physical layer: RL-based strategies for channel optimisation, interference management, and localization accuracy in IoT networks;
  • Responsible AI in critical infrastructure networks: Infrastructure (e.g., healthcare, autonomous vehicles, space systems) investigating the ethical and legal implications of AI-driven systems in sectors, with an emphasis on the need for trustworthy AI;
  • Resilient ML in critical infrastructure networks (e.g., healthcare, autonomous vehicles, space systems): Enhancing the resilience of critical infrastructure through the integration of AI and edge computing, aiming to improve system robustness and responsiveness;
  • AI safety and robustness Exploring safety aspects of AI, including the development of methods for AI systems to behave as intended and be robust against adversarial attacks.

Dr. Saurav Sthapit
Dr. Seong Ki Yoo
Guest Editors

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Mathematics is an international peer-reviewed open access semimonthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2600 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

  • reinforcement learning (RL)
  • machine learning (ML)
  • internet of things (IoT)
  • resilient machine learning
  • physical layer
  • non-terrestrial network
  • optimization
  • 5G and 6G
  • vehicular network

Benefits of Publishing in a Special Issue

  • Ease of navigation: Grouping papers by topic helps scholars navigate broad scope journals more efficiently.
  • Greater discoverability: Special Issues support the reach and impact of scientific research. Articles in Special Issues are more discoverable and cited more frequently.
  • Expansion of research network: Special Issues facilitate connections among authors, fostering scientific collaborations.
  • External promotion: Articles in Special Issues are often promoted through the journal's social media, increasing their visibility.
  • Reprint: MDPI Books provides the opportunity to republish successful Special Issues in book format, both online and in print.

Further information on MDPI's Special Issue policies can be found here.

Published Papers (1 paper)

Order results
Result details
Select all
Export citation of selected articles as:

Research

20 pages, 1685 KB  
Article
Small Language Model-Guided Quantile Temporal Difference Learning for Improved IoT Application Placement in Fog Computing
by Bhargavi Krishnamurthy and Sajjan G. Shiva
Mathematics 2025, 13(17), 2768; https://doi.org/10.3390/math13172768 - 28 Aug 2025
Viewed by 310
Abstract
The global market for fog computing is expected to reach USD 6385 million by 2032. Modern enterprises rely on fog computing since it offers computational resources at edge devices through decentralized computation mechanisms. One of the crucial components of fog computing is the [...] Read more.
The global market for fog computing is expected to reach USD 6385 million by 2032. Modern enterprises rely on fog computing since it offers computational resources at edge devices through decentralized computation mechanisms. One of the crucial components of fog computing is the proper placement of applications on fog nodes (edge devices, Internet of Things (IoT)) for servicing. Large-scale, geographically distributed fog networks and heterogeneity of fog nodes make application placement a challenging task. Quantile Temporal Difference Learning (QTDL) is a promising distributed form of a reinforcement learning algorithm. It is superior compared to traditional reinforcement learning as it learns the act of prediction based on the full distribution of returns. QTDL is enriched by a small language model (SLM), which results in low inference latency, reduced costs of operation, and also enhanced rates of learning. The SLM, being a lightweight model, has policy-shaping capability, which makes it an ideal choice for the resource-constrained environment of edge devices. The data-driven quantiles of temporal difference learning are blended with the informed heuristics of the SLM to prevent quantile loss and over- or underestimation of the policies. In this paper, a novel SLM-guided QTDL framework is proposed to perform task scheduling among fog nodes. The proposed framework is implemented using the iFogSim simulator by considering both certain and uncertain fog computing environments. Further, the results obtained are validated using expected value analysis. The performance of the proposed framework is found to be satisfactory with respect of the following performance metrics: energy consumption, makespan time violations, budget violations, and load imbalance ratio. Full article
(This article belongs to the Special Issue Advanced Reinforcement Learning in Internet of Things Networks)
Show Figures

Figure 1

Back to TopTop