Anti-Jamming Path Selection Method in a Wireless Communication Network Based on Dyna-Q
Abstract
:1. Introduction
1.1. Related Works
1.2. Contribution and Structure
- The algorithm can make the source node find the optimal path to transmit to the destination node, which has the advantages of fast convergence speed and improved communication efficiency.
- This paper proposes an anti-jamming algorithm based on Dyna-Q. This is the first use of Dyna-Q in anti-jamming communication.
2. System Model and Problem Formulation
2.1. System Model
- In a wireless communication network, there are communication nodes. The wireless communication links are non-interfered links. The communication node set is defined as , among which is the th node. In particular, is the source node, and is the destination node, where “node ” and “node ” refer to the th node. If the energy of all nodes is large enough, there are next hop nodes for the current node to select. The location set of all communication nodes is denoted as . The adjacent node set of the current node can be judged by the location set of all communication nodes [24]. In practice, all nodes have limited energy. The current node can only select neighboring nodes. Within an area, there is a jammer that continuously interferes with the surrounding area at constant power , as shown in Figure 1.
- When each node transmits information, the channel bandwidth is . Each packet transmitted from the source node to the target node has a length of bits. Each communication node receives all bits of the whole packet before forwarding it to the subsequent nodes. Assuming that all nodes have limited energy, the current node can only select neighboring nodes.
- Location information and link status between nodes are shared through a common control link. However, the specific position and jamming range of jammers are unknown to communication nodes.
- The jammer executes precise jamming aimed at tracking frequency and communication signal time alignment, which will lead to effective jamming in both the time domain and frequency domain. This jamming effect is equivalent to full-frequency blocking jamming. Additive White Gaussian noise exists in all channels, and its power spectral density is .
- It is assumed that when a communication node that is not jammed is selected, namely, , the modulation mode of the current communication node is , and the transmission is carried out at the full rate . If a communication node that is partially disturbed is selected, namely, if , the modulation mode of the current communication node is , and the transmission is carried out at the half-rate . If a communication node that is seriously disturbed is selected, namely, if , the transmission rate is 0, and the nodes cannot communicate with each other. Here, , , and are the threshold values of , which gradually decrease.
2.2. Problem Formulation
3. An Anti-Jamming Algorithm Based on Dyna-Q
Algorithm 1. Anti-jamming algorithm for wireless communication network |
. |
Fordo |
(1) In the current environment state , the transmitter performs the last decision action , or the initial action to select the next communication node; |
(2) After taking the action , the payoff r and the state at the next moment can be obtained. |
(3) According to Equation (7), the Q function is updated. |
(4) The latest quad is used to update . |
(5) n cycles are performed: |
① Random quads are selected from ; |
② The Q function is updated according to Equation (7). (6) The receiver gets the transmitter action at the next moment according to the following criteria and sends it back to the transmitter: |
end for |
4. Simulation Results and Analysis
4.1. Parameter Settings
- MAB model optimal anti-jamming path selection algorithm: By using the UCBI algorithm to select the arm, in the first K round experiment, all available communication nodes were tested once for the same communication node, and the results of each experiment are independent identically distributed. In subsequent experiments, the selected nodes were determined by the average income of each communication node in the previous K round experiment and the number of times each communication node was selected.
- Classic Q-learning: The communication node constantly interacts with the environment and selects the communication node by the next hop through feedback and return.
4.2. Analysis of Simulation
5. Conclusions
Author Contributions
Funding
Conflicts of Interest
References
- Li, G.Y.; Xu, Z.; Xiong, C.; Yang, C.; Zhang, S.; Chen, Y.; Xu, S. Energy-efficient wireless communications: Tutorial, survey, and open issues. IEEE Wirel. Commun. 2011, 18, 28–35. [Google Scholar] [CrossRef]
- Klaus, W.; Puttnam, B.J.; Luis, R.S.; Sakaguchi, J.; Mendinueta, J.D.; Awaji, Y.; Wada, N. Advanced space division multiplexing technologies for optical networks [Invited]. J. Opt. Commun. Netw. 2017, 9, C1–C11. [Google Scholar] [CrossRef]
- Lin, J.; Tian, B.; Wu, J.; He, J. Spectrum Resource Trading and Radio Management Data Sharing Based on Blockchain. In Proceedings of the 2020 IEEE 3rd International Conference on Information Systems and Computer Aided Education (ICISCAE), Dalian, China, 27–29 September 2020; pp. 83–87. [Google Scholar]
- Zou, Y.; Zhu, J.; Wang, X.; Hanzo, L. A Survey on Wireless Security: Technical Challenges, Recent Advances, and Future Trends. Proc. IEEE 2016, 104, 1727–1765. [Google Scholar] [CrossRef] [Green Version]
- Sun, Z. Hop Number and Relay Nodes Optimization in Clustering Routing Algorithms. Minicomput. Syst. 2019, 40, 1299–1305. [Google Scholar]
- Haoyu, Z.; Yan, H.; Xinxing, X.; Nengling, T. Optimal Configuration of Integrated Energy System Considering Energy Transmission Paths. In Proceedings of the 2021 4th International Conference on Energy, Electrical and Power Engineering (CEEPE), Chongqing, China, 23–25 April 2021; pp. 1119–1124. [Google Scholar]
- Miao, J. Multi-hop routing algorithm based on genetic algorithm and K-means clustering algorithm used in WSN. Mod. Electron. Technol. 2021, 44, 42–48. [Google Scholar]
- Marti, S.; Giuli, T.J.; Lai, K.; Baker, M. Mitigating Routing Misbehavior in Mobile ad Hoc Networks. 2000. Available online: https://www.cs.cmu.edu/~srini/15-829/readings/marti-giuli-lai-baker-mitigating-routing-misbehavior.pdf (accessed on 15 June 2022).
- Lu, B.; Pooch, U.W. Cooperative security-enforcement routing in mobile ad hoc networks. In Proceedings of the 4th International Workshop on Mobile and Wireless Communications Network 2002, Stockholm, Sweden, 9–11 September 2002; pp. 157–161. [Google Scholar]
- Bhatnagar, M.R.; Mallik, R.K.; Tirkkonen, O. Performance Evaluation of Best-Path Selection in a Multihop Decode-and-Forward Cooperative System. IEEE Trans. Veh. Technol. 2016, 65, 2722–2728. [Google Scholar] [CrossRef]
- Yang, W. Analysis and improvement of MDCE routing algorithm. Comput. Appl. Softw. 2021, 38, 105–110. [Google Scholar]
- Lin, M. Research on Anti-Jamming Routing Algorithm for ISM Band Multi-Channel Wireless Sensor Networks; Nanjing University of Technology: Nanjing, China, 2021. [Google Scholar]
- Khan, R.; Jayakody, D.N.K. Full Duplex Component—Forward Cooperative Communication for a Secure Wireless Communication System. Electronics 2020, 9, 2102. [Google Scholar] [CrossRef]
- Do, D.-T.; Van Nguyen, M.-S.; Hoang, T.-A.; Voznak, M. NOMA-assisted multiple ac cess scheme for IoT deployment: Relay selection model and secrecy performance improvement. Sensors 2019, 19, 736. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Wang, Y.T. Research on Anti-Jamming Method of Wireless Communication Based on Reinforcement Learning; Army Engineering University: Nanjing, China, 2019. [Google Scholar]
- Abunada, A.H.; Osman, A.Y.; Khandakar, A.; Chowdhury, M.E.H.; Khattab, T.; Touati, F. Design and Implementation of a RF Based Anti-Drone System. In Proceedings of the 2020 IEEE International Conference on Informatics, IoT, and Enabling Technologies, Doha, Qatar, 2–5 February 2020; pp. 42–45. [Google Scholar]
- Lu, Z.; Song, J.; Zheng, C.; Xu, W.; Wang, X. Generalized State Space Average-value Model of MAB Based Power Electrical Transformer. In Proceedings of the 2021 5th International Conference on Power and Energy Engineering (ICPEE), Xiamen, China, 2–4 December 2021; pp. 46–52. [Google Scholar]
- Chen, M.; Poor, H.V.; Saad, W.; Cui, S. Convergence Time Minimization of Federated Learning over Wireless Networks. In Proceedings of the ICC 2020–2020 IEEE International Conference on Communications (ICC), Dublin, Ireland, 7–11 June 2020; pp. 1–6. [Google Scholar]
- Almeida, P.J.; Lieb, J. Complete j-MDP Convolutional Codes. IEEE Trans. Inf. Theory 2020, 12, 7348–7359. [Google Scholar] [CrossRef]
- Xie, G.; Sun, L.; Wen, T.; Hei, X.; Qian, F. Adaptive Transition Probability Matrix-Based Parallel IMM Algorithm. IEEE Trans. Syst. Man Cybern. Syst. 2021, 51, 2980–2989. [Google Scholar] [CrossRef]
- Richard, S.S.; Andrew, G.B. Reinforcement Learning; Electronic Industry Press: Beijing, China, 2021. [Google Scholar]
- Su, W.; Tao, J.; Pei, Y.; You, X.; Xiao, L.; Cheng, E. Reinforcement Learning Based Efficient Underwater Image Communication. IEEE Commun. Lett. 2021, 25, 883–886. [Google Scholar] [CrossRef]
- Zhou, Y. Fundamentals of Information Theory; Beijing University of Aeronautics and Astronautics Press: Beijing, China, 2019. [Google Scholar]
- Sun, J. A Rapid Deeployment Method for Wireless Emergency Communication Relay Nodes in Complex Environment. Astrom. Tech. 2022, 42, 45–50. [Google Scholar]
- Vlassis, N. A Concise Introduction to Multiagent Systems and Distributed Artificial Intelligence; Morgan and Claypool Publishers: Williston, VT, USA, 2007. [Google Scholar]
Parameters | Value |
---|---|
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhang, G.; Li, Y.; Niu, Y.; Zhou, Q. Anti-Jamming Path Selection Method in a Wireless Communication Network Based on Dyna-Q. Electronics 2022, 11, 2397. https://doi.org/10.3390/electronics11152397
Zhang G, Li Y, Niu Y, Zhou Q. Anti-Jamming Path Selection Method in a Wireless Communication Network Based on Dyna-Q. Electronics. 2022; 11(15):2397. https://doi.org/10.3390/electronics11152397
Chicago/Turabian StyleZhang, Guoliang, Yonggui Li, Yingtao Niu, and Quan Zhou. 2022. "Anti-Jamming Path Selection Method in a Wireless Communication Network Based on Dyna-Q" Electronics 11, no. 15: 2397. https://doi.org/10.3390/electronics11152397
APA StyleZhang, G., Li, Y., Niu, Y., & Zhou, Q. (2022). Anti-Jamming Path Selection Method in a Wireless Communication Network Based on Dyna-Q. Electronics, 11(15), 2397. https://doi.org/10.3390/electronics11152397