Key Synchronization Method Based on Negative Databases and Physical Channel State Characteristics of Wireless Sensor Network

Pu, Haoyang; Chen, Wen; Wang, Hongchao; Bao, Shenghong

doi:10.3390/s24196217

Open AccessArticle

Key Synchronization Method Based on Negative Databases and Physical Channel State Characteristics of Wireless Sensor Network

School of Cyber Science and Engineering, Sichuan University, Chengdu 610207, China

^*

Author to whom correspondence should be addressed.

Sensors 2024, 24(19), 6217; https://doi.org/10.3390/s24196217

Submission received: 3 August 2024 / Revised: 18 September 2024 / Accepted: 25 September 2024 / Published: 25 September 2024

(This article belongs to the Section Sensor Networks)

Download

Browse Figures

Versions Notes

Abstract

Due to their inherent openness, wireless sensor networks (WSNs) are vulnerable to eavesdropping attacks. Addressing the issue of secure Internet Key Exchange (IKE) in the absence of reliable third parties like CA/PKI (Certificate Authority/Public Key Infrastructure) in WSNs, a novel key synchronization method named NDPCS-KS is proposed in the paper. Firstly, through an initial negotiation process, both ends of the main channels generate the same initial key seeds using the Channel State Information (CSI). Subsequently, negotiation keys and a negative database (NDB) are synchronously generated at the two ends based on the initial key seeds. Then, in a second-negotiation process, the NDB is employed to filter the negotiation keys to obtain the keys for encryption. NDPCS-KS reduced the risk of information leakage, since the keys are not directly transmitted over the network, and the eavesdroppers cannot acquire the initial key seeds because of the physical isolation of their eavesdropping channels and the main channels. Furthermore, due to the NP-hard problem of reversing the NDB, even if an attacker obtains the NDB, deducing the initial key seeds is computationally infeasible. Therefore, it becomes exceedingly difficult for attackers to generate legitimate encryption keys without the NDB or initial key seeds. Moreover, a lightweight anti-replay and identity verification mechanism is designed to deal with replay attacks or forgery attacks. Experimental results show that NDPCS-KS has less time overhead and stronger randomness in key generation compared with other methods, and it can effectively counter replay, forgery, and tampering attacks.

Keywords:

wireless sensor network; synchronous key distribution; negative database; Channel State Information; data security transmission

1. Introduction

The Internet of Things (IoT) has been widely applied in people’s daily lives. It finds applications in both civilian domains—such as e-commerce, smart homes, smart cities, and healthcare [1,2,3,4,5]—and military fields, such as military reconnaissance [6]. Among the enormous IoT devices, wireless sensor networks (WSNs) are commonly utilized due to their advantages of low cost, reliability, scalability, ease of deployment, etc. Since WSNs may be deployed in untrustworthy environments, they are very susceptible to network attacks. Therefore, data secure transmission in WSNs attracted lots of attention from researchers [7]. Secure transmission in WSNs involves ensuring data confidentiality, integrity, and reliability. Researchers have proposed various methods to ensure the security of data transmission in WSNs, including traditional encryption algorithms, lightweight block cipher algorithms, and physical layer security [8]. The communication key remains the most crucial security parameter in all these methods. However, traditional Internet Key Exchange (IKE) typically requires a pre-distribution of keys or relies on centralized key servers, such as CA/PKI (Certificate Authority/Public Key Infrastructure). In WSNs, many sensor nodes dynamically join or leave the network, and the network topology constantly changes. A centralized key server may become a bottleneck or a target for attackers, leading to a single point of failure. Therefore, relying on CA/PKI for key distribution in WSNs is challenging. Although pre-distributing fixed keys avoids the risk of key leakage during transmission, this method lacks flexibility and security: using the same key for data encryption over a long period still increases the risk of key compromise by attackers. Furthermore, key generation algorithms based on computational hardness pose efficiency and cost issues for WSNs with limited computing capabilities. Thus, there is a need for a novel key synchronization method suitable for WSNs, allowing sensor nodes to autonomously and securely synchronize communication keys without exchanging critical negotiation information through the open network.

Consequently, this paper proposes a novel Negative Database and Physical Channel State Key Synchronization (NDPCS-KS) for WSNs. NDPCS-KS exploits the physical isolation of channels between sensor nodes to synchronously generate initial key seeds at both ends of the communications through Channel State Information (CSI) in the initial-negotiation phase. Based on the initial key seeds, negotiation keys and negative databases (NDBs) are generated. Then, the NDBs are used to conduct a second negotiation, which filters the negotiation keys to obtain the final communication keys. The entire negotiation process involves both initial-negotiation and second negotiation, which is referred to as dual negotiation.

Additionally, a lightweight anti-replay and identity verification mechanism is introduced to prevent replay attacks and counter against forgery and tampering. NDPCS-KS operates without the need for reliable third parties such as CA/PKI, using hash tokens maintained among sensor nodes to prevent replay, and it computes hash values for verifying identities, ensuring the integrity and reliability of data transmission. The main contributions of this paper are outlined as follows:

Avoidance of Confidential Information Leakage: NDPCS-KS avoids reliance on Certificate Authorities (CAs) or Public Key Infrastructure (PKI), eliminating the need for pre-distributed keys or dependence on centralized key servers. It dynamically generates keys based on the current state of the physical channels, ensuring no confidential information is transmitted over the network in the key exchange process, preventing the leakage of secrets.
Security key exchange mechanism: By leveraging the isolation of the physical channels, NDPCS-KS prevents attackers from eavesdropping on the main channel’s state information to obtain initial key seeds. For the first time, the NDB is employed in the IKE process of WSNs to generate encryption keys. The irreversible nature of the NDB prevents the attackers from generating legitimate keys.
Resistance to Attacks: NDPCS-KS resists replay attacks, identity forgery, and data packet tampering, ensuring data integrity and reliability. Compared to traditional key exchange mechanisms, it offers higher security and flexibility.
Extensive Experimental Validation: Extensive experiments have been conducted to compare the effectiveness and feasibility of NDPCS-KS with other methods, demonstrating its effective performance and capability to resist various attacks.

The organization of this paper is as follows: Section 2 introduces related work, the current state of research on key generation in WSNs, the current state of research on key negotiation and quantization, and the concept and research status of the NDB; Section 3 introduces the system model; Section 4 introduces the key technologies and algorithms; Section 5 analyzes the security of NDPCS-KS; Section 6 validates the effectiveness and security of NDPCS-KS through experiments; Section 7 concludes the paper.

2. Related Work

2.1. Current State of Research on Key Generation in WSNs

For the security of the IKE process in WSNs, researchers have initially focused on pre-distribution key schemes. The basic idea involves distributing shared keys between nodes and base stations before network deployment. Dai et al. [9] proposed a matrix-based key pre-distribution scheme for secure communication in WSNs, utilizing the Lower–Upper matrix method for efficient key establishment. This method ensures reliable connectivity and enhances resilience against node capture by adopting polynomial-based key pre-distribution. Kuang [10] improved upon the basic random key pre-distribution scheme by enhancing network connectivity and security performance. The scheme introduces a “neighbor” node expansion protocol to protect isolated nodes and establish secure paths, thus increasing the global connectivity and security of the network. Yin et al. [11] proposed a multipath key establishment mechanism based on secret sharing to counteract attacks in WSNs. This mechanism can be integrated with other pre-distribution schemes to protect the network from various attacks, including stop-forwarding attacks and key leakage through intermediate nodes. Pre-distribution key methods face issues with poor flexibility and scalability. In larger networks, pre-distribution schemes can lead to key distribution and management becoming bottlenecks and increase the system’s vulnerability.

Researchers have proposed dynamic key distribution methods to address the limitations of pre-distribution schemes. These methods dynamically generate temporary keys, thus overcoming the flaws of static pre-allocated key schemes. Shu et al. [12] introduced a dynamic nonlinear key distribution method to reduce the number of keys shared between nodes. They utilize a seed key to encrypt and distribute keys, deriving data keys from the distributed keys and using the data keys for data encryption. They achieved confidential communication in WSNs through dynamic nonlinear key distribution with minimal shared seed keys. Chen et al. [13] developed a dynamic key management scheme that uses a shared seed key, node identifiers, packet counts, and random numbers to generate dynamic distribution keys. It generates a distribution key in a single communication packet and uses it only once. Chen also proposed key management principles for generating distribution keys and introduced a confidentiality algorithm. It showed that the dynamic key management scheme is secure and feasible, enhancing network security as keys can be immediately replaced after data transmission, reducing the risk of key leakage. However, dynamic key distribution methods also face challenges, such as the frequency of key updates, distribution efficiency, and ensuring timely key updates without adding excessive communication load. Yousefpoor and Barati [14] proposed an intelligent dynamic key management system designed for WSNs, employing fuzzy logic for path key generation and adding new nodes to the network. The system adopts a hierarchical network approach and leverages pre-allocation and post-deployment mechanisms for key distribution. The simulation results of this system indicated that compared with other key management systems, it is more efficient regarding communication load, storage space, and energy consumption.

In response to the limitations of dynamic key distribution methods, recent studies have proposed key generation methods based on wireless channels. These methods utilize the physical layer characteristics of wireless communications, such as signal strength or the randomness of channel responses, to generate keys. The advantage of this approach is that it does not rely on traditional key distribution protocols but instead exploits the inherent properties of the wireless channel itself. This enhances the security of key generation, as the isolation features of the physical channels make it difficult for external attackers to eavesdrop on the channel state. Xiong et al. [15] introduced a new device-to-device key generation strategy involving multiple randomnesses and a cooperative communication mechanism assisted by relay nodes. It allows Alice and Bob to acquire correlated information and generate keys by alternately transmitting random signals during each coherent period, leveraging the randomness and reciprocity of the wireless propagation channel. Altun et al. [16] present a secret key generation method for WSNs by using physical layer features. The method addresses latency and bandwidth issues in dense networks by exploiting the superposition property of wireless channels. It also provides security against eavesdroppers by using simultaneous transmissions in the same time and frequency block, ensuring scalability and secure communication in highly populated networks. Wei and Saha [17] proposed KNEW, which is a novel method that extracts implicit features from reciprocal channels to derive keys with high consistency. KNEW simultaneously trains two neural networks to map each other’s channel estimation into different domains, namely latent spaces, which remain inaccessible to adversaries. The model also minimizes the distance between the latent spaces generated by the two trusted nodes, improving the key inconsistency rate. Rangarajan [18] proposed a sensor node security enhancement method in WSNs based on an improved Diffie–Hellman key exchange, enhancing security by generating hashes for the payload transmitted over the network; thus, it can defend against man-in-the-middle attacks. Moara-Nkwe et al. [19] propose a physical layer secure communication scheme tailored for static and low-latency industrial IoT environments. The key contributions include the introduction of a multi-antenna controller and single-antenna sensor communication model, analysis of passive eavesdropping and active attack threats, and the development of a Random Pilots (RPs)-based key generation method. The scheme enhances security and transmission efficiency through self-interference-assisted one-time pad encryption, effectively addressing the challenges posed by stable channel conditions over time. Ji et al. [20] present a new physical layer secure key generation and refreshment scheme designed for WSNs. The process involves generating initial keys using pilot signals and channel estimation, which is followed by error correction and privacy amplification to ensure key security. The scheme introduces a key refreshment mechanism that incorporates information from all previously generated keys, forming a hash key chain to enhance long-term security. Experimental results validate the scheme’s effectiveness and security under various channel conditions, offering a practical solution for secure key management in WSNs. Aldaghri et al. [21] proposed a new method for generating secret keys in static environments by introducing randomness to address the issue of ultra-low key generation rates in traditional methods. This method, called “induced randomness”, allows two parties (Alice and Bob) to utilize the unique characteristics of the wireless channel to generate high-rate secret keys. Alice and Bob independently generate local random values and combine them with the inherent randomness of the wireless channel to create highly correlated samples for key generation. This method can be applied in both direct communication and relay-assisted communication scenarios. The induced randomness method achieves secure and efficient key generation without complex hardware or high mobility. These protocols also apply in dynamic environments, provided the wireless channel remains unchanged during each random value exchange. Chen et al. [22] proposed a physical layer key generation scheme for WLAN MIMO systems based on feature fusion autoencoder (FFAEncoder) to address the issue of high key disagreement rate (KDR). This method extracts amplitude and phase features from CSI separately, fuses them through multiplication operations in a neural network and uses an autoencoder to extract standard features. Evaluations on multiple real-world scenario datasets show that the mean squared error (MSE) and mean absolute error (MAE) of the codewords between the transmitter and receiver are lower than those of current models, demonstrating better key generation performance. Chen et al. [23] proposed a Physical Layer Key Generation (PLKG) scheme based on Bidirectional Convergence Feature Learning (BCFL), which processes the obtained CSI data through deep learning to enable legitimate communication devices to acquire highly correlated channel characteristics during channel estimation. The BCFL scheme has advantages in execution time and Key Generation Rate (KGR), and it enhances the utilization efficiency of channel characteristics during the feature quantization stage through a multi-level quantization method. Wunder et al. [24] proposed the Full Duplex–Bisparse Blind Deconvolution (FD–BBD) scheme, which uses full duplex communication and bisparse blind deconvolution technology to achieve secure and efficient key generation in Wireless Personal Area Networks (WPANs). This method leverages the reciprocity of the wireless channel, rather than the channel’s entropy, to generate secret keys, making it suitable for various wireless environments, including those with high noise and signal fading. Through full duplex communication, legitimate parties can agree on a common secret key without channel probing. The current key distribution methods are limited mainly by their complex key management, computational overhead, and memory overhead, making them difficult to implement.

2.2. Introduction on Key Negotiation and Quantization

The application of key negotiation quantization technology in WSNs has attracted widespread attention. The primary goal is to optimize the key negotiation process under limited energy and bandwidth conditions, ensuring network security and improving communication efficiency. Due to the resource constraints of WSN nodes, traditional key negotiation protocols are not always applicable. Quantization technology plays a critical role in reducing communication overhead and energy consumption. In recent years, with the widespread application of WSNs, the use of quantization technology in key negotiation has further developed. Cao et al. [25] proposed an innovative key generation method that skips the traditional privacy amplification step and relies on multi-bit quantization and sample position coordination techniques. This method not only increases the key generation rate but also significantly enhances the security and consistency of the key through sample decorrelation. Hua [26] introduced the Generalized Channel Probing (GCP) and Generalized Preprocessing (GPP) technologies, which not only optimize channel capacity in non-reciprocal channel environments but also improve key generation efficiency in multi-antenna systems, breaking the dependency on channel reciprocity for key generation. Li et al. [27] addressed the issues of non-reciprocity and high correlation in OFDM channels by proposing a preprocessing mechanism based on Principal Component Analysis (PCA). Through the optimized preprocessing of channel measurement data, the performance of quantization and information reconciliation was significantly improved, ensuring a higher key generation rate and randomness. Wang et al. [28] introduced randomness into the quantization process by employing Hadamard matrices, significantly reducing the correlation of quantized bits and lowering the risk of malicious inference during public information exchange, achieving breakthroughs in key generation security. Li et al. [29] further refined channel preprocessing techniques by using PCA to eliminate the autocorrelation of channel samples, which not only improved the key generation rate and consistency but also ensured that the generated keys possessed high randomness and security, significantly optimizing subsequent information reconciliation and privacy amplification processes.

2.3. Introduction and Current State of Research on Negative Databases

A negative database is a data representation scheme inspired by artificial immune systems, which is distinct from traditional positive databases in that it stores the complement of information in the positive database. Given that negative databases are fully equivalent to an SAT (Boolean satisfiability) problem [30,31], and since SAT has been proven to be an NP-hard problem [32,33], retrieving the original data from a negative database is exceedingly tricky.

Existing algorithms for generating NDBs primarily include the clause distribution control algorithm [34], the 1-hidden algorithm [35], the 2-hidden algorithm [35], the q-hidden algorithm [33], the hybrid algorithm [36], the p-hidden algorithm [37], and the K-hidden algorithm [38].

Negative databases typically consist of binary strings. Suppose the positive database DB is composed of n-bit binary strings, U is the universe of n-bit binary strings, and U-DB is the complement of DB. The NDB stores a compressed form of the contents in U-DB. An example of a simple NDB is shown in Table 1, where * represents a wildcard that can denote either 0 or 1.

Existing NDB generation algorithms are primarily based on binary data. This paper proposes a real-valued NDB generation method for key synchronous generation, moving beyond the conventional binary NDB generation. In NDPCS-KS, generating communication keys involves one-way hash functions and a dual negotiation based on the NDB. Given the unidirectional nature, collision resistance of one-way hash functions, and the NP-hardness of reversing the NDB [32,33], essential information for communication key agreement is well protected. Even if an attacker obtains an NDB from a session (data transmission cycle), deducing the initial key seeds that generated the NDB is computationally infeasible, ensuring data security. Meanwhile, it is hard for attackers to generate legitimate communication keys due to the second negotiation through the NDB, since the attackers do not have the NDB and the negotiation key for second negotiation.

3. System Model

Figure 1 illustrates the overall framework of NDPCS-KS. First, the two communication nodes generate an initial key through the initial negotiation process. Specifically, both parties probe their CSI via the main channel. After quantization, the data are divided into groups, and the grouped quantized information is encoded using Hamming codes and then hashed. The hashed Hamming-encoded information is exchanged over the main channel, and both parties locally compare the results. If discrepancies are detected, indicating an error, error correction is performed by flipping the erroneous bits based on the parity check matrix calculated from the Hamming codes. Through this process, all groups undergo verification and error correction, ensuring that both parties ultimately generate the same initial key seed.

Next, using the initial key seed, an initial hash state is generated. This state is iteratively updated to generate the baseline point elements (

b a s e_p o i n t

) of the NDB and the negotiation key (

n e g o t i a t i o n_K E Y

). The

n e g o t i a t i o n_K E Y

contains a series of negotiation values (

n e g o t i a t i o n_v a l u e

). For each

b a s e_p o i n t

, a circle with a fixed radius R is drawn, forming the NDB.

Finally, the communication key is generated through the second negotiation process. First, the density of the

n e g o t i a t i o n_v a l u e

around each

b a s e_p o i n t

is calculated, and based on this density, the radius R is dynamically increased or decreased to obtain a new radius

R_{n e w}

, making the final filtering more random. After dynamically adjusting the radius, each

n e g o t i a t i o n_v a l u e

in the

n e g o t i a t i o n_K E Y

is used for filtering in the NDB. Specifically, if a

n e g o t i a t i o n_v a l u e

falls within the circle defined by the radius around a

b a s e_p o i n t

, the corresponding key position is set to 1. Conversely, if a

n e g o t i a t i o n_v a l u e

does not fall within any circle, the corresponding key position is set to 0. This process results in a communication key with the same length as the

n e g o t i a t i o n_K E Y

.

Other more important definitions are shown in Table 2.

4. Key Technologies and Algorithms

4.1. Network Initialization

In the initialization phase of the sensor network, each node (sensor) sends pilot signals (heartbeat packets) to its directly connected neighboring nodes along the network topology. Then, these nodes broadcast response signals along the same topological paths, allowing each node in the network to obtain the initial CSI, specifically the RSSI (Received Signal Strength Indicator), of its directly connected neighboring nodes. RSSI is chosen as the CSI value due to its ease of measurement and robustness in dynamic environments. Although RSSI may have lower entropy compared to other CSI values such as phase information, it is stable and can effectively reflect signal strength variations caused by network topology changes. Moreover, due to the isolation characteristics of the physical channels, it is difficult for attackers to detect the RSSI values of the main channels through their eavesdropping channels.

Based on the RSSI value, the sensors can generate corresponding initial key seeds through initial negotiation with neighboring nodes. The initial negotiation is a process in which both parties negotiate initial key seeds using RSSI detected by each other. The two communicating parties typically synchronize the initial key seeds through the following four steps: (1) Measurement and Quantization of RSSI; (2) Grouping, Hamming Encoding and Hashing; (3) Error Detection and Correction; and (4) Generation of the Initial Key Seeds.

Measurement and Quantization of CSI Samples (RSSI): A pair of sensors $(a, b)$ measures their CSI samples contained in the received pilot signals as follows:

${CSI}_{a} = {{CSI}_{a} [i]}_{i = 1}^{N}, {CSI}_{b} = {{CSI}_{b} [i] + n [i]}_{i = 1}^{N}$

(1)

where ${CSI}_{a} [i]$ and ${CSI}_{b} [i]$ represent the ith CSI sample measured by sensors a and b, respectively. $n [i]$ represents noise, and N is the number of samples. Due to the reciprocity of the channel, sensors a and b can detect highly correlated CSI measurements, which exhibit similar fluctuation patterns. These patterns are primarily determined by the distance and interference factors in the physical channel connecting a and b. Suppose an attacker, Eve, is beyond the coherence distance from either a or b. She will observe entirely different fluctuation patterns [39,40], preventing her from eavesdropping on similar CSI and thus synchronizing the same initial key seed.
The measured CSI samples are then coded into binary sequences as follows:

$Q_{a} = {q_{a} [i]}_{i = 1}^{N}, Q_{b} = {q_{b} [i]}_{i = 1}^{N}$

(2)

where $q_{a} [i]$ and $q_{b} [i]$ represent the ith quantized CSI value of sensors a and sensor b, respectively. $q_{a} [i] = 1$ if ${CSI}_{a} [i] > threshold$ , otherwise $q_{a} [i] = 0$ .
Grouping, Hamming Encoding, and Hashing: The quantized CSI sequences are divided into multiple groups as follows:

$G_{a}^{k} = {q_{a} [i]}_{i = (k - 1) \cdot m + 1}^{k \cdot m}, G_{b}^{k} = {q_{b} [i]}_{i = (k - 1) \cdot m + 1}^{k \cdot m}$

(3)

where $G_{a}^{k}$ and $G_{b}^{k}$ represent the kth group of quantized CSI values of sensor a and sensor b, respectively. k is the group index, and m is the number of samples per group. Each group of data is then encoded using the Hamming code (7,4) as follows:

$\begin{matrix} {Encoded}_{a}^{k} = HammingEncode (G_{a}^{k}) \\ {Encoded}_{b}^{k} = HammingEncode (G_{b}^{k}) . \end{matrix}$

(4)

The Hamming code (7,4) is chosen for its single-bit error correction, making it suitable for mild noise in sensor networks. It provides high efficiency with low complexity, which is ideal for resource-constrained devices. In the network environment considered, where channel variation is minimal, Hamming ensures reliable communication without added overhead. In more complex environments, stronger error correction codes like Reed–Solomon or LDPC may be needed for higher noise and error levels. Then, the encoded groups are hashed as follows:

$H_{a}^{k} = hash ({Encoded}_{a}^{k}) H_{b}^{k} = hash ({Encoded}_{b}^{k})$

(5)

where $H_{a}^{k}$ and $H_{b}^{k}$ represent the hash values of the kth encoded group of sensors a and b, respectively.
Error Detection and Correction: When sensors a and b exchange hash values to detect inconsistencies between groups, if $H_{a}^{k} \neq H_{b}^{k}$ , it indicates that group k contains errors. In such cases, Hamming decoding and correction are applied. Specifically, the syndrome $s_{i}$ is calculated using

$s_{i} = H \cdot {Encoded}_{b}^{k} {[i]}^{⊤} mod 2$

(6)

where $H$ is the parity-check matrix of the Hamming code. The matrix $H$ is used to determine the positions of errors in the encoded data by comparing it with the transmitted codeword. If the syndrome $s_{i}$ is non-zero, it indicates the presence of an error. The position of the error j is located, and the error is corrected by flipping the bit:

${Encoded}_{B}^{k} [j] \leftarrow {Encoded}_{B}^{k} [j] \oplus 1 .$

(7)

Equation (7) ensures that inconsistencies between the hash values are detected and corrected appropriately.
Generation of the Initial Key Seeds: After correcting all errors, both sensors concatenate the corrected CSI sequences and hash the result to generate shared initial key seeds. The concatenation of all corrected groups is expressed as

$Q_{b}^{corrected} = ⋃_{k = 1}^{K} G_{b}^{k}$

(8)

where $G_{b}^{k}$ represents the corrected sequence of group k, and K represents the total number of the groups. The shared initial key seed ${r o w_s e e d}_{a \to b}$ is then generated by hashing the concatenated result:

${r o w_s e e d}_{a \to b} = hash (Q_{b}^{corrected}) .$

(9)

Through the above four steps, sensors a and b can synchronously generate the same initial key seeds, and it ensures that no confidential information is leaked to the attackers through eavesdropping on the wireless channels.

Then, each sensor will store all generated initial key seeds in a dictionary-type data structure called

R S_L i b r a r y

. For example, for a sensor a, its

R S_L i b r a r y

is as follows:

R S_L i b r a r y = \begin{matrix} [a \to b : {r o w_s e e d}_{a \to b}, \\ d \to a : {r o w_s e e d}_{d \to a}, \\ f \to a : {r o w_s e e d}_{f \to a}, \dots] . \end{matrix}

(10)

Equation (10) indicates that sensor b is the parent of sensor a, and sensors d and f are the children of sensor a (that is, the child node is on the left side of → and the parent node is on the right side of →). This identification reflects the relationship between connected nodes. After the initial negotiation, each sensor establishes its local

R S_L i b r a r y

, which is then used for subsequent key synchronization.

4.2. Negative Databases and Negotiation Key Generation

In order to build the same NDB for key synchronization in a pair of communication nodes (

c h i l d, p a r e n t

), they retrieve the local

R S_L i b r a r y

for

r o w_s e e d_{c h i l d \to p a r e n t}

. Then, the calculation of the two-dimensional baseline points of the NDB,

b a s e_p o i n t = (x_{i}, y_{i})

, is conducted. The NDB consists of a set of filters, each comprising a

b a s e_p o i n t

and a filtering radius. It should be noted that although the

b a s e_p o i n t

calculation can be easily extended to high-dimensional spaces, the NDB baseline points are generated in a two-dimensional space to reduce the computational cost for WSNs.

4.2.1. Negative Database Generation

Assume that there are two communication nodes, a and b, with a being the child node of b. The

N D B_b a s e

is obtained through the following four steps.

Initialization and Generating the Initial Hash Seed: Initially, a and b retrieve ${r o w_s e e d}_{a \to b}$ in their local $R S_L i b r a r y$ , respectively. Then, $H_{s e e d}$ is calculated based on ${r o w_s e e d}_{a \to b}$ :

$H_{s e e d} = SHA 256 ({r o w_s e e d}_{a \to b}) .$

(11)
Setting the Initial Hash State: Set the initial hash state $H_{0}$ to $H_{s e e d}$ :

$H_{0} = H_{s e e d} .$

(12)
Generating $H_{i + 1}$ as follows:

$H_{i + 1} = SHA 256 (concat (H_{i}, H_{s e e d}))$

(13)

in which the current hash state $H_{i}$ and $H_{s e e d}$ are employed to generate the next hash state $H_{i + 1}$ . Here, concat(x, y) concatenates two inputs, x and y, by appending the second input y to the first input x.
Generating the Coordinate Pairs $(x_{i}, y_{i})$ for $N D B_b a s e$ : the coordinates $x_{i}$ and $y_{i}$ are generated as follows:

$\begin{matrix} N D B_b a s e = \{(x_{i}, y_{i}) | \\ x_{i} = R o u n d (\frac{i n t (h e x (H_{i + 1}) [: 8])}{16^{8} - 1} \times 10, 5), \\ y_{i} = R o u n d (\frac{i n t (h e x (H_{i + 1}) [9 : 16])}{16^{8} - 1} \times 10, 5), \\ i = 1 \dots 25} . \end{matrix}$

(14)

here, the $R o u n d (x, n)$ function rounds the number x to n decimal places. $i n t (x)$ function converts a string x representing a hexadecimal number (base 16) into a decimal integer (base 10). $h e x (H)$ function generates the hexadecimal representation of the hash value H.

Through the aforementioned steps,

N D B_b a s e

is generated. Then, with a constant radius R, circles are drawn centered at each

b a s e_p o i n t

element in the

N D B_b a s e

list, forming a two-dimensional filter library used as the NDB. Each

b a s e_p o i n t

with a filter radius is referred to as a filter. Figure 2 is an example of an NDB, illustrating only ten filters for clarity.

Then, let us analyze the range of values for

x_{i}

and

y_{i}

.

For $i n t (h e x (H_{i + 1}) [: 8])$
- “ $h e x (H_{i + 1})$ ” returns a 64-character hexadecimal string.
- “[:8]” extracts the first eight characters, and these eight characters have a hexadecimal value range from “00000000” to “FFFFFFFF”.
- “ $i n t (\dots, 16)$ ” converts the hexadecimal string to a decimal integer with values ranging from 0 to $16^{8} - 1$ (i.e., $2^{32} - 1$ ).
For $\frac{i n t (h e x (H_{i + 1}) [: 8])}{16^{8} - 1}$ and $\frac{i n t (h e x (H_{i + 1}) [9 : 16])}{16^{8} - 1}$
- $h e x (H_{i + 1}) [: 8]$ extracts the first eight characters for calculating $x_{i}$ .
- $h e x (H_{i + 1}) [9 : 16]$ extracts the next eight characters for calculating $y_{i}$ .
- Since SHA256 generates a 64-character hexadecimal hash value, and each character represents 4 bits, totaling 256 bits (32 bytes). Extracting any eight characters (32 bits) is sufficient to represent any value between 0 and $16^{8} - 1$ (i.e., $2^{32} - 1$ ), fully covering the required range.
- This ratio ranges from 0 to 1 because the numerator’s maximum value is $16^{8} - 1$ (i.e., $2^{32} - 1$ ) and the denominator is also $16^{8} - 1$ .
Result after Multiplying by 10
- Multiplying the above ratio by 10 scales the range to 0 to 10.
Final Rounding to 5 Decimal Places
- The “Round(…, 5)” function rounds the value to five decimal places. Therefore, the final values of $x_{i}$ and $y_{i}$ range from 0.00000 to 9.99999.

4.2.2. Negotiation Key Generation

Next, the two-dimensional

n e g o t i a t i o n_v a l u e = (x_{j}^{'}, y_{j}^{'})

is generated in a manner similar to the

b a s e_p o i n t

elements. For the jth instance, the current hash state

H_{j}

is first updated, and the new hash value is computed as

H_{j + 1} = SHA 256 (c o n c a t (H_{j}, H_{s e e d} + b y t e [j % 256]))

(15)

where

b y t e [j % 256]

generates a byte with a value of

j mod 256

. Subsequently, the

n e g o t i a t i o n_K E Y

is generated as follows:

\begin{matrix} n e g o t i a t i o n_K E Y = \{(x_{j}^{'}, y_{j}^{'}) | \\ x_{j}^{'} = R o u n d (\frac{i n t (h e x (H_{j + 1}) [: 8])}{2^{32} - 1} \times 10, 5), \\ y_{j}^{'} = R o u n d (\frac{int (h e x (H_{j + 1}) [9 : 16])}{2^{32} - 1} \times 10, 5), \\ j = 1 \dots 128} . \end{matrix}

(16)

Figure 3 is a schematic diagram showing the distribution of the generated NDB and negotiation key in a two-dimensional plane. The numbers in the yellow circles represent the indices of the

n e g o t i a t i o n_v a l u e

.

4.3. Second Negotiation for Communication Key Generation

After generating the NDB and negotiation key, a second negotiation is conducted for the negotiation key, using the NDB to filter the

n e g o t i a t i o n_K E Y

generated by Equation (16). Considering that the distribution of

n e g o t i a t i o n_v a l u e

around each

b a s e_p o i n t

varies, some

b a s e_p o i n t

values may have an excessive concentration of

n e g o t i a t i o n_v a l u e

, while others may have sparser distributions. This uneven distribution may lead to the communication keys being filtered with a fixed radius lacking sufficient randomness. Therefore, dynamic radius adjustment is introduced to fine-tune the filtering radius based on the density of

n e g o t i a t i o n_v a l u e

around each

b a s e_p o i n t

.

First, the density of

n e g o t i a t i o n_v a l u e

around each

b a s e_p o i n t

in the NDB is calculated. The filtering radius is dynamically adjusted based on the density to filter the

n e g o t i a t i o n_v a l u e

. For each

b a s e_p o i n t

, the number of

n e g o t i a t i o n_v a l u e

within a given initial radius R is counted, and this number is divided by the area of the circular region

(π \times R^{2})

to obtain the density value. The radius R is an empirical value that needs to be adjusted according to actual conditions to ensure the randomness of the filtering process. The density calculation is as follows:

density = \frac{Number of n e g o t i a t i o n_v a l u e within R}{π \times R^{2}} .

(17)

After obtaining the density of each

b a s e_p o i n t

, the initial radius R is dynamically adjusted to a new filtering radius

R_{n e w}

. Specifically, suppose the density at a

b a s e_p o i n t

exceeds a preset threshold

D_{h i g h}

. In that case, the filtering radius at that

b a s e_p o i n t

is reduced to minimize the impact of overly concentrated

n e g o t i a t i o n_v a l u e

. Conversely, if the density is below a lower threshold

D_{l o w}

, the filtering radius is increased to capture more potentially relevant

n e g o t i a t i o n_v a l u e

. By dynamically adjusting the filtering radius based on the density of

n e g o t i a t i o n_v a l u e

around the

b a s e_p o i n t

, the radius is decreased for high-density points and increased for low-density points, making the filtering process more random. Let

D_{h i g h}

be the high threshold,

D_{l o w}

be the low threshold, and

Δ

be the adjustment value; then, the calculation of

R_{n e w}

is as follows:

R_{n e w} = \{\begin{matrix} R + Δ, if density > D_{h i g h} \\ R - Δ, if density < D_{l o w} \\ R, otherwise . \end{matrix}

(18)

Note that the initial filter radius R and parameters such as the adjustment value

Δ

and thresholds

D_{h i g h}

and

D_{l o w}

for dynamically adjusting the radius are pre-shared among all nodes in the sensor network. After dynamically adjusting the filtering radius, the

n e g o t i a t i o n_v a l u e

in the

n e g o t i a t i o n_K E Y

are filtered to generate the communication key. An empty key sequence

f i l t e r e d_k e y

of the same length as the

n e g o t i a t i o n_K E Y

is created. For each

n e g o t i a t i o n_v a l u e = (x_{j}^{'}, y_{j}^{'})

(where

j = 1, 2, \dots, 128

), if its distance to the nearest

b a s e_p o i n t

is less than the corresponding filtering radius, it is covered (or filtered) by the NDB, and the corresponding position j in the key sequence is set to 1,

f i l t e r e d_k e y [j] = 1

; otherwise, it is set to 0,

f i l t e r e d_k e y [j] = 0

. Let

(x_{j}^{'}, y_{j}^{'})

be the coordinates of the

n e g o t i a t i o n_v a l u e

in the

n e g o t i a t i o n_K E Y

(where

j = 1, 2, \dots, 128

),

(x_{B}, y_{B})

be the coordinates of the nearest

b a s e_p o i n t

, and

d = \sqrt{{(x_{j}^{'} - x_{B})}^{2} + {(y_{j}^{'} - y_{B})}^{2}}

be the distance between the current

n e g o t i a t i o n_v a l u e = (x_{j}^{'}, y_{j}^{'})

and the nearest

b a s e_p o i n t

(x_{B}, y_{B})

. The filtering process is then shown below:

f i l t e r e d_k e y [j] = \{\begin{matrix} 1, if d < R_{n e w} \\ 0, otherwise . \end{matrix}

(19)

Thus, by filtering the

n e g o t i a t i o n_K E Y

through a second negotiation using the NDB, a 128-bit key sequence

f i l t e r e d_k e y

is obtained. This communication key sequence can be directly used for encryption and decryption with the length of the communication key being dynamically adjustable. The schematic diagram of the second negotiation process for generating the communication key is shown in Figure 4. In the figure, purple dots represent unfiltered

n e g o t i a t i o n_v a l u e

, red dots represent

b a s e_p o i n t

, green dots represent filtered

n e g o t i a t i o n_v a l u e

, and red circles represent the dynamic radii corresponding to the

b a s e_p o i n t

. The radii vary for different

b a s e_p o i n t

due to dynamic radius adjustment based on the density of the

n e g o t i a t i o n_v a l u e

near each

b a s e_p o i n t

. The

n e g o t i a t i o n_v a l u e

covered by the red circles are filtered. The numbers in yellow circles indicate the identification numbers of the

n e g o t i a t i o n_v a l u e

, while the numbers in the squares represent the assignments in the filtered key sequence after filtering. For example, the communication key sequence filtered out in Figure 4 is 1111111100000000. Note that this example is for illustrative purposes, and the actual communication key generation is random.

The following Algorithm 1 describes the steps of the key generation method based on the NDB.

Algorithm 1 begins by taking

r o w_s e e d_{a \to b}

as input and computing its SHA256 hash value

H_{s e e d}

, which is then set as the initial hash state

H_{0}

(Lines 1 and 3). An empty set

N D B_b a s e

is initialized to store the

b a s e_p o i n t

. The algorithm iterates from

i = 1

to

i = 25

, updating the hash state from

H_{i}

to

H_{i + 1}

using Equation (13), and calculating and storing the

b a s e_p o i n t

in

N D B_b a s e

using Equation (14) (Lines 4–9). Next, an empty set

n e g o t i a t i o n_K E Y

is initialized to store the

n e g o t i a t i o n_v a l u e

. The algorithm iterates from

j = 0

to

j = 128

, updating the hash state from

H_{j}

to

H_{j + 1}

using Equation (15), and calculating and storing the

n e g o t i a t i o n_v a l u e

in

n e g o t i a t i o n_K E Y

using Equation (16) (Lines 10–14). For each

b a s e_p o i n t

in

N D B_b a s e

, the algorithm calculates its density using Equation (17) and dynamically adjusts its radius using Equation (18) (Lines 15–18). An empty set

f i l t e r e d_k e y

is then initialized to store the communication key. For each

n e g o t i a t i o n_v a l u e = (x_{j}^{'}, y_{j}^{'})

in

n e g o t i a t i o n_K E Y

, the algorithm filters to obtain

f i l t e r e d_k e y [j]

using Equation (19). Finally,

f i l t e r e d_k e y

is converted to a binary string

K_{a \to b}

, which is returned as the communication key.

Algorithm 1 Key Generation Method based on Negative Database

1: Input:

r o w_s e e d_{a \to b}

2: Output: Returns a byte array representing the communication key,

K_{a \to b}

3:

H_{s e e d} = SHA 256 (r o w_s e e d_{a \to b})

4:

H_{0} = H_{s e e d}

5:

N D B_b a s e \leftarrow \emptyset

6: for

i \leftarrow 1

to 25 do

7: update the hash state

H_{i}

to

H_{i + 1}

using Equation (13).

8: NDB_base[i] =

(x_{i}, y_{i})

, calculated using Equation (14).

9: end for

10:

n e g o t i a t i o n_K E Y \leftarrow \emptyset

11: for

j \leftarrow 0

to 128 do

12: update the hash state

H_{j}

to

H_{j + 1}

using Equation (15)

13: negotiation_KEY[i] =

(x_{j}^{'}, y_{j}^{'})

, calculated using Equation (16).

14: end for

15: for each base_point in NDB_base do

16: calculate the density of base_point using Equation (17).

17: dynamically adjust the radius of base_point using Equation (18).

18: end for

19:

f i l t e r e d_k e y \leftarrow \emptyset

20: for each negotiation_value =

(x_{j}, y_{j})

in negotiation_KEY do

21: filter

(x_{j}, y_{j})

to obtain filtered_key[j] using Equation (19).

22: end for

23:

K_{a \to b} \leftarrow

convert

f i l t e r e d_k e y

to a binary string

24: return

K_{a \to b}

4.4. Identity Verification Mechanism Based on SHA256

Considering that an attacker may send forged information to a node in a sensor network or tamper with legitimate information from a node in a sensor network and send it to other nodes again, an identity verification mechanism based on SHA256 is designed to complete the identity verification of the encryption party.

Specifically, when sensor a is about to send an encrypted message to sensor b, sensor a needs to calculate an identity verification value

v_{a \to b}

as follows:

v_{a \to b} = SHA 256 (encrypted message + r o w_s e e d_{a \to b}) .

(20)

In Equation (20),

r o w_s e e d_{a \to b}

is the initial key seed, and the encrypted message is the encrypted observation value. The hash verification value

v_{a \to b}

is appended to the encrypted observation value and sent to sensor b. Sensor b extracts

v_{a \to b}

and retrieves

r o w_s e e d_{a \to b}

from its

R S_L i b r a r y

. Sensor b uses

r o w_s e e d_{a \to b}

and the encrypted message in Equation (20) to calculate

v_{a \to b}^{'}

. If

v_{a \to b} = v_{a \to b}^{'}

,

r o w_s e e d_{a \to b}

is used to generate the communication key

K_{a \to b}

for decryption. Algorithm 2 details the identity verification mechanism based on SHA256.

Algorithm 2 Identity Verification Mechanism Based on SHA256

1: Input:

r o w_s e e d_{a \to b}

and encrypted message

2: Output: communication key

K_{a \to b}

or false

3:

v_{a \to b}

calculated using Equation (20)

4: for each initial key seed in

R S_L i b r a r y

of parent node b do

5:

v_{a \to b}^{'}

calculated using Equation (20)

6: if

v_{a \to b}

=

v_{a \to b}^{'}

then

7: calculate

K_{a \to b}

using Algorithm 1

8: return

K_{a \to b}

9: else

10: return false

11: end if

12: end for

Algorithm 2 takes

r o w_s e e d_{a \to b}

and the encrypted message as inputs, and it calculates

v_{a \to b}

using Equation (20) (Lines 1 and 3). The parent node b then search initial key seeds in its local

R S_L i b r a r y

to calculate the identity verification value

v_{a \to b}^{'}

using Equation (20) and compares it with

v_{a \to b}

(Lines 4–5). If a

r o w_s e e d_{a \to b}

satisfies

v_{a \to b} = v_{a \to b}^{'}

, it is used as input for Algorithm 1 to calculate the communication key

K_{a \to b}

for decryption (Lines 6–8). If no such

r o w_s e e d_{a \to b}

exists, it indicates that the encrypted information from child node a is either forged or tampered with (Lines 9–10). Since the attacker does not know

r o w_s e e d_{a \to b}

, according to Equation (20), the attacker either tampered with the encrypted information or forged the

r o w_s e e d_{a \to b}

.

4.5. Lightweight Anti-Replay Mechanism Based on Timestamps and Rolling Hash Tokens

Since WSNs broadcast data packets, encrypting observation data alone makes them susceptible to replay attacks. In critical applications, such as temperature monitoring in chemical plants, such attacks can have severe consequences. This section introduces a lightweight scheme based on timestamps and rolling hash tokens to achieve anti-replay with lower resource consumption.

Specifically, each node maintains a library of rolling hash tokens called

R H T_L i b r a r y

, recording the most recent n legitimate tokens. When sensor a sends an encrypted message to sensor b, sensor a attaches a timestamp

T i m e_S t a m p

that includes the generation time

T_{g e n}

and the token expiration time

T_{e x p}

, which is denoted as follows:

T i m e_S t a m p = T_{g e n} | | T_{e x p}

(21)

here, the token expiration time

T_e x p

is the generation time

T_g e n

plus a certain time. The

T i m e_S t a m p

is encrypted with the communication key

K_{a \to b}

as the encrypted message. Then, the encrypted timestamp

E (K_{a \to b}, T i m e_S t a m p)

is used as input for SHA256, which outputs a token

T K

as follows:

T K = SHA 256 (E (K_{a \to b}, T i m e_S t a m p)) .

(22)

The child node a sends the message group [encrypted message,

E (K_{a \to b}, T i m e_S t a m p)

,

T K

,

v_{a \to b}

] to the parent node b. In the proposed anti-replay mechanism, replay attacks are detected under two circumstances. The first occurs when the

T_{e x p}

has already passed the current system time of the receiving node. In this case, it is considered a replay attack, as it is highly likely that the attacker is replaying a previously intercepted message at a different time. The second occurs when the

T_{e x p}

has not yet passed the current system time of the receiving node, but the

T K

appears in the

R H T_L i b r a r y

maintained by the receiving node. This is also considered a replay attack, because after intercepting the message, the attacker attempts to quickly replay it to pass the expiration time check. However, since the

R H T_L i b r a r y

stores the most recent valid tokens, the attacker’s attempt to replay the message quickly is likely to collide with the tokens in the receiving node’s

R H T_L i b r a r y

, as the valid tokens have not yet been updated during this period.

Algorithm 3 describes the process of a lightweight anti-replay mechanism based on timestamps and rolling hash tokens.

Algorithm 3 Lightweight Anti-Replay Mechanism Based on Timestamps and Rolling Hash Tokens

1: Input:

T K

,

E (K_{a \to b}, T i m e_S t a m p)

,

v_{a \to b}

2: Output: replay attack detected or not

3: using

v_{a \to b}

as input to Algorithm 2 to generate

K_{a \to b}

4:

T i m e_S t a m p

←

D (K_{a \to b}, T i m e_S t a m p)

using

K_{a \to b}

5:

T_{e x p}

←

T i m e_S t a m p

6: if

T_{e x p} >

current system time then

7: return replay attack detected

8: else

9: return true

10: end if

11: for each token in

R H T_L i b r a r y

of sensor b do

12: if token =

T K

then

13: return replay attack detected

14: break

15: else

16: push

T K

onto

R H T_L i b r a r y

17: if

R H T_L i b r a r y

size exceeds limit n then

18: pop the oldest token from

R H T_L i b r a r y

19: end if

20: end if

21:end for

In Algorithm 3, sensor b first extracts

v_{a \to b}

from the message group [encrypted message,

E (K_{a \to b}, T i m e_S t a m p)

,

T K

,

v_{a \to b}

]. This extracted

v_{a \to b}

is used as input for Algorithm 2 to perform identity verification. If the identity is successfully verified, the communication key

K_{a \to b}

is generated. Next,

K_{a \to b}

is used to decrypt

E (K_{a \to b}, T i m e_S t a m p)

to obtain the

T i m e_S t a m p

. The expiration time

T_{e x p}

is extracted from the

T i m e_S t a m p

and compared with the current system time of sensor b. If

T_{e x p}

has already passed the current system time, the token is considered expired (Lines 3–10). If the token has not expired, the next validation step is performed. The

T K

is extracted from the message group and compared with each token in the

R H T_L i b r a r y

maintained by sensor b. If a matching token

T K

is found in the

R H T_L i b r a r y

, it indicates a replay attack (Lines 11–14). Conversely, if no matching token is found, the message is considered legitimate, and the communication token

T K

is stored in the

R H T_L i b r a r y

. If the

R H T_L i b r a r y

is full, the oldest token is removed to make space for the new token (Lines 15–21). By adjusting the size of the

R H T_L i b r a r y

and the validity period of the tokens, a lightweight and efficient anti-replay mechanism can be implemented.

4.6. Data Transmission Process Between Nodes

Figure 5 illustrates an example sensor network topology and the steps of data transmission between sensors. In each session, each sensor transmits encrypted data to its parent node. Assume

[t_{1}, t_{2}, t_{3}, \dots, t_{n}]

represents the start times of each session. At time

t_{1}

, sensor a first derives the initial key seed

{r o w_s e e d}_{a \to b}

from the CSI value between sensor a and its parent node b during the initial negotiation. Then, the second negotiation is used to obtain the communication key

K_{a \to b}

. After generating

K_{a \to b}

, sensor a uses

K_{a \to b}

to encrypt its observation

x_{1}

, resulting in the encrypted observation

E (K_{a \to b}, x_{1})

. Subsequently, the identity verification value

v_{a \to b}

is obtained using Equation (20), and the

T i m e_S t a m p

is generated using Equation (21). The

T i m e_S t a m p

is then encrypted, resulting in

E (K_{a \to b}, T i m e_S t a m p)

. Next, the token

T K

is obtained using Equation (22). Finally, the above information is assembled into a message group

M e s s_g r o u p_{a \to b} = [E (K_{a \to b}, x_{1}), E (K_{a \to b}, T i m e_S t a m p), T K, v_{a \to b}

] and sent to sensor b.

Upon receiving the message group

M e s s_g r o u p_{a \to b}

from sensor a, parent node b first extracts the identity verification value

v_{a \to b}

, the encrypted timestamp

E (K_{a \to b}, T i m e_S t a m p)

, and the token

T K

. Then, sensor b executes identity verification using Algorithm 2 with the extracted

v_{a \to b}

and then applies the lightweight anti-replay mechanism using Algorithm 3 with the extracted

E (K_{a \to b}, T i m e_S t a m p)

and

T K

. After successful verification and anti-replay checks, sensor b synchronizes the same communication key

K_{a \to b}

through the second negotiation using

{r o w_s e e d}_{a \to b}

based on the CSI value at time

t_{1}

. Finally, sensor b uses the synchronized key

K_{a \to b}

to decrypt the observation data by performing

D (K_{a \to b}, E (K_{a \to b}, x_{1}))

, accessing the observation data

x_{1}

from child node a at time

t_{1}

.

Subsequently, at time

t_{2}

, parent node b integrates its observation data

x_{2}

with the data from child node a into a data series

[x_{1}, x_{2}]

. Node b then uses

{r o w_s e e d}_{b \to c}

to generate the communication key

K_{b \to c}

for encryption. The encrypted data series

E (K_{b \to c}, [x_{1}, x_{2}])

, along with the identity verification value

v_{b \to c}

, the encrypted timestamp

E (K_{b \to c}, T i m e_S t a m p)

, and the token

T K

, are assembled into a message group

M e s s_g r o u p_{b \to c}

.

M e s s_g r o u p_{b \to c}

is then sent to the grandparent node c.

Upon receiving

M e s s_g r o u p_{b \to c}

, node c performs identity verification using Algorithm 2 and applies the lightweight anti-replay mechanism using Algorithm 3. After successful verification and anti-replay checks, node c synchronizes the communication key

K_{b \to c}

using

{r o w_s e e d}_{b \to c}

at time

t_{2}

. Finally, node c decrypts the data series with

D (K_{b \to c}, E (K_{b \to c}, [x_{1}, x_{2}]))

, accessing the observation data

[x_{1}, x_{2}]

.

Repeat the above steps until all child nodes have safely transmitted their observation values to the final root node. After each key synchronization and successful encryption-decryption communication, both communicating parties will update their locally maintained

R S_L i b r a r y

using SHA256 to update the initial key seed used in the current session. Specifically, considering sensors a and b as examples, sensor a uses

r o w_s e e d_{a \to b} (t_{1})

at time

t_{1}

to generate the communication key, encrypts the message, and transmits it to sensor b. Upon receiving the encrypted message, sensor b uses

r o w_s e e d_{a \to b} (t_{1})

to generate the same communication key and successfully decrypts the message. Subsequently, both sensors a and b will update

r o w_s e e d_{a \to b} (t_{1})

in their locally maintained

R S_L i b r a r y

as follows:

r o w_s e e d_{a \to b} (t_{2}) = SHA 256 (r o w_s e e d_{a \to b} (t_{1}))

(23)

where

r o w_s e e d_{a \to b} (t_{2})

is the key seed that sensors a and b will use in the next session start time

t_{2}

. This ensures that each key seed is different in different sessions, generating a unique key for different sessions, achieving a “one-time pad”, and also ensuring that the keys generated by both communicating parties are synchronized.

It is important to note that the method proposed in this paper is suited for relatively static wireless sensor network environments and does not account for significant node mobility. In cases where nodes disconnect, since each node has already acquired and stored the CSI with its directly connected nodes during the initialization phase, the disconnection of certain leaf nodes will not affect the communication of the entire network. The security of communication relies on the CSI of the directly connected nodes rather than the global network topology. This ensures that even if some nodes leave, the remaining nodes can still communicate normally. However, if there are significant changes in the network topology, such as large-scale node disconnections or mobility that leads to substantial structural changes, the network will need to be re-initialized. This will involve re-acquiring the CSI and re-synchronizing the keys to ensure the security and reliability of communication.

5. Security Analysis

5.1. Key Space Size

The dual negotiation processes generate the communication key, including the initial negotiation using Algorithm 1 and the second negotiation through the filtering process by an NDB. For example, suppose the length of the final

n e g o t i a t i o n_K E Y

is 128 with each bit of the key having a 50% probability of being 0 or 1. In that case, the key space size is

2^{128}

, which is a considerable number sufficient to withstand brute force attack with current computing resources as it is a “one-time pad” and changes in different sessions. According to the randomness analysis in Section 6.4 it is evident that the distribution probability of each bit of the key approaches 50% for both 0 and 1. Therefore, the key space size of NDPCS-KS approaches

2^{128}

, which remains computationally infeasible for brute force attacks.

5.2. Key Security

In WSNs, attackers cannot obtain the CSI between the communicating parties due to the isolation of the physical channels. They thus cannot calculate the initial key seeds. It is computationally infeasible for attackers to derive a legitimate communication key without the initial key seeds, as demonstrated below:

Let S be the initial key seed and let $H a s h ()$ denote a hash function (specifically SHA256).
Let $B = (x_{1}, y_{1}), (x_{2}, y_{2}), \dots, (x_{n}, y_{n})$ be a set of n baseline points generated based on S.
Let $P = (x_{1}^{'}, y_{1}^{'}), (x_{2}^{'}, y_{2}^{'}), \dots, (x_{m}^{'}, y_{m}^{'})$ be a set of m negotiation value points generated.
Assume the initial filtering radius is R, the adjustment value is $Δ$ , the high density threshold is $D_{h i g h}$ , and the low density value is $D_{l o w}$ . All these parameters are pre-shared by all nodes in the sensor network.

The attacker aims to deduce P filtered by B to derive the communication key without knowing S. For the attacker, without S, it is difficult to calculate B or P within an acceptable time frame (assuming a typical WSN session cycle is 15 min, after which a new session and key updating process begins) because the unidirectional nature of the hash function

H a s h ()

makes it computationally infeasible to deduce the input from its output. Since the attacker cannot effectively find two different inputs that lead to the same output, they cannot find a different seed

S^{'}

to generate the same sets

P^{'}

and

B^{'}

. It is computationally infeasible even if the attacker attempts to brute-force guess the valid B and P. For instance,

|B| = 25

and

|P| = 128

consist of two-dimensional coordinates within the range [0,10] × [0,10]. The coordinates for B and P are rounded to five decimal places, each with

10^{5} \times 10 = 10^{6}

possible values (from 0.00000 to 9.99999). For the

b a s e_p o i n t

set B, each

b a s e_p o i n t

has

10^{6} \times 10^{6} = 10^{12}

possible positions, and since B contains 25 such points, the potential space size of B is

{(10^{12})}^{25}

. For the

n e g o t i a t i o n_v a l u e

set P, each member also has

10^{12}

possible positions, and since P includes 128 such points, the potential space size of P is

{(10^{12})}^{128}

. The attacker would need to simultaneously guess B and P, leading to a total potential space size of

{(10^{12})}^{25} \times {(10^{12})}^{128} = 10^{1836}

, which is clearly computationally infeasible.

Furthermore, due to the second negotiation through the NDB, since the attacker does not know the initial filtering radius R, the adjustment value

Δ

, or the density thresholds

D_{h i g h}

and

D_{l o w}

, it is tough to filter out a legitimate communication key.

In conclusion, an attacker cannot calculate and obtain the communication key without the initial key seed S within an acceptable time frame.

6. Experiments

The experiment arranged 10 ESP32 development boards with sensors into a tree topology, as shown in Figure 6. Every 30 s, the sensors generate a simulated observation value of 0 or 1. Following the procedure described in Section 4.6, after generating the key, the observation values are encrypted using SIMON [41], forming the corresponding message group, and sent to the parent node via the ESP-NOW protocol. Regarding the parameters of NDPCS-KS, the filter radius R is set to 1.5, the dynamic adjustment value

Δ = 0.25

, and there are 25

b a s e_p o i n t

elements and 128

n e g o t i a t i o n_v a l u e

. The ESP32 development board used in the experiment is shown in Figure 7.

The experiments include the following tasks:

Simulated replay attacks and calculated detection accuracy.
Simulated attacks involving tampering and forging transmission packets and calculated detection accuracy.
Analyzed the performance of NDPCS-KS, recorded the average key generation time, and compared its performance with the key generation method described in references [18,19,20].
The randomness of NDPCS-KS was validated through probabilistic statistics, Monte Carlo simulations, and entropy statistics experiments. Additionally, the NIST Statistical Test Suite [42] was used to test the randomness of the keys generated by NDPCS-KS.

6.1. Replay Attack Detection Accuracy

Select any two sensors, a and b, and let sensor a send 1000 data messages to sensor b with the attacker intercepting the sent data with a probability of 0.2 and replaying it after a random delay. The expiration time

T_{e x p}

in each data timestamp is set to the current system time +60s at the time of timestamp generation

T_{g e n}

. Each sensor maintains a rolling hash token library

R H T_L i b r a r y

storing the most recent ten legitimate tokens. When a replay attack is detected, a counter is incremented by 1, and the accuracy of replay attack detection can be calculated as follows:

detection_rate = \frac{replay_attack_detected}{1000 \times 0.2} .

(24)

The experiment was repeated ten times, resulting in the outcomes shown in Figure 8.

The results show that the detection accuracy is around 90%, with the highest reaching 92.2%, indicating effective performance.

More accurate anti-replay can be achieved by setting the size of the

R H T_L i b r a r y

and the

T_{e x p}

appropriately. For instance, in a specific WSN environment, let the size of the

R H T_L i b r a r y

maintained by each sensor be n, and let the communication frequency between sensors be

λ

(representing the average number of token requests per unit time). To maximize the detection rate of replay attacks, the condition must be satisfied as follows:

T_{e x p} \leq \frac{n}{λ} .

(25)

Equation (25) required that the token’s lifetime

T_{e x p}

should be less than or equal to the number of tokens expected to be received by the system during its lifetime

\frac{n}{λ}

. This way, once a token expires, it is no longer in the

R H T_L i b r a r y

(as new tokens have replaced it), and any unexpired replay tokens will also be recognized because they are already in the

R H T_L i b r a r y

. Effective anti-replay can be achieved by customizing the expiration time

T_{e x p}

and the size of the

R H T_L i b r a r y

n according to the communication frequency

λ

of different WSNs.

6.2. Tampering and Forging of Transmission Packets Detection Accuracy

The experiment uses a lightweight SIMON algorithm for encryption. After intercepting encrypted data packets sent by sensor a to sensor b, the attacker randomly tampers with part of the packet content, forming a tampered data packet. Moreover, the attacker uses randomly generated CSI to obtain his own

r o w_s e e d_{a t t a c k e r \to b}

and encrypts the forged observation values before sending them to sensor b. By counting the number of times sensor b reports the identity verification mechanism based on SHA256 returns falsely, it is possible to determine whether the data packets have been tampered with or forged and calculate the corresponding detection accuracy. Figure 9 shows the detection accuracy for forged and tampered data.

Forgery and tampering were conducted 1,000,000 times each, and as shown in Figure 9, the detection accuracy for both forgery and tampering is 100%. This is because the SHA256 hash function is unidirectional and highly resistant to collisions. This means that even if an attacker can access the hash verification value

v_{a \to b}

, he cannot deduce the initial key seeds or encrypted message nor modify the encrypted message without detection. Furthermore, the verification value

v_{a \to b}

is generated based on the ciphertext and a secret seed; the correct

v_{a \to b}

can only be generated when both are unaltered. Any modification to the ciphertext or seed will result in a verification failure.

6.3. Performance Analysis

NDPCS-KS (generating a 128-bit key) and the method from references [18,19,20] are carried out 100 times on sensors. Record the time cost of each execution, calculate the average execution time of 100 key generations, and plot, as shown in Figure 10. As can be seen from Figure 10, the average execution time of NDPCS-KS is 0.21449 s, which is faster than 0.47139 s in reference [18], 0.30205 s in reference [19], and 0.27638 s in reference [20].

Due to experimental limitations, a simulation-based approach on a computer was adopted to compare the performance overhead of various methods under different network scales. Figure 11 presents the total time overhead for key generation and the average time overhead per node for key generation under different network scales for the methods referenced in [18,19,20] as well as NDPCS-KS.

As shown in Figure 11, NDPCS-KS demonstrates better performance and computational complexity across different network scales. Specifically, the average key generation time per node is consistently lower compared to the methods in references [18,19,20], indicating that NDPCS-KS reduces the computational burden on individual nodes. Furthermore, the total key generation time across the network scales also shows a clear advantage, suggesting that NDPCS-KS scales more efficiently as the network size increases. This reduction in both individual and total key generation times reflects a decrease in computational complexity, which in turn implies lower energy consumption, making NDPCS-KS more suitable for resource-constrained environments like sensor networks.

6.4. Randomness Analysis

6.4.1. Probabilistic Statistics

A secure key generation method is expected to exhibit randomness and unpredictability, meaning the probability of each bit of the generated key being 0 or 1 should approach 50%. The experiment uses a common pseudorandom number generator to simulate the generation of CSI and obtains the initial key seeds. Subsequently, the key generation is repeated 10,000 times, and the number of times each key bit

n_{i}

appears as 1, where i is a positive integer representing the key bit positions from 1 to 128, is recorded. The probability of generating a “1” for key bit i can then be calculated as follows:

P_{i} = \frac{n_{i}}{10, 000} (1 \leq n \leq 128) .

(26)

The experimental results are shown in Figure 12: as can be seen from the diagram, the probability of generating a 1 for each key bit approaches 50%. This indicates that NDPCS-KS generates keys with randomness, and the probability of each bit being 0 or 1 approaches 50%, making it difficult to predict.

6.4.2. Monte Carlo Simulations

Next, the analysis explains why NDPCS-KS ensures that the probability of each key bit being 0 or 1 approaches 50%. NDPCS-KS includes three steps:

Random generation of $b a s e_p o i n t$ : 25 $b a s e_p o i n t$ are randomly generated within the continuous two-dimensional space [0, 10] × [0, 10].
Random generation of $n e g o t i a t i o n_v a l u e$ : A point is randomly generated within the same two-dimensional interval, and whether this point falls within the neighborhood of any $b a s e_p o i n t$ is determined.
Consideration of overlapping neighborhoods: The neighborhood of each $b a s e_p o i n t$ may overlap with other $b a s e_p o i n t$ , which increases the complexity of calculating the probability that a single point falls within a neighborhood.

Due to the involvement of continuous two-dimensional random variables and overlapping areas, precise calculation using traditional probability methods is too complex and may lack a closed-form solution.

Therefore, the Monte Carlo method is utilized to estimate the probability that a random point falls within the neighborhood of at least one

b a s e_p o i n t

by generating many random samples. We calculate the probability that a random point, uniformly distributed in the range [0, 10] × [0, 10], falls within the neighborhood (

R = 1.5

) of at least one of 25

b a s e_p o i n t

also defined within the same range. The specific simulation steps are as follows:

Generate $b a s e_p o i n t$ : Randomly generate 25 $b a s e_p o i n t$ in the two-dimensional interval [0, 10] × [0, 10].
Generate random points: Generate many random points within the same range.
Calculate neighborhood hits: For each random point, check whether it falls within the neighborhood of any $b a s e_p o i n t$ .
Estimate probability: Calculate the proportion of random points within at least one neighborhood.

The probabilities obtained from 100 rounds of Monte Carlo simulations are calculated, and the results are shown in Figure 13.

According to Figure 13, most of the estimated probabilities are around 50%, which also reflects the randomness of NDPCS-KS.

6.4.3. Entropy Statistics

Additionally, the key entropy can also reflect its randomness. Specifically, entropy is calculated by determining the frequency and distribution of 0s and 1s in the key. The equation for calculating entropy involves the probabilities of each element (here, 0 and 1), which are used to calculate the average information content of the entire key. The calculation of entropy is as follows:

H (X) = - \sum_{i = 1}^{n} P (x_{i}) {log}_{2} P (x_{i}) .

(27)

P (x_{i})

is the probability of the random variable taking a specific value

x_{i}

, and n is the total number of possible values of X.

The closer the results are to the theoretical maximum value (1 for binary keys), the more it indicates that each bit of the key is nearly equally likely to be 0 or 1, ensuring satisfactory randomness and security. The key generation process is repeated 100 times, and the results are shown in Figure 14: the entropy values of the keys generated by NDPCS-KS all approach 1, indicating that the keys have excellent randomness and a high degree of uncertainty. Such keys are difficult to predict, therefore enhancing their security.

6.4.4. NIST Statistical Test Suite

Additional results of key randomness testing are shown in Table 3 below, which shows the results of testing NDPCS-KS in the NIST Statistical Test Suite [42]. The NIST Statistical Test Suite, developed by the National Institute of Standards and Technology (NIST), is a collection of tests designed to evaluate the randomness and statistical properties of random number generators. It includes tests such as the frequency test, block frequency test, cumulative sums test, runs test, longest run test, matrix rank test, discrete Fourier transform (FFT) test, template matching test, universal statistical test, approximate entropy test, and linear complexity test. These tests help ensure that random number generators produce sequences with the expected statistical characteristics, making them essential for applications in cryptography, secure communications, and other fields requiring high levels of randomness.

A length of 10,000,000 bits of 0s and 1s was generated for testing with the bitstream set to 10. The minimum pass rate for each statistical test, with the exception of the random excursion (variant) test, is approximately 8 for a sample size of 10 binary sequences. For the random excursion (variant) test, the minimum pass rate is approximately 3 for a sample size of 4 binary sequences. The standard for passing the tests is a p-value greater than 0.01. As shown in Table 3, the experimental results indicate that the random sequences generated by the proposed algorithm passed all randomness tests, validating the robustness and reliability of the generated keys in terms of randomness, making them suitable for applications requiring high security.

7. Conclusions

In the paper, we proposed a key synchronization method, NDPCS-KS, based on NDBs and physical channel state characteristics in WSNs. Due to the increasing number of network nodes, centralized CA/PKI authorities face challenges in distributing, updating, and managing keys and are vulnerable to attacks. To achieve key distribution without CA/PKI or other trusted third-party facilities, a novel key synchronization method based on negative databases and physical channel state characteristics of WSNs is designed. The communicating parties obtain the initial key seeds through the initial negotiation. The negotiation keys and NDBs are constructed using the initial key seeds. The NDBs are then used for the second negotiation of the sequence to obtain the final communication keys. The key synchronization process involves a one-way hash function, isolation of the physical channels and the NP-hard characteristic of reversing the NDB. It is computationally infeasible for attackers to deduce the key seeds that produced the NDBs using any eavesdropped data. Furthermore, the second negotiation process based on the NDBs makes it impossible for the attackers to guess the legitimate communication keys. Additionally, an identity verification mechanism and a lightweight anti-replay mechanism are introduced to prevent replay, forgery, and tampering attacks in WSNs.

The theoretical analysis demonstrated that attackers cannot obtain legitimate communication keys without the initial key seeds. Experimental results proved that NDPCS-KS effectively counters replay, tampering, and forgery attacks, while its key distribution is more efficient than other solutions. Lastly, the randomness of NDPCS-KS is also verified.

Author Contributions

Conceptualization, H.P. and W.C.; methodology, H.P. and W.C.; software, H.P.; validation, H.P., W.C. and H.W.; formal analysis, H.P., W.C. and S.B.; investigation, H.P.; resources, W.C.; data curation, H.P.; writing—original draft preparation, H.P.; writing—review and editing, W.C., H.W. and S.B.; visualization, H.W.; supervision, H.W.; project administration, W.C.; funding acquisition, W.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key Research and Development Program of China (2024QY1000, 020YFB1805405, 2019QY0800), and the Natural Science Foundation of China (62372313, 62002248).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study.

Acknowledgments

The authors thank the editors and reviewers for their hard work and valuable advice.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Al-Fuqaha, A.; Guizani, M.; Mohammadi, M.; Aledhari, M.; Ayyash, M. Internet of Things: A Survey on Enabling Technologies, Protocols, and Applications. IEEE Commun. Surv. Tutor. 2015, 17, 2347–2376. [Google Scholar] [CrossRef]
Shafique, K.; Khawaja, B.A.; Sabir, F.; Qazi, S.; Mustaqim, M. Internet of Things (IoT) for Next-Generation Smart Systems: A Review of Current Challenges, Future Trends and Prospects for Emerging 5G-IoT Scenarios. IEEE Access 2020, 8, 23022–23040. [Google Scholar] [CrossRef]
Lombardi, M.; Pascale, F.; Santaniello, D. Internet of Things: A General Overview Between Architectures, Protocols and Applications. Information 2021, 12, 87. [Google Scholar] [CrossRef]
Abiodun, O.I.; Oluwaranti, A.; Misra, S.; Chamola, V. A Review on the Security of the Internet of Things: Challenges and Solutions. Wirel. Pers. Commun. 2021, 119, 2603–2637. [Google Scholar] [CrossRef]
Lin, Y.; Xie, Z.; Chen, T.; Cheng, X.; Wen, H. Image Privacy Protection Scheme Based on High-Quality Reconstruction DCT Compression and Nonlinear Dynamics. Expert Syst. Appl. 2024, 257, 124891. [Google Scholar] [CrossRef]
Shen, S.; Chen, X.; Duan, H.; Zhang, J.; Xiang, Y. Differential Game-Based Strategies for Preventing Malware Propagation in Wireless Sensor Networks. IEEE Trans. Inf. Forensics Secur. 2014, 9, 1962–1973. [Google Scholar] [CrossRef]
Chen, L.; Ji, J.; Zhang, Z. Wireless Network Security; Springer: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
Zhou, L.; Kang, M.; Chen, W. Lightweight Security Transmission in Wireless Sensor Networks Through Information Hiding and Data Flipping. Sensors 2022, 22, 823. [Google Scholar] [CrossRef]
Dai, H.; Xu, H. Key Predistribution Approach in Wireless Sensor Networks Using LU Matrix. IEEE Sens. J. 2010, 10, 1399–1409. [Google Scholar] [CrossRef]
Kuang, W. Improved Random Key Pre-Distribution Scheme for Wireless Sensor Networks. Chin. J. Sens. Actuators 2010, 10, 225–234. [Google Scholar]
Yin, L.; Liu, Q.; Liang, W.; Shen, H.; Yang, Y. Secure Pairwise Key Establishment for Key Predistribution in Wireless Sensor Networks. In Proceedings of the 2012 International Conference on Computer Science and Service System, Nanjing, China, 22–24 June 2012; pp. 1018–1021. [Google Scholar]
Renyi, S. Dynamic Nonlinear Key Distribution in Wireless Sensor Networks. J. Yanbian Univ. 2011. Available online: http://en.cnki.com.cn/Article_en/CJFDTOTAL-YBDZ201103022.htm (accessed on 2 August 2024).
Chen, S.; Zhang, H.; Wang, Q. Dynamic Key Management Scheme in Wireless Sensor Networks. In High Performance Networking, Computing, and Communication Systems: Second International Conference ICHCC 2011, Singapore, 5–6 May 2011, Selected Papers; Communications in Computer and Information Science; Springer: Berlin/Heidelberg, Germany, 2011; Volume 163. [Google Scholar]
Yousefpoor, M.S.; Barati, H. DSKMS: A Dynamic Smart Key Management System Based on Fuzzy Logic in Wireless Sensor Networks. Wirel. Netw. 2020, 26, 2515–2535. [Google Scholar] [CrossRef]
Xiong, H.; Li, X.; Gao, S.; Zhang, J.; Zhao, H. Efficient Secret Key Generation Scheme of Physical Layer Security Communication in Ubiquitous Wireless Networks. IET Commun. 2021, 15, 1123–1132. [Google Scholar] [CrossRef]
Altun, U.; Emekligil, M.; Ozger, M. Scalable Secret Key Generation for Wireless Sensor Networks. IEEE Syst. J. 2022, 16, 6031–6041. [Google Scholar] [CrossRef]
Wei, X.; Saha, D. KNEW: Key Generation Using Neural Networks From Wireless Channels. In Proceedings of the 2022 ACM Workshop on Wireless Security and Machine Learning, New Orleans, LA, USA, 18 March 2022; pp. 1–6. [Google Scholar]
Rangarajan, J.; Sharma, N.; Gope, P.; Kundu, S. Crypto Analysis With Modified Diffie–Hellman Key Exchange Based Sensor Node Security Improvement in Wireless Sensor Networks. In Proceedings of the 2023 Third International Conference on Artificial Intelligence and Smart Energy (ICAIS), Coimbatore, India, 24–26 March 2023; pp. 1–5. [Google Scholar]
Moara-Nkwe, K.; Masek, P.; Barcelo-Ordinas, J.M.; De Poorter, E. A Novel Physical Layer Secure Key Generation and Refreshment Scheme for Wireless Sensor Networks. IEEE Access 2018, 6, 11374–11387. [Google Scholar] [CrossRef]
Ji, Z.; Huang, H.; Zhang, J.; Yang, F.; Wen, H. Physical-Layer-Based Secure Communications for Static and Low-Latency Industrial Internet of Things. IEEE Internet Things J. 2022, 9, 18392–18405. [Google Scholar] [CrossRef]
Aldaghri, N.; Mahdavifar, H. Physical Layer Secret Key Generation in Static Environments. IEEE Trans. Inf. Forensics Secur. 2020, 15, 2692–2705. [Google Scholar] [CrossRef]
Chen, Y.; Li, X.; Du, X.; Zhao, H. Physical Layer Key Generation Scheme for MIMO System Based on Feature Fusion Autoencoder. IEEE Internet Things J. 2023, 10, 14886–14895. [Google Scholar] [CrossRef]
Chen, Y.; Li, X.; Du, X.; Zhao, H. Physical Layer Secret Key Generation Based on Bidirectional Convergence Feature Learning Convolutional Network. IEEE Internet Things J. 2023, 10, 14846–14855. [Google Scholar] [CrossRef]
Wunder, G.; Nieman, K.; Dahmen, T. Mimicking DH Key Exchange Over a Full Duplex Wireless Channel via Bisparse Blind Deconvolution. In Proceedings of the 2023 6th International Conference on Advanced Communication Technologies and Networking (CommNet), Marrakech, Morocco, 16–18 May 2023; pp. 1–5. [Google Scholar]
Cao, Z.; Lin, Z.; Zhou, X.; Niu, H. Eliminating Privacy Amplification in Secret Key Generation from Wireless Channels. In Proceedings of the 2015 10th International Conference on Communications and Networking in China (ChinaCom), Shanghai, China, 15–17 August 2015; pp. 1–5. [Google Scholar]
Hua, Y. Generalized Channel Probing and Generalized Pre-processing for Secret Key Generation. IEEE Trans. Signal Process. 2023, 71, 1067–1082. [Google Scholar] [CrossRef]
Li, G.; Xia, Y.; Chen, Y.; Zhang, Z. The Optimal Preprocessing Approach for Secret Key Generation from OFDM Channel Measurements. In Proceedings of the 2016 IEEE Globecom Workshops (GC Wkshps), Washington, DC, USA, 4–8 December 2016; pp. 1–6. [Google Scholar]
Li, G.; Zhang, Z.; Liu, L.; Xia, Y.; Chen, Y. High-Agreement Uncorrelated Secret Key Generation Based on Principal Component Analysis Preprocessing. IEEE Trans. Commun. 2018, 66, 3022–3034. [Google Scholar] [CrossRef]
Wang, X.; Cao, Z.; Li, G.; Zhou, X. Secret Key Extraction with Quantization Randomness Using Hadamard Matrix on QuaDRiGa Channel. In Information and Communications Security: 17th International Conference, ICICS 2015, Beijing, China, 9–11 December 2015, Revised Selected Papers; Zhou, J., Ed.; Springer: Cham, Switzerland, 2016; pp. 1–12. [Google Scholar]
Horie, S.; Watanabe, O. Hard Instance Generation for SAT. In Proceedings of the 8th International Symposium on Algorithms and Computation (ISAAC ’97), Singapore, 17–19 December 1997; pp. 21–30. [Google Scholar]
Hwang, M.-S.; Li, L.-H. A New Remote User Authentication Scheme Using Smart Cards. IEEE Trans. Consum. Electron. 2000, 46, 28–30. [Google Scholar] [CrossRef]
Esponda, F.; Forrest, S.; Helman, P. Negative Representations of Information. Int. J. Inf. Secur. 2009, 8, 331–345. [Google Scholar] [CrossRef]
Esponda, F.; Ackley, E.; Forrest, S.; Helman, P. Protecting Data Privacy Through Hard-to-Reverse Negative Databases. Int. J. Inf. Secur. 2007, 6, 403–415. [Google Scholar] [CrossRef]
Barthel, W.; Hartmann, A.K.; Leone, M.; Ricci-Tersenghi, F.; Weigt, M.; Zecchina, R. Hiding Solutions in Random Satisfiability Problems: A Statistical Mechanics Approach. Phys. Rev. Lett. 2002, 88, 188701. [Google Scholar] [CrossRef]
Achlioptas, D.; Jia, H.; Moore, C. Hiding Satisfying Assignments: Two Are Better Than One. J. Artif. Intell. Res. 2005, 24, 623–639. [Google Scholar] [CrossRef]
Liu, R.; Luo, W.; Wang, X. A Hybrid of the Prefix Algorithm and the Q-Hidden Algorithm for Generating Single Negative Databases. In Proceedings of the 2011 IEEE Symposium on Computational Intelligence in Cyber Security (CICS), Paris, France, 11–15 April 2011; pp. 36–43. [Google Scholar]
Liu, R.; Luo, W.; Yue, L. The P-Hidden Algorithm: Hiding Single Databases More Deeply. Immune Comput. 2014, 2, 43–55. [Google Scholar]
Zhao, D.; Du, D.; Liang, Y.; Xu, D.; Ji, S. A Fine-Grained Algorithm for Generating Hard-to-Reverse Negative Databases. In Proceedings of the 2015 International Workshop on Artificial Immune Systems (AIS), Taormina, Italy, 21–22 October 2015; pp. 1–8. [Google Scholar]
Zhang, J.; Mao, J.; Li, X.; Yao, W. Experimental Study on Channel Reciprocity in Wireless Key Generation. In Proceedings of the 2016 IEEE 17th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), Edinburgh, UK, 3–6 July 2016; pp. 1–5. [Google Scholar]
Klement, F.; Voss, P.; Koller, F.; Obermeier, S.; Parzinger, M.; Hollick, M. Keep Your Enemies Closer: On the Minimal Distance of Adversaries When Using Channel-Based Key Extraction in SISO 6G Systems. In Proceedings of the 2023 19th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob), Thessaloniki, Greece, 9–11 October 2023; pp. 445–451. [Google Scholar]
Beaulieu, R.; Shors, D.; Smith, J.; Treatman-Clark, S.; Weeks, B.; Wingers, L. The SIMON and SPECK Lightweight Block Ciphers. In Proceedings of the 52nd Annual Design Automation Conference (DAC ’15), San Francisco, CA, USA, 8–12 June 2015; pp. 1–6. [Google Scholar]
National Institute of Standards and Technology. NIST Statistical Test Suite. Available online: https://csrc.nist.gov/publications/detail/sp/800-22/rev-1a/final (accessed on 30 June 2024).

Figure 1. System model.

Figure 2. Schematic diagram of the NDB.

Figure 3. Schematic diagram showing the distribution of the generated NDB and negotiation key in a two-dimensional plane.

Figure 4. Schematic diagram of communication key generated by dual negotiation of the NDB.

Figure 5. Data transmission flow.

Figure 6. Schematic diagram of sensor topology.

Figure 7. Schematic diagram of ESP32 development board.

Figure 8. Replay attack detection accuracy.

Figure 9. (a) Forgery detection accuracy. (b) Tamper detection accuracy.

Figure 10. Comparison of execution time. The execution time of NDPCS-KS is compared with that of Rangarajan et al. (2023) [18], Moara-Nkwe et al. (2018) [19], and Ji et al. (2022) [20].

Figure 11. (a) Total key generation time across different network scales. The execution time of NDPCS-KS is compared with the methods of Rangarajan et al. (2023) [18], Moara-Nkwe et al. (2018) [19], and Ji et al. (2022) [20]. (b) Average key generation time per node across different network scales. The comparison includes NDPCS-KS and the methods from Rangarajan et al. (2023) [18], Moara-Nkwe et al. (2018) [19], and Ji et al. (2022) [20].

Figure 12. Key distribution chart.

Figure 13. Monte Carlo simulation results.

Figure 14. Entropy statistics of keys.

Table 1. Schematic diagram of an NDB.

DB	U-DB	NDB
	001
000	010	0*1
111	011	*11
	100	10*
	101
	110

Table 2. Basicdefinition of the system.

Term	Definition
Main channels	The legitimate channels between any pair of sensor nodes
$K_{a \to b}$	The communication key between child node a and parent node b
$E (K_{a \to b}, x)$	Message x encrypted with $K_{a \to b}$
$D (K_{a \to b}, E (K_{a \to b}, x))$	Message x decrypted with $K_{a \to b}$
${r o w_s e e d}_{a \to b}$	The initial key seed is negotiated via CSI between nodes a and b, with $a \to b$ indicating a as the child node and b as the parent node
${r o w_s e e d}_{a \to b} [i]$	Represents the ith bit of $r o w_s e e d_{a \to b}$ , where a is the child node of b
${r o w_s e e d}_{a \to b} (t)$	$r o w_s e e d_{a \to b}$ used by node a to communicate with node b at time t
$R S_L i b r a r y$	Initial key seeds library maintained by each node
$b a s e_p o i n t = (x_{i}, y_{i})$	The two-dimensional baseline points of the NDB
$N D B_b a s e$	All $b a s e_p o i n t$ elements form an $N D B_b a s e$ sequence
$N D B_b a s e [i]$	Represents the ith $b a s e_p o i n t$ element of $N D B_b a s e$
$n e g o t i a t i o n_v a l u e = (x_{j}^{'}, y_{j}^{'})$	The two-dimensional negotiation values
$n e g o t i a t i o n_K E Y$	All $n e g o t i a t i o n_v a l u e$ elements form the negotiation key $n e g o t i a t i o n_{K} E Y$ for NDB second negotiation
$n e g o t i a t i o n_K E Y [j]$	Represents the jth $n e g o t i a t i o n_v a l u e$ element of $n e g o t i a t i o n_K E Y$
R	The initial filtering radius
$Δ$	The adjustment value for dynamic radius adjusting
$R_{n e w}$	The adjusted filtering radius
$f i l t e r e d_k e y$	A sequence representing the communication key
$f i l t e r e d_k e y [j]$	The jth bit of the communication key, which can be 0 or 1
$v_{a \to b}$	Identity verification value calculated by child node a sending information
$v_{a \to b}^{'}$	Identity verification value calculated by parent node b receiving information
$R H T_L i b r a r y$	A library of rolling hash tokens maintained by each sensor
$T i m e_S t a m p$	A timestamp includes the generation time $T_{g e n}$ and the token expiration time $T_{e x p}$
$T K$	A token calculated by sensor a using SHA256 with $E (K_{a \to b}, T i m e_S t a m p)$ as input, sent to sensor b

Table 3. Results of NIST randomness tests for NDPCS-KS.

Test Name	p-Value	Proportion Passing
Frequency Test	0.350485	10/10
Block Frequency Test	0.534146	9/10
Cumulative Sums Test 1	0.739918	10/10
Cumulative Sums Test 2	0.911413	10/10
Runs Test	0.213309	9/10
Longest Run Test	0.350485	10/10
Rank Test	0.350485	10/10
FFT Test	0.534146	10/10
Non-Overlapping Template Test 1	0.122325	10/10
Non-Overlapping Template Test 2	0.350485	10/10
Non-Overlapping Template Test 3	0.739918	10/10
Non-Overlapping Template Test 4	0.534146	10/10
Non-Overlapping Template Test 5	0.739918	9/10
Overlapping Template Test	0.739918	10/10
Universal Statistical Test	0.534146	10/10
Approximate Entropy Test	0.534146	10/10
Random Excursions Test 1	—	3/4
Random Excursions Test 2	—	4/4
Random Excursions Test 3	—	4/4
Serial Test 1	0.739918	10/10
Serial Test 2	0.739918	10/10
Linear Complexity Test	0.035174	10/10

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Pu, H.; Chen, W.; Wang, H.; Bao, S. Key Synchronization Method Based on Negative Databases and Physical Channel State Characteristics of Wireless Sensor Network. Sensors 2024, 24, 6217. https://doi.org/10.3390/s24196217

AMA Style

Pu H, Chen W, Wang H, Bao S. Key Synchronization Method Based on Negative Databases and Physical Channel State Characteristics of Wireless Sensor Network. Sensors. 2024; 24(19):6217. https://doi.org/10.3390/s24196217

Chicago/Turabian Style

Pu, Haoyang, Wen Chen, Hongchao Wang, and Shenghong Bao. 2024. "Key Synchronization Method Based on Negative Databases and Physical Channel State Characteristics of Wireless Sensor Network" Sensors 24, no. 19: 6217. https://doi.org/10.3390/s24196217

APA Style

Pu, H., Chen, W., Wang, H., & Bao, S. (2024). Key Synchronization Method Based on Negative Databases and Physical Channel State Characteristics of Wireless Sensor Network. Sensors, 24(19), 6217. https://doi.org/10.3390/s24196217

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Key Synchronization Method Based on Negative Databases and Physical Channel State Characteristics of Wireless Sensor Network

Abstract

1. Introduction

2. Related Work

2.1. Current State of Research on Key Generation in WSNs

2.2. Introduction on Key Negotiation and Quantization

2.3. Introduction and Current State of Research on Negative Databases

3. System Model

4. Key Technologies and Algorithms

4.1. Network Initialization

4.2. Negative Databases and Negotiation Key Generation

4.2.1. Negative Database Generation

4.2.2. Negotiation Key Generation

4.3. Second Negotiation for Communication Key Generation

4.4. Identity Verification Mechanism Based on SHA256

4.5. Lightweight Anti-Replay Mechanism Based on Timestamps and Rolling Hash Tokens

4.6. Data Transmission Process Between Nodes

5. Security Analysis

5.1. Key Space Size

5.2. Key Security

6. Experiments

6.1. Replay Attack Detection Accuracy

6.2. Tampering and Forging of Transmission Packets Detection Accuracy

6.3. Performance Analysis

6.4. Randomness Analysis

6.4.1. Probabilistic Statistics

6.4.2. Monte Carlo Simulations

6.4.3. Entropy Statistics

6.4.4. NIST Statistical Test Suite

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI