SEPSI: A Secure and Efficient Privacy-Preserving Set Intersection with Identity Authentication in IoT

Liu, Bai; Zhang, Xiangyi; Shi, Runhua; Zhang, Mingwu; Zhang, Guoxing

doi:10.3390/math10122120

Open AccessArticle

SEPSI: A Secure and Efficient Privacy-Preserving Set Intersection with Identity Authentication in IoT

by

Bai Liu

^1,*

,

Xiangyi Zhang

¹

,

Runhua Shi

¹,

Mingwu Zhang

^1,*

and

Guoxing Zhang

²

¹

The School of Computer Science, Hubei University of Technology, Wuhan 430068, China

²

School of Management, Lanzhou University, Lanzhou 730000, China

^*

Authors to whom correspondence should be addressed.

Mathematics 2022, 10(12), 2120; https://doi.org/10.3390/math10122120

Submission received: 3 May 2022 / Revised: 31 May 2022 / Accepted: 14 June 2022 / Published: 17 June 2022

(This article belongs to the Special Issue Computational and Mathematical Methods in Information Science and Engineering)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

The rapid development of the Internet of Things (IoT), big data and artificial intelligence (AI) technology has brought extensive IoT services to entities. However, most IoT services carry the risk of leaking privacy. Privacy-preserving set intersection in IoT is used for a wide range of basic services, and its privacy protection issues have received widespread attention. The traditional candidate protocols to solve the privacy-preserving set intersection are classical encryption protocols based on computational difficulty. With the emergence of quantum computing, some advanced quantum algorithms may undermine the security and reliability of traditional protocols. Therefore, it is important to design more secure privacy-preserving set intersection protocols. In addition, identity information is also very important compared to data security. To this end, we propose a quantum privacy-preserving set intersection protocol for IoT scenarios, which has higher security and linear communication efficiency. This protocol can protect identity anonymity while protecting private data.

Keywords:

private set intersection; quantum authentication; oblivious quantum key distribution; Internet of Things

MSC:

81P94

1. Introduction

In the Internet of Things (IoT), many devices are connected to exchange data through the internet [1,2]. The core components of IoT are smart devices, the internet and connectivity, where IoT devices collect information about personal behavior. In recent years, the development of IoT has brought about many practical scenarios, such as the Internet of Medical Things (IoMT) [3], smart cities [4], and smart homes [5]. IoT services bring great convenience to human life.

As a basic service, privacy-preserving set intersection (PSI) in IoT is widely used in various practical environments. For example, in IoMT, hospitals cannot share electronic medical records while protecting patient privacy. Patients with similar symptoms also cannot exchange and share medical information. Therefore, there exists the phenomenon of information islands in IoMT. In this regard, personal health information (PHI) can be securely shared through profile matching [6] based on PSI. In a cloud environment, Abadi et al. [7] proposed an efficient delegated privacy set intersection scheme on outsourced private datasets. In addition, private graph intersection operation also plays an important role in social networks. Zuo et al. [8] proposed an efficient and privacy-preserving verifiable graph intersection scheme using cryptographic accumulators in social networks.

Because of its importance and wide applicability, many privacy-preserving set intersection (PSI) protocols have been proposed. In 2004, Friedman et al. [9] proposed the first PSI protocol, where a set can be used with homomorphic encryption to ensure secure computation. In 2019, Le et al. [10] proposed a PSI protocol based on secret sharing, which removes the trusted third party of the protocol [11]. Kolesnikov et al. [12] proposed a new PSI protocol, which improved the communication efficiency of the protocol [13] by 2.9–3.3 times. In 2020, Chase et al. [14] proposed a novel lightweight multi-point oblivious pseudorandom function protocol based on oblivious OT extension and utilized it to construct a PSI scheme. In 2021, Badrina Rayanan et al. [15] proposed an updated privacy set intersection protocol, which allows two parties that have constantly updated sets to calculate their privacy set intersections.

However, most existing PSI protocols are based on difficulty assumptions, which are vulnerable to attacks by quantum technology. As a consequence, classical PSI protocols may not have long-term security and the design of quantum-resistant PSI protocols becomes a research hot spot. In addition, quantum cryptography [16,17] has emerged, which can guarantee information-theoretic security.

In this article, we propose a general system model of privacy-preserving set intersection in IoT, which is aided with edge computation (ED). Then, we present a quantum protocol for a private-preserving set intersection with identity authentication. A novel quantum PSI in IoT is designed with the help of obvious quantum key distribution, quantum authentication and count Bloom filter.

Our contributions, in this paper, are summarized as follows:

We propose a general system model aided with ED of PSI, which is suitable for IoT applications.
we present a novel quantum updatable PSI protocol in IoT, which can be roughly divided into three phases: key generation, encryption and decryption.
We analyze security and communication efficiency of the protocol. The protocol has efficient communication efficiency, i.e., linear communication complexity $O (τ) (τ ≪ N)$ qubits, where N is the size of the universal set. The proposed protocol has higher security. The protocol also provides identity authentication to protect identity information and to maintain the integrity of the transmitted information.

The remainder of this article is organized as follows. In Section 2, we introduce the related works of a privacy-preserving set intersection in a quantum setting. Then, we describe our system model, security model and design goals in Section 3. In Section 4, we present our quantum PSI protocol, followed by security analysis and performance evaluation in Section 5. Then, we have some discussions in Section 6. Finally, we draw our conclusions in Section 7.

2. Related Works

2.1. Quantum PSI Protocol

In 2015, Shi et al. [18] first proposed a cheat-sensitive quantum PSI protocol using phase-encoded private query. Then, Cheng et al. [19] presented a new quantum PSI protocol, which is cryptanalysis and an improvement of the protocol [18]. Cheng’s protocol shows that the protocol [18] is not as efficient as claimed because the communication complexity should be

O (n l o g N)

instead of

O (n)

. Later, Maitra [20] presented a fair quantum PSI protocol based on a set membership decision protocol [21]. However, these protocols need complicated oracle operators and multi-particle entangled states. Subsequently, in order to enhance the realizability, Kumar [22] introduced a feasible quantum private set intersection protocol with single photons using the flexible oblivious quantum key distribution (OQKD) [23]. Based on the quantum PSI protocol [22], Debnath et al. [24] presented an efficient quantum PSI protocol, which reduced communication complexity. However, a multi feasible OQKD protocol [25] was broken by the protocol [26] using the man-in-the-middle attack. Therefore, the security of protocols [22,24] may not be guaranteed.

2.2. Oblivious Quantum Key Distribution

In 2011, Jakobi et al. [27] proposed a practical oblivious quantum key distribution (OQKD) protocol, which guaranteed better efficiency and feasibility of a private quantum query. The oblivious key can be distributed between two parties by using SARG04 QKD [28], where the sender knows the whole key while the receiver only knows a single or a few bits of the key. The main process of OQKD can be briefly described as follows:

The sender, i.e., Alice, generates a long quantum sequence including states

| ↑ 〉, | ↓ 〉, | \leftarrow 〉, | \to 〉

, where two quantum states carry a bit of classical information, e.g.,

{| ↓ 〉, | ↑ 〉}

represent the bit 0 and

{| \leftarrow 〉, | \to 〉}

denote the bit 1. Then, Alice sends the quantum sequence to the receiver. After receiving it, the receiver, i.e., Bob, measures each qubit randomly in ↔ basis or ↕ basis.

Then, Bob announces that he successfully measured the positions of the qubits and discards the missed or undetected qubits. For each qubit that Bob successfully measured, Alice announces a pair of verification qubits to verify the correctness of Bob’s measured results. Due to the uncertainty of measurements in quantum mechanics, Bob only obtains partial values that match a pair of qubits published by Alice. In other words, Bob can only obtain partially correct values of the key. In order to reduce Bob’s information on the raw key, two parties cut the raw key into multiple substrings of length N and added these strings bitwise to obtain the final key with length N.

Then, Gao et al. [23] proposed a variant OQKD protocol in which a variable angle

θ

was introduced in the protocol [27]. That is, they use four generalized states

{| 0 〉, | 1 〉, | 0^{'} 〉, | 1^{'} 〉}

, where

| 0^{'} 〉 = c o s θ | 0 〉 + s i n θ | 1 〉

and

| 1^{'} 〉 = c o s θ | 0 〉 - s i n θ | 1 〉

.

Later, Xiao et al. [29] integrated an identity authentication mechanism into the OQDK protocol [27] to present a new OQKD protocol that can implement mutual identity authentication to resist malicious adversary attacks. First, two parties register with a trusted third party (Certificate Authority, CA) to obtain their respective identity information, i.e., Alice’s identity string

I D_{C}

and Bob’s identity string

I D_{S}

. Then, Alice sends the qubits used as the original key (QOK) along with the qubits for authentication (QA) to CA. All qubits need to be forwarded by the CA to Bob, where QA are modified by the CA based on the identity strings of both parties. Both parties can authenticate with QA to obtain a key K that can be used for subsequent anonymous authentication. Another difference with the OQDK protocol [27] is that instead of directly disclosing the quantum bit pairs used to verify Bob’s measurement results, Alice encrypts them with the key K and sends them to Bob. The system model is shown in Figure 1.

2.3. Quantum Authentication

Quantum message authentication is an important research direction in quantum cryptography and is divided into two parts: authentication of classical information [30] and authentication of quantum information [31]. Curty et al. [30] proposed the first protocol for classical information by quantum entangled states. Subsequently, Xi et al. [32] proposed a quantum authentication scheme that required only single photons. This protocol assumes that two parties pre-share a classical key and a pair of quantum operators. Then, the sender converts a classical message into quantum bits and transmits these qubits to the receiver through the quantum channel. Finally, the receiver verifies the authenticity of the qubits.

The main process of the protocol [32] is as follows:

Suppose that Alice has a classical set

{m_{1}, m_{2}, \dots m_{n}}

, where

m_{i} \in {0, 1}

and

i \in {1, 2, \dots, n}

. Two parties, Alice and Bob, share a secret key

{s_{1}, s_{2}, \dots s_{n + 1}}

in advance, where

s_{i} \in {0, 1}

and

i \in {1, 2, \dots, n + 1}

. Then, two parties also pre-share two publicly quantum unitary operations,

U_{0}

and

U_{1}

, which should satisfy the following conditions:

$U_{0} | v 〉 〈 v | U_{0}^{+} + U_{1} | v 〉 〈 v | U_{1}^{+} \neq 0$ .
There is no a unitary operation $U_{e}$ to make $〈 v | U_{i}^{+} U_{e} U_{i} | v 〉 = 0$ , where $i \in {0, 1}$ .
$〈 v | U_{0}^{+} U_{1} | v 〉 \neq 0$ .

where

| v 〉

is an arbitrary qubit.

Two parties select two pairs of arbitrary quantum states, i.e.,

| φ_{0} 〉, | φ_{1} 〉,

and

| ψ_{0} 〉, | ψ_{1} 〉

, where

〈 φ_{0} | φ_{1} 〉 = 0

and

〈 ψ_{0} | ψ_{1} 〉 = 0

. As shown in Table 1 and Table 2, Alice generates a pair of quantum states

{| a_{i} 〉 | t_{i} 〉}

, where the first qubit represents the quantization of

m_{i}

and the second qubit implies the relevant label of

m_{i}

. Alice transforms classical information

{m_{1}, m_{2}, \dots m_{n}}

to obtain a quantum sequence

{| a_{1} 〉, | t_{1} 〉, | a_{2} 〉, | t_{2} 〉, \dots | a_{n} 〉, | t_{n} 〉}

by the method in Table 1 and Table 2, then sends the quantum sequence to Bob.

After receiving the quantum sequence, Bob selects suitable measurement bases by the method in Table 3, then measures the quantum sequence

{| a_{1} 〉, | t_{1} 〉, | a_{2} 〉, | t_{2} 〉, \dots | a_{n} 〉, | t_{n} 〉}

. If each quantum pair satisfies the equation

| t_{i} 〉_{m} = U_{s_{i + 1}} {| a_{i} 〉}_{m}

, where

| t_{i} 〉_{m}

and

| a_{i} 〉_{m}

are measurement results of

| t_{i} 〉

and

| a_{i} 〉

, respectively, the quantum sequence passes the verification of Bob.

2.4. Count Bloom Filter

A Bloom filter is an efficient data structure that is mainly used to determine or find whether an element exists in a set. The Bloom filter was first proposed by B.H. Bloom in 1970 [33]. Since Bloom filters do not support delete operations, it cannot be adapted to dynamic data environments. A counting Bloom filter that can support a delete operation is proposed in the protocol in [34].

Figure 2 shows the composition of a counting Bloom filter. It mainly consists of two tools: an array of size m and k different collision-resistant hash functions

{H_{1}, \dots, H_{k}}

, where

H_{i} : {0, 1}^{*} ⟶ {1, \dots, m}

for

i \in {1, 2, \dots, k}

. Suppose Alice has a private set

S = {s_{1}, s_{2}, \dots, s_{n}}

. She wants to map all elements of S into the m-size array

C B F_{s}

by k hash functions. Initially, Alice obtains an empty array

C B F_{s}

, where all elements are set to 0. For each element x of S, Alice uses hash functions

{H_{1}, \dots, H_{k}}

to obtain positions

{H_{1} (x) t h, \dots, H_{k} (x) t h}

in

C B F_{s}

, and adds 1 to the values in these positions.

In general, if someone wants to insert an element into

C B F_{s}

, he can use hash functions to map the element to the corresponding positions in

C B F_{s}

and add one to the values in these positions. In addition, if someone wants to query whether an element x belongs to S, he only needs to map the element x to the corresponding positions in

C B F_{s}

by hash functions. Then, he determines whether the values in all these positions are non-zero. If there exists a position where the value is 0, then it means that the element x cannot belong to S. If Alice wants to delete an element x of S to

C B F_{s}

, she only needs to map the element x to the corresponding positions in

C B F_{s}

and reduces the value of all positions by one unit (value = value − 1). Please note that x must belong to S. However, if the values in all positions are non-zero, it is possible that x is not in S. That is, the count Bloom filter has false positives.

3. Models and Design Goal

3.1. System Model

In this section, we will illustrate our design of the privacy-preserving set intersection from a system perspective. Our system model consists of five groups of entities: (1) IoT devices; (2) devices for an edge device; (3) a server provider; (4) a client and (5) a certificate authority, as shown in Figure 3.

IoT Devices: IoT devices equipped with sensing and communication capabilities are deployed in areas of interest. IoT devices generate real-time data and periodically report data to the edge device. Communication between IoT devices and edge devices is classic communication.

Edge Devices (ED): In order to improve efficient communication, an edge device is deployed at the network edge, which receives the data reported from IoT devices. After receiving data, it locally processes, aggregates, and forwards data to a service provider.

Service Provider (SP): An SP might consist of servers equipped with quantum devices. The SP directly provides the IoT services to the end client. Specifically, we take the IoMT scenario as an example to describe the privacy-preserving set intersection. In hospitals, various IoT devices monitor patients’ physical health, such as physiological parameters and living habits. After IoT devices report data to an ED, ED first processes data locally. Then, the ED forwards processing results to SP through wireless communication. When physicians belonging to other hospitals want to obtain data on patients with similar diseases, SP will respond to the client according to this protocol.

Client: A client may be an end device that is equipped with quantum devices. She receives anonymous encrypted data from SP and calculates the privacy intersection of their sets.

Certificate Authority (CA): A CA is a trusted third party that generates identity information for clients and servers. CA is also equipped with quantum devices that can forward quantum states to the client. CA is only used in the basic building block of the protocol: oblivious key distribution scheme [29], which is introduced in Section 2.2.

In this paper, the quantum devices required above only need to support single-photon preparation, measurement, and simple single-bit operations. That is, the quantum device we describe is not a full-fledged quantum computer including quantum random memory but has some basic devices [35,36,37,38] and single-bit circuits that can support single-photon operations.

3.2. Security Model

We consider honest-but-curious parties, where adversaries may attempt to learn more information from a given protocol execution but are not able to deviate from the protocol.

Definition 1.

Privacy-preserving set intersection (PSI) protocol—there are two communicating parties, i.e., a client with a private set C and an SP with a private set S. After executing a PSI protocol, the client outputs the intersection of their respective private sets, i.e.,

C \cap S

, but the SP obtains nothing. Furthermore, a PSI protocol should meet the following privacy requirements:

(1) SP Privacy: The client learns no information about the SP’s private set except the intersection

C \cap S

.

(2) Client Privacy: SP cannot obtain any private information about the client’s private set.

Traditionally, PSI uses a static setting where computation is performed only once on both parties’ input sets. We also consider that parties can periodically calculate the intersection of their private updatable sets.

In addition, we also consider external adversary attacks and authentication analysis to enhance security. That is, the protocol should also meet the following security requirement:

(3) Authentication: If the tag passes authentication, the client will continue to execute the protocol, otherwise, terminate.

Due to the focus on privacy-preserving of two parties, i.e., a client and an SP, during the interaction, we do not consider the honesty of IoT devices and EDs. That is, they faithfully report data and are not subject to attack.

3.3. Design Goal

The design goals are as follows.

The proposed protocol can not only protect the private data of both parties but also protect the identity information of both parties. The protocol needs to ensure correctness without losing the ability to protect privacy. In order to enhance privacy protection, the protocol is required to protect the identity information of both parties. In addition, the protocol may be subject to external attacks with quantum devices, so it needs to have a certain resistance to external attacks.
The proposed protocol should have efficient communication efficiency. This protocol only needs the linear communication complexity of $O (τ)$ qubits.

4. Proposed Protocol

In this protocol, assume that a client has a private set

C = {c_{1}, c_{2}, \dots, c_{v}}

and an SP has a private set

S = {s_{1}, s_{2}, \dots, s_{w}}

, where

w > v

. All elements of sets C and S lie in

Z_{N}

, where

Z_{N} = {0, 1, 2, . ., N - 1}

.

Furthermore, SP and the client have the same count Bloom filter parameters, i.e., hash functions

{h_{1}, h_{2}, \dots, h_{λ}}

and the length

τ

of the count Bloom filter [34,39].

The protocol consists of three main parts, including key generation phase, encryption phase and decryption phase. Next, we will describe these phases. In addition, specific notations used in the following text are illustrated in Table 4.

4.1. Key Generation

In this section, two parties, i.e., a client and an SP, will be distributed a special asymmetric key. SP knows every bit of the key, while the client only knows partial bits of the key, where each bit that the client knows is associated with a unique element of her private set. For instance, assume that position indexes of the key bits start from 0 to

N - 1

. Suppose that Alice has a set

X = {x_{1}, x_{2} \dots, x_{n}}

, where

x_{i} \in {0, 1,, \dots N - 1}

and

n < N

. Then, Alice only knows the

x_{1} t h, x_{2} t h, \dots

and

x_{n}

th bits of the key.

Step 1: The client and SP invoke Xiao’s Oblivious Quantum Key Distribution (OQKD) protocol [29] to share a random secret

(τ + q)

-bit key

k_{B}

. SP knows the whole key

k_{B}

, and the client only knows

m + q

bits of key

k_{B}

(note that m is the number of non-zero items in the client’s array

C B F_{C}

during decryption phase,

τ

is the size of SP’s array BF in encryption phase and q is a security parameter).

Furthermore, as for reference [29], we can also obtain the

τ + 1

bits message authentication key K, which only are known by the SP and client.

Step 2: Then, the client randomly chooses q bits of the key to check whether SP is honest. That is, she requests SP to announce the values of these checked bits. If these values published by SP do not entirely match those that she has deciphered, it would indicate that SP is dishonest or there is an outside eavesdropper. If the client discovered a dishonest SP or any outside eavesdropping, she would terminate this protocol, otherwise, continue to the next step.

Step 3: SP and the client discard q checked bits of the raw key

k_{B}

and further obtain the intermediate key

k_{b}

of length

τ

. Similarly, the client only knows m bits of key

k_{b}

, while SP still knows all bits. Actually, the client knows not only m-bit values:

k_{b} (j_{1}), k_{b} (j_{2}), \dots, k_{b} (j_{m})

but also their respective position indexes:

{j_{1}, j_{2}, \dots, j_{m}}

, where

k_{b} (j_{i})

denotes the

j_{i}

th bit of

k_{b}

. In addition, SP does not know the bits which the client knows.

Step 4: The client generates a random permutation

π

of an

τ

-element sequence by position index set

{j_{1}, j_{2}, \dots, j_{m}}

and non-zero items’ position index set

{p_{1}, p_{2} \dots, p_{m}}

of the count Bloom filter

C B F_{C}

, which must meet the following condition

\{k_{b} (j_{1}), \dots, k_{b} (j_{m})\} = \{k_{b}^{*} (p_{1}),, \dots, k_{b}^{*} (p_{m})\}

(1)

where

k_{b}^{*}

is a new sequence after applying the permutation

π

to

τ

-element sequence

k_{b}

, i.e.,

k_{b}^{*} = π (k_{b})

. Then the client announces the permutation

π

to SP.

Step 5: SP obtains the final key

k_{b}^{*} = π (k_{b})

from key

k_{b}

by permutation

π

. Obviously, the client only knows partial bits:

k_{b}^{*} (p_{1})

,

k_{b}^{*} (p_{2})

, …,

k_{b}^{*} (p_{m})

, where

k_{b}^{*} (p_{i})

denotes the

p_{i} t h

bit of

k_{b}^{*}

for

i = {1, 2, \dots, m}

. However, SP does not know any secret information about position index set

{p_{1}, p_{2} \dots, p_{m}}

without

{j_{1}, j_{2}, \dots, j_{m}}

.

Here, we give a simple example to illustrate how to generate an oblivious key between the client and SP, as shown in Figure 4. The client and SP share the length

τ = 14

of the count Bloom filter. The client has position indexes,

{p_{1} = 4, p_{2} = 7, p_{3} = 8, p_{4} = 14}

, of non-zero items in the count Bloom filter, and thus finally, she only knows

k_{b}^{*} (4)

,

k_{b}^{*} (7)

,

k_{b}^{*} (8)

and

k_{b}^{*} (14)

, while SP knows all bits of

k_{b}^{*}

. The elements of Figure 4 with blue background are the checked qubits, such as

k_{B} (11)

and

k_{B} (15)

. The elements with black slashes are the checked qubits that have been discarded, such as

k_{b} (15)

and

k_{b} (16)

.

4.2. Encryption

Suppose that SP has a private set

S = {s_{1}, s_{2}, \dots, s_{w}}

, where every element lies in

Z_{N}

. She employs

λ

independent collision resistant hash functions

{h_{1}, h_{2}, \dots, h_{λ}}

.

Step 6: In this step, SP utilizes Algorithm 1 to generate an array of

τ

elements. First, SP maps the private set

S = {s_{1}, s_{2}, \dots, s_{w}}

to the counting Bloom filter

C B F = {C B F_{1}, C B F_{2}, \dots, C B F_{τ}}

through hash functions

{h_{1}, h_{2}, \dots, h_{λ}}

. Then, SP selects an array

B F = {B F_{1}, B F_{2}, \dots, B F_{τ}}

, where all elements initialize to 0. All elements of corresponding positions in

B F

are set to 1, according to non-zero items in

C B F

. SP has position indexes

{q_{1}, q_{2}, \dots, q_{l}}

of non-zero items in the array

B F

. The construction process is shown in Figure 5.

Furthermore, SP’s database is constantly changing in the actual environment. Therefore, SP synchronously modifies the local counting Bloom filter through Algorithms 2 and 3.

Algorithm 1 Generating an array of

τ

elements

Require :: ${s_{1}, s_{2}, \dots, s_{w}}$ .
Ensure :: $B F \in {0, 1}^{τ}$ .
1:: for $i = 1$ to $τ$ do
2:: $C B F [i] = 0$
3:: $B F [i] = 0$
4:: end for
5:: // All $τ$ elements in $C B F$ and $B F$ are set to 0 initially.
6:: for $i = 1$ to w do
7:: for $j = 1$ to $λ$ do
8:: $C B F [h_{j} (s_{i})] = C B F [h_{j} (s_{i})] + 1$ ;
9:: end for
10:: end for
11:: // That is, for each element $s_{i}$ of the private set S, the $h_{1} (s_{i}) t h, h_{2} (s_{i}) t h, \dots,$ and $h_{λ} s_{i} t h$ the elements of $C B F$ all plus 1.
12:: for $i = 1$ to $τ$ do
13:: if CBF[i] > 0 then
14:: BF[i] = 1;
15:: end if
16:: end for
17:: // That is, for each element $s_{i}$ of the private set S, the $h_{1} (s_{i}) t h, h_{2} (s_{i}) t h, \dots,$ and $h_{λ} (s_{i}) t h$ the elements of $B F$ all set 1.

Algorithm 2 Adding an element to count Bloom filter

Require :: x.
Ensure :: $B F$ and $C B F$ , where $C B F = C B F \cup x$ .
1:: Execute Algorithm 1 to generate CBF and BF
2:: for $i = 1$ to $λ$ do
3:: $C B F [h_{i} (x)] = C B F [h_{i} (x)] + 1$ ;
4:: if BF[i] = 0 then
5:: BF[i] = 1;
6:: end if
7:: end for

Algorithm 3 Deleting an existing element from count Bloom filter

Require :: x;
Ensure :: $B F$ and $C B F$ , where $C B F = C B F - x$ ;
1:: Execute Algorithm 1 to generate CBF and BF
2:: for $i = 1$ to k do
3:: CBF[i] = CBF[i]-1;
4:: if CBF[i] = 0 then
5:: BF[i] = 0;
6:: end if
7:: end for
8:: //Please note that it must guarantee that the element indeed belongs to the set associated with count Bloom filter before deleting it.

Step 7: After obtaining the array BF, SP encrypts it with the key

k^{*}

(k^{*} = k_{b}^{*})

to obtain

\begin{matrix} K B F & = k^{*} \oplus B F \\ = {k_{1}^{*} \oplus B F [1], k_{2}^{*} \oplus B F [2], \dots, {k_{n}^{*} \oplus B F [τ]} \\ = {K B F_{1}, \dots K B F_{τ}} . \end{matrix}

(2)

Then, as for reference [32], the client and SP publicly select two unitary quantum operations

U_{0}, U_{1}

, which should satisfy the conditions of Section 2.3.

According to the key K and operations

U_{0}, U_{1}

, SP transforms

K B F}

into

τ

pairs of qubits

{| a_{1} 〉, | t_{1} 〉, | a_{2} 〉, | t_{2} 〉, \dots, | a_{τ} 〉, | t_{τ} 〉}

, where each item

K B F_{j}

is associated with a pair of qubits

| a_{j} 〉, | t_{j} 〉

. First qubit

| a_{j} 〉

is the quantization of

K B F_{j}

and the second

| t_{j} 〉

is the tag of

K B F_{j}

. Finally, SP sends this quantum sequence to the client.

4.3. Decryption

Suppose that a client has a private set

C = {c_{1}, c_{2}, \dots, c_{v}}

, where every element lies in

Z_{N}

. He also employs

λ

independent collision resistant hash functions

{h_{1}, h_{2}, \dots, h_{λ}}

.

Step 8: The client also generates a count Bloom filter

C B F_{C}

of

τ

size and can obtain position indexes of non-zero items in

C B F_{C}

, i.e.,

{p_{1}, p_{2}, \dots, p_{m}}

.

Furthermore, the client’s database is also constantly changing in actual environments. Therefore, the client synchronously modifies the local counting Bloom filter through Algorithms 2 and 3. However, different from SP, the client does not need to generate the array

B F_{C}

that is similar to

B F

.

Step 9: After receiving the quantum sequence from SP, the client verifies each pair of qubits. As previously introduced in Section 2.3, if the client finds that equation

| t_{i} 〉_{m} = U_{K_{i + 1}} {| a_{i} 〉}_{m}

holds, where

i \in {1, 2, \dots, τ}

, the verification will succeed, otherwise, it will fail.

| t_{i} 〉_{m}

and

| a_{i} 〉_{m}

are measurement results of

| t_{i} 〉

and

| a_{i} 〉

, respectively. If the client discovered a dishonest SP or any outside eavesdropping, she would terminate this protocol, otherwise, continue to the next step. After successful authentication, the client obtains a correct encrypted array

K B F = {K B F_{1}, \dots, K B F_{τ}}

. Then the client decrypts

K B F

to obtain decrypted values of partial position indexes

{p_{1}, p_{2}, \dots, p_{m}}

in

K B F

by

k^{*}

, where the client only knows m bits of

k^{*}

. Furthermore, the decryption of array

K B F

is also reflected in Algorithm 4.

Finally, the client continues to execute Algorithm 4 to obtain the desired private set intersection

C \cap S

.

Algorithm 4 Obtaining the set intersection

Require :: $C = {c_{1}, c_{2}, \dots, c_{v}}, K B F, {p_{1}, p_{2}, \dots, p_{m}}, k^{*}$ ;
Ensure :: $χ \in {0, 1 \dots, N - 1}^{τ}$ , where $χ = C \cap S$ ;
1:: for $i = 1$ to $τ$ do
2:: PBF[i] = 0;
3:: $χ$ [i] = 0;
4:: end for
5:: for $i = p_{1}$ to $p_{m}$ do
6:: PBF[i] = KBF[i] $\oplus k^{*}$ [i];
7:: end for
8:: //Initialization and setting values;
9:: z = 0;
10:: for $i = 1$ to v do
11:: for $j = 1$ to $λ$ do
12:: if $P B F [h_{j} (c [i])] = 0$ then
13:: Break;
14:: end if
15:: end for
16:: $χ$ [++z] = c[i];
17:: end for
18:: //Testing membership tests

5. Security Analysis and Performance Evaluation

In this section, we mainly analyze the security and performance evaluation of this protocol. In the above definition 1, PSI protocol satisfies the following three security properties:

1. Correctness: After executing the protocol, the client should obtain the correct set intersection (

C \cap S

).

2. SP Privacy: The client learns no information about SP’s set except

C \cap S

.

3. Client Privacy: SP cannot obtain any private information about the client’s set.

Next, we specifically analyze three properties of this protocol.

5.1. Correctness

As we know, the client has a private set

C = {c_{1}, c_{2}, \dots, c_{v}}

and SP has a private set

S = {s_{1}, s_{2}, \dots, s_{w}}

, where

w > v

. All elements of sets, i.e., C and S, lie in

Z_{N}

, where

Z_{N} = {0, 1, 2, . ., N - 1}

.

Furthermore, SP and the client have same count Bloom filter parameters: hash functions

{h_{1}, h_{2}, \dots, h_{λ}}

and the size

τ

of the count Bloom filter. Then, SP has position indexes

{q_{1}, q_{2}, \dots, q_{l}}

of non-zero items in

B F

. The client also has position indexes

{p_{1}, p_{2}, \dots, p_{m}}

of non-zero items in the count Bloom filter

C B F_{C}

. Then, we will obtain

\begin{matrix} i \in S \cap C & ⟺ i \in S \land i \in C \\ ⟹ B F [j] \neq 0 \land C B F_{C} [j] \neq 0 \\ \land j \in {h_{1} (i), h_{2} (i), \dots, h_{λ} (i)} \\ (by hash functions {h_{1}, h_{2}, \dots, h_{λ}}) \\ ⟹ B F [j] \neq 0 \land j \in {p_{1}, p_{2}, \dots, p_{m}} \\ ⟹ B F [j] \land j \in {q_{1}, q_{2}, \dots, q_{l}} \land j \in {p_{1}, \dots, p_{m}} \\ ⟹ B F [j] \land j \in {q_{1}, q_{2}, \dots, q_{l}} \cap {p_{1}, p_{2}, \dots, p_{m}} \\ ⟹ K B F [j] \land j \in {q_{1}, q_{2}, \dots, q_{l}} \cap {p_{1}, p_{2}, \dots, p_{m}} \\ (by Equations (2)) \\ ⟹ P B F [j] \land j \in {q_{1}, q_{2}, \dots, q_{l}} \cap {p_{1}, p_{2}, \dots, p_{m}} \\ (by step 1 ∽ 7 of Algorithm 4) \\ ⟹ i \in χ ⟺ i \in S \cap C \\ (by step 10 ∽ 17 Algorithm 4) \end{matrix}

Therefore, the set of all parameters i satisfying condition

i \in χ

is equal to the intersection of their respective private sets, i.e.,

C \cap S

. Thus, the proposed protocol is correct.

Furthermore, we give an example to clearly illustrate correctness of the protocol from Figure 6. In this example, the client has a private set

C = {25, 34, 56, 36, 57}

and SP has a private set

S = {20, 34, 56, 38, 50}

, where all elements of sets C and S lie in

Z_{60}

.

Two parties have the same count Bloom filter parameters: hash functions

h_{1}, h_{2}

and the length of the count Bloom filter

τ = 16

. First, SP and the client successfully construct their own count Bloom filters, i.e.,

C B F

and

C B F_{C}

. In addition, SP extends count Bloom filter

C B F

to obtain an array

B F

. Then, SP has position indexes

{2, 3, 4, 6, 10, 12, 14}

of non-zero items in

B F

. The client also has position indexes

{2, 3, 4, 6, 10, 14}

of non-zero items in the count Bloom filter

C B F_{C}

.

In addition, the quantum sequence

{a_{1}, t_{1}, a_{2}, t_{2} \dots, a_{τ}, t_{τ}}

has no influence on the correctness of the protocol. Therefore, we do not consider the quantum sequence in the following example.

After the key generation phase, SP secretly obtains the final key

k^{*} (k^{*} = k_{b}^{*})

, where the client obtains values of position indexes of red digits in the key

k^{*}

. Obviously,

C B F_{C} [h_{1} (i)] \neq 0

and

C B F_{C} [h_{2} (i)] \neq 0

if

i \in C

. If

i \in S

,

B F [h_{1} (i)] \neq 0

and

B F [h_{2} (i)] \neq 0

, because

C B F [h_{1} (i)] \neq 0

and

C B F [h_{2} (i)] \neq 0

. Therefore,

{C B F_{C} [h_{1}, h_{2} (i)] \cap B F [h_{1}, h_{2} (i)]} \neq 0

, if

i \in C \cap S

. Please look at those positions in

B F

, where the number color is red and the number is 1. After encryption and decryption, these positions are still representations of set intersection in the array

P B F

, i.e.,

j \in r e d \land P B F [r e d] = 1

, if

i \in C \cap S

and

j \in {h_{1} (i), h_{2} (i)}

. Furthermore,

K B F

is an encrypted array of

B F

by the key

k^{*}

, where SP knows all elements. This array

P B F

is an array that partially decrypts

K B F

with the key

k^{*}

, where the client only knows part of the elements. In our example,

j \in {2, 4, 10, 14}

, if

i \in C \cap S

. Then, the client uses the array

P B F

to obtain the set intersection

χ

, i.e.,

{34, 56}

, by Algorithm 4.

5.2. Security

The protocol consists of three main parts, i.e., key generation phase, encryption phase and decryption phase. The security analysis of the protocol will be orderly presented.

5.2.1. SP privacy

During key generation, the security of Step 1 is guaranteed by Xiao et al.’s OQKD protocol [29]. By the analysis of reference [29], a dishonest client will not receive more bits than expected, i.e.,

m + q

-bit, even with more efficient measures, such as the optimal unambiguous state discrimination (USD) measurement.

During the encryption phase, SP firstly maps a private set

S = {s_{1}, s_{2}, \dots, s_{w}}

to an array

C B F = {C B F_{1}, C B F_{2}, \dots, C B F_{τ}}

through hash functions

{h_{1}, h_{2}, \dots, h_{λ}}

that the client also knows. Then, SP changes

C B F = {C B F_{1}, C B F_{2}, \dots, C B F_{τ}}

to obtain an array

B F = {B F_{1}, B F_{2}, \dots, B F_{τ}}

. That is, if a dishonest client obtains

B F

, she may obtain SP’s private set

S = {s_{1}, s_{2}, \dots, s_{w}}

. However, SP encrypts

B F

by the key

k^{*}

, where SP knows all the bits of the key, while the client only knows the partial bits. The security of

B F

has information-theoretic security because SP uses one-time pad encryption. During the decryption phase, the client can just decrypt the encrypted array

K B F

to obtain partial values of

B F

by

k^{*}

, where she only knows m-bit of the key. That is to say, the client cannot have more information about SP’s private set S.

In a word, the protocol can protect the privacy information of SP.

5.2.2. Client Privacy

Specifically, if a dishonest SP wants to eavesdrop on the client’s private key during the key generation phase, the probability that his dishonesty will be detected by his client is at least

1 - \frac{1}{2^{q}}

, where q is a secure parameter.

The security in Step 1 of key generation is guaranteed by Xiao et al.’s OQKD protocol [29]. Based on reference [29], a dishonest SP will introduce bit errors. That is, if SP obtains a message on the conclusiveness of the client’s bits, he will lose information on the bit values that the client has recorded. Actually, it is impossible for SP to have both correct bit value and conclusiveness message of the client’s measurement, i.e., position index of the correct basis. Therefore, SP cannot simultaneously obtain a bit value

k_{b} (j)

that is a correct result deciphered by the client and its corresponding index j.

In Step 2 of key generation, the client randomly compares q bits of the key with corresponding bits announced by SP to decide whether SP is dishonest. SP cannot know which bits will be taken as the checked bits before the client declares them.

Moreover, for each checked bit, if SP does not honestly execute the protocol, he will receive an error probability of

\frac{1}{2}

in the honesty test. Therefore, for a dishonest SP, the successful probability of completely passing the honest test is less than

\frac{1}{2^{q}}

.

Finally, in Step 4 of key generation, the client declares the permutation

π

to SP, which is defined by two sets

{j_{1}, j_{2}, \dots, j_{m}}

and

{p_{1}, p_{2}, \dots, p_{m}}

. Next, the condition probability

P ({j_{1}, j_{2}, \dots, j_{m}}, {p_{1}, p_{2}, \dots, p_{m}} | π)

will be analyzed. Although the permutation

π

is randomly selected by the client, it still must satisfy Equation (1). That is, the client announces a random permutation

π

with m fixed points, where fixed points are private, but the permutations are public. Accordingly, the number of permutations satisfies the condition

m! (τ - m)!

.

For simplicity, suppose that

J M

denotes two arrays

{j_{1}, j_{2}, \dots, j_{m}}

and

{p_{1}, p_{2}, \dots p_{m}}

.

p (|)

and

I (;)

denote the conditional probability and mutual information, respectively. Then, we deduce following results:

P (π) = \frac{1}{τ!}

(3)

P (π ∣ J M) = \frac{1}{m! (τ - m)!}

(4)

P (J M) = \frac{1}{C_{τ}^{m} \cdot C_{τ}^{m}}

(5)

\begin{matrix} I (π; J M) & = log \frac{P (π ∣ J M)}{P (π)} \\ = log \frac{\frac{1}{t! (τ - m)!}}{\frac{1}{τ!}} & = log \frac{τ!}{t! (τ - m)!} \end{matrix}

(6)

\begin{matrix} I (J M) & = - log P (J M) = - log \frac{1}{C_{τ}^{m} \cdot C_{τ}^{m}} \\ = 2 log C_{τ}^{m} = 2 log \frac{τ!}{t! (τ - m)!} \end{matrix}

(7)

\begin{matrix} I (J M ∣ π) & = I (J M) - I (π; J M) \\ = 2 log \frac{τ!}{t! (τ - m)!} - log \frac{τ!}{t! (τ - m)!} \\ = log \frac{τ!}{t! (τ - m)!} \end{matrix}

(8)

I (J M ∣ π) = - log P (J M ∣ π)

(9)

P (J M ∣ π) = \frac{1}{\frac{τ!}{m! (τ - m)!}} = \frac{1}{C_{τ}^{m}}

(10)

The probability of successfully guessing values of two arrays

{j_{1}, j_{2}, \dots, j_{m}}

and

{p_{1}, p_{2}, \dots, p_{m}}

through the public permutation

π

is negligible, i.e.,

\frac{1}{C_{τ}^{m}}

.

As we know that

p (M) = \frac{1}{C_{τ}^{m}}

, so

p (J M | π) = p (M)

. In other words, the probability of successfully guessing these sets

{j_{1}, j_{2}, \dots, j_{m}}

and

{p_{1}, p_{2}, \dots p_{m}}

with the public permutation

π

is equal to the probability of directly guessing values of set

{p_{1}, p_{2}, \dots p_{m}}

without

π

. In addition, the set

{j_{1}, j_{2}, \dots, j_{m}}

is the client’s private message. Therefore, it is difficult for the SP to obtain the private set

{p_{1}, p_{2}, \dots p_{m}}

even if the client declares the permutation

π

.

In a word, the honest test (i.e., q checked bits) ensures the honesty of SP during the key generation phase. The probability of successfully guessing the private sets by the public permutation

π

is negligible, i.e.,

\frac{1}{C_{τ}^{m}}

.

Furthermore, the client does not send any information during encryption and decryption phases, so private information is not leaked. Therefore, the protocol can protect the client’s private information.

5.3. External Security Analysis and Anonymity Analysis

In this protocol, we not only consider the three basic properties above but also consider external adversary security analysis and anonymity analysis.

5.3.1. External Security Analysis

During the key generation phase, the external security of Step 1 is guaranteed by Xiao et al.’s OQKD protocol [29]. Their protocol is resistant to external attacks, such as impersonation and man-in-the-middle attacks, through quantum bits for authentication (QA). Thus, our protocol can resist external attacks in the key generation phase.

Furthermore, they also use quantum bits to generate a key K, which is shared by SP and the client.

In the Step 7 of the encryption phase, even if a malicious adversary impersonates the client, she cannot obtain SP’s private set

S = {s_{1}, s_{2}, \dots, s_{w}}

by the encrypted quantum sequence

{| a_{1} 〉, | t_{1} 〉, | a_{2} 〉, | t_{2} 〉, \dots, | a_{τ} 〉, | t_{τ} 〉}

. First, the quantum sequence is obtained by encrypting the array

K B F

with the key K. The adversary cannot obtain values of K, which is only known by the client and SP. Secondly, the adversary also cannot know the values of the array

K B F

, where the security is information-theoretic security. Therefore, even if the adversary pretends to be the client to obtain the quantum sequence

{| a_{1} 〉, | t_{1} 〉, | a_{2} 〉, | t_{2} 〉, \dots, | a_{τ} 〉, | t_{τ} 〉}

, he cannot obtain SP’s private information.

Furthermore, a malicious adversary may apply man-in-the-middle attack in Step 7 of encryption and Step 9 of decryption phases. She first intercepts the quantum sequence sent by SP and then sends fake information to the client so that the client decrypts fake information. However, the client verifies the correctness of the transmitted information. Once any bit is wrong, the client will think there is an external adversary or SP is dishonest. The adversary cannot obtain values of the key K, so fake information cannot pass verification. Therefore, in the encryption and decryption phases, our protocol can resist impersonation and man-in-the-middle attacks.

In a word, the protocol can resist external attacks, such as impersonation and man-in-the-middle attacks.

5.3.2. Anonymity Analysis

In the key generation phase, the anonymity analysis of Step 1 is guaranteed by Xiao et al.’s OQKD protocol [29]. They send quantum sequences through CA.

During the encryption phase, SP only sends quantum sequences

{a_{1}, t_{1}, a_{2}, t_{2}, \dots, a_{τ}, t_{τ}}

to clients that SP already knows in step 1 of the key generation phase. However, in decryption phase, the client cannot directly determine whether the quantum sequence

{a_{1}, t_{1}, a_{2}, t_{2}, \dots, a_{τ}, t_{τ}}

is sent from the actual SP, even if quantum information indeed comes from SP. Because the client cannot determine the source of the quantum sequence. Therefore, the protocol provides an authentication function. That is, if the quantum sequence

{a_{1}, t_{1}, a_{2}, t_{2}, \dots, a_{τ}, t_{τ}}

passes authentication, the sequence is indeed sent by SP. After the quantum sequence

{a_{1}, t_{1}, a_{2}, t_{2}, \dots, a_{τ}, t_{τ}}

are authenticated, the client can obtain the actual encrypted array

K B F

from SP.

Therefore, the protocol can guarantee the anonymity of the communicating parties.

5.4. Performance

In the key generation of the protocol, it uses single photons as quantum resources. There are no complicated quantum operators except projective measurements of single photons and simple single-bit operators. In encryption and decryption, the protocol only uses simple single-bit operators and projective measurements of single photons; thus, it is easy to implement this protocol in a real-life setting.

Next, we will consider the role of protocol in updatable databases. In encryption and decryption, counting Bloom filters are employed to reduce communication overhead and accommodate dynamic databases. Counting Bloom filters are employed to handle the updated data from Algorithms 2 and 3. With the increase in data, we only need to change corresponding values in the count Bloom filter according to updatable values, instead of creating a completely new Bloom filter at each modification. At the same time, the size

τ

of the count Bloom filter will not be changed when updating the database on a small scale. Instead, the protocol only increases the size of the counting Bloom filter to reduce the false rate after that data increases to a certain threshold.

With the size

τ

of the count Bloom filter remaining the same, if the client needs more key bits due to the increase in data, the client only needs to request insufficient key bits from SP, not all bits of the key. For example, the client and SP had the key

k_{1}^{*}

of the

τ

length, where the client only knew k-bit values of the key, while SP knew all bits of the key. Now, the client has the size

l (l > k)

of position indexes of non-zero items in the count Bloom filter. Then, the client only needs to obtain a new key

k_{2}^{*}

of the

p (p < = τ)

length from SP, where the client only knows (

k - l

)-bit. Later, the client combines the key

k_{1}^{*}

and

k_{2}^{*}

to form a new key

k_{3}^{*}

of

τ

length after applying the permutation

p l

, where the client knows l-bit of

k_{3}^{*}

. The effect of

p l

is similar to the effect of

π

. Then, the client announces the permutation

p l

to SP. In this way, we can reduce the communication overhead of keys and the cost of preparing them. Of course, we consider the semi-honesty model, where the client should not deceive SP. Thus, the protocol can significantly reduce computation and storage overhead.

From Table 5, we can see a comparative summary of existing quantum private set intersection (QPSI) protocols. The communication complexity of our protocol is

O (τ)

-qubit. The transmitted qubits of the OQKD protocol [29] in key generation are

κ (τ + q) + z

qubits, where z is the number of the qubits for authentication,

κ

is a security parameter and

κ \approx l o g \sqrt{(τ + q)}

. Then, in the encryption phase, SP only transmits

2 τ

-qubit to the client. Therefore, the communication complexity (qubit) of our protocol depends on the communication complexity (qubit) of OQKD protocol, i.e.,

O (τ)

, because

τ ≫ q

in

O (κ (τ + q) + z)

. The client needs a single-photon measurements of the

κ (τ + q) + z

-qubit in the key generation phase. CA only needs to change the quantum state of the z-bit in the key generation. Therefore, the computation complexity of the key is

O (τ)

. SP needs to generate

2 τ

-qubit while performing quantum transformations on them in the encryption phase. The client needs single-photon measurements of

2 τ

-qubit in the decryption phase. Therefore, the computation complexity of transmitted messages is

O (τ)

. The computation complexity of this protocol is

O (τ)

.

Similarly, we analyze that the communication complexity of the protocol [24] should be

O (ς)

-qubit (

N ≫ ς ≫ q

), because the OQKD protocol [23] that they cite needs to transmit

ω (ς + q)

-qubit, where a security parameter is

ω \approx l o g \sqrt{(ς + q)}

. The communication complexity of the protocol [22] should be

O (N)

-qubit (

N ≫ ς ≫ q

) because the OQKD protocol [23] that they cite needs to transmit

ω (N + q)

-qubit, where a security parameter is

ω \approx l o g \sqrt{(N + q)}

.

In addition, our protocol only needs single photons, which are easier to achieve in a real-life setting. We also have a linear communication performance

O (τ)

, where

τ \approx ς ≪ N

and

τ < v

in large-scale data. We need only one round of communication during the data transfer phase, i.e.,

{| a_{1} 〉, | t_{1} 〉, | a_{2} 〉, | t_{2} 〉, \dots, | a_{τ} 〉, | t_{τ} 〉}

.

6. Discussion

PSI have a wide range of application environments in IoT. In this paper, a novel quantum PSI in IoT is designed with the help of OQKD, quantum authentication and count Bloom filter. We describe the correctness and security of this protocol by formal expressions. Of course, there is also some security analysis software for reference, such as AVISPA and SCYTHER. In this paper, we extend the OQKD method to PSI. In Table 6, we describe some differences between this paper and the underlying protocol.

Below we present some limitations of the protocol and the direction of future work. Limited by the current development of quantum technology, we are not able to conduct experiments and perform practical validation of the protocols in the IoT. Although the development of quantum facilities is still immature, there already exist some programming environments capable of simulating a small number of quantum bits, e.g., HiQ quantum cloud platform, IBM quantum cloud platform. The OQKD of key generation is similar to that of quantum key distribution (QKD). As far as we know, the key rate of QKD is 14.5 b/s under experimental conditions of 75 MHz clock rate and time bin encoding [40], which is the most advanced development [41]. Quantum devices are also subject to this protocol. We hope to perform experimental validation of the protocol in the future. The OQKD protocol [29] is the first protocol that combines OQKD methods with quantum authentication, but to our knowledge, its efficiency is not optimal. In the future, we can improve the efficiency of the overall protocol with other existing OQKD protocols [42,43,44]. The quantum authentication method used in the overall protocol requires relatively more conditions. In the future, we will improve the authentication method with better quantum authentication protocols. In addition, we hope to combine this protocol with existing classical methods so that the protocol can contribute to the development of research in other directions [7,45]. At the same time, we would like to promote a new idea: the most likely faster implementation of OQKD or QKD as a basic building block for other research topics. We hope to combine QKD and OQKD with other technologies to create a whole new security system.

7. Conclusions

In this paper, we proposed a generic system model aided with ED for PSI in IoT. Then, we presented a quantum PSI protocol in IoT. Our proposed quantum PSI protocol obtained higher security and only needed the communication complexity of

O (τ)

qubits. The proposed protocol can not only protect the private data of two parties but also protect identity information of two parties. The proposed protocol had an authentication function to prevent malicious adversary attacks and maintain information integrity.

Author Contributions

Conceptualization: B.L. and X.Z.; methodology, B.L. and X.Z.; validation: B.L., X.Z. and M.Z.; formal analysis: B.L. and M.Z.; investigation: B.L. and X.Z.; resources: M.Z.; data curation: B.L. and X.Z.; writing—original draft preparation: B.L. and X.Z.; writing—review and editing: B.L., X.Z. and R.S.; visualization: B.L. and X.Z.; supervision: M.Z. and G.Z.; project administration: B.L. and M.Z.; funding acquisition: B.L. and M.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (62002105, 62072134, U2001205, 61902116) and The Key Research and Development Program of Hubei (2021BEA163).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Yang, Y.; Wu, L.; Yin, G.; Li, L.; Zhao, H. A survey on security and privacy issues in Internet-of-Things. IEEE Internet Things J. 2017, 4, 1250–1258. [Google Scholar] [CrossRef]
Xu, X.; He, Y. Blockchain application in modern logistics information sharing: A review and case study analysis. In Production Planning & Control; Taylor & Franics: Abingdon, UK, 2022; pp. 1–15. [Google Scholar]
Qadri, Y.A.; Nauman, A.; Zikria, Y.B.; Vasilakos, A.V.; Kim, S.W. The future of healthcare Internet of Things: A survey of emerging technologies. IEEE Commun. Surv. Tutorials 2020, 22, 1121–1167. [Google Scholar] [CrossRef]
Zhang, K.; Ni, J.; Yang, K.; Liang, X.; Ren, J.; Shen, X.S. Security and privacy in smart city applications: Challenges and solutions. IEEE Commun. Mag. 2017, 55, 122–129. [Google Scholar] [CrossRef]
Chakravorty, A.; Wlodarczyk, T.; Rong, C. Privacy preserving data analytics for smart homes. In Proceedings of the 2013 IEEE Security and Privacy Workshops, San Francisco, CA, USA, 23–24 May 2013; IEEE: Piscataway, NJ, USA, 2013; pp. 23–27. [Google Scholar]
Qian, Y.; Shen, J.; Vijayakumar, P.; Sharma, P.K. Profile matching for IoMT: A verifiable private set intersection scheme. IEEE J. Biomed. Health Inform. 2021, 25, 3794–3803. [Google Scholar] [CrossRef] [PubMed]
Abadi, A.; Terzis, S.; Metere, R.; Dong, C. Efficient delegated private set intersection on outsourced private datasets. IEEE Trans. Dependable Secur. Comput. 2017, 16, 608–624. [Google Scholar] [CrossRef] [Green Version]
Zuo, X.; Li, L.; Luo, S.; Peng, H.; Yang, Y.; Gong, L. Privacy-Preserving Verifiable Graph Intersection Scheme With Cryptographic Accumulators in Social Networks. IEEE Internet Things J. 2020, 8, 4590–4603. [Google Scholar] [CrossRef]
Freedman, M.J.; Nissim, K.; Pinkas, B. Efficient private matching and set intersection. In Proceedings of the International Conference on the Theory and Applications of Cryptographic Techniques, Interlaken, Switzerland, 2–6 May 2004; Springer: Cham, Switzerland, 2004; pp. 1–19. [Google Scholar]
Le, P.H.; Ranellucci, S.; Gordon, S.D. Two-party private set intersection with an untrusted third party. In Proceedings of the 2019 ACM SIGSAC Conference on Computer and Communications Security, London, UK, 11–15 November 2019; pp. 2403–2420. [Google Scholar]
Hazay, C.; Nissim, K. Efficient set operations in the presence of malicious adversaries. In Proceedings of the International Workshop on Public Key Cryptography, Xi’an, China, 30 May–3 June 2016; Springer: Berlin/Heidelberg, Germany, 2010; pp. 312–331. [Google Scholar]
Kolesnikov, V.; Kumaresan, R.; Rosulek, M.; Trieu, N. Efficient batched oblivious PRF with applications to private set intersection. In Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security, Vienna, Austria, 24–28 October 2016; pp. 818–829. [Google Scholar]
Pinkas, B.; Schneider, T.; Segev, G.; Zohner, M. Phasing: Private set intersection using permutation-based hashing. In Proceedings of the 24th USENIX Security Symposium (USENIX Security 15), Washington, DC, USA, 12–14 August 2015; pp. 515–530. [Google Scholar]
Chase, M.; Miao, P. Private set intersection in the internet setting from lightweight oblivious PRF. In Proceedings of the Annual International Cryptology Conference, Santa Barbara, CA, USA, 18–22 August 2020; Springer: Berlin/Heidelberg, Germany, 2020; pp. 34–63. [Google Scholar]
Badrinarayanan, S.; Miao, P.; Xie, T. Updatable Private Set Intersection. Cryptol. ePrint Arch. 2021. Available online: https://eprint.iacr.org/2021/1349 (accessed on 15 March 2022).
Cho, K.; Miyano, T. Chaotic cryptography using augmented Lorenz equations aided by quantum key distribution. IEEE Trans. Circuits Syst. I: Regul. Pap. 2014, 62, 478–487. [Google Scholar] [CrossRef]
Shi, R.H. Anonymous Quantum Sealed-bid Auction. IEEE Trans. Circuits Syst. II Express Briefs 2021, 69, 414–418. [Google Scholar] [CrossRef]
Shi, R.H.; Mu, Y.; Zhong, H.; Cui, J.; Zhang, S. An efficient quantum scheme for Private Set Intersection. Quantum Inf. Process. 2016, 15, 363–371. [Google Scholar] [CrossRef] [Green Version]
Cheng, X.; Guo, R.; Chen, Y. Cryptanalysis and improvement of a quantum private set intersection protocol. Quantum Inf. Process. 2017, 16, 37. [Google Scholar] [CrossRef]
Maitra, A. Quantum secure two party computation for set intersection with rational players. Quantum Inf. Process. 2018, 17, 197. [Google Scholar] [CrossRef] [Green Version]
Shi, R.H.; Mu, Y.; Zhong, H.; Zhang, S. Quantum oblivious set-member decision protocol. Phys. Rev. A 2015, 92, 022309. [Google Scholar] [CrossRef] [Green Version]
Debnath, S.K.; Dey, K.; Kundu, N.; Choudhury, T. Feasible private set intersection in quantum domain. Quantum Inf. Process. 2021, 20, 41. [Google Scholar] [CrossRef]
Gao, F.; Liu, B.; Wen, Q.Y.; Chen, H. Flexible quantum private queries based on quantum key distribution. Opt. Express 2012, 20, 17411–17420. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Debnath, S.K.; Srivastava, V.; Mohanty, T.; Kundu, N.; Sakurai, K. Quantum Secure Privacy Preserving Technique to Obtain the Intersection of Two Datasets for Contact Tracing. IACR Cryptol. ePrint Arch. 2021, 2021, 618. [Google Scholar] [CrossRef]
Ye, T.; Li, H.K.; Hu, J.L. Multi-User Quantum Private Query Protocol. Int. J. Theor. Phys. 2020, 59, 2867–2874. [Google Scholar] [CrossRef]
Zhu, D.; Wang, L.; Zhu, H. Cryptanalysis of Multi-User Quantum Private Query Protocol. Int. J. Theor. Phys. 2021, 60, 284–292. [Google Scholar] [CrossRef]
Jakobi, M.; Simon, C.; Gisin, N.; Bancal, J.D.; Branciard, C.; Walenta, N.; Zbinden, H. Practical private database queries based on a quantum-key-distribution protocol. Phys. Rev. A 2011, 83, 22301. [Google Scholar] [CrossRef] [Green Version]
Bennett, C.H. Quantum cryptography: Public key distribution and coin tossing. In Proceedings of the IEEE International Conference on Computers, Bangalore, India, 9–12 December 1984. [Google Scholar]
Xiao, M.; Lei, S. Quantum private query with authentication. Quantum Inf. Process. 2021, 20, 166. [Google Scholar] [CrossRef]
Curty, M.; Santos, D. Quantum authentication of classical messages. Phys. Rev. A 2012, 64, 168. [Google Scholar] [CrossRef] [Green Version]
Curty, M.; Santos, D.J.; Pérez, E.; García-Fernández, P. Qubit authentication. Phys. Rev. A 2002, 66, 022301. [Google Scholar] [CrossRef] [Green Version]
Xin, X.; Li, F. Quantum Authentication of Classical Messages without Entangled State as Authentication Key. Int. J. Multimed. Ubiquitous Eng. 2015, 10, 199–206. [Google Scholar] [CrossRef]
Bloom, B.H. Space/time trade-offs in hash coding with allowable errors. Commun. ACM 1970, 13, 422–426. [Google Scholar] [CrossRef]
Fan, L. Summary Cache: A Scalable Wide-area Web Cache Sharing Protocol. ACM SIGCOMM Comput. Commun. Rev. 1998, 28, 254–265. [Google Scholar] [CrossRef]
Xu, F.; Ma, X.; Zhang, Q.; Lo, H.K.; Pan, J.W. Secure quantum key distribution with realistic devices. Rev. Mod. Phys. 2020, 92, 025002. [Google Scholar] [CrossRef]
Liu, H.; Wang, W.; Wei, K.; Fang, X.T.; Li, L.; Liu, N.L.; Liang, H.; Zhang, S.J.; Zhang, W.; Li, H.; et al. Experimental demonstration of high-rate measurement-device-independent quantum key distribution over asymmetric channels. Phys. Rev. Lett. 2019, 122, 160501. [Google Scholar] [CrossRef] [Green Version]
Gisin, N.; Ribordy, G.; Tittel, W.; Zbinden, H. Quantum cryptography. Rev. Mod. Phys. 2002, 74, 145. [Google Scholar] [CrossRef] [Green Version]
Liu, B.; Xia, S.; Xiao, D.; Huang, W.; Xu, B.; Li, Y. Decoy-state method for quantum-key-distribution-based quantum private query. Sci. China Phys. Mech. Astron. 2022, 65, 240312. [Google Scholar] [CrossRef]
Liu, B.; Ruan, O.; Shi, R.; Zhang, M. Quantum private set intersection cardinality based on bloom filter. Sci. Rep. 2021, 11, 17332. [Google Scholar] [CrossRef]
Goldreich, O. Secure multi-party computation. Manuscript. Prelim. Version 1998, 78, 110. [Google Scholar]
Shor, P.W. Polynomial-Time Algorithms for Prime Factorization and Discrete Logarithms on a Quantum Computer. SIAM Rev. 1999, 41, 303–332. [Google Scholar] [CrossRef]
Gao, F.; Qin, S.; Huang, W.; Wen, Q. Quantum private query: A new kind of practical quantum cryptographic protocol. Sci. China Physics Mech. Astron. 2019, 62, 70301. [Google Scholar] [CrossRef]
Wei, C.Y.; Cai, X.Q.; Wang, T.Y.; Qin, S.J.; Gao, F.; Wen, Q.Y. Error tolerance bound in QKD-based quantum private query. IEEE J. Sel. Areas Commun. 2020, 38, 517–527. [Google Scholar] [CrossRef]
Wei, C.Y.; Cai, X.Q.; Liu, B.; Wang, T.Y.; Gao, F. A generic construction of quantum-oblivious-key-transfer-based private query with ideal database security and zero failure. IEEE Trans. Comput. 2017, 67, 2–8. [Google Scholar] [CrossRef] [Green Version]
Xu, X.; Wei, Z.; Ji, Q.; Wang, C.; Gao, G. Global renewable energy development: Influencing factors, trend predictions and countermeasures. Resour. Policy 2019, 63, 101470. [Google Scholar] [CrossRef]

Figure 1. System model of the OQKD protocol [29].

Figure 2. Counting bloom filter.

Figure 3. System model aided with ED of PSI in IoT scenarios.

Figure 4. Illustration of generating the key. (a) How to reduce the client’s information in the key. (b) How to obtain the final key

k_{b}^{*}

from the raw key

k_{B}

.

Figure 4. Illustration of generating the key. (a) How to reduce the client’s information in the key. (b) How to obtain the final key

k_{b}^{*}

from the raw key

k_{B}

.

Figure 5. The process of transforming data.

Figure 6. An example of privately computing

C \cap S

.

Figure 6. An example of privately computing

C \cap S

.

Table 1. The value of

| a_{i} 〉

.

Table 1. The value of

| a_{i} 〉

.

$s_{i}$ / $m_{i}$ ¹	0	1
0	$\| a_{i} 〉 = \| φ_{0} 〉$	$\| a_{i} 〉 = \| φ_{1} 〉$
1	$\| a_{i} 〉 = \| ψ_{0} 〉$	$\| a_{i} 〉 = ψ_{1} 〉$

¹ The row represents the value of m_i, while the column represents the value of s_i.

Table 2. The value of

| t_{i} 〉

.

Table 2. The value of

| t_{i} 〉

.

$s_{i + 1}$	$\| t_{i} 〉$
0	$U_{0} \| a_{i} 〉$
1	$U_{1} \| a_{i} 〉$

Table 3. Measurement basis of

| t_{i} 〉

.

Table 3. Measurement basis of

| t_{i} 〉

.

$s_{i + 1}$ / $s_{i}$ ¹	0	1
0	${U_{0} \| φ_{0} 〉, U_{0} \| φ_{1} 〉}$	${U_{1} \| φ_{0} 〉, U_{1} \| φ_{1} 〉}$
1	${U_{0} \| ψ_{0} 〉, U_{0} \| ψ_{1} 〉}$	${U_{1} \| ψ_{0} 〉, U_{1} \| ψ_{1} 〉}$

¹ The row represents the value of s_i, while the column represents the value of s_i+1.

Table 4. Definitions of notations.

Notations	Definitions
C	The client’s private set
S	The SP’s private set
${h_{1}, h_{2}, \dots, h_{λ}}$	The hash functions
$τ$	The length of the count Bloom filter
$k_{B}$	The raw key distributed by the SP
K	The message authentication key from the protocol [29]
$k_{b}$	The intermediate key after checking the SP’s honesty
$k_{b}^{}, k^{}$	The final key distributed by the SP
$C B F$	The SP’s count Bloom filter
$B F$	The variant of $C B F$
$K B F$	Encryption result of the array $B F$ by the key $k^{*}$
$\| a_{i} 〉, \| t_{i} 〉$	The ith element of the encryption result of the array $K B F$ by the key K
$C B F_{C}$	The client’s count Bloom filter
${p_{1},, p_{2} \dots, p_{m}}$	The positions index of non-zero items of $C B F_{C}$

Table 5. Comparison summary.

Protocol	Ours	[24]	[22]	[18]	[19]	[20]
Quantum resource	single photons	single photons	single photons	multi-particle entangled states	multi-particle entangled states	multi-particle entangled states
Complicated oracle operators	no	no	no	yes	yes	yes
Dimension of the Hilbert Space	2	2	2	N	N	N
Quantum measurements	single-photon measurements	single-photon measurements	single-photon measurements	projective measurements	projective measurements	projective measurements
Intersection cardinality revealed to SP	no	no	no	no	no	yes
Communication complexity (qubit)	$O (τ)$	$O (ς$ )	$O (N)$	$O (v l o g N)$	$O (v l o g N)$	$O (v + l) l o g N$
Computation complexity (qubit)	$O (τ)$	$O (ς)$	$O (N)$	$O (v)$	$O (v)$	$O (N + l)$
Round complexity in the set intersection	1	1	1	2	3	4
Resistant to external attacks	yes	no	no	no	no	no

Table 6. Comparison with the OQKD protocol [29].

Protocol	Research Themes	SP (Server) Honesty Test	The Data Process	Matching Method between Keys and Data
Ours	private query	no	no	a shift value
[29]	PSI	yes	count Bloom filter	a permutation

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, B.; Zhang, X.; Shi, R.; Zhang, M.; Zhang, G. SEPSI: A Secure and Efficient Privacy-Preserving Set Intersection with Identity Authentication in IoT. Mathematics 2022, 10, 2120. https://doi.org/10.3390/math10122120

AMA Style

Liu B, Zhang X, Shi R, Zhang M, Zhang G. SEPSI: A Secure and Efficient Privacy-Preserving Set Intersection with Identity Authentication in IoT. Mathematics. 2022; 10(12):2120. https://doi.org/10.3390/math10122120

Chicago/Turabian Style

Liu, Bai, Xiangyi Zhang, Runhua Shi, Mingwu Zhang, and Guoxing Zhang. 2022. "SEPSI: A Secure and Efficient Privacy-Preserving Set Intersection with Identity Authentication in IoT" Mathematics 10, no. 12: 2120. https://doi.org/10.3390/math10122120

APA Style

Liu, B., Zhang, X., Shi, R., Zhang, M., & Zhang, G. (2022). SEPSI: A Secure and Efficient Privacy-Preserving Set Intersection with Identity Authentication in IoT. Mathematics, 10(12), 2120. https://doi.org/10.3390/math10122120

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

$s_{i}$ / $m_{i}$ ¹	0	1
0	$\| a_{i} 〉 = \| φ_{0} 〉$	$\| a_{i} 〉 = \| φ_{1} 〉$
1	$\| a_{i} 〉 = \| ψ_{0} 〉$	$\| a_{i} 〉 = ψ_{1} 〉$

$s_{i + 1}$ / $s_{i}$ ¹	0	1
0	${U_{0} \| φ_{0} 〉, U_{0} \| φ_{1} 〉}$	${U_{1} \| φ_{0} 〉, U_{1} \| φ_{1} 〉}$
1	${U_{0} \| ψ_{0} 〉, U_{0} \| ψ_{1} 〉}$	${U_{1} \| ψ_{0} 〉, U_{1} \| ψ_{1} 〉}$

Article Menu

SEPSI: A Secure and Efficient Privacy-Preserving Set Intersection with Identity Authentication in IoT

Abstract

1. Introduction

2. Related Works

2.1. Quantum PSI Protocol

2.2. Oblivious Quantum Key Distribution

2.3. Quantum Authentication

2.4. Count Bloom Filter

3. Models and Design Goal

3.1. System Model

3.2. Security Model

3.3. Design Goal

4. Proposed Protocol

4.1. Key Generation

4.2. Encryption

4.3. Decryption

5. Security Analysis and Performance Evaluation

5.1. Correctness

5.2. Security

5.2.1. SP privacy

5.2.2. Client Privacy

5.3. External Security Analysis and Anonymity Analysis

5.3.1. External Security Analysis

5.3.2. Anonymity Analysis

5.4. Performance

6. Discussion

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI