PVkNN: A Publicly Verifiable and Privacy-Preserving Exact kNN Query Scheme for Cloud-Based Location Services

Li, Jingyi; Song, Yuqi; Tian, Chengliang; Tian, Weizhong

doi:10.3390/modelling6020044

Open AccessArticle

PVkNN: A Publicly Verifiable and Privacy-Preserving Exact kNN Query Scheme for Cloud-Based Location Services

¹

College of Computer Science and Technology, Qingdao University, Qingdao 266071, China

²

College of Big Data and Internet, Shenzhen Technology University, Shenzhen 518118, China

^*

Authors to whom correspondence should be addressed.

Modelling 2025, 6(2), 44; https://doi.org/10.3390/modelling6020044

Submission received: 11 April 2025 / Revised: 19 May 2025 / Accepted: 30 May 2025 / Published: 3 June 2025

Download

Browse Figures

Versions Notes

Abstract

The k-nearest- neighbor (kNN) algorithm is crucial in data mining and machine learning, yet its deployment on large-scale datasets within cloud environments presents significant security and efficiency challenges. This paper is dedicated to advancing the resolution of these challenges and presents novel contributions to the development of efficient and secure exact kNN query schemes tailored for spatial datasets in cloud-based location services. Addressing existing limitations, our approach focuses on accelerating query processing while ensuring robust privacy preservation and public verifiability. Key contributions include the establishment of a formal framework underpinned by stringent security definitions, providing a solid groundwork for future advancements. Leveraging Paillier’s homomorphic cryptosystem and public-key signature techniques, our design achieves heightened security by safeguarding databases, query access patterns, and result integrity while enabling public verification. Additionally, our scheme enhances computational efficiency through optimized data-packing techniques and simplified Voronoi diagram-based ciphertext index construction, leading to substantial savings in computational and communication overheads. Rigorous and transparent theoretical analysis substantiates the correctness, security, and efficiency of our design, while comprehensive experimental evaluations confirm the effectiveness of our approach, showcasing its practical applicability and scalability across datasets of varying scales.

Keywords:

cloud computing; kNN query; DSA; privacy preservation; verifiability

1. Introduction

The k-nearest-neighbor (kNN) algorithm is a fundamental data mining and machine learning tool used for classification and regression tasks. It identifies the k closest points in a dataset to a given query point and makes decisions based on the properties of these points. In large-scale datasets, particularly those hosted in cloud environments, kNN queries present unique challenges and opportunities in terms of efficiency and privacy. When dealing with large-scale datasets, the computational and memory requirements for performing kNN queries can be substantial. The sheer volume of data can overwhelm local processing capabilities, making cloud-based solutions attractive due to their scalability and resource availability. However, distributing this process across a cloud infrastructure introduces latency and potential bottlenecks in data transmission and processing [1].

Privacy is a critical concern when sensitive data is processed or stored outside local premises, such as in cloud environments. Traditional kNN implementations may expose private data to cloud providers, increasing the risk of unauthorized access and breaches. The verifiability of query results is another significant concern. During information processing, a malicious cloud provider could conceal or alter information, leaving the user unsure about the accuracy of the results and unable to assess them with certainty. Therefore, it is crucial to design privacy-preserving and verifiable mechanisms that protect data and ensure the integrity of query results while still leveraging the cloud’s computational power. To address these challenges, many privacy-preserving and verifiable designs for kNN have been developed [2,3,4,5,6].

To ensure the confidentiality of data during the query process, a range of techniques are employed, including encryption, data anonymization, and secure multi-party computation. For instance, homomorphic encryption allows computations to operate on encrypted data, yielding encrypted results that are decipherable solely by the data owner [3]. Another method involves distributing data across multiple independent servers, each processing segments of the kNN computation without access to the underlying data or final query outcomes [7]. To ensure the verifiability of query results, existing approaches incorporate cryptographic hash functions and signature-based authentication technologies [3,8]. Moreover, to enhance query efficiency, protocols often integrate cryptographic methods with data-efficient structures and algorithms such as tree-based dimensionality reduction. This technique simplifies data while preserving essential characteristics relevant to kNN queries. Additionally, indexing techniques can be tailored to function with encrypted data, thereby reducing query response times without compromising security [9].

1.1. Related Works

In recent years, researchers have proposed various schemes to enable privacy-preserving and verifiable kNN queries on data outsourced to untrusted cloud servers. Depending on their security functionalities, cloud-assisted kNN schemes can be categorized as privacy-preserving or verifiable schemes.

Privacy-preserving kNN schemes without verifiability: Privacy-preserving cloud-assisted kNN schemes aim to protect user data while leveraging cloud resources for query processing. These schemes are essential in scenarios where sensitive data is stored in an honest-but-curious cloud and needs to be analyzed without compromising individuals’ privacy. Traditionally, a secret distance-preserving transformation (DPT) was used to ensure data privacy [10]. However, concerns were raised by Wong et al. [11] regarding vulnerabilities to known-sample attacks (

KSA ʃ

) and known-plaintext attacks (

KPA ʃ

) in DPT-based privacy-preserving kNN query schemes. In response, they proposed an asymmetric scalar product preservation encryption (ASPE) method. Hu et al. [12] introduced a kNN query processing scheme utilizing lightweight order-preserving encryption (OPE) and deterministic random encryption (DRE) to effectively support kNN queries of encrypted data. Unfortunately, subsequent analyses revealed vulnerabilities to chosen-plaintext attacks [13]. Choi et al. [14] and Wang et al. [15] adopted mutable order-preserving encryption (OPE) [16] to improve security. However, these schemes assumed mutual trust between the data owner (DO) and query user (QU), which introduced a serious risk of privacy disclosure of the DO’s key. Recognizing potential mutual distrust among the DO, QU, and cloud server (CS), Zhu et al. [17,18] proposed solutions for securely outsourcing kNN queries by using random blind techniques and Paillier’s additively homomorphic cryptosystem. However, their schemes were susceptible to known-plaintext attacks and failed to conceal data access patterns. Lei et al. [2] addressed outsourced kNN queries in spatial databases, employing vector-based projection technology for data preprocessing but sacrificing query accuracy. Subsequent improvements [19] aimed at enhancing accuracy but led to increased communication complexity. Further advancements highlighted insecurities in the ASPE method against ciphertext-only attacks [20], prompting the proposal of a 1NN scheme against adaptive chosen-keyword attacks, but it lacked support for secure kNN queries. Recently, Zheng et al. [21] introduced a privacy-preserving kNN query scheme based on asymmetric matrix encryption (AME), offering practicality and security, and Qi et al. [5] presented a privacy-aware kNN protocol under the dual cloud-server model, supporting dynamic data update operations. Despite these advancements, most methods do not consider the privacy of access patterns. To protect access patterns, works exploring the two (or more)-non-colluding cloud servers model have been pursued [22,23]. Elmehdwi et al. [22] designed a secure kNN query outsourcing scheme with Paillier’s homomorphic cryptosystem, achieving data, query, and result privacy at the cost of significant overhead on the QU side. Guan et al. [23] proposed a more efficient oblivious location-based kNN query scheme over the cloud, which is resistant to rainbow-location attacks.

Privacy-preserving kNN schemes with verifiability: Verifiable cloud-assisted kNN schemes focus on ensuring the correctness and integrity of query results obtained from the cloud. These schemes are crucial in scenarios where the cloud provider may be malicious or prone to errors. Compared with privacy-preserving designs for data in kNN queries, few works have focused on the verifiability of the query results. Yiu et al. [24] proposed a framework based on Voronoi MR trees, which is a variant of Merkle hash trees (MHT), to verify kNN query results and security zones. However, their design does not address data privacy, and most existing approaches relying on tree-based indexes leak access patterns. Moreover, Rong et al. [25] proposed the Verifiable Secure kNN (VSkNN) query scheme. However, their verifiable framework is probabilistic and they employed ASPE as the encryption scheme, which has previously been shown to compromise security [13] by exposing data and access pattern privacy to the cloud. Jiang et al. [26] proposed a verifiable dynamic searchable symmetric encryption scheme based on additive trees, but it only supports Boolean queries, not kNN queries. Therewith, Wu et al. [27] introduced ServeDB for secure and verifiable range queries, but it is not applicable to kNN queries, does not support access pattern privacy, and reveals data privacy in the verification results. For the first time, Cui et al. [9] proposed a secure and verifiable kNN query scheme called SVkNN, which uses Paillier’s additively homomorphic cryptosystem for privacy, Voronoi graphs for fast queries, and a hash-based message authentication code for verification. To address the efficiency issues identified in the work by [9], Liu et al. [3] proposed SecVKQ, a two-stage framework integrating edge servers to optimize query performance. Later, Cui et al. [8] further considered secure and verifiable kNN queries in multi-user settings and designed an effective scheme called MSVkNN, which employs a two-trapdoor public-key cryptosystem (DT-PKC) [28] for privacy and a hash-based message authentication code for verification. Recently, Zhang et al. [4] developed a secure kNN query scheme that integrates DT-PKC with random projection forests. This innovation not only enhances query efficiency but also ensures the privacy of data, queries, results, and access patterns while allowing for the verification of result correctness.

For transparency, we provide a comparison of the aforementioned works in terms of privacy, verifiability, the type of kNN employed, and the system models, as presented in Table 1.

1.2. Challenges and Contributions

As shown in Table 1, most existing schemes focus solely on data privacy, with only a few considering verifiability. Schemes that address both security properties simultaneously are relatively rare. Among these, most have one or more of the following limitations:

Inability to protect the privacy of data access patterns [3].
Use of probabilistic verification methods [7,25].
Verification is limited to confirming the authenticity of the data source for the query results returned by the cloud.
Support is limited to private verification, where the verification algorithm requires the involvement of a private key. This restricts verification to designated verifiers and leads to the inability to achieve accurate arbitration when disputes arise between the querier and the cloud server.

Consequently, designing a privacy-preserving, publicly verifiable scheme that simultaneously ensures data and access pattern privacy while achieving verifiable query results—including both the authenticity of the data source and the completeness of the results—remains a significant challenge.

Aiming at the above challenges, this paper builds on prior work to propose an efficient cloud-assisted kNN query scheme that guarantees data privacy while fulfilling the requirements of public verifiability. Specifically, compared to previous approaches, our contributions are notable in the following key aspects:

Formal Framework Establishment and Rigorous Theoretical Analysis: Although numerous privacy-preserving or verifiable kNN query schemes exist, to the best of our knowledge, no strict formal definition of privacy-preserving, publicly verifiable kNN queries has been proposed. Our first contribution is the establishment of the first formal framework for this purpose, clearly identifying its fundamental components and security definitions. Additionally, we conduct a rigorous and detailed theoretical analysis of our proposed scheme designed within this framework, laying a robust foundation for future research and practical advancements in the field.
Robust Privacy: By adopting the two-non-colluding cloud servers model and leveraging the semantic security of Paillier’s homomorphic cryptosystem along with the Voronoi diagram-based index structure, we design and integrate a series of secure computation algorithms. These include Secure Division Computation (SDC), Secure Grid Computation (SGC), Secure Nearest-Neighbor (SNN) search, and Secure Voronoi Cell Read (SCR). Together, these algorithms enable secure kNN queries, ensuring the privacy of the original dataset, query data, and query results while also preserving the privacy of data access patterns.
Public Verifiability: By integrating Paillier’s homomorphic cryptosystem with public-key signature techniques and a Voronoi diagram-based index structure, our scheme enables public verification of the authenticity and completeness of query results. Unlike previous designs, which restricted verification to designated verifiers, our approach removes this limitation, allowing for accurate arbitration in disputes between the querier and the cloud server.
Optimized Computational Efficiency: To ensure the correctness of encryption and decryption, the proposed scheme refines the parameters of a series of secure protocols integrated with data packing techniques, addressing the ambiguous and inaccurate descriptions in Cui et al.’s design [8]. Additionally, it optimizes the Voronoi diagram-based ciphertext index structure. These improvements result in significant computational and communication savings compared to prior schemes, thereby enhancing overall performance and contributing to scalability and practical applicability in real-world scenarios.

1.3. Layout of This Paper

The remainder of this paper is organized as follows. Section 2 describes the system model and the associated security definitions. Section 3 covers the necessary preliminaries. Our main computation outsourcing protocol is detailed in Section 4. In Section 5, we analyze the correctness and security of the proposed protocol. Section 6 presents a theoretical efficiency analysis and a practical performance evaluation. Finally, we summarize our findings in Section 7.

2. System Architecture, Threat Models, and Design Goals

2.1. System Architecture and Threat Models

Following the setup of prior verifiable designs, our kNN query system over a cloud dataset, as illustrated in Figure 1, involves four entities: the data owner (DO), the query user (QU), and two cloud servers (CS₁ and CS₂). The DO possesses a large-scale spatial dataset

D = {D [i] = (i, P_{i}) | i \in [n]}

, and the QU issues a kNN query to the DO’s dataset. Due to limited storage and computing resources, the DO offloads the storage and associated computation tasks to the cloud servers while maintaining the privacy of the dataset. CS₁ and CS₂ are two non-colluding, resource-abundant, yet potentially untrusted cloud servers. To protect the privacy of the spatial dataset and support efficient kNN search operations, the DO must determine an appropriate blinding method and an efficient data structure to encrypt and organize the dataset

D

. Also, the design should ensure that the QU can verify the integrity and correctness of the cloud returned query result. Formally, the framework of the privacy-preserving publicly verifiable kNN query scheme, denoted as PVkNN = (Setup, DSEnc, QuEnc, Search, Verify, ResDec), consists of the following six algorithms:

${P K, S K} \leftarrow Setup (1^{λ}, n)$ : Given a security parameter $λ$ and dataset size n, this algorithm generates the public key $P K$ and the secret key $S K$ .
${ED} \leftarrow DSEnc (D, P K, S K)$ : On inputting the dataset $D$ , the public key $P K$ , and the secret key $S K$ , the DO encrypts the dataset $D$ into a ciphertext dataset $ED = {E D [i] | i \in I}$ , where $I$ denotes the label set.
${Q^{'}} \leftarrow QuEnc (Q, P K)$ : On inputting the query data point Q and the public key $P K$ , this algorithms generates the encrypted query point $Q^{'}$ .
${R^{'}, VO} \leftarrow Search (Q^{'}, ED, P K)$ : On inputting the encrypted query $σ_{i}$ , the encrypted dataset $ED$ , and the public key $P K$ , the search algorithm finds the query result $R^{'}$ and the associated proof $VO$ .
${δ} \leftarrow Verify (Q, R^{'}, VO, P K)$ : On inputting the encrypted query $Q^{'}$ , the query result $R^{'}$ , the proof $VO$ , and the key pair $(P K, S K)$ , the verification algorithm checks whether $R^{'}$ is authentic and complete, producing an output $δ \in {T r u e, F a l s e}$ .
${γ} \leftarrow ResDec (Q, R^{'}, S K, δ)$ : With the query Q, the query result $R$ , the secret key $S K$ and the indicator $δ$ , if $δ = T r u e$ , this algorithm decrypts the returned result $R^{'}$ and recovers the set $γ = R$ of k-nearest neighbors to Q. Otherwise, it rejects $R^{'}$ and outputs $γ = ⊥$ .

In this kNN query system, the security threat mainly comes from untrustworthy cloud servers or malicious queriers. In our design, we consider the following three threat behaviors: (1) the cloud servers are curious and may attempt to infer confidential information, such as the dataset, the query access pattern, and the query result; (2) the cloud servers are malicious and may deliberately tamper with the query result for financial incentives; and (3) the querier is malicious and may forge a false query result and attribute it to the cloud servers.

Remark 1.

It should be noted that the two-non-colluding cloud servers framework is reasonable in practice. For well-established cloud service providers such as Amazon AWS, Microsoft Azure, or Huawei Cloud, it is highly unlikely that any two of these companies would collude to damage their invaluable reputations. Also, this setup has been extensively employed in secure kNN query scenarios [3,4,5,7,8,9,22,25].

2.2. Design Goals

Considering our system architecture and threat models, we aim to design a secure and efficient kNN query protocol. Specifically, our design should meet the four requirements outlined below.

2.2.1. Correctness

Generally, correctness means that if the cloud servers perform the specified computation task, the design should ensure that an honest querier can finally obtain the k-nearest neighbors to his/her query data point. The formal definition is presented below.

Definition 1 (Correctness).

A

PV k NN

scheme is correct if, for any given security parameter λ,

{P K, S K}

\leftarrow Setup (1^{λ}, n)

,

{ED} \leftarrow DSEnc (D, P K, S K)

,

{Q^{'}} \leftarrow QuEnc (Q, P K)

, and

{R^{'}, VO} \leftarrow Search (Q^{'}, ED, P K)

are honestly performed, the probability

\begin{matrix} \Pr [{δ = T r u e} \leftarrow Verify (Q^{'}, R^{'}, VO, P K) \land {γ = R} \leftarrow ResDec (Q, R^{'}, S K, δ)] \geq 1 - negl (λ), \end{matrix}

where

negl (λ)

is a negligible function of λ.

2.2.2. Public Verifiability

In the context of verifiable kNN queries, verifiability refers to the ability to verify the authenticity and completeness of the query results returned by the cloud server. Specifically, this involves confirming whether the data in the query result was indeed uploaded by the DO and whether it corresponds precisely to the ciphertext of the k-nearest neighbors to the query point. Verification is considered private if it requires private keys, meaning that only the designated user who possesses the private keys can perform the verification [30,31]. Conversely, verification is public if it relies on public keys and can be performed by any entity [32,33,34]. Clearly, compared to private verifiability, public verifiability provides a more transparent verification mechanism. When disputes arise between the querier and the cloud server regarding the query results, public verifiability ensures the prompt resolution of such conflicts. Here, we define public verifiability using a private-key-independent verification algorithm called Verify, which guarantees that the probability of a cloud server successfully deceiving a verifier into accepting an incorrect result—either inauthentic or incomplete—is negligible. The formal definition is presented below.

{Exp}_{A}^{PV} [\prod, λ, n] :

{P K, S K} \leftarrow Setup (1^{λ}, n)

{ED} \leftarrow DSEnc (D, P K, S K)

For a = 0 to t - 1 :

Q_{a} \leftarrow A (P K, ED, Q_{0}, \dots, Q_{a - 1})

Q_{a}^{'} \leftarrow QuEnc (Q_{a}, P K)

{R_{a}, {VO}_{a}} \leftarrow Search (Q_{a}^{'}, E D, P K)

;

Q \leftarrow A (P K, ED, {(Q_{a}, Q_{a}^{'}, R_{a}^{'}, {VO}_{a})}_{0 \leq a \leq t - 1});

Q^{'} \leftarrow QuEnc (Q, P K)

{R^{'}, VO} \leftarrow A (P K, ED, Q, Q^{'}, {(Q_{a}, Q_{a}^{'}, R_{a}, {VO}_{a})}_{0 \leq a \leq t - 1})

;

{δ} \leftarrow Verify (Q, R^{'}, VO, P K);

{γ} \leftarrow ResDec (Q, R^{'}, S K, δ);

If δ = T r u e and γ \neq R :

output 1;

else

output 0;

Definition 2 (Public Verifiability).

Let

Π

be a

PV k NN

protocol, and let

A (•)

be a probabilistic polynomial-time (PPT) machine. We say that protocol

Π

is publicly verifiable if

A d v_{A}^{PV} (Π, λ, n) \leq n e g l (λ),

where

A d v_{A}^{PV} (Π, λ, n) = \Pr [{Exp}_{A}^{PV} [Π, λ, n] = 1]

.

2.2.3. Privacy

Our design aims to protect the privacy of the following information: the dataset, the query data, the query result, and query access pattern. The following privacy aspects are considered:

Dataset privacy. Cloud servers cannot obtain any valuable information about the plaintext data in the dataset $D$ .
Query data privacy. The QU’s query data should not be revealed to cloud servers.
Query result privacy. Apart from the QU, other participants cannot learn the plaintext query result.
Access pattern privacy. The identifiers corresponding to the k-nearest neighbors of the query point should not be revealed to the QU and the cloud servers (to prevent any inference attacks) [8,9,22].

In our analysis, we establish the privacy of our design by demonstrating the indistinguishability between the simulation model and the real model.

2.2.4. Efficiency

High efficiency is a generic requirement for any practical protocol. Under the premise of ensuring security, the design should reduce each participant’s computational and communication overheads as much as possible. Specifically, we should design an efficient index structure to support frequent query operations.

3. Preliminaries

3.1. Notations

For ease of description, we describe the frequently used notations in Table 2.

3.2. Permutation

Given a natural number

m > 0

, mathematically, a permutation

ρ

is a bijection over the set

{0, 1, \dots, m - 1}

. Generally, it can be represented as

(\begin{matrix} 0 & 1 & \dots & m - 1 \\ ρ (0) & ρ (1) & \dots & ρ (m - 1) \end{matrix}),

where

(ρ (0), ρ (1), \dots, ρ (m - 1))

is some rearrangement of

(0, 1, \dots, m - 1)

. For some m-tuple

v = (v_{0}, v_{1}, \dots, v_{m - 1})

, we define

ρ (v) = (v_{ρ (0)}, v_{ρ (1)}, \dots, v_{ρ (m - 1)})

.

3.3. Paillier’s Additively Homomorphic Cryptosystem

The well-known Paillier additively homomorphic cryptosystem [35] was proposed by Paillier in 1999. For some message

m \in Z_{N}

, its encryption function E is denoted as

c = E (m, r) = {(1 + N)}^{m} \times r^{N} \mod N^{2},

where N, which is public, is the product of two large prime numbers p and q, and

r \in Z_{N}^{★}

is randomly chosen. The decryption function D with the secret key

s k = λ (N) = L C M (p - 1, q - 1)

is

m = \frac{c^{λ (N)} \mod N^{2} - 1}{N} \cdot λ {(N)}^{- 1} \mod N .

The Paillier cryptographic system exhibits the following properties:

Homomorphic addition: The decrypted product of two ciphertexts equals the sum of their corresponding plaintexts, and the decrypted kth power of a ciphertext equals the product of k and its corresponding plaintext.

$\begin{matrix} D (E (x_{0}) E (x_{1}) \mod N^{2}) = x_{0} + x_{1} \mod N, \\ D (E {(x)}^{k} \mod N^{2}) = k x \mod N . \end{matrix}$
Semantic security: If the decisional composite residuosity problem is hard, then the Paillier cryptosystem is $CPA$ -secure.

3.4. Digital Signature Algorithm DSA

The Digital Signature Algorithm (DSA) is a standard algorithm for digital signatures and is a public-key cryptosystem based on the discrete logarithm problem [36]. For two large prime numbers p and q satisfying

q ∣ p - 1

, let

g \in Z_{p}^{★}

be an element of order q, and

y = g^{x} \mod p

for some randomly chosen

x \in Z_{q}^{★}

. Then, the signature key is

s k = {x}

, and the verification key is

p k = {y}

. For a message m, its signature is

Sig (m) = (r, s)

, where, for some randomly chosen

k \in Z_{q}^{★}

,

\begin{matrix} r = (g^{k} \mod p) \mod q, s = k^{- 1} (H (m) + x r) \mod q, \end{matrix}

and

H (\cdot)

is a public hash function. Given

(m, r, s)

, the verification algorithm

{0, 1} \leftarrow Ver (m, r, s)

proceeds as follows: (1) Verify

r, s \in Z_{q}^{★}

. (2) Calculate

w = s^{- 1} \mod q

,

u_{1} = w H (m) \mod q

,

u_{2} = w r \mod q

, and

v = (g^{u_{1}} y^{u_{2}} \mod p) \mod q

. (3) The algorithm outputs “1” (the signature is valid) if

v = r

. Otherwise, it outputs “0”.

3.5. Voronoi Diagram

Given a dataset

D

consisting of n points

{P_{0}, \dots, P_{n - 1}}

in the plane, the Voronoi region

V o r (P_{i})

of the point

P_{i} \in D

refers to the set of all points in the plane closer to

P_{i}

than to any other point in

D

. Precisely,

V o r (P_{i}) = {x \in R^{2} | ∥ x - P_{i} ∥ \leq ∥ x - P_{j} ∥, \forall j \neq i} .

(1)

Equivalently, for any

j \neq i

, let

H_{i j} = {x \in R^{2} | ∥ x - P_{i} ∥ \leq ∥ x - P_{j} ∥}

denote the half-plane, and we have

V o r (P_{i}) = \cap_{j \neq i} H_{i j}

. The point

P_{j}

is called a Voronoi-relevant vector of

P_{i}

if and only if

V o r (P_{j})

and

V o r (P_{i})

share a boundary line. The set of these Voronoi-relevant vectors is generally denoted as

V R V (P_{i})

.

Lemma 1

([37]). Given a dataset

D = {P_{0}, \dots, P_{n - 1}}

and a query point Q, the nearest neighbor of the query point Q is P if and only if

Q \in V o r (P)

.

Lemma 2

([38]). Let

P_{1}, P_{2}, \dots, P_{k}

be the k

(k \geq 2)

-nearest neighbors of a given query point Q. Then,

P_{k} \in V R V (P_{1}) \cup V R V (P_{2}) \cup \dots \cup V R V (P_{k - 1})

.

3.6. Data Packing with Paillier’s Cryptosystem

For a large-scale dataset, to encrypt and decrypt each data point item by item is time-consuming. To improve encryption/decryption efficiency, Liu et al. [39] introduced a data-packing encryption/decryption technique. Here, we present their design with Paillier’s cryptosystem. For

λ

σ

-bits integers

x_{1}, x_{2}, \dots, x_{λ}

with

2^{σ λ} < N

, unlike traditional methods that encrypt them into

E (x_{1}), E (x_{2}), \dots, E (x_{λ})

item by item, the data-packing technique encrypts them into a single ciphertext, which contains the following four algorithms:

Data packing $〈 x_{1} | x_{2} | \dots | x_{λ} 〉 \leftarrow Pack (x_{1}, \dots, x_{λ})$ :
$〈 x_{1} | x_{2} | \dots | x_{λ} 〉 = \sum_{i = 1}^{λ} x_{i} 2^{σ (λ - i)} .$
Data encryption $c \leftarrow DataEnc (〈 x_{1} | x_{2} | \dots | x_{λ} 〉)$ :
$c = E (x_{1} | x_{2} | \dots | x_{λ}) = E (\sum_{i = 1}^{λ} x_{i} 2^{σ (λ - i)}) .$
Data decryption $x \leftarrow DataDec (c)$ :
$x = D (c) .$
Data unpacking $(x_{1}, x_{2}, \dots, x_{λ}) \leftarrow Unpack (x)$ : $x_{λ} = {lsb}_{σ} (x), x_{λ - i} = {lsb}_{(i σ + 1) \to (i + 1) σ} (x)$ for $i = 1, \dots, λ - 1,$ where ${lsb}_{(i σ + 1) \to (i + 1) σ} (x)$ denotes the binary bits between the $(i σ + 1)$ th position and the $(i + 1) σ$ th position of x counting from the least significant bit.

The correctness of the data decryption follows the fact that, since

2^{λ σ} < N

,

\begin{matrix} D (c) = D (E (\sum_{i = 1}^{λ} x_{i} 2^{σ (λ - i)})) = \sum_{i = 1}^{λ} x_{i} 2^{σ (λ - i)} \mod N = \sum_{i = 1}^{λ} x_{i} 2^{σ (λ - i)} = x . \end{matrix}

The condition

2^{λ σ} < N

is critical to guarantee the equivalence

\sum_{i = 1}^{λ} x_{i} \cdot 2^{σ (λ - i)} \mod N = \sum_{i = 1}^{λ} x_{i} \cdot 2^{σ (λ - i)},

as it ensures that the summation remains strictly smaller than the modulus N, thereby avoiding modular reduction. Also, if we know the ciphertext

E (x_{i})

, we can get the ciphertext

E (x_{1} | x_{2} | \dots | x_{λ})

by using the homomorphic property, i.e.,

\begin{matrix} E (x_{1} | \dots | x_{λ}) = E (\sum_{i = 1}^{λ} x_{i} 2^{σ (λ - i)}) = \prod_{i = 1}^{λ} E {(x_{i})}^{2^{σ (λ - i)}} \mod N^{2} . \end{matrix}

4. Our Main Design

4.1. Design Intuition and Basic Idea

Inspired by [8,9], we adopt Paillier’s homomorphic cryptosystem to preserve the privacy of the dataset and the query data, as well as the “baby-step–giant-step” strategy to construct an efficient index data structure. Specifically, the spatial dataset is initially divided into several large rectangular regions, and each rectangle is further represented using a refined Voronoi diagram-based index. During the kNN search, we first perform a giant step to locate the target rectangular region, followed by a baby step to traverse the Voronoi diagram-based index and identify the individual points. To enable efficient search operations over the ciphertext dataset, we develop a series of secure protocols by integrating Paillier’s homomorphic encryption algorithm with the data-packing techniques, building upon previous designs. These protocols include the Secure Division Computation (SDC), Secure Grid Computation (SGC), Secure Nearest-Neighbor (SNN) search, and Secure Voronoi Cell Read (SCR) protocols, collectively enabling secure kNN queries. Since Paillier’s cryptosystem operates in a ring modulo a large integer and the data-packing technique is sensitive to parameter selection, we refine and tighten the previous designs to ensure correctness and optimize computational efficiency. This involves optimizing the index structure, improving parameter selection strategies, and addressing ambiguous or inaccurate descriptions. More importantly, previous designs lack support for public verifiability. In their approaches, the DO needs to send a secret verification key to the QU through a secure channel, or an additional Certified Authority (CA) is needed to generate and distribute keys through secure channels. To avoid the above-mentioned strong security assumption, as well as to prevent a malicious querier from forging a false query result and attributing it to the cloud servers, we integrate a public-key signature scheme with the Voronoi diagram-based index. This ensures both the authenticity and completeness of the query results while supporting public verifiability.

4.2. Our Main Outsourcing Protocol

4.2.1. DO Dataset Preprocessing Stage

This stage, carried out by the DO, is a one-time process. Assume that the spatial dataset

D = {(i, P_{i}) | i \in [n]}

, where each element comprises an identifier (

i d

) i, and its corresponding point

P_{i} = (x_{i}, y_{i}) \in R^{2}

. With loss of generality, we bound

x_{i} \in [0, X]

and

y_{i} \in [0, Y]

for any

i \in [n]

, and

\max {X, Y, n - 1} < 2^{σ}

. The aim of this stage is to construct two indexes (

I_{1}

and

I_{2}

) with the following steps:

Given a natural number m of appropriate size, the DO divides the region $[0, X] \times [0, Y]$ into $m^{2}$ small rectangular regions $S_{s t} = [\frac{X}{m} s, \frac{X}{m} (s + 1)] \times [\frac{Y}{m} t, \frac{Y}{m} (t + 1)]$ for $s, t \in [m]$ .
For each point $P_{i}$ , the DO finds its Voronoi-relevant vectors

$\begin{matrix} V R V (P_{i}) = {(i_{1}, P_{i_{1}}), (i_{2}, P_{i_{2}}), \dots, (i_{ℓ_{i}}, P_{i_{ℓ_{i}}})} . \end{matrix}$
The DO constructs the index $I_{1} = {G_{s t} | s, t \in [m]}$ , where $G_{s t} = {(s t_{1}, P_{s t_{1}}), (s t_{2}, P_{s t_{2}}), \dots,$ $(s t_{λ_{s t}}, P_{s t_{λ_{s t}}})}$ refers to a subset of $D$ and, for any point $P_{s t_{u}} \in G_{s t}$ , $V o r (P_{s t_{u}}) \cap S_{s t} \neq \emptyset$ .
The DO constructs the index $I_{2} = {〈 i, P_{i}, V R V (P_{i}) 〉 | i \in [n]}$ .

4.2.2. System Setup Stage: $Setup$

This stage is also performed by the DO and is a one-time task. Given a security parameter

κ

, the DO first invokes the key-generation algorithm in Paillier’s homomorphic cryptosystem to generate two random large prime numbers p and q with the same bit length

κ

and calculates

N = p q, λ (N) = L C M (p - 1, q - 1)

. Then, the DO generates the signature-verification key pair

(s i g k, v e r k)

for the DSA. Finally, the DO publishes the public key

P K = {p k_{1} = N, p k_{2} = v e r k}

and keeps the private key

S K = {s k_{1} = λ (N), s k_{2} = s i g k}

secret.

4.2.3. Dataset Encryption Stage: $DSEnc$

This stage is performed by the DO and is a one-time task. As shown in the example illustrated in Figure 2, with the public key

p k_{1}

for encryption and

s k_{2}

for signature, the DO takes

w = ⌈ \sqrt{n} ⌉

and encrypts the dataset

D

into

ED = (E (I_{1}), E (I_{2}))

with

\begin{matrix} E (I_{1}) = \{E (G_{s t}) | s, t \in [m]\}, E (I_{2}) = \{B_{j} | j = 0, 1, \dots, ⌈\frac{n}{w}⌉ - 1\} . \end{matrix}

Here, for

E (I_{1})

,

\begin{matrix} E (G_{s t}) = E (i_{1}^{(s t)} | P_{i_{1}^{(s t)}} | i_{2}^{(s t)} | P_{i_{2}^{(s t)}} | \dots | i_{λ}^{(s t)} | P_{i_{λ}^{(s t)}}) = E (i_{1}^{(s t)} | x_{i_{1}^{(s t)}} | y_{i_{1}}^{(s t)} | i_{2}^{(s t)} | x_{i_{2}^{(s t)}} | y_{i_{2}^{(s t)}} | \dots | i_{λ}^{(s t)} | x_{i_{λ}^{(s t)}} | y_{i_{λ}^{(s t)}}) \end{matrix}

with

P_{i_{u}^{(s t)}} = (x_{i_{u}^{(s t)}}, y_{i_{u}^{(s t)}})

for

u = 1, \dots, λ

and

λ = \max_{0 \leq s, t \leq m - 1} {λ_{s t}}

. Specifically, if the number of points contained in some grid is less than

λ

, we can pad several random points chosen from

D

. For

E (I_{2})

,

\begin{matrix} B_{j} & = {(E (j w), E (V R V (P_{j w})), E (S i g (P_{j w}))), \dots, \\ (E ((j + 1) w - 1), E (V R V (P_{(j + 1) w - 1})), E (S i g (P_{(j + 1) w - 1})))}, \\ B_{⌈ \frac{n}{w} ⌉ - 1} & = {(E ((⌈ \frac{n}{w} ⌉ - 1) w), E (V R V (P_{(⌈ \frac{n}{w} ⌉ - 1) w})), E (S i g (P_{(⌈ \frac{n}{w} ⌉ - 1) w}))), \dots, \\ (E ((⌈ \frac{n}{w} ⌉) w - 1), E (V R V (P_{(⌈ \frac{n}{w} ⌉) w - 1})), E (S i g (P_{(⌈ \frac{n}{w} ⌉) w - 1})))}, \end{matrix}

are

⌈ \frac{n}{w} ⌉

buckets, each bucket consisting of w three-tuples. Specifically, for the last bucket

B_{⌈ \frac{n}{w} ⌉ - 1}

, we pad the subscripts

n, n + 1, \dots, ⌈ \frac{n}{w} ⌉ - 1

with some random data points if it contains no more than w three-tuples. Moreover, for any

i \in [n]

,

E (V R V (P_{i})) = E (i_{1} | P_{i_{1}} | i_{2} | P_{i_{2}} | \dots | i_{L} | P_{i_{L}}), E (S i g (P_{i})) = E (Sig (P_{i} | | P_{i_{1}} | | \dots | | P_{i_{L}}))

. Similarly, if the number of points contained in some

V R V (P_{i})

is less than

L = \max {ℓ_{0}, \dots, ℓ_{n - 1}}

, we can pad several random points chosen from

D

. Finally, the DO uploads

(ED, E (X / m), E (Y / m))

to CS₁ and the private key

s k_{1}

to CS₂.

4.2.4. QU Query Encryption Stage: $QuEnc$

This stage is performed by the QU. When the QU issues a kNN query request

Q = (x_{q}, y_{q})

, he/she encrypts it into

E (Q) = (E (x_{q}), E (y_{q}))

with the public key

p k_{1}

and then queries CS₁ with

E (Q)

.

4.2.5. CS Search Stage: $Search$

This stage is cooperatively performed by the two non-colluding cloud servers CS₁ and CS₂ and is a query-based one-time operation. Algorithm 1 presents the details. In general, this stage proceeds as follows:

Algorithm 1 kNN (

E_{p k} (I), E_{p k} (X / m), E_{p k} (Y / m), E (Q), p k_{1}, s k_{1}

)

Input:

{CS}_{1} : E_{p k} (I) = (E (I_{1}), E (I_{2})), E_{p k} (X / m), E_{p k} (Y / m)), E (Q), p k,

CS₂: the public-private key pair

(p k_{1}, s k_{1})

Output: Result sets

R_{1}

,

R_{2}

, and verification sets

{VO}_{1}

,

{VO}_{2}

1:: CS₁ initializes two empty sets $R_{1} \leftarrow \emptyset$ , ${VO}_{1} \leftarrow \emptyset$ , CS₂ initializes two empty sets $R_{2} \leftarrow \emptyset$ , ${VO}_{2} \leftarrow \emptyset$
2:: CS₁ and CS₂ jointly perform Algorithm 2: $(E (⌊ \frac{x_{q}}{X / m} ⌋), E (⌊ \frac{y_{q}}{Y / m} ⌋)) \leftarrow SDC (E (Q), (E (X / m), E (Y / m)), p k_{1}, s k_{1})$
3:: CS₁ and CS₂ jointly perform Algorithm 3: $E (G_{\hat{s} \hat{t}}) \leftarrow$ SGC( $E (I_{1}), E (⌊ \frac{x_{q}}{X / m} ⌋), E (⌊ \frac{y_{q}}{Y / m} ⌋), p k_{1}, s k_{1}$ )
4:: ${CS}_{1}$ and CS₂ jointly perform Algorithm 4: $(E (i d_{\min}^{(1)}), E (x_{i d_{\min}^{(1)}}), E (y_{i d_{\min}^{(1)}})) \leftarrow$ SNN( $E (G_{\hat{s} \hat{t}}), E (Q), ⊥, p k_{1}, s k_{1})$
5:: CS₁ generates three random number $r_{1}^{(1)}, r_{2}^{(1)}, r_{3}^{(1)} \in Z_{N}$ and updates $R_{1} \leftarrow R_{1} \cup {(r_{2}^{(1)}, r_{3}^{(1)})}$
6:: CS₁ calculates $(E (i d_{\min}^{(1)'}), E (x_{i d_{\min}^{(1)}}^{'}), E (y_{i d_{\min}^{(1)}}^{'})) \leftarrow (E (i d_{\min}^{(1)}) \times E (r_{1}^{(1)}), E (x_{i d_{\min}^{(1)}}) \times E (r_{2}^{(1)}), E (y_{i d_{\min}^{(1)}}) \times E (r_{3}^{(1)}))$ and sends it to ${CS}_{2}$
7:: ${CS}_{2}$ decrypts $(i d_{\min}^{(1)'}, x_{i d_{\min}^{(1)}}^{'}, y_{i d_{\min}^{(1)}}^{'}) \leftarrow (D (E (i d_{\min}^{(1)'})), D (E (x_{i d_{\min}^{(1)}}^{'})), D (E (y_{i d_{\min}^{(1)}}^{'})))$ and updates $R_{2} \leftarrow R_{2} \cup {(x_{i d_{\min}^{(1)}}^{'}, y_{i d_{\min}^{(1)}}^{'})}$
8:: ${CS}_{1}$ and CS₂ jointly perform Algorithm 5: $(E (V R V (P_{i d_{\min}^{(1)}}), E (S i g (P_{i d_{\min}^{(1)}}))) \leftarrow$ SCR( $E (I_{2}), E (i d_{\min}^{(1)}), p k_{1}, s k_{1})$
9:: CS₁ generates $3 L + 1$ random numbers $r_{4, 1}^{(1)}, r_{4, 2}^{(1)}, \dots, r_{4, 3 L}^{(1)}, r_{5}^{(1)} \in Z_{N}$ , packs $r_{4, 1}^{(1)} \sim r_{4, 3 L}^{(1)}$ into $r_{0}^{(1)} = 〈 r_{4, 1}^{(1)} | r_{4, 2}^{(1)} | \dots | r_{4, 3 L}^{(1)} 〉$ and updates ${VO}_{1} \leftarrow {VO}_{1} \cup {((r_{4, 2}^{(1)}, r_{4, 3}^{(1)}), \dots, (r_{4, 3 k - 1}^{(1)}, r_{4, 3 k}^{(1)}), \dots, (r_{4, 3 L - 1}^{(1)}, r_{4, 3 L}^{(1)}), r_{5}^{(1)})}$
10:: CS₁ calculates $E (V R V {(P_{i d_{\min}^{(1)}})}^{'}) \leftarrow E (V R V (P_{i d_{\min}^{(1)}})) \times E (r_{0}^{(1)}), E (S i g {(P_{i d_{\min}^{(1)}})}^{'}) \leftarrow E (S i g (P_{i d_{\min}^{(1)}})) \times E (r_{5}^{(1)})$ and sends them to ${CS}_{2}$
11:: ${CS}_{2}$ decrypts $(V R V {(P_{i d_{\min}^{(1)}})}^{'}, S i g {(P_{i d_{\min}^{(1)}})}^{'}) \leftarrow (D (E (V R V {(P_{i d_{\min}^{(1)}})}^{'})), D (E (S i g {(P_{i d_{\min}^{(1)}})}^{'})))$
12:: ${CS}_{2}$ unpacks ${{i d_{1}^{(1)}}^{'}, P_{{i d_{1}^{(1)}}^{'}}^{'}, \dots, {i d_{L}^{(1)}}^{'}, P_{{i d_{L}^{(1)}}^{'}}^{'}} \leftarrow Unpack (V R V {(P_{i d_{\min}^{(1)}})}^{'})$ , where $P_{{i d_{1}^{(1)}}^{'}}^{'} = (x_{{i d_{1}^{(1)}}^{'}}^{'}, y_{{i d_{1}^{(1)}}^{'}}^{'}),$ $\dots,$ $P_{{i d_{L}^{(1)}}^{'}}^{'}$ $= (x_{{i d_{L}^{(1)}}^{'}}^{'}, y_{{i d_{L}^{(1)}}^{'}}^{'})$ .
13:: ${CS}_{2}$ updates ${VO}_{2} \leftarrow {VO}_{2} \cup {(P_{{i d_{1}^{(1)}}^{'}}^{'}, \dots, P_{{i d_{L}^{(1)}}^{'}}^{'}, S i g {(P_{i d_{\min}^{(1)}})}^{'})}$
14:: for $j = 2$ to k do
15:: ${CS}_{1}$ and CS₂ jointly perform Algorithm 4: $(E (i d_{\min}^{(j)}), E (x_{i d_{m i n}^{(j)}}), E (y_{i d_{\min}^{(j)}})) \leftarrow$ SNN $({E (V R V (P_{i d_{\min}^{(1)}})), \dots,$ $E (V R V (P_{i d_{\min}^{(j - 1)}})}, E (Q), {E (i d_{\min}^{(1)}), \dots, E (i d_{\min}^{(j - 1)})}, p k_{1}, s k_{1})$
16:: ${CS}_{1}$ generate three random numbers $r_{1}^{(j)}, r_{2}^{(j)}, r_{3}^{(j)} \in Z_{N}$ and updates $R_{1} \leftarrow R_{1} \cup {(r_{2}^{(j)}, r_{3}^{(j)})}$
17:: CS₁ calculates $(E (i d_{\min}^{(j)'}), E (x_{i d_{\min}^{(j)}}^{'}), E (y_{i d_{\min}^{(j)}}^{'})) \leftarrow (E (i d_{\min}^{(j)}) \times E (r_{1}^{(j)}), E (x_{i d_{\min}^{(j)}}) \times E (r_{2}^{(j)}), E (y_{i d_{\min}^{(j)}}) \times E (r_{3}^{(j)}))$ and sends it to ${CS}_{2}$
18:: ${CS}_{2}$ decrypts $(i d_{\min}^{(j)'}, x_{i d_{\min}^{(j)}}^{'}, y_{i d_{\min}^{(j)}}^{'}) \leftarrow (D (E (i d_{\min}^{(j)'})), D (E (x_{i d_{\min}^{(j)}}^{'})), D (E (y_{i d_{\min}^{(j)}}^{'})))$ and updates $R_{2} \leftarrow R_{2} \cup {(x_{i d_{\min}^{(j)}}^{'}, y_{i d_{\min}^{(j)}}^{'})}$
19:: ${CS}_{1}$ and ${CS}_{2}$ jointly perform Algorithm 5: $(E (V R V (P_{i d_{\min}^{(j)}}), E (S i g (P_{i d_{\min}^{(j)}}))) \leftarrow$ SCR( $E (I_{2}), E (i d_{\min}^{(j)}), p k_{1}, s k_{1})$
20:: ${CS}_{1}$ generates $3 L + 1$ random numbers $r_{4, 1}^{(j)}, r_{4, 2}^{(j)}, \dots, r_{4, 3 L}^{(j)}, r_{5}^{(j)} \in Z_{N}$ , packs $r_{4, 1}^{(j)} \sim r_{4, 3 L}^{(j)}$ into $r_{0}^{(j)} = 〈 r_{4, 1}^{(j)} | r_{4, 2}^{(j)} | \dots | r_{4, 3 L}^{(j)} 〉$ and updates ${VO}_{1} \leftarrow {VO}_{1} \cup {(r_{4, 2}^{(j)}, r_{4, 3}^{(j)}, \dots, r_{4, 3 L}^{(j)}, r_{5}^{(j)})}$
21:: ${CS}_{1}$ calculates $E (V R V {(P_{i d_{\min}^{(j)}})}^{'}) \leftarrow E (V R V (P_{i d_{\min}^{(j)}})) \times E (r_{0}^{(j)})$ , $E (S i g {(P_{i d_{\min}^{(j)}})}^{'}) \leftarrow E (S i g (P_{i d_{\min}^{(j)}})) \times E (r_{5}^{(j)})$ and sends them to ${CS}_{2}$
22:: ${CS}_{2}$ decrypts $(V R V {(P_{i d_{\min}^{(j)}})}^{'}, S i g {(P_{i d_{\min}^{(j)}})}^{'}) \leftarrow (D (E (V R V {(P_{i d_{\min}^{(j)}})}^{'})), D (E (S i g {(P_{i d_{\min}^{(j)}})}^{'})))$
23:: ${CS}_{2}$ unpacks ${{i d_{1}^{(j)}}^{'}, P_{{i d_{j}^{(1)}}^{'}}^{'}, \dots, {i d_{L}^{(j)}}^{'}, P_{{i d_{L}^{(j)}}^{'}}^{'}} \leftarrow Unpack (V R V {(P_{i d_{\min}^{(j)}})}^{'})$ , where $P_{{i d_{1}^{(j)}}^{'}}^{'} = (x_{{i d_{1}^{(j)}}^{'}}^{'}, y_{{i d_{1}^{(j)}}^{'}}^{'}), \dots,$ $P_{{i d_{L}^{(j)}}^{'}}^{'} =$ $(x_{{i d_{L}^{(j)}}^{'}}^{'}, y_{{i d_{L}^{(j)}}^{'}}^{'})$
24:: ${CS}_{2}$ updates ${VO}_{2} \leftarrow {VO}_{2} \cup {(P_{{i d_{1}^{(j)}}^{'}}^{'}, \dots, P_{{i d_{L}^{(j)}}^{'}}^{'}, S i g {(P_{i d_{\min}^{(j)}})}^{'})}$ .
25:: ${CS}_{1}$ sends $R_{1} = {(r_{2}^{(j)}, r_{3}^{(j)}) | j = 1, \dots, k}$ and ${VO}_{1} = {((r_{4, 2}^{(j)}, r_{4, 3}^{(j)}), \dots, (r_{4, 3 L - 1}, r_{4, 3 L}^{(j)}), r_{5}^{(j)}) | j = 1, \dots, k}$ to QU
26:: ${CS}_{2}$ sends $R_{2} = {(x_{i d_{\min}^{(j)}}^{'}, y_{i d_{\min}^{(j)}}^{'}) | j = 1, \dots, k}$ and ${VO}_{2} = {(P_{{i d_{1}^{(j)}}^{'}}^{'}, \dots, P_{{i d_{L}^{(j)}}^{'}}^{'}, S i g {(P_{i d_{\min}^{(j)}})}^{'}) | j = 1, \dots, k}$ to QU

(1): With the inputs $E (Q) = (E (x_{q}), E (y_{q}))$ , $(E (X / m)$ , and $E (Y / m))$ , and the public–private key pair $(p k_{1}, s k_{1})$ , cloud servers ${CS}_{1}$ and ${CS}_{2}$ interact with each other to calculate the ciphertexts $E (⌊ \frac{x_{q}}{X / m} ⌋)$ and $E (⌊ \frac{y_{q}}{Y / m} ⌋)$ using Algorithm 2.
(2): With the inputs $E (I_{1}), E (⌊ \frac{x_{q}}{X / m} ⌋), E (⌊ \frac{y_{q}}{Y / m} ⌋), p k_{1},$ and $s k_{1}$ , cloud servers ${CS}_{1}$ and ${CS}_{2}$ collaborate to execute Algorithm 3, traversing $E (I_{1})$ to locate the target grid $E (G_{\hat{s} \hat{t}})$ with $\hat{s} = ⌊ \frac{x_{q}}{X / m} ⌋, \hat{t} = ⌊ \frac{y_{q}}{Y / m} ⌋$ , which means that $Q \in G_{\hat{s} \hat{t}}$ . The core idea of this algorithm is to determine $(\hat{s}, \hat{t})$ based on verifying whether both $E (⌊ \frac{x_{q}}{X / m} ⌋) \times {(E (s))}^{N - 1} = E (⌊ \frac{x_{q}}{X / m} ⌋ - s)$ and $E (⌊ \frac{y_{q}}{Y / m} ⌋) \times {(E (t))}^{N - 1} = E (⌊ \frac{y_{q}}{Y / m} ⌋ - t)$ represent ciphertexts of 0 for some $s, t \in 0, \dots, m - 1$ . Note that $⌊ \frac{x_{q}}{X / m} ⌋ - s$ may be negative; the parameter T ensures that $E (⌊ \frac{x_{q}}{X / m} ⌋) \times {(E (s))}^{N - 1} \times E (T) = E (⌊ \frac{x_{q}}{X / m} ⌋ - s + T)$ ensures that the results represents a ciphertext of a positive number.

Algorithm 2 SDC (

E (Q), (E (X / m), E (Y / m)), p k_{1}, s k_{1}

)

Input:

{CS}_{1}

:

E (Q) = (E (x_{q}), E (y_{q}))

,

(E (X / m), E (Y / m))

and a security parameter

τ < κ - \frac{σ}{2} - 1

,

{CS}_{2}

:

p k_{1}, s k_{1}

.

Output:

{CS}_{1}

obtains the quotients

E (⌊ \frac{x_{q}}{X / m} ⌋)

and

E (⌊ \frac{y_{q}}{Y / m} ⌋)

1:: ${CS}_{1}$ generates two random numbers $r_{1}, r_{2} \in Z_{N}$ with $τ$ bits
2:: ${CS}_{1}$ calculates $x^{'} \leftarrow E {(x_{q})}^{r_{1}} \times E {(X / m)}^{r_{1} r_{2}}$ , $u^{'} \leftarrow E {(X / m)}^{r_{1}}$ and $y^{'} \leftarrow E {(y_{q})}^{r_{1}} \times E {(Y / m)}^{r_{1} r_{2}}$ , $v^{'} \leftarrow E {(Y / m)}^{r_{1}}$
3:: ${CS}_{1}$ sends $(x^{'}, y^{'})$ and $(u^{'}, v^{'})$ to ${CS}_{2}$
4:: ${CS}_{2}$ decrypts $(x, y) \leftarrow (D (x^{'}), D (y^{'}))$ and $(u, v) \leftarrow (D (u^{'}), D (v^{'}))$
5:: ${CS}_{2}$ calculates the quotients $d_{x} = ⌊ \frac{x}{u} ⌋$ and $d_{y} = ⌊ \frac{y}{v} ⌋$
6:: ${CS}_{2}$ encrypts $d_{x}^{'} \leftarrow E (d_{x})$ and $d_{y}^{'} \leftarrow E (d_{y})$ with $p k$
7:: ${CS}_{2}$ sends $(d_{x}^{'}, d_{y}^{'})$ to ${CS}_{1}$
8:: ${CS}_{1}$ calculates the quotients $E (⌊ \frac{x_{q}}{X / m} ⌋) = d_{x}^{'} \times E {(r_{2})}^{N - 1}$ and $E (⌊ \frac{y_{q}}{Y / m} ⌋) = d_{y}^{'} \times E {(r_{2})}^{N - 1}$

Algorithm 3 SGC (

E (I_{1}), E (⌊ \frac{x_{q}}{X / m} ⌋), E (⌊ \frac{y_{q}}{Y / m} ⌋), p k_{1}, s k_{1}

)

Input:

{CS}_{1}

:

E (I_{1}) = {E (G_{s t}) | s, t \in [m]},

E (⌊ \frac{x_{q}}{X / m} ⌋),

E (⌊ \frac{y_{q}}{Y / m} ⌋)

,

p k_{1}

and security parameters

τ, σ^{'}

satisfying

2^{τ + σ} < 2^{σ^{'}}, 2^{σ^{'} m} < N

,

{CS}_{2}

:

p k_{1}, s k_{1}, m

Output: the grid

E (G_{\hat{s} \hat{t}})

with

\hat{s} = ⌊ \frac{x_{q}}{X / m} ⌋

and

\hat{t} = ⌊ \frac{y_{q}}{Y / m} ⌋

1:

Δ_{x} \leftarrow \emptyset, Δ_{y} \leftarrow \emptyset, Γ^{'} \leftarrow \emptyset, M \leftarrow \emptyset

2: for

j = 0

to

m - 1

do

3:

{CS}_{1}

generates random numbers

r_{x j}, r_{y j} \in Z_{N}

with

τ

bits and a random number

T \geq m

4:

{CS}_{1}

calculates

Δ_{x j} \leftarrow {(E (⌊ \frac{x_{q}}{X / m} ⌋) \times E {(j)}^{N - 1} \times E (T))}^{r_{x j}}, Δ_{y j} \leftarrow {(E (⌊ \frac{y_{q}}{Y / m} ⌋) \times E {(j)}^{N - 1} \times E (T))}^{r_{y j}}

5:

Δ_{x} \leftarrow (Δ_{x 0}, Δ_{x 1}, \dots, Δ_{x (m - 1)}), Δ_{y} \leftarrow (Δ_{y 0}, Δ_{y 1},

\dots,

Δ_{y (m - 1)})

6: for

s = 0

to

m - 1

do

7: for

t = 0

to

m - 1

do

8:

{CS}_{1}

generates a random numbers

r_{s t}

and calculates

E (r_{s t})

9:

{CS}_{1}

computes

E (G_{s t}^{'}) \leftarrow E (G_{s t}) \times E (r_{s t})

10:

E (G^{'}) \leftarrow {(E (G_{s t}^{'}))}_{0 \leq s, t \leq m - 1}

11:

{CS}_{1}

permutes the vectors

Δ_{x}, Δ_{y}

and grid

E (G^{'})

with two random permutations

ρ_{1}, ρ_{2}

:

12:

Δ_{x}^{'} = ρ_{1} (Δ_{x}) = (Δ_{x ρ_{1} (0)}, Δ_{x ρ_{1} (1)}, \dots, Δ_{x ρ_{1} (m - 1)}),

Δ_{y}^{'} = ρ_{2} (Δ_{y}) = (Δ_{y ρ_{2} (0)}, Δ_{y ρ_{2} (1)}, \dots, Δ_{y ρ_{2} (m - 1)}),

13:

Γ^{'} = ρ_{2} (ρ_{1} (E (G^{'}))) = {(E (G_{ρ_{1} (s) ρ_{2} (t)}^{'}))}_{0 \leq s, t \leq m - 1}

14:

{CS}_{1}

packs

Δ_{x}^{'}, Δ_{y}^{'}

, i.e., calculates

v_{x}^{'} \leftarrow \prod_{i = 0}^{m - 1} Δ_{x ρ_{1} (i)}^{2^{σ^{'} (m - (i + 1))}}, v_{y}^{'} \leftarrow \prod_{i = 0}^{m - 1} Δ_{y ρ_{2} (i)}^{2^{σ^{'} (m - (i + 1))}}

and sends

v_{x}^{'}, v_{y}^{'}, Γ^{'}, T

to

{CS}_{2}

15:

{CS}_{2}

decrypts and unpacks

v_{x}^{'}

and

v_{y}^{'}

:

(v_{x ρ_{1} (0)},

\dots,

v_{x ρ_{1} (m - 1)}) \leftarrow D (v_{x}^{'}), (v_{y ρ_{2} (0)},

\dots, v_{y ρ_{2} (m - 1)}) \leftarrow D (v_{y}^{'})

16: for

s = 0

to

m - 1

do

17: for

t = 0

to

m - 1

do

18: if

v_{x ρ_{1} (s)} \mod T = 0

and

v_{y ρ_{2} (t)} \mod T = 0

19:

20:

Γ^{'} \leftarrow Γ_{ρ_{1} (s) ρ_{2} (t)}^{'} = E (G_{ρ_{1} (s) ρ_{2} (t)}^{'}), M_{s t} \leftarrow E (1)

21:

22: else

M_{s t} \leftarrow E (0)

23:

{CS}_{2}

sends

Γ^{'}

and

M = {(M_{s t})}_{0 \leq s, t \leq m - 1}

to

{CS}_{1}

24:

{CS}_{1}

permutes the matrix M with

ρ_{1}, ρ_{2}

:

M^{'} \leftarrow ρ_{1}^{- 1} (ρ_{2}^{- 1} (M))

25:

{CS}_{1}

calculates

E (r) \leftarrow \prod_{s = 0}^{m - 1} \prod_{t = 0}^{m - 1} {(M_{s t}^{'})}^{r_{s t}}

26:

{CS}_{1}

gets the target grid

E (G_{\hat{s} \hat{t}})

containing the query point:

E (G_{\hat{s} \hat{t}}) \leftarrow Γ^{'} \times E {(r)}^{N - 1}

(3): After identifying the correct grid $E (G_{\hat{s} \hat{t}})$ , cloud servers CS1 and CS2 jointly execute Algorithm 4 to search within the grid $G \hat{s} \hat{t}$ in ciphertext form and locate the ciphertext ( $E (i d^{(1)} m i n), E (x_{i d_{m i n}^{(1)}}), E (y_{i d_{m i n}^{(1)}})$ ) corresponding to the nearest neighbor to Q, where $(i d_{m i n}^{(1)}, P_{i d_{m i n}^{(1)}} = (x_{i d_{m i n}^{(1)}}, y_{i d_{m i n}^{(1)}})) \in G_{\hat{s} \hat{t}}$ . It is worth noting that Algorithm 4 can trivially be adapted to handle scenarios where the input includes multiple packed ciphertext datasets and multiple ciphertext $i d$ values. Specifically, the input format is $({E (G^{(1)}), \dots, E (G^{(α)})}, E (Q), {E (i d_{1}), E (i d_{2}), \dots,$ $E (i d_{β})}, p k_{1}, s k_{1})$ , and the output is the ciphertext $(E (i d_{m i n}), E (x_{i d_{m i n}}), E (y_{i d_{m i n}}))$ , where $P_{i d_{m i n}} = (x_{i d_{m i n}}, y_{i d_{m i n}}) \in G^{(1)} \cup \dots \cup G^{(α)}$ represents the nearest-neighbor point to Q, with the constraint that $i d_{\min} \notin {i d_{1}, i d_{2}, \dots, i d_{β}}$ .

Algorithm 4 SNN (

E (G_{\hat{s} \hat{t}}), E (Q), E (i d), p k_{1}, s k_{1}

)

Input:

{CS}_{1}

:

E (Q) = (E (x_{q}), E (y_{q}))

,

E (G_{\hat{s} \hat{t}})

and

E (i d)

,

{CS}_{2}

: the private key

s k_{1}

Output: (

E (i d_{\min}), E (x_{i d_{\min}}), E (y_{i d_{\min}})

): the ciphertext of the nearest neighbor

(i d_{\min}, P_{i d_{\min}} = (x_{i d_{\min}}, y_{i d_{\min}})) \in G_{\hat{s} \hat{t}}

to Q with

i d_{\min} \neq i d

1:: ${CS}_{1}$ generates $3 λ + 1$ random numbers $r_{0}, r_{1}, \dots, r_{3 λ} \in Z_{N}$
2:: ${CS}_{1}$ packs $r_{1} \sim r_{3 λ}$ into $Φ_{0} = 〈 r_{1} | r_{2} | \dots | r_{3 λ} 〉$ , packs $r_{2}, r_{3}, r_{5}, r_{6}, \dots, r_{3 λ - 1}, r_{3 λ}$ into $Φ_{1} = 〈 r_{2} | r_{3} | \dots | r_{3 λ - 1} | r_{3 λ} 〉$ and packs $r_{1}, r_{4}, r_{7}, \dots, r_{3 λ - 2}$ into $Φ_{2} = 〈 r_{1} | r_{4} | \dots | r_{3 λ - 2} 〉$
3:: ${CS}_{1}$ calculates $v_{0}^{'} \leftarrow {(E (G_{\hat{s} \hat{t}}) \times E (Φ_{0}))}^{r_{0}}$
4:: ${CS}_{1}$ calculates $E (Q^{'}) \leftarrow E (x_{q} | y_{q} | x_{q} | y_{q} | \dots | x_{q} | y_{q}) = E {(x_{q})}^{2^{σ (2 λ - 1)}}$ $E {(y_{q})}^{2^{σ (2 λ - 2)}} \dots$ $E {(x_{q})}^{2^{σ}} E {(y_{q})}^{2^{0}}$
5:: ${CS}_{1}$ calculates $v_{1}^{'} \leftarrow {(E (Q^{'}) \times E (Φ_{1}))}^{r_{0}}$
6:: ${CS}_{1}$ calculates $E (i d^{'}) \leftarrow E (i d | i d | i d | i d | \dots | i d | i d) = E {(i d)}^{2^{σ (λ - 1)}} E {(i d)}^{2^{σ (λ - 2)}} \dots$ $E {(i d)}^{2^{σ}} E {(i d)}^{2^{0}}$
7:: ${CS}_{1}$ calculates $v_{2}^{'} \leftarrow {(E (i d^{'}) \times E (Φ_{2}))}^{r_{0}}$
8:: ${CS}_{1}$ sends $v_{0}^{'}, v_{1}^{'}, v_{2}^{'}$ to ${CS}_{2}$
9:: ${CS}_{2}$ decrypts $v_{0} \leftarrow D (v_{0}^{'})$ , $v_{1} \leftarrow D (v_{1}^{'})$ and $v_{2} \leftarrow D (v_{2}^{'})$ with
10:: $v_{0} = r_{0} (i_{1}^{(s t)} + r_{1}) 2^{σ (3 λ - 1)} + r_{0} (x_{i_{1}^{(s t)}} + r_{2}) 2^{σ (3 λ - 2)} + r_{0} (y_{i_{1}^{(s t)}} + r_{3}) 2^{σ (3 λ - 3)} + \dots + r_{0} (y_{i_{λ}^{(s t)}} + r_{3 λ}) 2^{0}$ ,
11:: $v_{1} = r_{0} (r_{2} + x_{q}) 2^{σ (2 λ - 1)} + r_{0} (r_{3} + y_{q}) 2^{σ (2 λ - 2)} + \dots + r_{0} (y_{q} + r_{3 λ}) 2^{0}$
12:: $v_{2} = r_{0} (i d + r_{1}) 2^{σ (λ - 1)} + r_{0} (i d + r_{4}) 2^{σ (λ - 2)} + \dots + r_{0} (i d + r_{3 λ - 2}) 2^{0}$
13:: ${CS}_{2}$ unpacks $v_{0}$ , $v_{1}$ , $v_{2}$ to get (ID, P), Q and id i.e.,
14:: $ID = \{r_{0} (i_{1}^{(s t)} + r_{1}), r_{0} (i_{2}^{(s t)} + r_{4}), r_{0} (i_{3}^{(s t)} + r_{7}), \dots, r_{0} (i_{λ}^{(s t)} + r_{3 λ - 2})\}$ ,
15:: $P = \{(r_{0} (x_{i_{1}^{(s t)}} + r_{2}), r_{0} (y_{i_{1}^{(s t)}} + r_{3})), (r_{0} (x_{i_{2}^{(s t)}} + r_{5}), r_{0} (y_{i_{2}^{(s t)}} + r_{6})), \dots,$
16:: $(r_{0} (x_{i_{λ}^{(s t)}} + r_{3 λ - 1}), r_{0} (y_{i_{λ}^{(s t)}} + r_{3 λ}))\}$
17:: $Q = \{(r_{0} (x_{q} + r_{2}), r_{0} (y_{q} + r_{3})), (r_{0} (x_{q} + r_{5}), r_{0} (y_{q} + r_{6})), \dots, (r_{0} (x_{q} + r_{3 λ - 1}), r_{0} (y_{q} + r_{3 λ}))\}$
18:: $id = \{r_{0} (i d + r_{1}), r_{0} (i d + r_{4}), r_{0} (i d + r_{7}), \dots, r_{0} (i d + r_{3 λ - 2})\}$
19:: $d_{m i n} \leftarrow + \infty$ , $d [] \leftarrow \emptyset$ , $δ [] \leftarrow \emptyset$ , $p o s \leftarrow ⊥$
20:: for $ℓ = 0$ to $λ - 1$ do
21:: $d [ℓ] \leftarrow {(P [ℓ] . x - Q [ℓ] . x)}^{2} + {(P [ℓ] . y - Q [ℓ] . y)}^{2}$
22:: if $d [ℓ] < d_{\min}$ and $ID [ℓ] \neq id [ℓ]$ then

23:

24:: $p o s \leftarrow ℓ$
25:: $d_{\min} \leftarrow d [p o s]$
26:: $δ [p o s] \leftarrow E (1)$ and $\forall j \neq p o s, δ [j] \leftarrow E (0)$
27:: ${CS}_{2}$ sends $δ, E (ID [p o s]), E (P [p o s] . x), E (P [p o s] . y)$ to ${CS}_{1}$
28:: ${CS}_{1}$ calculates $E (r_{i d}) \leftarrow \prod_{i = 0}^{λ - 1} δ {[i]}^{r_{3 i + 1}}, E (r_{x}) \leftarrow \prod_{i = 0}^{λ - 1} δ {[i]}^{r_{3 i + 2}}$ and $E (r_{y}) \leftarrow \prod_{i = 0}^{λ - 1} δ {[i]}^{r_{3 i + 3}}$
29:: ${CS}_{1}$ calculates $E (i d_{m i n}) \leftarrow E {(ID [p o s])}^{r_{0}^{- 1}} \times E {(r_{i d})}^{N - 1},$ $E (x_{i d_{m i n}}) \leftarrow E {(P [p o s] . x)}^{r_{0}^{- 1}} \times E {(r_{x})}^{N - 1},$ $E (y_{i d_{m i n}}) \leftarrow E {(P [p o s] . y)}^{r_{0}^{- 1}}$ $\times E {(r_{y})}^{N - 1}$

(4): According to Lemma 2, the second-nearest neighbor to Q resides in the set $V R V (P_{i d_{\min}^{(1)}})$ . Therefore, leveraging $E (i d_{m i n})$ , cloud servers ${CS}_{1}$ and ${CS}_{2}$ collaboratively execute Algorithm 5 to explore $E (I_{2})$ and identify $(E (V R V (P_{i d_{\min}^{(1)}})), S i g (P_{i d_{\min}^{(1)}}))$ . These results facilitate the discovery of the second-nearest neighbor and verification of the correctness of $P_{i d_{m i n}^{(1)}}$ .

Algorithm 5 SCR (

E (I_{2}), E (i d_{m i n}), p k_{1}, s k_{1}

)

Input:

{CS}_{1}

:

E (I_{2})

,

E (i d_{m i n})

and

p k_{1}

.

{CS}_{2}

: the public-private key pair

(p k_{1}, s k_{1})

Output:

E (V R V (P_{i d_{\min}}))

and

E (S i g (P_{i d_{\min}}))

in

E (I_{2})

1:: for each $B_{j} \in I_{2}$ ( $i . e .$ , $j = 0$ to $⌈ \frac{n}{w} ⌉ - 1$ ) do
2:: ${CS}_{1}$ generates random numbers $r_{0 j}, r_{1 j} \in Z_{N}$ with $τ$ bits
3:: ${CS}_{1}$ calculates $E (η_{M j}) \leftarrow {(E ((j + 1) w - 1) \times E (r_{0 j}))}^{r_{1 j}}$ , $E (η_{j}) \leftarrow {(E (i d_{m i n}) \times E (r_{0 j}))}^{r_{1 j}}$ , $E (η_{m j}) \leftarrow {(E (j w) \times E (r_{0 j}))}^{r_{1 j}}$
4:: for $k = 0$ to $w - 1$ do
5:: ${CS}_{1}$ generates three packing random numbers $r_{(j w + k) 0}, r_{(j w + k) 1}, r_{(j w + k) 2} \in Z_{N}$
6:: ${CS}_{1}$ calculates $Ψ_{(j w + k) 0} = {(E (j w + k) \times E {(i d_{\min})}^{N - 1})}^{r_{(j w + k) 0}}$ ,
7:: $Ψ_{(j w + k) 1} = E (V R V (P_{j w + k})) \times E (r_{(j w + k) 1})$ , $Ψ_{(j w + k) 2} = E (S i g (P_{j w + k})) \times E (r_{(j w + k) 2})$
8:: $Ψ_{j 0} \leftarrow {(Ψ_{(j w + k) 0})}_{0 \leq k \leq w - 1}$ , $Ψ_{j 1} \leftarrow {(Ψ_{(j w + k) 1})}_{0 \leq k \leq w - 1}$ , $Ψ_{j 2} \leftarrow {(Ψ_{(j w + k) 2})}_{0 \leq k \leq w - 1}$
9:: ${CS}_{1}$ calculates $η_{M}^{'} \leftarrow (E (η_{M 0}), \dots, E (η_{M (⌈ \frac{n}{w} ⌉ - 1)})),$ $η_{m}^{'} \leftarrow (E (η_{m 0}), \dots, E (η_{m (⌈ \frac{n}{w} ⌉ - 1)})),$ $η^{'} \leftarrow (E (η_{0}), \dots,$ $E (η_{⌈ \frac{n}{w} ⌉ - 1}))$
10:: ${CS}_{1}$ calculates $Ψ_{0} \leftarrow (Ψ_{00}, \dots, Ψ_{⌈ \frac{n}{w} ⌉ - 1, 0}) = {(Ψ_{(j w + k) 0})}_{0 \leq j \leq (⌈ \frac{n}{w} ⌉ - 1), 0 \leq k \leq w - 1},$
11:: $Ψ_{1} \leftarrow (Ψ_{01}, \dots, Ψ_{⌈ \frac{n}{w} ⌉ - 1, 1}) = {(Ψ_{(j w + k) 1})}_{0 \leq j \leq (⌈ \frac{n}{w} ⌉ - 1), 0 \leq k \leq w - 1},$
12:: $Ψ_{2} \leftarrow (Ψ_{02}, \dots, Ψ_{⌈ \frac{n}{w} ⌉ - 1, 1}) = {(Ψ_{(j w + k) 2})}_{0 \leq j \leq (⌈ \frac{n}{w} ⌉ - 1), 0 \leq k \leq w - 1}, B^{'} \leftarrow (Ψ_{0}, Ψ_{1}, Ψ_{2})$
13:: ${CS}_{1}$ permutes $η_{M}^{'}, η_{m}^{'}, η^{'}$ and buckets $B^{'}$ with two random permutations $ρ_{1}, ρ_{2}$ :
14:: $η_{M}^{''} \leftarrow ρ_{1} (η_{M}^{'}), η_{m}^{''} \leftarrow ρ_{1} (η_{m}^{'}), η^{''} = ρ_{1} (η^{'}),$
15:: $B^{''} \leftarrow ρ_{2} (ρ_{1} (B^{'})) = (Ψ_{0}^{'}, Ψ_{1}^{'}, Ψ_{2}^{'}) = ((Ψ_{ρ_{1} (0) 0}^{'}, \dots, Ψ_{ρ_{1} (⌈ \frac{n}{w} ⌉ - 1), 0}^{'}), (Ψ_{ρ_{1} (0) 1}^{'}, \dots, Ψ_{ρ_{1} (⌈ \frac{n}{w} ⌉ - 1), 1}^{'}),$
16:: $(Ψ_{ρ_{1} (0) 2}^{'}, \dots, Ψ_{ρ_{1} (⌈ \frac{n}{w} ⌉ - 1), 2}^{'})) = ({(Ψ_{(ρ_{1} (j) w + ρ_{2} (k)) 0})}_{0 \leq j \leq (⌈ \frac{n}{w} ⌉ - 1), 0 \leq k \leq w - 1},$
17:: ${(Ψ_{(ρ_{1} (j) w + ρ_{2} (k)) 1})}_{0 \leq j \leq (⌈ \frac{n}{w} ⌉ - 1), 0 \leq k \leq w - 1}, {(Ψ_{(ρ_{1} (j) w + ρ_{2} (k)) 2})}_{0 \leq j \leq (⌈ \frac{n}{w} ⌉ - 1), 0 \leq k \leq w - 1})$
18:: ${CS}_{1}$ sends $η_{M}^{''}, η_{m}^{''}, η^{''}, B^{''}$ to ${CS}_{2}$
19:: ${CS}_{2}$ decrypts $η_{M}^{''}, η_{m}^{''}, η^{''}$ : $(η_{M ρ_{1} (0)}, \dots, η_{M ρ_{1} (⌈ \frac{n}{w} ⌉ - 1)}) \leftarrow D (η_{M}^{''}),$
20:: $(η_{m ρ_{1} (0)}, \dots, η_{m ρ_{1} (⌈ \frac{n}{w} ⌉ - 1)}) \leftarrow D (η_{m}^{''}),$ $(η_{ρ_{1} (0)}, \dots,$ $η_{ρ_{1} (⌈ \frac{n}{w} ⌉ - 1)}) \leftarrow D (η^{''})$
21:: for $j = 0$ to $⌈ \frac{n}{w} ⌉ - 1$ do
22:: if $(η_{M ρ_{1} (j)} - η_{ρ_{1} (j)}) \geq 0$ and $(η_{m ρ_{1} (j)} - η_{ρ_{1} (j)}) < 0$ then
23:: ${CS}_{2}$ decrypts $Ψ_{ρ_{1} (j) 0}^{'}$ : $(ψ_{(ρ_{1} (j) w + ρ_{2} (0)) 0}, \dots, ψ_{(ρ_{1} (j) w + ρ_{2} (w - 1)) 0}) \leftarrow D (Ψ_{ρ_{1} (j) 0}^{'})$
24:: for $k = 0$ to $w - 1$ do
25:: if $ψ_{(ρ_{1} (j) w + ρ_{2} (k)) 0} = 0$ then
26:: $Θ \leftarrow (Ψ_{(ρ_{1} (j) w + ρ_{2} (k)) 1}, Ψ_{(ρ_{1} (j) w + ρ_{2} (k)) 2})$ , $M_{j k} \leftarrow E (1)$
27:: else $M_{j k} \leftarrow E (0)$
28:: ${CS}_{2}$ sends $Θ, M$ to ${CS}_{1}$
29:: ${CS}_{1}$ permutes the matrix M with $ρ_{1}, ρ_{2}$ : $M^{'} \leftarrow ρ_{1}^{- 1} (ρ_{2}^{- 1} (M))$
30:: ${CS}_{1}$ calculates $E (ψ^{(1)}) \leftarrow \prod_{j = 0}^{⌈ \frac{n}{w} ⌉ - 1} \prod_{k = 0}^{w - 1} {(M_{j k}^{'})}^{r_{(j w + k) 1}}$ and $E (ψ^{(2)}) \leftarrow \prod_{j = 0}^{⌈ \frac{n}{w} ⌉ - 1} \prod_{k = 0}^{w - 1} {(M_{j k}^{'})}^{r_{(j w + k) 2}}$
31:: ${CS}_{1}$ calculates $E (V R V (P_{i d_{\min}})) \leftarrow Θ_{1} \times E {(ψ^{(1)})}^{N - 1}$ and $E (S i g (P_{i d_{\min}})) \leftarrow Θ_{2} \times E {(ψ^{(2)})}^{N - 1}$

(5): Similarly, the jth ( $j \geq 2$ ) nearest neighbor to Q is in the set $V R V (P_{i d_{\min}^{(1)}}) \cup V R V (P_{i d_{\min}^{(2)}}) \cup \dots \cup V R V (P_{i d_{\min}^{(j - 1)}})$ . Thus, for $j = 2$ to k, cloud servers ${CS}_{1}$ and ${CS}_{2}$ recursively perform the following operations. First, they jointly invoke Algorithm 4 to traverse $E (V R V (P_{i d_{\min}^{(1)}})) \cup \dots \cup E (V R V (P_{i d_{\min}^{(j - 1)}}))$ and find the ciphertext ( $E (i d_{m i n}^{(j)}), E (x_{i d_{m i n}^{(j)}}), E (y_{i d_{m i n}^{(j)}})$ ) of the jth nearest neighbor to Q. Then, they jointly perform Algorithm 5 to search $E (I_{2})$ in the ciphertext form and find $(E (V R V (P_{i d_{\min}^{(j)}})), S i g (P_{i d_{\min}^{(j)}}))$ , which is used to find the $(j + 1)$ th nearest neighbor and verify the correctness of $P_{i d_{\min}^{(j)}} = (x_{i d_{m i n}^{(j)}}, y_{i d_{m i n}^{(j)}})$ .
(6): Through the above five steps, ${CS}_{1}$ can obtain the encrypted query result and the encryption verification information

$\begin{matrix} \{(E (i d_{\min}^{(j)}), E (x_{i d_{\min}^{(j)}}), E (y_{i d_{\min}^{(j)}})) | j = 1, \dots, k\}, \{(E (V R V (P_{i d_{\min}^{(j)}})), S i g (P_{i d_{\min}^{(j)}})) | j = 1, \dots, k\} . \end{matrix}$

However, without the secret key $s k_{1}$ , the QU cannot recover the plaintext of the query result and the verification information. Therefore, ${CS}_{1}$ must leverage the private key stored in ${CS}_{2}$ to assist the QU in obtaining the plaintext. To achieve this, through the homomorphic property, ${CS}_{1}$ adds some random numbers to blind the encrypted query result and the verification information and sends them to ${CS}_{2}$ . Finally, ${CS}_{1}$ returns the sets $R_{1}$ and ${VO}_{1}$ of random numbers to the QU, while ${CS}_{2}$ decrypts the blinded results sent from ${CS}_{1}$ and returns the decrypted sets $R_{2}$ and ${VO}_{2}$ to the QU.

4.2.6. QU Verification and Decryption Stage: $Verify$ and $ResDec$

After receiving the

〈 R_{1}, {VO}_{1} 〉

from CS₁ and

〈 R_{2}, {VO}_{2} 〉

from CS₂, the QU performs Algorithm 6 to verify the correctness of the query result. If the verification algorithm returns

T r u e

, the QU treats the set

R

in Equation (2) as the final query result. Otherwise, the QU rejects the result.

Algorithm 6 Verify (

〈 R_{1}, {VO}_{1} 〉, 〈 R_{2}, {VO}_{2} 〉, p k_{2}

)

Input:

〈 R_{1}, {VO}_{1} 〉, 〈 R_{2}, {VO}_{2} 〉

.

Output:

T r u e

or

F a l s e

1:: for $j = 1$ to k
2:: Calculate the difference between the jth element in $R_{2}$ and that in $R_{1}$ and let

$\begin{matrix} R = \{(x_{i d_{\min}^{(j)}}^{'}, y_{i d_{\min}^{(j)}}^{'}) - (r_{2}^{(j)}, r_{3}^{(j)}) | j = 1, \dots, k\} = \{P_{i d_{\min}^{(j)}} = (x_{i d_{\min}^{(j)}}, y_{i d_{\min}^{(j)}}) | j = 1, \dots, k\} \end{matrix}$

(2)
3:: Calculate the difference between the jth element in ${VO}_{2}$ and that in ${VO}_{1}$ and let

$\begin{matrix} VO & = \{(P_{{i d_{1}^{(j)}}^{'}}^{'}, \dots, P_{{i d_{L}^{(j)}}^{'}}^{'}, S i g {(P_{i d_{\min}^{(j)}})}^{'}) - ((r_{4, 2}^{(j)}, r_{4, 3}^{(j)}), \dots, (r_{4, 3 L - 1}^{(j)}, r_{4, 3 L}^{(j)}), r_{5}^{(j)}) | j = 1, \dots, k\} \\ = \{(P_{i d_{1}^{(j)}}, \dots, P_{i d_{L}^{(j)}}, S i g (P_{i d_{\min}^{(j)}})) | j = 1, \dots, k\} \end{matrix}$
4:: For the message $P_{i d_{\min}^{(j)}} | | P_{i d_{1}^{(j)}} | | \dots | | P_{i d_{L}^{(j)}}$ and its signature $S i g (P_{i d_{\min}^{(j)}}) = Sig (P_{i d_{\min}^{(j)}} | | P_{i d_{1}^{(j)}} | | \dots | | P_{i d_{L}^{(j)}})$ , invoke the verification algorithm $Ver$ of DSA to check the signature.
5:: if ${0} \leftarrow Ver (P_{i d_{\min}^{(j)}} | | P_{i d_{1}^{(j)}} | | \dots | | P_{i d_{L}^{(j)}}, S i g (P_{i d_{\min}^{(j)}}))$ , return $F a l s e$
6:: For the point $P_{i d_{\min}^{(1)}}$ , calculate the distance $d i s t {(P_{i d_{\min}^{(1)}}, Q)}^{2} = {(x_{i d_{\min}^{(1)}} - x_{q})}^{2} + {(y_{i d_{\min}^{(1)}} - y_{q})}^{2}$
7:: Calculate ${MIN}_{1} = \min {d i s t {(P_{i d_{1}^{(1)}}, Q)}^{2}, \dots, d i s t {(P_{i d_{L}^{(1)}}, Q)}^{2}}$
8:: if $d i s t {(P_{i d_{\min}^{(1)}}, Q)}^{2} > {MIN}_{1}$ , return $F a l s e$
9:: for $j = 2$ to k
10:: Initialize ${MIN}_{j} \leftarrow + \infty$
11:: for $v = 1$ to $j - 1$
12:: for $u = 1$ to L
13:: if $P_{i d_{u}^{(v)}} \notin \{P_{i d_{\min}^{(1)}}, \dots, P_{i d_{\min}^{(j - 1)}}\}$ and $d i s t {(P_{i d_{u}^{(v)}}, Q)}^{2} < {MIN}_{j}$ then
14:: ${MIN}_{j} \leftarrow d i s t {(P_{u, i d_{\min}^{(j)}}, Q)}^{2}$
15:: if $d i s t {(P_{i d_{\min}^{(j)}}, Q)}^{2} > {MIN}_{j}$ , return $F a l s e$
16:: return $T r u e$

5. Correctness and Security Analysis

5.1. Correctness Analysis

In this section, we analyze the correctness of our proposed protocol. First, we prove the correctness of each algorithm.

Lemma 3.

Algorithm 2 is correct. That is, for any valid input

E (Q) = (E (x_{q}), E (y_{q}))

,

(E (X / m), E (Y / m))

, and

s k_{1}

,

{CS}_{1}

can indeed obtain the ciphertexts

E (⌊ \frac{x_{q}}{X / m} ⌋)

and

E (⌊ \frac{y_{q}}{Y / m} ⌋)

.

Proof.

See Appendix B.1. □

Lemma 4.

Algorithm 3 is correct. That is, for any valid input

E (G_{s t}), E (⌊ \frac{x_{q}}{X / m} ⌋), E (⌊ \frac{y_{q}}{Y / m} ⌋)

,

p k_{1}

, and

s k_{1}

,

{CS}_{1}

can indeed obtain the target grid

E (G_{s t})

with

(s, t) = (⌊ \frac{x_{q}}{X / m} ⌋, ⌊ \frac{y_{q}}{Y / m} ⌋)

.

Proof.

See Appendix B.2. □

Lemma 5.

Algorithm 4 is correct. That is, for any valid input

E (Q) = (E (x_{q}), E (y_{q}))

,

E (G_{s t})

,

p k_{1}

, and

s k_{1}

,

{CS}_{1}

can indeed obtain the ciphertext (

E (i d_{m i n}), E (x_{i d_{m i n}}), E (y_{i d_{m i n}})

) such that

i d_{\min} \neq i d

, and

P_{i d_{m i n}} = (x_{i d_{m i n}}, y_{i d_{m i n}}) \in G_{s t}

is the nearest-neighbor point to Q with

i d_{\min} \neq i d

.

Proof.

See Appendix B.3. □

Lemma 6.

Algorithm 5 is correct. That is, for any valid input

E (I_{2})

,

E (i d_{m i n})

, and

s k

,

{CS}_{1}

can indeed obtain

E (V R V (P_{i d_{\min}}))

and

S i g (P_{i d_{\min}})

in

E (I_{2})

.

Proof.

See Appendix B.4. □

Lemma 7.

If

{CS}_{1}

and

{CS}_{2}

faithfully execute Algorithm 1, then, for any

1 \leq j \leq k

, the difference between the jth element in

R_{2}

(resp.

{VO}_{2}

) and that in

R_{1}

(resp.

{VO}_{1}

) satisfies

\begin{matrix} (x_{i d_{\min}^{(j)}}^{'}, y_{i d_{\min}^{(j)}}^{'}) - (r_{2}^{(j)}, r_{3}^{(j)}) = (x_{i d_{\min}^{(j)}}, y_{i d_{\min}^{(j)}}), \end{matrix}

\begin{matrix} (r e s p . (P_{{i d_{1}^{(j)}}^{'}}^{'}, \dots, P_{{i d_{L}^{(j)}}^{'}}^{'}, S i g {(P_{i d_{\min}^{(j)}})}^{'}) - ((r_{4, 2}^{(j)}, r_{4, 3}^{(j)}), \dots, (r_{4, 3 L - 1}^{(j)}, r_{4, 3 L}^{(j)}), r_{5}^{(j)}) = (P_{i d_{1}^{(j)}}, \dots, P_{i d_{L}^{(j)}}, S i g (P_{i d_{\min}^{(j)}}))), \end{matrix}

where

P_{i d_{\min}^{(j)}} = (x_{i d_{\min}^{(j)}}, y_{i d_{\min}^{(j)}})

is exactly the jth nearest neighbor to Q,

{P_{i d_{1}^{(j)}}, \dots, P_{i d_{L}^{(j)}}}

is the set of Voronoi-relevant vectors of

P_{i d_{\min}^{(j)}}

, and

S i g (P_{i d_{\min}^{(j)}}))

is the signature of

P_{i d_{\min}^{(j)}}

.

Building on the foundation provided by the aforementioned lemmas, we are able to establish the correctness of our protocol.

Theorem 1.

According to Definition 1, our protocol is correct. That is, if the cloud servers are honest, an honest QU can get the exact k-nearest neighbors to the query point Q.

Proof.

As outlined in Definition 1, establishing correctness merely requires demonstrating that Algorithm 6 indeed returns

T r u e

and that

P_{i d_{\min}^{(j)}} = (x_{i d_{\min}^{(j)}}, y_{i d_{\min}^{(j)}}) \in R

is exactly the jth nearest neighbor to Q. In fact, by Lemma 7, the set

R

(resp.

VO

) in Algorithm 6 is

\begin{matrix} R = {(x_{i d_{\min}^{(j)}}, y_{i d_{\min}^{(j)}}) | j = 1, \dots, k}, (r e s p . VO = {(P_{i d_{1}^{(j)}}, \dots, P_{i d_{L}^{(j)}}, S i g (P_{i d_{\min}^{(j)}})) | j = 1, \dots, k}), \end{matrix}

where

P_{i d_{\min}^{(j)}} = (x_{i d_{\min}^{(j)}}, y_{i d_{\min}^{(j)}})

is indeed the jth nearest neighbor to Q,

{P_{i d_{1}^{(j)}}, \dots, P_{i d_{L}^{(j)}}}

is the set of Voronoi-relevant vectors of

P_{i d_{\min}^{(j)}}

, and

S i g (P_{i d_{\min}^{(j)}})

is the signature of

P_{i d_{\min}^{(j)}}

. Therefore, in Step 5, the signature

S i g (P_{i d_{\min}^{(j)}})

of the message

P_{i d_{\min}^{(j)}} | | P_{i d_{1}^{(j)}} | | \dots | | P_{i d_{L}^{(j)}}

will pass the verification algorithm. Moreover, in conjunction with Lemma 2, we know that, after the for-loop in Step 9,

P_{i d_{\min}^{(j)}} = (x_{i d_{\min}^{(j)}}, y_{i d_{\min}^{(j)}})

will pass the distance verification. That is, Algorithm 6 will return

T r u e

and

R

is correct. □

5.2. Public Verifiability

Theorem 2.

If the hash functions

H (\cdot)

and the signature algorithm are secure, then our protocol achieves public verifiability, as defined in Definition 2. That is, the advantage probability

A d v_{A}^{PV} (Π, λ, n) = \Pr [{Exp}_{A}^{PV} [Π, λ, n] = 1]

that the adversary

A

obtains in the experiment

{Exp}_{A}^{PV} [Π, λ, n]

is negligible.

Proof.

According to Definition 2, we only need to analyze the probability of the event that the experiment

{Exp}_{A}^{PV} [Π, λ, n]

outputs 1, which means that

{T r u e} \leftarrow Verify (Q, R^{'}, VO, P K)

in Algorithm 6 and

{γ \neq R} \leftarrow ResDec (Q, R^{'}, S K, δ)

.

In essence, the event

{T r u e} \leftarrow Verify (Q, R^{'}, VO, P K)

entails two conditions:

(1): In Step 4, $S i g (P_{i d_{\min}^{(j)}})$ is the correct signature of the message $P_{i d_{\min}^{(j)}} | | P_{i d_{1}^{(j)}} | | \dots | | P_{i d_{L}^{(j)}}$ .
(2): In Step 15, $d i s t (P_{i d_{\min}^{(j)}}, Q) \leq M I N_{j}$ .

Furthermore, condition (1) implies that

P_{i d_{\min}^{(j)}},

P_{i d_{1}^{(j)}},

\dots, P_{i d_{L}^{(j)}}

are the data points from the DO and

{P_{i d_{1}^{(j)}}, \dots, P_{i d_{L}^{(j)}}}

are the Voronoi-relevant vectors of

P_{i d_{\min}^{(j)}}

. Condition (2) implies that

P_{i d_{\min}^{(j)}}

is the jth nearest neighbor of Q. Consequently, the output of the decryption algorithm yields

γ = R

. In simpler terms,

A d v_{A}^{PV} (Π, λ, n) = 0

. □

5.3. Privacy

For privacy analysis, we leverage the formal definition of multiparty computation, as outlined in [39,40], within the framework of the simulation paradigm [41]. The overarching concept is described below.

Theorem 3.

(Composition Theorem [41]) Given a protocol Ω composed of several sub-protocols, if all sub-protocols are secure and all intermediate results are either random or pseudorandom, then the protocol Ω is secure.

In the simulation paradigm, it is essential that the perspective of each participating party in a protocol can be replicated based solely on its input and output. This requirement ensures that parties do not gain any additional information from the protocol beyond what their inputs and outputs imply. In other words, the simulated view of each sub-protocol must be computationally indistinguishable from the actual execution view. To illustrate this concept, we formally demonstrate the security of the SDC protocol (Algorithm 2). Although we focus on the SDC protocol here, other protocols can be elucidated in a similar manner.

Theorem 4.

If the hash functions

H (\cdot)

and Paillier’s homomorphic cryptosystem are secure, then the

S D C

protocol is secure. That is, for any probability polynomial-time adversary

A

, there exists a simulator

S

such that the probability

\Pr ({Real}_{SDC}^{A}) - \Pr ({Sim}_{SDC}^{A})

is negligible, i.e.,

\begin{matrix} |\Pr ({Real}_{SDC}^{A}) - \Pr ({Sim}_{SDC}^{A})| \leq negl (λ) . \end{matrix}

Proof.

We first define the real view

{Real}_{S D C}^{A}

and the simulated view

{Sim}_{S D C}^{A}

.

{Real}_{S D C}^{A}

: With the inputs

E (Q) = (E (x_{q}), E (y_{q}))

,

(E (X / m), E (Y / m))

, and a security parameter

τ < κ - \frac{σ}{2} - 1

. CS₁ first generates two random numbers

r_{1}, r_{2} \in Z_{N}

and then calculates

x^{'} = E {(x_{q})}^{r_{1}} \times E {(X / m)}^{r_{1} r_{2}}

,

u^{'} = E {(X / m)}^{r_{1}}

,

y^{'} = E {(y_{q})}^{r_{1}} \times E {(Y / m)}^{r_{1} r_{2}}

,

v^{'} = E {(Y / m)}^{r_{1}}

. Then, it sends

(x^{'}, y^{'})

and

(u^{'}, v^{'})

to

{CS}_{2}

.

{CS}_{2}

first decrypts them as

(x, y) = (D (x^{'}), D (y^{'}))

and

(u, v) = (D (u^{'}), D (v^{'}))

and calculates the quotients

d_{x} = ⌊ \frac{x}{u} ⌋

and

d_{y} = ⌊ \frac{y}{v} ⌋

. Then,

{CS}_{2}

encrypts

d_{x}^{'} = E (d_{x})

,

d_{y}^{'} = E (d_{y})

and sends

(d_{x}^{'}, d_{y}^{'})

to CS₁. Finally,

{CS}_{1}

calculates the quotients

E (⌊ \frac{x_{q}}{X / m} ⌋) = d_{x}^{'} \times E {(r_{2})}^{N - 1}

and

E (⌊ \frac{y_{q}}{\bar{Y / m}} ⌋) = d_{y}^{'} \times E {(r_{2})}^{N - 1}

.

{Sim}_{S D C}^{A}

: The simulation contains two simulators

{S_{1}, S_{2}}

.

S_{1}

first chooses four random numbers

{\bar{x}}_{q}, {\bar{y}}_{q}, \bar{X / m}, \bar{Y / m} \in Z_{N}

and calculates

\bar{E} (Q) = (E ({\bar{x}}_{q}), E ({\bar{y}}_{q}))

and

(E (\bar{X / m}), E (\bar{Y / m}))

. Then, it generates two random numbers

r_{1}

and

r_{2} \in Z_{N}

and calculates

({\bar{x}}_{1}^{'}, {\bar{u}}_{1}^{'}, {\bar{y}}_{1}^{'}, {\bar{v}}_{1}^{'}) = (E {({\bar{x}}_{q})}^{r_{1}} \times E {(\bar{X / m})}^{r_{1} r_{2}}, E {(\bar{X / m})}^{r_{1}}, E {({\bar{y}}_{q})}^{r_{1}} \times E {(\bar{Y / m})}^{r_{1} r_{2}}, E {(\bar{Y / m})}^{r_{1}}) \in Z_{N^{2}}^{★} \times Z_{N^{2}}^{★} \times Z_{N^{2}}^{★} \times Z_{N^{2}}^{★}

. Finally, it sends

({\bar{x}}_{1}^{'}, {\bar{u}}_{1}^{'}, {\bar{y}}_{1}^{'}, {\bar{v}}_{1}^{'})

to

S_{2}

.

S_{2}

first chooses four random numbers

({\bar{x}}_{2}^{'}, {\bar{u}}_{2}^{'}, {\bar{y}}_{2}^{'}, {\bar{v}}_{2}^{'}) \in Z_{N^{2}}^{★} \times Z_{N^{2}}^{★} \times Z_{N^{2}}^{★} \times Z_{N^{2}}^{★}

and decrypts them as

({\bar{x}}_{2}, {\bar{y}}_{2}, {\bar{u}}_{2}, {\bar{v}}_{2}) = (D ({\bar{x}}_{2}^{'}), D ({\bar{y}}_{2}^{'}), D ({\bar{u}}_{2}^{'}), D ({\bar{v}}_{2}^{'})) \in Z_{N} \times Z_{N} \times Z_{N} \times Z_{N}

. Then,

S_{2}

calculates the quotients

{\bar{d}}_{x} = ⌊ \frac{{\bar{x}}_{2}}{{\bar{u}}_{2}} ⌋

and

{\bar{d}}_{y} = ⌊ \frac{{\bar{y}}_{2}}{{\bar{v}}_{2}} ⌋

and sends

({\bar{d}}_{x}^{'}, {\bar{d}}_{y}^{'}) = (E ({\bar{d}}_{x}), E ({\bar{d}}_{y}))

to

S_{1}

. Finally,

S_{1}

calculates the quotients

E (⌊ \frac{{\bar{x}}_{q}}{X / m} ⌋) = {\bar{d}}_{x}^{'} \times E {(r_{2})}^{N - 1}

and

E (⌊ \frac{{\bar{y}}_{q}}{Y / m} ⌋) = {\bar{d}}_{y}^{'} \times E {(r_{2})}^{N - 1}

.

Since Paillier’s homomorphic cryptosystem is semantically secure, for any two invalid plaintexts

x^{(0)}

and

x^{(1)}

, no probability polynomial-time (PPT) adversary can distinguish their ciphertexts

E (x^{(0)})

and

E (x^{(1)})

. That is,

({\bar{x}}_{2}, {\bar{y}}_{2}, {\bar{u}}_{2}, {\bar{v}}_{2}) = (D ({\bar{x}}_{2}^{'}), D ({\bar{y}}_{2}^{'}), D ({\bar{u}}_{2}^{'}), D ({\bar{v}}_{2}^{'}))

in the simulated view is computationally indistinguishable from

(x^{'}, y^{'}, u^{'}, v^{'})

in the actual view. Similarly,

({\bar{x}}_{2}, {\bar{y}}_{2}, {\bar{u}}_{2}, {\bar{v}}_{2})

is computationally indistinguishable from

(x, y, u, v)

. Therefore, the output distribution of the simulated view and that of the real view are computationally indistinguishable. In other words, the adversary cannot trace back to the corresponding data records, which preserves the privacy of the dataset and access patterns. To sum up, the SDC protocol is secure. □

Similarly, we can prove that the kNN (Algorithm 1), SGC (Algorithm 3), SNN (Algorithm 4) and SCR (Algorithm 5) protocols are secure under our security model; thus, according to the composition theorem, we can obtain the following theorem.

Theorem 5.

If the hash functions

H (\cdot)

and Paillier’s homomorphic cryptosystem are secure, our protocol achieves dataset privacy, query data privacy, query result privacy, and access pattern privacy.

Remark 2.

For clarity, we use access pattern privacy as an example and provide an intuitive explanation. That is, the identifiers corresponding to the k-nearest neighbors of the query point are kept confidential from both the cloud servers and the querier QU . In fact, during the kNN search process (as described in Algorithm 1), cloud server

{CS}_{1}

receives the encrypted data

{(E (i d_{\min}^{(j)}), E (x_{i d_{m i n}^{(j)}}), E (y_{i d_{\min}^{(j)}})) | j = 1, \dots, k}

, while cloud server

{CS}_{2}

receives the blinded data

{(i d_{\min}^{(j)'}, x_{i d_{\min}^{(j)}}^{'}, y_{i d_{\min}^{(j)}}^{'}) | j = 1, \dots, k}

. Here,

(i d_{\min}^{(j)'},

x_{i d_{\min}^{(j)}}^{'}, y_{i d_{\min}^{(j)}}^{'}) = (i d_{\min}^{(j)}, x_{i d_{\min}^{(j)}}, y_{i d_{\min}^{(j)}}) + (r_{1}^{(j)}, r_{2}^{(j)}, r_{3}^{(j)}) \mod N

, where

(r_{1}^{(j)}, r_{2}^{(j)}, r_{3}^{(j)}) \in Z_{N} \times Z_{N} \times Z_{N}

are uniformly random numbers owned by

{CS}_{1}

. The decryption key

s k_{1}

is owned by

{CS}_{2}

, and since

{CS}_{1}

and

{CS}_{2}

are assumed to be non-collusive, the identifier

i d^{(j)}

remains concealed from both cloud servers. The privacy of

i d^{(j)}

against the querier QU is clear. As shown in Steps 25 and 26 of Algorithm 1, the query results

〈 R_{1}, {VO}_{1} 〉

and

〈 R_{2}, {VO}_{2} 〉

received by the QU contain no information about

i d^{(j)}

.

6. Efficiency Analysis and Performance Evaluation

In this section, we present a comprehensive efficiency evaluation of our scheme from both theoretical and practical perspectives.

6.1. Evaluation Methodology

A widely recognized methodology for assessing the efficiency of a new scheme is to compare it against prior designs. However, it is unfair and meaningless to compare the efficiency of two schemes without considering the different system models and security intentions. Ideally, a well-constructed scheme should satisfy two conditions: (1) it outperforms earlier designs that offer the same or fewer security guarantees, and (2) any previous designs delivering higher security levels should exhibit significantly lower efficiency.

Since our new scheme simultaneously considers two secure functionalities—privacy and verifiability—Table 1 highlights existing schemes that also address these functionalities concurrently. These schemes include those proposed by Rong et al. [25], Sundarapandi et al. [7], Liu et al. [3], Zhang et al. [4], Cui et al. [9], and Cui et al. [8]. It is worth noting that Rong et al.’s and Liu et al.’s schemes fail to preserve the privacy of data access patterns. Furthermore, the verification approaches in Rong et al.’s scheme [25] and Sundarapandi et al.’s scheme [7] are probabilistic. Zhang et al.’s scheme [4], on the other hand, only supports verifying the authenticity of the query results. As for Cui et al.’s scheme [9], the authors themselves evaluated it in [8] and found it to have lower efficiency and unsatisfactory performance. Additionally, Cui et al. [8] comprehensively evaluated their proposed scheme called

MSV k NN

, demonstrating its efficiency advantages over previous designs. Therefore, based on our two comparison principles, we only need to evaluate the efficiency of our scheme relative to the currently most efficient scheme [8].

6.2. Theoretical Analysis

For ease of description, we first introduce some necessary notations. The time complexities of encryption and decryption in Paillier’s HE cryptosystem and DT-PKC [28] are almost the same:

O (\log N) Mul s

. Let

Mul

and

Div

denote the time cost of one multiplication in

Z_{N}

or

Z_{N^{2}}

and the time cost of one division of two integers less than N, respectively. Considering that the encryption and decryption time complexities in both Paillier’s HE cryptosystem and DT-PKC [28] are nearly identical, both at

O (\log N) Mul s

, we denote the time cost of a single encryption operation as

Enc

and the time cost of a single decryption operation as

Dec

in either Paillier’s HE cryptosystem or DT-PKC. Moreover,

Sig

and

Ver

refer to the time cost of one signature operation and oce verification operation in the DSA, respectively. With these notations, we initially analyze the theoretical computational and communication costs of each algorithm in Table 3 and Table 4, respectively. Subsequently, we compare the theoretical computational cost of our protocol with that of Cui et al.’s protocol [8] in Table 5, along with the communication cost in Table 6.

6.3. Experimental Analysis

To comprehensively evaluate the practical performance of the proposed scheme, we conducted experimental comparisons of the time and communication costs between our design and Cui et al.’s protocol [8] across multiple dimensions, including but not limited to the dataset size, grid granularity m, query parameter k in kNN, and size of security parameter modulus.

All experiments were conducted on a laptop featuring an Intel^® Core^TM i5-8250U CPU (1.60 GHz, with eight logical cores, Hewlett-Packard, Palo Alto, CA, USA) with 8GB of RAM, running on Windows 10. The implementations were developed in Java using the JCA Library. Furthermore, we adopted the NIST-recommended parameters for the Digital Signature Algorithm (DSA), where the prime modulus p and subgroup order q were configured with bit lengths of 1024 and 160, respectively. Subsequently, we analyzed the impact of the following parameters:

(1): Impact of varying n: With fixed parameters $m = 16$ , $k = 5$ , and $K = 1024$ , we systematically varied the dataset size n from 1000 to 20,000 to evaluate scalability. Table 7 presents the stage-wise execution times for both Cui et al.’s protocol [8] and our proposed protocol. Visually, Figure 3 further illustrates the comparative trends in the total cost of these two protocols as n increases. The results demonstrate that our protocol achieved a 58.5–65.5% reduction in time cost compared to the baseline, with the performance gap widening significantly for larger n.
(2): Impact of varying m: Under fixed parameters $n = 2000$ , $k = 5$ , and $K = 1024$ , we systematically evaluated the grid granularity $m \in {4, 8, 12, 16, 32, 64}$ to analyze algorithmic scalability. Table 8 presents a comparative analysis of computational latency (in seconds) between Cui et al.’s protocol [8] and our proposed method across these configurations. As shown in the table, the total cost of our design was about 32.7–35.4% of that of the baseline. Furthermore, as the grid granularity m primarily influences the search stages, Figure 4 visualizes the combined latencies of these phases. Notably, minimal computational overhead was achieved at $m = 8$ , aligning closely with the theoretical optimum derived for uniform random datasets:

$m^{2} \approx \sqrt{n} \Rightarrow m \approx n^{1 / 4} = 2000^{1 / 4} \approx 6.68 .$

The empirically observed optimum ( $m = 8$ ) reflects practical implementation constraints while remaining consistent with this theoretical boundary.
(3): Impact of varying k: As shown in Table 9, under fixed parameters $n = 2000$ , $m = 16$ , and $K = 1024$ , we systematically evaluated the computational efficiency of each stage of our protocol against Cui et al.’s baseline [8] by varying the query parameter k in k-nearest-neighbor (kNN) searches from 1 to 10. Further, since the search, verification, and decryption stages are inherently dependent on k, whereas the setup, dataset encryption, and query encryption stages remain protocol-level invariants independent of k, Figure 5 illustrates the variance of the time cost of these two stages as k increases, demonstrating that the efficiency gains of our design became more pronounced as k increased.
(4): Impact of varying K: Given that the modulus size K of N determines the security strength of both the DT-PKC and Paillier cryptosystems employed in our scheme, we conducted a comprehensive performance comparison between our proposed scheme and Cui et al.’s protocol [8] under varying security levels ( $K = 512, 1024, 2048$ ) for fixed parameters $m = 16$ , $k = 5$ , and $n = 2000$ . Table 10 presents the stage-wise computational latencies (e.g., setup, encryption, search, and verification) for both schemes, explicitly quantifying the trade-off between cryptographic robustness and operational efficiency. Also, Figure 6 illustrates the total execution time scaling with increasing K, showing that the total cost of our design was about 33.8–38.1% of that of Cui et al.’s protocol.
(5): Communication cost: We conducted an experimental evaluation of communication costs with fixed parameters, $m = 16$ , $k = 5$ , and $K = 1024$ , while systematically varying the dataset size n. As shown in our theoretical analysis (Table 6), the primary difference in communication cost between our protocol and Cui et al.’s protocol occurred during the $Search$ stage. Figure 7 illustrates the difference in communication overhead between ${CS}_{1}$ and ${CS}_{2}$ in our protocol compared to Cui et al.’s protocol across various dataset sizes. The polyline in Figure 8 represents the comparison of data transfer times between CS1 and CS2 during the Search phase across varying dataset sizes. The data transfer time (communication latency) was calculated as the communication volume divided by the transfer rate, with the transfer rate simulated as 390 Mbps ≈ 48.75 MB/s. Our findings demonstrated that our protocol incurred lower costs, consistent with our theoretical analysis.

7. Conclusions

This paper presents our investigation and proposal of a faster, privacy-preserving, and publicly verifiable protocol for exact k-nearest-neighbor queries, termed PPVkNN. Leveraging Paillier’s homomorphic encryption and a series of meticulously designed secure protocols, PPVkNN not only supports exact kNN query functionality but also preserves the privacy of data, queries, results, and query access patterns. Furthermore, it guarantees result correctness and enables public verification. Theoretical analysis confirms the correctness and security of our proposed protocols. Additionally, efficiency analysis and performance evaluation demonstrate significant computational and communication savings compared to prior works, enhancing the practicality of our scheme.

Author Contributions

J.L. and C.T.: Conceptualization, methodology, validation, investigation, resources, supervision, project administration, visualization, and writing—original draft preparation; Y.S. and W.T.: Formal analysis, data curation, writing—review and editing, and visualization. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partially supported by the Natural Science Foundation of Shandong Province (ZR2022MF250), the National Natural Science Foundation of China (61702294), and the Natural Science Foundation of Top Talent of SZTU (GDRC202214).

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Acknowledgments

The authors would like to thank the editor and the three anonymous referees for their careful reading of this article and their constructive suggestions, which considerably improved this article.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Table A1. Notations.

Parameter	Description
$ρ$	a permutation function
$D$	the plaintext spatial dataset
$ED$	the ciphertext spatial dataset
$L C M (a, b)$	the least common multiple of two integers a and b
Q	$Q = (x_{q}, y_{q})$ is the query data point
$i d_{\min}^{(j)}$	the index of the jth nearest neighbor to Q
$P_{i d_{\min}^{(j)}}$	$P_{i d_{\min}^{(j)}} = (x_{i d_{\min}^{(j)}}, y_{i d_{\min}^{(j)}})$ denotes the jth
	nearest neighbor to Q
$V R V (P_{i d_{\min}^{(j)}})$	the set of Voronoi-relevant vectors of $P_{i d_{\min}^{(j)}}$
$H (\cdot)$	a cryptographic hash function
$Sig (\cdot)$	the signature algorithm in DSA
N	a large integer that is the product of two
	prime numbers p and q
K	the size of N
$Z_{N^{2}}^{★}$	the multiplication group of the residue
	class modulo $N^{2}$
$Z_{N}^{★}$	the residue class ring modulo N
$x \| \| y$	the concatenation of two numbers x and y
$⌊ x ⌋$	the greatest integer, no larger than x
$[n]$	the set ${0, \dots, n - 1}$
$[m]$	the set ${0, \dots, m - 1}$
$τ$	the security parameter
$negl (\cdot)$	a negligible function of the security parameter
n	the dataset size
m	the grid granularity
k	the query parameter
$ω$	the number of lines in one bucket
$λ$	the number of packed points

Appendix B

Appendix B.1. The Proof of Lemma 3

Proof.

In Algorithm 2, due to the homomorphic property of Paillier’s cryptosystem, we know that

\begin{matrix} x^{'} = E {(x_{q})}^{r_{1}} \times E {(X / m)}^{r_{1} \times r_{2}} = E (r_{1} x_{q} + r_{1} r_{2} X / m), u^{'} = E {(X / m)}^{r_{1}} = E (r_{1} X / m), \\ y^{'} = E {(y_{q})}^{r_{1}} \times E {(Y / m)}^{r_{1} \times r_{2}} = E (r_{1} y_{q} + r_{1} r_{2} Y / m), v^{'} = E {(Y / m)}^{r_{1}} = E (r_{1} Y / m) . \end{matrix}

Thus,

x = D (x^{'}) = r_{1} x_{q} + r_{1} r_{2} X / m \mod N,

y = D (y^{'}) = r_{1} y_{q} + r_{1} r_{2} Y / m \mod N,

u = D (u^{'}) = r_{1} X / m \mod N, v = D (v^{'}) = r_{1} Y / m \mod N .

Since

r_{1}

and

r_{2}

are at most

τ

bits, and

x_{q}

and

X / m

are at most

σ

bits, while N is

2 κ

bits, with the security parameter satisfying

τ < κ - \frac{σ + 1}{2}

, we have

x = D (x^{'}) = r_{1} x_{q} + r_{1} r_{2} X / m, u = D (u^{'}) = r_{1} X / m, y = D (y^{'}) = r_{1} y_{q} + r_{1} r_{2} Y / m, v = D (v^{'}) = r_{1} Y / m .

Then,

\begin{matrix} d_{x} = ⌊\frac{x}{u}⌋ = \frac{r_{1} x_{q} + r_{1} r_{2} X / m}{r_{1} X / m} = ⌊\frac{x_{q}}{X / m}⌋ + r_{2}, d_{y} = ⌊\frac{y}{u}⌋ = \frac{r_{1} y_{q} + r_{1} r_{2} Y / m}{r_{1} Y / m} = ⌊\frac{y_{q}}{Y / m}⌋ + r_{2} . \end{matrix}

Consequently, the output is

\begin{matrix} d_{x}^{'} \times E {(r_{2})}^{N - 1} & = E (⌊\frac{x_{q}}{X / m}⌋ + r_{2}) \times E {(r_{2})}^{N - 1} = E (⌊\frac{x_{q}}{X / m}⌋ + r_{2} - r_{2}) = E (⌊\frac{x_{q}}{X / m}⌋), \\ d_{y}^{'} \times E {(r_{2})}^{N - 1} & = E (⌊\frac{y_{q}}{Y / m}⌋ + r_{2}) E {(r_{2})}^{N - 1} = E (⌊\frac{y_{q}}{Y / m}⌋ + r_{2} - r_{2}) = E (⌊\frac{y_{q}}{Y / m}⌋) . \end{matrix}

□

Appendix B.2. The Proof of Lemma 4

Proof.

According to the homomorphic property of Paillier’s cryptosystem and the data-packing technique, from Step 2 of Algorithm 3, we know that

\begin{matrix} Δ_{x j} = {(E (⌊\frac{x_{q}}{X / m}⌋) \times E {(j)}^{N - 1} \times E (T))}^{r_{x j}} = E (r_{x j} (⌊\frac{x_{q}}{X / m}⌋ - j + T)), \\ Δ_{y j} = {(E (⌊\frac{y_{q}}{Y / m}⌋) \times E {(j)}^{N - 1} \times E (T))}^{r_{y j}} = E (r_{y j} (⌊\frac{x_{q}}{X / m}⌋ - j + T)), \end{matrix}

and, in Step 9 of Algorithm 3, we have

E (G_{s t}^{'}) = E (G_{s t}) \times E (r_{s t}) = E (G_{s t} + r_{s t}) .

Thus, in Step 11,

\begin{matrix} Δ_{x}^{'} & = ρ_{1} (Δ_{x}) = (Δ_{x ρ_{1} (0)}, \dots, Δ_{x ρ_{1} (m - 1)}) \\ = (E (r_{x ρ_{1} (0)} (⌊ \frac{x_{q}}{X / m} ⌋ - ρ_{1} (0) + T)), \dots, E (r_{x ρ_{1} (m - 1)} (⌊ \frac{x_{q}}{X / m} ⌋ - ρ_{1} (m - 1) + T))), \\ Δ_{y}^{'} & = ρ_{2} (Δ_{y}) = (Δ_{y ρ_{2} (0)}, \dots, Δ_{y ρ_{2} (m - 1)}) \\ = (E (r_{y ρ_{2} (0)} (⌊ \frac{y_{q}}{Y / m} ⌋ - ρ_{2} (0) + T)), \dots, E (r_{y ρ_{2} (m - 1)} (⌊ \frac{y_{q}}{Y / m} ⌋ - ρ_{2} (m - 1) + T))), \\ Γ^{'} & = ρ_{2} (ρ_{1} (E (G^{'}))) = {(E (G_{ρ_{1} (s) ρ_{2} (t)}^{'}))}_{0 \leq s, t \leq m - 1} = (E (G_{ρ_{1} (0) ρ_{2} (0)}^{'}), \dots, E (G_{ρ_{1} (m - 1) ρ_{2} (m - 1)}^{'})), \end{matrix}

and, in Step 14,

\begin{matrix} v_{x}^{'} & = \prod_{i = 0}^{m - 1} Δ_{x ρ_{1} (i)}^{2^{σ^{'} (m - (i + 1))}} \\ = {(Δ_{x ρ_{1} (0)})}^{2^{σ^{'} (m - 1)}} \dots (Δ_{x ρ_{1} (m - 1)}) = E (\sum_{i = 0}^{m - 1} (r_{x ρ_{1} (i)} (⌊ \frac{x_{q}}{X / m} ⌋ - ρ_{1} (i) + T)) 2^{σ^{'} (m - (i + 1))}), \\ v_{y}^{'} & = \prod_{i = 0}^{m - 1} Δ_{y ρ_{2} (i)}^{2^{σ^{'} (m - (i + 1))}} \\ = {(Δ_{y ρ_{2} (0)})}^{2^{σ^{'} (m - 1)}} \dots (Δ_{y ρ_{2} (m - 1)}) = E (\sum_{i = 0}^{m - 1} (r_{y ρ_{2} (i)} (⌊ \frac{y_{q}}{Y / m} ⌋ - ρ_{2} (i) + T)) 2^{σ^{'} (m - (i + 1))}) . \end{matrix}

Also, in Step 15, according to the property of the data-packing technique, as long as

\begin{matrix} 0 < r_{x ρ_{1} (i)} (⌊ \frac{x_{q}}{X / m} ⌋ - ρ_{1} (i) + T) < 2^{σ^{'}}, \end{matrix}

(A1)

\begin{matrix} 0 < r_{y ρ_{2} (i)} (⌊ \frac{y_{q}}{Y / m} ⌋ - ρ_{2} (i) + T) < 2^{σ^{'}}, \end{matrix}

(A2)

\begin{matrix} \sum_{i = 0}^{m - 1} (r_{x ρ_{1} (i)} (⌊ \frac{x_{q}}{X / m} ⌋ - ρ_{1} (i) + T)) 2^{σ^{'} (m - (i + 1))} < N, \end{matrix}

(A3)

\begin{matrix} \sum_{i = 0}^{m - 1} (r_{y ρ_{2} (i)} (⌊ \frac{y_{q}}{Y / m} ⌋ - ρ_{2} (i) + T)) 2^{σ^{'} (m - (i + 1))} < N, \end{matrix}

(A4)

we have

\begin{matrix} (v_{x ρ_{1} (0)}, \dots, v_{x ρ_{1} (m - 1)}) = D (v_{x}^{'}) = ((⌊ \frac{x_{q}}{X / m} ⌋ - ρ_{1} (0) + T) r_{x ρ_{1} (0)}, \dots, (⌊ \frac{x_{q}}{X / m} ⌋ - ρ_{1} (m - 1) + T) r_{x ρ_{1} (m - 1)}), \\ (v_{y ρ_{2} (0)}, \dots, v_{y ρ_{2} (m - 1)}) = D (v_{y}^{'}) = ((⌊ \frac{y_{q}}{Y / m} ⌋ - ρ_{2} (0) + T) r_{y ρ_{2} (0)}, \dots, (⌊ \frac{y_{q}}{Y / m} ⌋ - ρ_{2} (m - 1) + T) r_{y ρ_{2} (m - 1)}) . \end{matrix}

Since

r_{x j}

and

r_{y j} (0 \leq j \leq m - 1)

are at most

τ

bits, and

x_{q}, y_{q}, X / m, Y / m

and m are at most

σ

bits, with

T \geq m

and

2^{τ + σ} < 2^{σ^{'}}, 2^{σ^{'} m} < N

, the conditions (Equations (A1)–(A4)) hold. Subsequently, in Step 18, the decisional conditions

v_{x ρ_{1} (s)} \mod T = 0

and

v_{y ρ_{2} (t)} \mod T = 0

mean that

⌊ \frac{x_{q}}{X / m} ⌋ = ρ_{1} (s)

and

⌊ \frac{y_{q}}{Y / m} ⌋ = ρ_{2} (t)

. Consequently, in Step 25,

\begin{matrix} E (r) = \prod_{s = 0}^{m - 1} \prod_{t = 0}^{m - 1} {(M_{s t}^{'})}^{r_{s t}} = \prod_{s = 0}^{m - 1} \prod_{t = 0}^{m - 1} {(M_{ρ_{1}^{- 1} (s) ρ_{2}^{- 1} (t)})}^{r_{s t}} = \prod_{s = 0}^{m - 1} \prod_{t = 0}^{m - 1} {(M_{s t})}^{r_{ρ_{1} (s) ρ_{2} (t)}} = E (\sum_{s = 0}^{m - 1} \sum_{t = 0}^{m - 1} r_{ρ_{1} (s) ρ_{2} (t)} δ_{s t}) = E (r_{\hat{s} \hat{t}}) \end{matrix}

with

\hat{s} = ⌊\frac{x_{q}}{X / m}⌋, \hat{t} = ⌊\frac{y_{q}}{Y / m}⌋

and

δ_{s t} = \{\begin{matrix} 1 & (ρ_{1} (s), ρ_{2} (t)) = (\hat{s}, \hat{t}) = (⌊ \frac{x_{q}}{X / m} ⌋, ⌊ \frac{y_{q}}{Y / m} ⌋) \\ 0 & o t h e r w i s e \end{matrix} .

In the last step (Step 26),

\begin{matrix} Γ^{'} \times E {(r)}^{N - 1} = E (G_{ρ_{1} (s) ρ_{2} (t)}^{'}) \times E {(r_{\hat{s} \hat{t}})}^{N - 1} = E (G_{⌊ \frac{x_{q}}{X / m} ⌋, ⌊ \frac{y_{q}}{Y / m} ⌋} + r_{⌊ \frac{x_{q}}{X / m} ⌋, ⌊ \frac{y_{q}}{Y / m} ⌋} - r_{\hat{s} \hat{t}}) = E (G_{\hat{s} \hat{t}}) . \end{matrix}

□

Appendix B.3. The Proof of Lemma 5

Proof.

Due to the homomorphic properties of Paillier’s cryptosystem and the data-packing technique, we know that

\begin{matrix} v_{0}^{'} = & {(E (G_{s t}) \times E (Φ_{0}))}^{r_{0}} = (E (i_{1}^{(s t)} | x_{i_{1}^{(s t)}} | y_{i_{1}}^{(s t)} | \dots | i_{λ}^{(s t)} | x_{i_{λ}^{(s t)}} | y_{i_{λ}^{(s t)}} {) \times E (r_{1} | r_{2} | \dots | r_{3 λ}))}^{r_{0}} \\ = & E (r_{0} (i_{1}^{(s t)} + r_{1}) 2^{σ^{'} (3 λ - 1)} + r_{0} (x_{i_{1}^{(s t)}} + r_{2}) 2^{σ^{'} (3 λ - 2)} + r_{0} (y_{i_{1}^{(s t)}} + r_{3}) 2^{σ^{'} (3 λ - 3)} + \dots + r_{0} (y_{i_{λ}^{(s t)}} + r_{3 λ}) 2^{0}), \\ v_{1}^{'} = & {(E (Q^{'}) \times E (Φ_{1}))}^{r_{0}} = {(E (x_{q} | y_{q} | \dots | x_{q} | y_{q} | \dots | x_{q} | y_{q}) \times E (r_{2} | r_{3} | \dots | r_{3 k - 1} | r_{3 k} | \dots | r_{3 λ - 1} | r_{3 λ}))}^{r_{0}} \\ = & E (r_{0} (x_{q} + r_{2}) 2^{σ^{'} (2 λ - 1)} + r_{0} (y_{q} + r_{3}) 2^{σ^{'} (2 λ - 2)} + \dots + r_{0} (y_{q} + r_{3 λ}) 2^{0}), \\ v_{2}^{'} = & {(E (i d^{'}) \times E (Φ_{2}))}^{r_{0}} = (E (i d | i d | i d | \dots | i d) \times E (r_{1} | r_{4} | r_{7} | \dots | r_{3 λ - 2} {))}^{r_{0}} \\ = & E (r_{0} (i d + r_{1}) 2^{σ^{'} (λ - 1)} + r_{0} (i d + r_{4}) 2^{σ^{'} (λ - 2)} + r_{0} (i d + r_{7}) 2^{σ^{'} (λ - 3)} + \dots + r_{0} (i d + r_{3 λ - 2}) 2^{0}) . \end{matrix}

Thus,

\begin{matrix} v_{0} & = D (v_{0}) = r_{0} (i_{1}^{(s t)} + r_{1}) 2^{σ^{'} (3 λ - 1)} + r_{0} (x_{i_{1}^{(s t)}} + r_{2}) 2^{σ^{'} (3 λ - 2)} + r_{0} (y_{i_{1}^{(s t)}} + r_{3}) 2^{σ^{'} (3 λ - 3)} + \dots + r_{0} (y_{i_{λ}^{(s t)}} + r_{3 λ}) 2^{0} \mod N, \\ v_{1} & = D (v_{1}) = r_{0} (x_{q} + r_{2}) 2^{σ^{'} (2 λ - 1)} + r_{0} (y_{q} + r_{3}) 2^{σ^{'} (2 λ - 2)} + \dots + r_{0} (y_{q} + r_{3 λ}) 2^{0} \mod N, \\ v_{2} & = D (v_{2}) = r_{0} (i d + r_{1}) 2^{σ^{'} (λ - 1)} + r_{0} (i d + r_{4}) 2^{σ^{'} (λ - 2)} + r_{0} (i d + r_{7}) 2^{σ^{'} (λ - 3)} + \dots + r_{0} (i d + r_{3 λ - 2}) 2^{0} \mod N . \end{matrix}

Since

r_{j} (0 \leq j \leq 3 λ)

are at most

τ

bits,

i_{j}^{(s t)}, x_{i_{j}^{(s t)}}, y_{i_{j}^{(s t)}} (1 \leq j \leq λ), x_{q}

,

y_{q}

and

i d

are at most

σ

bits, and the security parameters

τ

and

σ^{'}

satisfy

2^{τ + σ} < 2^{σ^{'}}

and

2^{σ^{'} 3 λ} < N

, we have

\begin{matrix} v_{0} = D (v_{0}) = r_{0} (i_{1}^{(s t)} + r_{1}) 2^{σ^{'} (3 λ - 1)} + r_{0} (x_{i_{1}^{(s t)}} + r_{2}) 2^{σ^{'} (3 λ - 2)} + r_{0} (y_{i_{1}^{(s t)}} + r_{3}) 2^{σ^{'} (3 λ - 3)} + \dots + r_{0} (y_{i_{λ}^{(s t)}} + r_{3 λ}) 2^{0}, \\ v_{1} = D (v_{1}) = r_{0} (x_{q} + r_{2}) 2^{σ^{'} (2 λ - 1)} + r_{0} (y_{q} + r_{3}) 2^{σ^{'} (2 λ - 2)} + \dots + r_{0} (y_{q} + r_{3 λ}) 2^{0}, \\ v_{2} = D (v_{2}) = r_{0} (i d + r_{1}) 2^{σ^{'} (λ - 1)} + r_{0} (i d + r_{4}) 2^{σ^{'} (λ - 2)} + r_{0} (i d + r_{7}) 2^{σ^{'} (λ - 3)} + \dots + r_{0} (i d + r_{3 λ - 2}) 2^{0}, \\ ID = {r_{0} (i_{1}^{(s t)} + r_{1}), r_{0} (i_{2}^{(s t)} + r_{4}), r_{0} (i_{3}^{(s t)} + r_{7}), \dots, r_{0} (i_{λ}^{(s t)} + r_{3 λ - 2})} \\ P = \{(r_{0} (x_{i_{1}^{(s t)}} + r_{2}), r_{0} (y_{i_{1}^{(s t)}} + r_{3})), (r_{0} (x_{i_{2}^{(s t)}} + r_{5}), r_{0} (y_{i_{2}^{(s t)}} + r_{6})), \dots, (r_{0} (x_{i_{λ}^{(s t)}} + r_{3 λ - 1}), r_{0} (y_{i_{λ}^{(s t)}} + r_{3 λ}))\} \\ Q = {(r_{0} (x_{q} + r_{2}), r_{0} (y_{q} + r_{3})), (r_{0} (x_{q} + r_{5}), r_{0} (y_{q} + r_{6})), \dots, (r_{0} (x_{q} + r_{3 λ - 1}), r_{0} (y_{q} + r_{3 λ}))} \\ id = {r_{0} (i d + r_{1}), r_{0} (i d + r_{4}), r_{0} (i d + r_{7}), \dots, r_{0} (i d + r_{3 λ - 2})} \end{matrix}

Subsequently, after the for-loop in Step 20, we have

\begin{matrix} d_{\min} & = \min_{0 \leq j \leq λ - 1, i_{j + 1}^{(s t)} \neq i d} r_{0}^{2} ({(x_{i_{j + 1}^{(s t)}} - x_{q})}^{2} + {(y_{i_{j + 1}^{(s t)}} - y_{q})}^{2}) = r_{0}^{2} ({(x_{i_{p o s + 1}^{(s t)}} - x_{q})}^{2} + {(y_{i_{p o s + 1}^{(s t)}} - y_{q})}^{2}), \end{matrix}

and, in Step 28,

\begin{matrix} E (r_{i d}) = \prod_{i = 0}^{λ - 1} δ {[i]}^{r_{3 i + 1}} = δ {[0]}^{r_{1}} δ {[1]}^{r_{4}} δ {[2]}^{r_{7}} \dots δ {[λ - 1]}^{r_{3 λ - 2}} = E (r_{1} \cdot 0 + \dots + r_{3 p o s + 1} \cdot 1 + \dots + r_{3 λ - 2} \cdot 0) = E (r_{3 p o s + 1}), \\ E (r_{x}) = \prod_{i = 0}^{λ - 1} δ {[i]}^{r_{3 i + 2}} = δ {[0]}^{r_{2}} δ {[1]}^{r_{5}} δ {[2]}^{r_{8}} \dots δ {[λ - 1]}^{r_{3 λ - 1}} = E (r_{2} \cdot 0 + \dots + r_{3 p o s + 2} \cdot 1 + \dots + r_{3 λ - 1} \cdot 0) = E (r_{3 p o s + 2}), \\ E (r_{y}) = \prod_{i = 0}^{λ - 1} δ {[i]}^{r_{3 i + 3}} = δ {[0]}^{r_{3}} δ {[1]}^{r_{6}} δ {[2]}^{r_{9}} \dots δ {[λ - 1]}^{r_{3 λ}} = E (r_{3} \cdot 0 + \dots + r_{3 p o s + 3} \cdot 1 + \dots + r_{3 λ} \cdot 0) = E (r_{3 p o s + 3}) . \end{matrix}

Therefore, in the last step (Step 29), by the homomorphic property,

\begin{matrix} E (i d_{m i n}) & = E {(ID [p o s])}^{r_{0}^{- 1}} \times E {(r_{i d})}^{N - 1} = E (r_{0}^{- 1} ID [p o s] - r_{i d}) = E (r_{0}^{- 1} (r_{0} (i_{p o s + 1}^{(s t)} + r_{3 p o s + 1})) - r_{3 p o s + 1}) = E (i_{p o s + 1}^{(s t)}), \\ E (x_{i d_{m i n}}) & = E {(P [p o s] . x)}^{r_{0}^{- 1}} \times E {(r_{x})}^{N - 1} = E (r_{0}^{- 1} P [p o s] . x - r_{x}) = E (r_{0}^{- 1} (r_{0} (x_{i_{p o s + 1}^{(s t)}} + r_{3 p o s + 2})) - r_{3 p o s + 2}) = E (x_{i_{p o s + 1}^{(s t)}}), \\ E (y_{i d_{m i n}}) & = E {(P [p o s] . y)}^{r_{0}^{- 1}} \times E {(r_{y})}^{N - 1} = E (r_{0}^{- 1} P [p o s] . y - r_{y}) = E (r_{0}^{- 1} (r_{0} (y_{i_{p o s + 1}^{(s t)}} + r_{3 p o s + 3})) - r_{3 p o s + 3}) = E (y_{i_{p o s + 1}^{(s t)}}) . \end{matrix}

□

Appendix B.4. The Proof of Lemma 6

Proof.

According to the homomorphic property, in Step 3, we know that for

0 \leq j \leq ⌈ \frac{n}{w} ⌉ - 1

,

\begin{matrix} E (η_{M j}) = {(E ((j + 1) w - 1) \times E (r_{0 j}))}^{r_{1 j}} = E (r_{1 j} ((j + 1) w - 1 + r_{0 j})), \\ E (η_{j}) = {(E (i d_{\min}) \times E (r_{0 j}))}^{r_{1 j}} = E (r_{1 j} (i d_{\min} + r_{0 j})), \\ E (η_{m j}) = {(E (j w) \times E (r_{0 j}))}^{r_{1 j}} = E (r_{1 j} (j w + r_{0 j})), \end{matrix}

and, in Step 7, for

0 \leq j \leq ⌈ \frac{n}{w} ⌉ - 1

and

0 \leq k \leq w - 1

, we have

\begin{matrix} Ψ_{(j w + k) 0} = {(E (j w + k) \times E {(i d_{\min})}^{N - 1})}^{r_{(j w + k) 0}} = E (r_{(j w + k) 0} (j w + k - i d_{\min})), \\ Ψ_{(j w + k) 1} = E (V R V (P_{j w + k})) \times E (r_{(j w + k) 1}) = E (V R V (P_{j w + k}) + r_{(j w + k) 1}), \\ Ψ_{(j w + k) 2} = E (S i g (P_{j w + k})) \times E (r_{(j w + k) 2}) = E (S i g (P_{j w + k}) + r_{(j w + k) 2}) . \end{matrix}

Thus, in Step 13,

\begin{matrix} η_{M}^{''} = ρ_{1} (η_{M}^{'}) = (E (η_{M ρ_{1} (0)}), \dots, E (η_{M ρ_{1} (⌈ \frac{n}{w} ⌉ - 1)})), η_{m}^{''} = ρ_{1} (η_{m}^{'}) = (E (η_{m ρ_{1} (0)}), \dots, E (η_{m ρ_{1} (⌈ \frac{n}{w} ⌉ - 1)})), \\ η^{''} = ρ_{1} (η^{'}) = (E (η_{ρ_{1} (0)}), \dots, E (η_{ρ_{1} (⌈ \frac{n}{w} ⌉ - 1)})), B^{''} = ρ_{2} (ρ_{1} (B^{'})) = (Ψ_{0}^{'}, Ψ_{1}^{'}, Ψ_{2}^{'}), \\ (Ψ_{0}^{'}, Ψ_{1}^{'}, Ψ_{2}^{'}) = ((Ψ_{ρ_{1} (0) 0}^{'}, \dots, Ψ_{ρ_{1} (⌈ \frac{n}{w} ⌉ - 1), 0}^{'}), (Ψ_{ρ_{1} (0) 1}^{'}, \dots, Ψ_{ρ_{1} (⌈ \frac{n}{w} ⌉ - 1), 1}^{'}), (Ψ_{ρ_{1} (0) 2}^{'}, \dots, Ψ_{ρ_{1} (⌈ \frac{n}{w} ⌉ - 1), 2}^{'})) = \\ ({(Ψ_{(ρ_{1} (j) w + ρ_{2} (k)) 0})}_{0 \leq j \leq (⌈ \frac{n}{w} ⌉ - 1), 0 \leq k \leq w - 1}, {(Ψ_{(ρ_{1} (j) w + ρ_{2} (k)) 1})}_{0 \leq j \leq (⌈ \frac{n}{w} ⌉ - 1), 0 \leq k \leq w - 1}, {(Ψ_{(ρ_{1} (j) w + ρ_{2} (k)) 2})}_{0 \leq j \leq (⌈ \frac{n}{w} ⌉ - 1), 0 \leq k \leq w - 1}), \end{matrix}

which results in the following in Step 19:

\begin{matrix} (η_{M ρ_{1} (0)}, \dots, η_{M ρ_{1} (⌈ \frac{n}{w} ⌉ - 1)}) = D (η_{M}^{''}), (η_{m ρ_{1} (0)}, \dots, η_{m ρ_{1} (⌈ \frac{n}{w} ⌉ - 1)}) = D (η_{m}^{''}), (η_{ρ_{1} (0)}, \dots, η_{ρ_{1} (⌈ \frac{n}{w} ⌉ - 1)}) = D (η^{''}) . \end{matrix}

Consequently, in Step 22, the condition

(η_{M ρ_{1} (j)} - η_{ρ_{1} (j)}) \geq 0 \land (η_{m ρ_{1} (j)} - η_{ρ_{1} (j)}) < 0

is equivalent to

\begin{matrix} (r_{1 ρ_{1} (j)} ((ρ_{1} (j) + 1) w - 1 + r_{0 ρ_{1} (j)})) \mod N - (r_{1 ρ_{1} (j)} (i d_{\min} + r_{0 ρ_{1} (j)})) \mod N \geq 0, \end{matrix}

(A5)

\begin{matrix} (r_{1 ρ_{1} (j)} ((ρ_{1} (j)) w + r_{0 ρ_{1} (j)})) \mod N - (r_{1 ρ_{1} (j)} (i d_{\min} + r_{0 ρ_{1} (j)})) \mod N < 0 . \end{matrix}

(A6)

Since

r_{0 j}

and

r_{1 j} > 0

are at most

τ

bits,

i d_{\min}

and

(⌈ \frac{n}{w} ⌉) w - 1 < n

are at most

σ

bits, and the security parameter

τ

satisfies

2^{τ + σ} < N

and

2^{2 τ} < N

, Equations (A5) and (A6) are

\begin{matrix} r_{1 ρ_{1} (j)} ((ρ_{1} (j) + 1) w - 1 + r_{0 ρ_{1} (j)}) - r_{1 ρ_{1} (j)} (i d_{\min} + r_{0 ρ_{1} (j)}) \geq 0 \Leftrightarrow (ρ_{1} (j) + 1) w - 1 \geq i d_{\min}, \end{matrix}

(A7)

\begin{matrix} r_{1 ρ_{1} (j)} ((ρ_{1} (j)) w + r_{0 ρ_{1} (j)}) - r_{1 ρ_{1} (j)} (i d_{\min} + r_{0 ρ_{1} (j)}) < 0 \Leftrightarrow ρ_{1} (j) w < i d_{\min}, \end{matrix}

(A8)

which means that

i d_{\min}

lies in the bucket

B_{ρ_{1} (j)}

. Then, Step 23 decrypts the entry in each row of this bucket:

\begin{matrix} (ψ_{(ρ_{1} (j) w + ρ_{2} (0)) 0}, \dots, ψ_{(ρ_{1} (j) w + ρ_{2} (w - 1)) 0}) = D (Ψ_{ρ_{1} (j) 0}^{'}) . \end{matrix}

Thus, the condition in Step 25 is

ψ_{(ρ_{1} (j) w + ρ_{2} (k)) 0} = r_{(ρ_{1} (j) w + ρ_{2} (k)) 0} (ρ_{1} (j) w + ρ_{2} (k) - i d_{\min}) = 0,

which means that

i d_{\min} = ρ_{1} (j) w + ρ_{2} (k)

. At this point, we record the ciphertexts of the

V R V (P_{i d_{\min}})

and the signature in Step 26:

\begin{matrix} Θ = (Ψ_{(ρ_{1} (j) w + ρ_{2} (k)) 1}, Ψ_{(ρ_{1} (j) w + ρ_{2} (k)) 2}) = (Ψ_{(i d_{\min}) 1}, Ψ_{(i d_{\min}) 2}) = (E (V R V (P_{i d_{\min}}) + r_{i d_{\min} 1}), E (S i g (P_{i d_{\min}}) + r_{i d_{\min} 1}))) . \end{matrix}

Consequently, in Step 30,

\begin{matrix} E (ψ^{(1)}) & = \prod_{j = 0}^{⌈ \frac{n}{w} ⌉ - 1} \prod_{k = 0}^{w - 1} {(M_{j k}^{'})}^{r_{(j w + k) 1}} = \prod_{j = 0}^{⌈ \frac{n}{w} ⌉ - 1} \prod_{k = 0}^{w - 1} {(M_{ρ_{1}^{- 1} (j) ρ_{2}^{- 1} (k)})}^{r_{(j w + k) 1}} = \prod_{j = 0}^{⌈ \frac{n}{w} ⌉ - 1} \prod_{k = 0}^{w - 1} {(M_{j k})}^{r_{(ρ_{1} (j) w + ρ_{2} (k)) 1}} \\ = E (\sum_{j = 0}^{⌈ \frac{n}{w} ⌉ - 1} \sum_{k = 0}^{w - 1} δ_{j k} r_{(ρ_{1} (j) w + ρ_{2} (k)) 1}) = E (r_{i d_{\min} 1}), \end{matrix}

\begin{matrix} E (ψ^{(2)}) & = \prod_{j = 0}^{⌈ \frac{n}{w} ⌉ - 1} \prod_{k = 0}^{w - 1} {(M_{j k}^{'})}^{r_{(j w + k) 2}} = \prod_{j = 0}^{⌈ \frac{n}{w} ⌉ - 1} \prod_{k = 0}^{w - 1} {(M_{ρ_{1}^{- 1} (j) ρ_{2}^{- 1} (k)})}^{r_{(j w + k) 2}} = \prod_{j = 0}^{⌈ \frac{n}{w} ⌉ - 1} \prod_{k = 0}^{w - 1} {(M_{j k})}^{r_{(ρ_{1} (j) w + ρ_{2} (k)) 2}} \\ = E (\sum_{j = 0}^{⌈ \frac{n}{w} ⌉ - 1} \sum_{k = 0}^{w - 1} δ_{j k} r_{(ρ_{1} (j) w + ρ_{2} (k)) 2}) = E (r_{i d_{\min} 2}) \end{matrix}

with

δ_{j k} = \{\begin{matrix} 1 & ρ_{1} (j) w + ρ_{2} (k) = i d_{\min} \\ 0 & o t h e r w i s e \end{matrix},

and, in the last step (Step 31),

\begin{matrix} Θ_{1} \times E {(ψ^{(1)})}^{N - 1} = E (V R V (P_{i d_{\min}}) + r_{i d_{\min} 1}) \times E {(r_{i d_{\min} 1})}^{N - 1} = E (V R V (P_{i d_{\min}})), \\ Θ_{2} \times E {(ψ^{(2)})}^{N - 1} = E (S i g (P_{i d_{\min}}) + r_{i d_{\min} 2}) \times E {(r_{i d_{\min} 2})}^{N - 1} = E (S i g (P_{i d_{\min}})) . \end{matrix}

□

Appendix B.5. The Proof of Lemma 7

Proof.

Given the correctness established by Lemmas 3–5 for Algorithms 2–4, it follows that in Step 4, the two-tuple

(E (x_{i d_{m i n}^{(1)}}), E (y_{i d_{m i n}^{(1)}}))

represents the encrypted nearest-neighbor point to Q. Due to the homomorphic property of Paillier’s cryptosystem, after Steps 5–7,

(x_{i d_{\min}^{(1)}}^{'}, y_{i d_{\min}^{(1)}}^{'}) = (x_{i d_{\min}^{(1)}} + r_{2}^{(1)}, y_{i d_{\min}^{(1)}} + r_{3}^{(1)})

, and

P_{i d_{\min}^{(1)}} = (x_{i d_{\min}^{(1)}}, y_{i d_{\min}^{(1)}})

is the nearest neighbor to Q. Also, by Lemma 6 and the homomorphic property, after Steps 8–11, we have

S i g {(P_{i d_{\min}^{(1)}})}^{'} = S i g (P_{i d_{\min}^{(1)}}) + r_{5}^{(1)}

and

\begin{matrix} V R V {(P_{i d_{\min}^{(1)}})}^{'} & = (i d_{1}^{(1)} + r_{4, 1}^{(1)}) 2^{σ (3 L - 1)} + (x_{i d_{1}^{(1)}} + r_{4, 2}^{(1)}) 2^{σ (3 L - 2)} + (y_{i d_{1}^{(1)}} + r_{4, 3}^{(1)}) 2^{σ (3 L - 3)} + \dots + (y_{i d_{L}^{(1)}} + r_{4, 3 L}^{(1)}) 2^{0} . \end{matrix}

Thus, after the unpacking operation in Step 12,

\begin{matrix} \{{i d_{1}^{(1)}}^{'}, P_{{i d_{1}^{(1)}}^{'}}^{'}, \dots, {i d_{L}^{(1)}}^{'}, P_{{i d_{L}^{(1)}}^{'}}^{'}\} = \{i d_{1}^{(1)} + r_{4, 1}^{(1)}, P_{i d_{1}^{(1)}} + (r_{4, 2}^{(1)}, r_{4, 3}^{(1)}), \dots, i d_{L}^{(1)} + r_{4, 3 L - 2}^{(1)}, P_{i d_{L}^{(1)}} + (r_{4, 3 L - 1}^{(1)}, r_{4, 3 L}^{(1)})\} . \end{matrix}

Through a similar analysis, we can also prove the cases for

j = 2, \dots, k

. □

References

Wang, J.; Chen, X. Efficient and Secure Storage for Outsourced Data: A Survey; Springer: Berlin/Heidelberg, Germany, 2016; Volume 1, pp. 178–188. [Google Scholar] [CrossRef]
Lei, X.; Liu, A.X.; Li, R.; Tu, G.H. SecEQP: A Secure and Efficient Scheme for SkNN Query Problem Over Encrypted Geodata on Cloud. In Proceedings of the 2019 IEEE 35th International Conference on Data Engineering (ICDE), Macao, China, 8–11 April 2019; pp. 662–673. [Google Scholar] [CrossRef]
Liu, Q.; Hao, Z.; Peng, Y.; Jiang, H.; Wu, J.; Peng, T.; Wang, G.; Zhang, S. SecVKQ: Secure and verifiable kNN queries in sensor–cloud systems. J. Syst. Archit. 2021, 120, 102300. [Google Scholar] [CrossRef]
Zhang, Y.; Wang, B.; Zhao, Z. Secure k-NN Query With Multiple Keys Based on Random Projection Forests. IEEE Internet Things J. 2024, 11, 15205–15218. [Google Scholar] [CrossRef]
Qi, J.; Jia, X.; Luo, M.; Feng, Q. A Privacy-Aware K-Nearest Neighbor Query Scheme for Location-Based Services. IEEE Internet Things J. 2024, 11, 10831–10842. [Google Scholar] [CrossRef]
Cheng, K.; Wang, L.; Shen, Y.; Wang, H.; Wang, Y.; Jiang, X.; Zhong, H. Secure k-NN Query on Encrypted Cloud Data with Multiple Keys. IEEE Trans. Big Data 2021, 7, 689–702. [Google Scholar] [CrossRef]
Sundarapandi, G.P.; Bokhary, S.; Samanthula, B.K.; Dong, B. A Probabilistic Approach for Secure and Verifiable Computation of kNN Queries in Cloud. In Proceedings of the 2023 IEEE Cloud Summit, Baltimore, MD, USA, 6–7 July 2023; pp. 15–20. [Google Scholar] [CrossRef]
Cui, N.; Qian, K.; Cai, T.; Li, J.; Yang, X.; Cui, J.; Zhong, H. Towards Multi-User, Secure, and Verifiable kNN Query in Cloud Database. IEEE Trans. Knowl. Data Eng. 2023, 35, 9333–9349. [Google Scholar] [CrossRef]
Cui, N.; Yang, X.; Wang, B.; Li, J.; Wang, G. SVkNN: Efficient Secure and Verifiable k-Nearest Neighbor Query on the Cloud Platform. In Proceedings of the 2020 IEEE 36th International Conference on Data Engineering (ICDE), Dallas, TX, USA, 20–24 April 2020; pp. 253–264. [Google Scholar] [CrossRef]
Oliveira, S.R.; Zaiane, O.R. Privacy preserving clustering by data transformation. J. Inf. Data Manag. 2010, 1, 37. [Google Scholar]
Wong, W.K.; Cheung, D.W.l.; Kao, B.; Mamoulis, N. Secure kNN computation on encrypted databases. In Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data, New York, NY, USA, 29 June–2 July 2009; SIGMOD ’09. pp. 139–152. [Google Scholar] [CrossRef]
Hu, H.; Xu, J.; Ren, C.; Choi, B. Processing private queries over untrusted data cloud through privacy homomorphism. In Proceedings of the 2011 IEEE 27th International Conference on Data Engineering, Hannover, Germany, 11–16 April 2011; pp. 601–612. [Google Scholar] [CrossRef]
Yao, B.; Li, F.; Xiao, X. Secure nearest neighbor revisited. In Proceedings of the 2013 IEEE 29th International Conference on Data Engineering (ICDE), Brisbane, QLD, Australia, 8–12 April 2013; pp. 733–744. [Google Scholar] [CrossRef]
Choi, S.; Ghinita, G.; Lim, H.S.; Bertino, E. Secure kNN Query Processing in Untrusted Cloud Environments. IEEE Trans. Knowl. Data Eng. 2014, 26, 2818–2831. [Google Scholar] [CrossRef]
Wang, B.; Hou, Y.; Li, M. QuickN: Practical and Secure Nearest Neighbor Search on Encrypted Large-Scale Data. IEEE Trans. Cloud Comput. 2022, 10, 2066–2078. [Google Scholar] [CrossRef]
Popa, R.A.; Li, F.H.; Zeldovich, N. An Ideal-Security Protocol for Order-Preserving Encoding. In Proceedings of the 2013 IEEE Symposium on Security and Privacy, Berkeley, CA, USA, 19–22 May 2013; pp. 463–477. [Google Scholar] [CrossRef]
Zhu, Y.; Xu, R.; Takagi, T. Secure k-NN computation on encrypted cloud data without sharing key with query users. In Proceedings of the 2013 International Workshop on Security in Cloud Computing, Hangzhou, China, 8 May 2013; Cloud Computing ’13. pp. 55–60. [Google Scholar] [CrossRef]
Zhu, Y.; Huang, Z.; Takagi, T. Secure and controllable k-NN query over encrypted cloud data with key confidentiality. J. Parallel Distrib. Comput. 2016, 89, 1–12. [Google Scholar] [CrossRef]
Lei, X.; Tu, G.H.; Liu, A.X.; Xie, T. Fast and Secure kNN Query Processing in Cloud Computing. In Proceedings of the 2020 IEEE Conference on Communications and Network Security (CNS), Avignon, France, 29 June–1 July 2020; pp. 1–9. [Google Scholar] [CrossRef]
Li, R.; Liu, A.X.; Xu, H.; Liu, Y.; Yuan, H. Adaptive Secure Nearest Neighbor Query Processing Over Encrypted Data. IEEE Trans. Dependable Secur. Comput. 2022, 19, 91–106. [Google Scholar] [CrossRef]
Zheng, Y.; Lu, R.; Zhang, S.; Shao, J.; Zhu, H. Achieving Practical and Privacy-Preserving kNN Query over Encrypted Data. In IEEE Transactions on Dependable and Secure Computing; IEEE: Piscataway, NJ, USA, 2024; pp. 1–13. [Google Scholar] [CrossRef]
Elmehdwi, Y.; Samanthula, B.K.; Jiang, W. Secure k-nearest neighbor query over encrypted data in outsourced environments. In Proceedings of the 2014 IEEE 30th International Conference on Data Engineering, Chicago, IL, USA, 31 March–4 April 2014; pp. 664–675. [Google Scholar] [CrossRef]
Guan, Y.; Lu, R.; Zheng, Y.; Shao, J.; Wei, G. Toward Oblivious Location-Based k-Nearest Neighbor Query in Smart Cities. IEEE Internet Things J. 2021, 8, 14219–14231. [Google Scholar] [CrossRef]
Yiu, M.L.; Lo, E.; Yung, D. Authentication of moving kNN queries. In Proceedings of the 2011 IEEE 27th International Conference on Data Engineering, Hannover, Germany, 11–16 April 2011; pp. 565–576. [Google Scholar] [CrossRef]
Rong, H.; Wang, H.; Liu, J.; Wu, W.; Xian, M. Efficient Integrity Verification of Secure Outsourced kNN Computation in Cloud Environments. In Proceedings of the 2016 IEEE Trustcom/BigDataSE/ISPA, Tianjin, China, 23–26 August 2016; pp. 236–243. [Google Scholar] [CrossRef]
Jiang, S.; Zhu, X.; Guo, L.; Liu, J. Publicly Verifiable Boolean Query Over Outsourced Encrypted Data. IEEE Trans. Cloud Comput. 2019, 7, 799–813. [Google Scholar] [CrossRef]
Wu, S.; Li, Q.; Li, G.; Yuan, D.; Yuan, X.; Wang, C. ServeDB: Secure, Verifiable, and Efficient Range Queries on Outsourced Database. In Proceedings of the 2019 IEEE 35th International Conference on Data Engineering (ICDE), Macao, China, 8–11 April 2019; pp. 626–637. [Google Scholar] [CrossRef]
Liu, X.; Deng, R.H.; Choo, K.K.R.; Weng, J. An Efficient Privacy-Preserving Outsourced Calculation Toolkit With Multiple Keys. IEEE Trans. Inf. Forensics Secur. 2016, 11, 2401–2414. [Google Scholar] [CrossRef]
Yi, X.; Paulet, R.; Bertino, E.; Varadharajan, V. Practical Approximate k Nearest Neighbor Queries with Location and Query Privacy. IEEE Trans. Knowl. Data Eng. 2016, 28, 1546–1559. [Google Scholar] [CrossRef]
Benabbas, S.; Gennaro, R.; Vahlis, Y. Verifiable Delegation of Computation over Large Datasets. In Proceedings of the Advances in Cryptology—CRYPTO 2011, Santa Barbara, CA, USA, 14–18 August 2011; Rogaway, P., Ed.; Springer: Berlin/Heidelberg, Germany, 2011; pp. 111–131. [Google Scholar]
Gennaro, R.; Gentry, C.; Parno, B. Non-interactive Verifiable Computing: Outsourcing Computation to Untrusted Workers. In Proceedings of the Advances in Cryptology—CRYPTO 2010, Santa Barbara, CA, USA, 15–19 August 2010; Rabin, T., Ed.; Springer: Berlin/Heidelberg, Germany, 2010; pp. 465–482. [Google Scholar]
Parno, B.; Raykova, M.; Vaikuntanathan, V. How to delegate and verify in public: Verifiable computation from attribute-based encryption. In Proceedings of the 9th International Conference on Theory of Cryptography, Sicily, Italy, 19–21 March 2012; Springer: Berlin/Heidelberg, Germany, 2012. TCC’12. pp. 422–439. [Google Scholar] [CrossRef]
Wang, Q.; Zhou, F.; Zhou, B.; Xu, J.; Chen, C.; Wang, Q. Privacy-Preserving Publicly Verifiable Databases. IEEE Trans. Dependable Secur. Comput. 2022, 19, 1639–1654. [Google Scholar] [CrossRef]
Liu, J.; Zhang, L.F. Privacy-Preserving and Publicly Verifiable Matrix Multiplication. IEEE Trans. Serv. Comput. 2023, 16, 2059–2071. [Google Scholar] [CrossRef]
Paillier, P. Public-Key Cryptosystems Based on Composite Degree Residuosity Classes. In Advances in Cryptology—EUROCRYPT ’99; Stern, J., Ed.; Springer: Berlin/Heidelberg, Germany, 1999; pp. 223–238. [Google Scholar]
National Institute of Standards and Technology. FIPS-186–3 FIPS 186-3, Digital Signature Standard (DSS)-NIST CSRC. Available online: https://csrc.nist.gov/files/pubs/fips/186-3/final/docs/fips_186-3.pdf (accessed on 10 April 2025).
Okabe, A.; Boots, B.; Sugihara, K. Spatial Tessellations: Concepts and Applications of Voronoi Diagrams; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 1992. [Google Scholar]
Kolahdouzan, M.; Shahabi, C. Voronoi-based K nearest neighbor search for spatial network databases. In Proceedings of the Thirtieth International Conference on Very Large Data Bases-Volume 30. VLDB Endowment, 2004, VLDB ’04, Toronto, ON, Canada, 30 August–3 September 2004; pp. 840–851. [Google Scholar]
Liu, A.; Zhengy, K.; Liz, L.; Liu, G.; Zhao, L.; Zhou, X. Efficient secure similarity computation on encrypted trajectory data. In Proceedings of the 2015 IEEE 31st International Conference on Data Engineering, Seoul, Republic of Korea, 13–17 April 2015; pp. 66–77. [Google Scholar] [CrossRef]
Liu, J.; Yang, J.; Xiong, L.; Pei, J. Secure Skyline Queries on Cloud Platform. In Proceedings of the 2017 IEEE 33rd International Conference on Data Engineering (ICDE), San Diego, CA, USA, 19–22 April 2017; pp. 633–644. [Google Scholar] [CrossRef]
Yao, A.C.C. How to generate and exchange secrets. In Proceedings of the 27th Annual Symposium on Foundations of Computer Science (focs 1986), Toronto, ON, Canada, 27–29 October 1986; pp. 162–167. [Google Scholar] [CrossRef]

Figure 1. The system model.

Figure 2. An example of

ED = (E (I_{1}), E (I_{2}))

with the grid granularity

m = 2

,

E (I_{1}) = \{E (G_{s t}) | s, t = 0, 1\}

, and

E (I_{2}) = \{B_{j} | j = 0, 1, 2, 3\}

. The points with red are padding records.

Figure 2. An example of

ED = (E (I_{1}), E (I_{2}))

with the grid granularity

m = 2

,

E (I_{1}) = \{E (G_{s t}) | s, t = 0, 1\}

, and

E (I_{2}) = \{B_{j} | j = 0, 1, 2, 3\}

. The points with red are padding records.

Figure 3. Time cost comparison for varying dataset sizes n.

Figure 4. Time cost comparison for varying grid granularities m.

Figure 5. Time cost comparison for varying query parameters k.

Figure 6. Time cost comparison for varying key sizes K.

Figure 7. Communication cost comparison for

{CS}_{1} \leftrightarrow {CS}_{2}

in the

Search

stage.

Figure 7. Communication cost comparison for

{CS}_{1} \leftrightarrow {CS}_{2}

in the

Search

stage.

Figure 8. Communication latency for

{CS}_{1} \leftrightarrow {CS}_{2}

in the

Search

stage.

Figure 8. Communication latency for

{CS}_{1} \leftrightarrow {CS}_{2}

in the

Search

stage.

Table 1. Comparison of system models and security properties of existing kNN schemes.

Scheme	Privacy				Verifiability		kNN		System Model
Scheme	Dataset	Query	Result	Access Patterns	Private	Public	Appro	Exact	System Model
Wong et al. [11]	×	×	×	×	×	×	×	✓	1 server
Hu et al. [12]	×	×	×	×	×	×	×	✓	1 server
Yao et al. [13]	✓	✓	✓	×	×	×	✓	×	1 server
Choi et al. [14]	✓	✓	✓	×	×	×	×	✓	1 server
Zhu et al. [17,18]	✓	✓	✓	×	×	×	×	✓	1 server
Yi [29]	✓	✓	✓	×	×	×	✓	×	1 server
Lei et al. [2]	✓	✓	✓	×	×	×	✓	×	1 server
Lei et al. [19]	✓	✓	✓	×	×	×	×	✓	1 server
Li et al. [20]	✓	✓	✓	×	×	×	×	✓	1 server
Zheng et al. [21]	✓	✓	✓	×	×	×	×	✓	1 server
Elmehdwi et al. [22]	✓	✓	✓	✓	×	×	×	✓	2 servers
Guan et al. [23]	✓	✓	✓	✓	×	×	×	✓	2 servers
Qi et al. [5]	✓	✓	✓	×	×	×	×	✓	2 servers
Yiu et al. [24]	×	×	×	×	✓	×	×	✓	1 server
Rong et al. [25]	×	✓	✓	×	✓^★	×	×	✓	2 servers
Sundarapandi et al. [7]	✓	✓	✓	✓	✓^★	×	×	✓	2 servers
Liu et al. [3]	✓	✓	✓	×	✓	×	×	✓	3 servers (2 clouds + 1 edge)
Zhang et al. [4]	✓	✓	✓	✓	✓^★★	×	×	✓	2 servers + 1 KGC
Cui et al. [9]	✓	✓	✓	✓	✓	×	×	✓	2 servers
Cui et al. [8]	✓	✓	✓	✓	✓	×	×	✓	2 servers + 1 CA
Ours	✓	✓	✓	✓	×	✓	×	✓	2 servers

Regarding verifiability, ‘✓^★’ indicates that the verification approach for query results is probabilistic, while ‘✓^★★’ signifies that the scheme only supports verifying whether the query results returned by the cloud correspond to the authentic data uploaded by the DO.

Table 2. Notations.

Notation	Description
$ρ$	a permutation function
$D$	the plaintext spatial dataset
$ED$	the ciphertext spatial dataset
$L C M (a, b)$	the least common multiple of two integers a and b
Q	$Q = (x_{q}, y_{q})$ is the query data point
$i d_{\min}^{(j)}$	the index of the jth nearest neighbor to Q
$P_{i d_{\min}^{(j)}}$	$P_{i d_{\min}^{(j)}} = (x_{i d_{\min}^{(j)}}, y_{i d_{\min}^{(j)}})$ denotes the jth
	nearest neighbor to Q
$V R V (P_{i d_{\min}^{(j)}})$	the set of Voronoi-relevant vectors of $P_{i d_{\min}^{(j)}}$
$H (\cdot)$	a cryptographic hash function
$Sig (\cdot)$	a DSA signature
N	a large integer that is the product of two
	prime numbers p and q
$Z_{N^{2}}^{★}$	the multiplicative group of the residue
	class modulo $N^{2}$
$Z_{N}^{★}$	the residue class ring modulo N
$x \| \| y$	the concatenation of two numbers x and y
$⌊ x ⌋$	the greatest integer no larger than x
$[n]$	the set ${0, \dots, n - 1}$
$[m]$	the set ${0, \dots, m - 1}$
$negl (\cdot)$	a negligible function of some input parameter

Table 3. Computational cost of each algorithm.

	${CS}_{1}$	${CS}_{2}$
Algorithm	${CS}_{1}$	${CS}_{2}$
Algorithm 1	$O ((k λ + k ⌈ \frac{n}{w} ⌉) τ + (m^{2} + k n) \log N) Mul s$ $+ O (m^{2} + k n) Enc s$	$O (k ⌈ \frac{n}{w} ⌉) Dec s + 2 Div s +$ $O (k λ) Mul s + O (k) Enc s$
Algorithm 2	$O (τ + \log N) Mul s$	$4 Dec s + 2 Div s + 2 Enc s$
Algorithm 3	$O (m^{2} \log N) Mul s + O (m^{2}) Enc s$	$2 Enc s + 2 Dec s$
Algorithm 4	$O (λ σ + \log N) Mul s + 3 Enc s$	$3 Dec s + 5 Enc s + 2 λ Mul s$
Algorithm 5	$O (⌈ \frac{n}{w} ⌉ τ + n \log N) Mul s + O (n) Enc s$	$O (⌈ \frac{n}{w} ⌉) Dec s + 2 Enc s$

Table 4. Communication cost of each algorithm (unit: bits).

	${CS}_{1} \to$ ${CS}_{2}$	${CS}_{2} \to$ ${CS}_{1}$	${CS}_{1} \to$ QU	${CS}_{2} \to$ QU
Algorithm	${CS}_{1} \to$ ${CS}_{2}$	${CS}_{2} \to$ ${CS}_{1}$	${CS}_{1} \to$ QU	${CS}_{2} \to$ QU
Algorithm 1	$O ((m^{2} + k n) \log N + \log m)$	$O ((m^{2} + k λ + k n) \log N)$	$O (k τ)$	$O (k \log N)$
Algorithm 2	$O (\log N)$	$O (\log N)$	−	−
Algorithm 3	$O (m^{2} \log N + \log m)$	$O (m^{2} \log N)$	−	−
Algorithm 4	$O (\log N)$	$O (λ \log N)$	−	−
Algorithm 5	$O (⌈ \frac{n}{w} ⌉ \log N + n \log N)$	$O (n \log N)$	−	−

Table 5. Comparison of computational costs between our protocol and Cui et al.’s protocol.

Protocol		Cui et al.’s Protocol [8]					Our Protocol
	Stages	$Setup$	$DSEnc$	$QUEnc$	$Search$	$Verify$ $ResDec$	$Setup$	$DSEnc$	$QUEnc$	$Search$	$Verify$ $ResDec$
Entities		$Setup$	$DSEnc$	$QUEnc$	$Search$	$Verify$ $ResDec$	$Setup$	$DSEnc$	$QUEnc$	$Search$	$Verify$ $ResDec$
$DO$		−	$(2 m^{2} + 4 n +$ $2) Enc s +$ $O (n) Hash s$	−	−	−	$O (\log p) Muls$	$(m^{2} + 3 n +$ $2) Enc s + n Sig s$	−	−	−
${CS}_{1}$		−	−	−	$O ((m^{2} + t m$ $+ k (λ + n)) τ + (t m$ $+ k n) \log N) Mul s$ $+ O (m^{2} + k n) Enc s$	−	−	−	−	$O ((k λ + k ⌈ \frac{n}{w} ⌉) τ$ $+ (m^{2} + k n) \log N) Mul s$ $+ O (m^{2} + k n) Enc s$	−
${CS}_{2}$		−	−	−	$2 Div s +$ $+ O (k) Enc s +$ $O (k ⌈ \frac{n}{w} ⌉) Dec s$	−	−	−	−	$2 Div s +$ $O (k) Enc s +$ $O (k ⌈ \frac{n}{w} ⌉) Dec s$	−
$QU$		−	−	$6 Enc s$	−	$O (2 k) Dec s$ $+ O (k^{2}) Hash s$	−	−	$2 Enc s$	−	$O (k) Hash s +$ $O (k) Ver s$

Table 6. Comparison of communication costs between our protocol and Cui et al.’s protocol (unit: bits).

Protocol		Cui et al.’s Protocol [8]			Our Protocol
	Stages	$DSEnc$	$QUEnc$	$Search$	$DSEnc$	$QUEnc$	$Search$
Entities		$DSEnc$	$QUEnc$	$Search$	$DSEnc$	$QUEnc$	$Search$
$CA \to DO$		$O (\log N)$	−	−	−	−	−
$CA \to {CS}_{1}$		$O (\log N)$	−	−	−	−	−
$CA \to {CS}_{2}$		$O (\log N)$	−	−	−	−	−
$CA \to QU$		−	$O (\log N)$	−	−	−	−
$DO \to {CS}_{1}$		$O ((m^{2} + n) \log N)$	−	−	$O ((m^{2} + n) \log N)$	−	−
$DO \to {CS}_{2}$		−	−	−	$O (\log N)$	−	−
$QU \to {CS}_{1}$		−	$O (\log N)$	−	−	$O (\log N)$	−
${CS}_{1} \to {CS}_{2}$		−	−	$O ((m^{2} + k n) \log N$ $+ \log T)$	−	−	$O ((m^{2} + k n) \log N + \log m)$
${CS}_{2} \to {CS}_{1}$		−	−	$O ((m^{2} + k λ + k n) \log N)$	−	−	$O ((m^{2} + k λ + k n) \log N)$
${CS}_{1} \to QU$		−	−	$O (k \log N)$	−	−	$O (k τ)$
${CS}_{2} \to QU$		−	−	−	−	−	$O (k \log N)$

Table 7. Time cost comparison on synthesized datasets with different sizes of 1000, 5000, 10,000, and 20,000 (unit: seconds).

Protocol		Cui et al.’s Protocol [8]				Our Protocol
	Dataset Size $n$	1000	5000	10,000	20,000	1000	5000	10,000	20,000
Stages		1000	5000	10,000	20,000	1000	5000	10,000	20,000
$Setup$		$1.448$	$1.181$	$1.211$	$1.156$	$0.312$	$0.544$	$0.279$	$0.497$
$DSEnc$		$97.933$	$723.832$	$1458.838$	$3658.052$	$47.716$	$337.935$	$727.206$	$2046.939$
$QUEnc$		$0.058$	$0.057$	$0.056$	$0.059$	$0.022$	$0.022$	$0.023$	$0.023$
$Search$		$58.655$	$303.406$	$598.363$	$1388.696$	$8.287$	$16.914$	$24.610$	$47.330$
$Verify$ and $ResDec$		$0.160$	$0.534$	$0.602$	$0.722$	$0.005$	$0.005$	$0.006$	$0.008$
$Total$		$158.254$	$1029.010$	$2059.070$	$5048.685$	$56.342$	$355.420$	$752.124$	$2094.797$

Table 8. Time cost comparison with different grid granularities on a synthesized dataset of size 2000 (unit: seconds).

Protocol		Cui et al.’s Protocol [8]						Our Protocol
	Grid Granularity $m$	4	8	12	16	32	64	4	8	12	16	32	64
Stages		4	8	12	16	32	64	4	8	12	16	32	64
$Setup$		1.009	1.188	1.462	1.692	1.914	1.687	0.291	0.285	0.240	0.207	0.463	0.871
$DSEnc$		181.751	182.137	184.899	172.719	218.232	225.192	86.298	85.202	90.337	92.901	103.423	115.173
$QUEnc$		0.057	0.058	0.056	0.056	0.058	0.055	0.023	0.022	0.023	0.023	0.023	0.024
$Search$		114.947	102.155	110.785	120.766	133.233	137.680	11.147	10.255	10.811	11.175	11.708	12.942
$Verify$ and $ResDec$		0.243	0.221	0.230	0.205	0.277	0.251	0.004	0.004	0.005	0.004	0.005	0.005
$Total$		$298.007$	$285.759$	$297.432$	$309.036$	$353.714$	$364.865$	$97.763$	$95.768$	$101.456$	$104.420$	$115.622$	$129.015$

Table 9. Time cost comparison with different query parameters k on a synthesized dataset of size 2000 (unit: seconds).

Protocol		Cui et al.’s Protocol [8]					Our Protocol
	Query Parameter $k$	1	3	5	7	9	1	3	5	7	9
Stages		1	3	5	7	9	1	3	5	7	9
$Setup$		1.317	1.402	1.692	1.566	1.477	0.286	0.256	0.317	0.279	0.263
$DSEnc$		180.479	185.852	186.317	178.708	183.617	89.623	88.044	92.901	91.075	92.367
$QUEnc$		0.057	0.058	0.056	0.057	0.057	0.023	0.022	0.023	0.022	0.023
$Search$		37.263	70.774	120.766	201.474	261.221	2.827	6.855	11.175	16.411	21.933
$Verify$ and $ResDec$		0.031	0.094	0.205	0.258	0.322	0.002	0.003	0.004	0.008	0.013
$Total$		$219.147$	$258.180$	$309.036$	$382.063$	$446.694$	$92.761$	$95.180$	$104.420$	$107.795$	$114.599$

Table 10. Time cost comparison with different key sizes on a synthesized dataset of size 2000 (unit: seconds).

Protocol		Cui et al.’s Protocol [8]			Our Protocol
	Key Size $K$	512	1024	2048	512	1024	2048
Stages		512	1024	2048	512	1024	2048
$Setup$		$0.349$	$1.192$	$6.582$	$0.192$	$0.317$	$0.535$
$DSEnc$		$36.517$	$172.719$	$867.321$	$20.978$	$92.901$	$494.262$
$QUEnc$		$0.009$	$0.056$	$0.267$	$0.004$	$0.023$	$0.154$
$Search$		$24.160$	$120.766$	$650.312$	$2.092$	$11.175$	$53.270$
$Verify$ and $ResDec$		$0.055$	$0.205$	$1.067$	$0.005$	$0.004$	$0.006$
$Total$		$61.090$	$309.036$	$1525.549$	$23.271$	$104.420$	$548.227$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, J.; Song, Y.; Tian, C.; Tian, W. PVkNN: A Publicly Verifiable and Privacy-Preserving Exact kNN Query Scheme for Cloud-Based Location Services. Modelling 2025, 6, 44. https://doi.org/10.3390/modelling6020044

AMA Style

Li J, Song Y, Tian C, Tian W. PVkNN: A Publicly Verifiable and Privacy-Preserving Exact kNN Query Scheme for Cloud-Based Location Services. Modelling. 2025; 6(2):44. https://doi.org/10.3390/modelling6020044

Chicago/Turabian Style

Li, Jingyi, Yuqi Song, Chengliang Tian, and Weizhong Tian. 2025. "PVkNN: A Publicly Verifiable and Privacy-Preserving Exact kNN Query Scheme for Cloud-Based Location Services" Modelling 6, no. 2: 44. https://doi.org/10.3390/modelling6020044

APA Style

Li, J., Song, Y., Tian, C., & Tian, W. (2025). PVkNN: A Publicly Verifiable and Privacy-Preserving Exact kNN Query Scheme for Cloud-Based Location Services. Modelling, 6(2), 44. https://doi.org/10.3390/modelling6020044

Article Menu

PVkNN: A Publicly Verifiable and Privacy-Preserving Exact kNN Query Scheme for Cloud-Based Location Services

Abstract

1. Introduction

1.1. Related Works

1.2. Challenges and Contributions

1.3. Layout of This Paper

2. System Architecture, Threat Models, and Design Goals

2.1. System Architecture and Threat Models

2.2. Design Goals

2.2.1. Correctness

2.2.2. Public Verifiability

2.2.3. Privacy

2.2.4. Efficiency

3. Preliminaries

3.1. Notations

3.2. Permutation

3.3. Paillier’s Additively Homomorphic Cryptosystem

3.4. Digital Signature Algorithm DSA

3.5. Voronoi Diagram

3.6. Data Packing with Paillier’s Cryptosystem

4. Our Main Design

4.1. Design Intuition and Basic Idea

4.2. Our Main Outsourcing Protocol

4.2.1. DO Dataset Preprocessing Stage

4.2.2. System Setup Stage: Setup

4.2.3. Dataset Encryption Stage: DSEnc

4.2.4. QU Query Encryption Stage: QuEnc

4.2.5. CS Search Stage: Search

4.2.6. QU Verification and Decryption Stage: Verify and ResDec

5. Correctness and Security Analysis

5.1. Correctness Analysis

5.2. Public Verifiability

5.3. Privacy

6. Efficiency Analysis and Performance Evaluation

6.1. Evaluation Methodology

6.2. Theoretical Analysis

6.3. Experimental Analysis

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Appendix B

Appendix B.1. The Proof of Lemma 3

Appendix B.2. The Proof of Lemma 4

Appendix B.3. The Proof of Lemma 5

Appendix B.4. The Proof of Lemma 6

Appendix B.5. The Proof of Lemma 7

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.2.2. System Setup Stage: $Setup$

4.2.3. Dataset Encryption Stage: $DSEnc$

4.2.4. QU Query Encryption Stage: $QuEnc$

4.2.5. CS Search Stage: $Search$

4.2.6. QU Verification and Decryption Stage: $Verify$ and $ResDec$