Efficient and Privacy-Preserving Decision Tree Inference via Homomorphic Matrix Multiplication and Leaf Node Pruning

Fukui, Satoshi; Wang, Lihua; Ozawa, Seiichi

doi:10.3390/app15105560

Open AccessArticle

Efficient and Privacy-Preserving Decision Tree Inference via Homomorphic Matrix Multiplication and Leaf Node Pruning

by

Satoshi Fukui

^1,†,

Lihua Wang

^2,*,†

and

Seiichi Ozawa

^1,*,†

¹

Graduate School of Engineering, Kobe University, Kobe 657-8501, Japan

²

Cybersecurity Research Institute, National Institute of Information and Communications Technology, Tokyo 184-8795, Japan

^*

Authors to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Appl. Sci. 2025, 15(10), 5560; https://doi.org/10.3390/app15105560

Submission received: 10 April 2025 / Revised: 7 May 2025 / Accepted: 13 May 2025 / Published: 15 May 2025

(This article belongs to the Special Issue Intelligent Systems and Information Security)

Download

Browse Figures

Versions Notes

Abstract

Cloud computing is widely used by organizations and individuals to outsource computation and data storage. With the growing adoption of machine learning as a service (MLaaS), machine learning models are being increasingly deployed on cloud platforms. However, operating MLaaS on the cloud raises significant privacy concerns, particularly regarding the leakage of sensitive personal data and proprietary machine learning models. This paper proposes a privacy-preserving decision tree (PPDT) framework that enables secure predictions on sensitive inputs through homomorphic matrix multiplication within a three-party setting involving a data holder, a model holder, and an outsourced server. Additionally, we introduce a leaf node pruning (LNP) algorithm designed to identify and retain the most informative leaf nodes during prediction with a decision tree. Experimental results show that our approach reduces prediction computation time by approximately 85% compared to conventional protocols, without compromising prediction accuracy. Furthermore, the LNP algorithm alone achieves up to a 50% reduction in computation time compared to approaches that do not employ pruning.

Keywords:

data mining; decision trees; data security; privacy-preserving

1. Introduction

It has been a long time since data was referred to as “the oil of the 21st century”. Today, many of the world’s most valuable companies thrive by collecting and monetizing vast amounts of data. However, such data often include sensitive personal information, making it inaccessible for public or collaborative use—not only due to commercial interests but also to mitigate privacy risks. Despite the significant potential of data sharing in addressing pressing societal challenges such as crime prevention, public health, and elder care, privacy concerns continue to severely limit the practical use of sensitive datasets across institutional boundaries. To address this challenge, privacy-preserving machine learning (PPML) has emerged as a key paradigm for enabling data-driven insights without compromising confidentiality.

One of the most prominent PPML frameworks is federated learning (FL) [1], which allows multiple parties to collaboratively train machine learning models without sharing their raw data. Each participant trains a local model and shares only model updates with a central server, preserving data privacy. However, FL assumes that participants have sufficient computational capacity for local training. In many real-world settings, particularly for devices or institutions with limited resources, local computation must be outsourced, raising new privacy concerns–even under semi-honest threat models.

This paper focuses on another type of PPML scenario in which a local entity securely outsources prediction computations. In our setting, both the data and the model remain encrypted, ensuring the confidentiality of sensitive user input and the intellectual property of the model provider. This setup is especially effective in protecting against model inversion attacks [2], which constitute a well-known vulnerability in machine learning-as-a-service (MLaaS) platforms. To implement secure outsourced prediction, we employ homomorphic encryption (HE) [3], particularly in its somewhat homomorphic variant, named somewhat homomorphic encryption (SHE) [4], which supports a limited number of encrypted additions and multiplications with significantly improved efficiency over fully homomorphic schemes. This enables practical encrypted inference in real-world scenarios. Among the various machine learning models, decision trees are particularly attractive for privacy-preserving inference due to their interpretability and low computational cost. As a result, a growing body of research has focused on designing cryptographic protocols for privacy-preserving decision tree (PPDT) inference that minimize both communication and computational overhead while preserving privacy.

1.1. Related Work

Existing PPDT protocols can be broadly classified based on the cryptographic primitives they employ.

HE-based protocols:: HE supports computation over encrypted data without requiring decryption. Akavia et al. [5] used low-degree polynomial approximations to support non-interactive inference with communication costs independent of tree depth. Frery et al. [6], Hao et al. [7], and Shin et al. [8] adopted the TFHE, BFV, and CKKS schemes, respectively, for efficient computation with low multiplicative depth. Cong et al. [9] reported ciphertext size comparisons and proposed homomorphic traversal algorithms across various commonly used HE schemes. Most existing HE-based PPDT protocols are designed for two-party settings, similar to the linear-function-based scheme proposed by Tai et al. [10].
Protocols based on other cryptographic primitives:: Zheng et al. [11] employed additive secret sharing in a two-server setting, ensuring that no single party gained access to both the model and the data, while maintaining low communication overhead. MPC-based approaches, such as those reported b Wu et al. [12], combine additive HE with oblivious transfer (OT), offering strong privacy guarantees but often suffering from scalability issues, particularly with deep trees. Differential privacy (DP), although more commonly used during training [13], can complement cryptographic inference methods by adding noise to outputs to mask sensitive patterns. However, DP typically compromises utility and operates orthogonally to encrypted inference techniques.

Within this landscape, several notable HE-based PPDT approaches stand out. Tai et al. [10] proposed a prediction protocol using linear functions and DGK-based integer comparison [14], reducing complexity at the cost of requiring bitwise encryption. Lu et al. [15] enhanced efficiency via the XCMP protocol, based on Ring-LWE SHE, although it struggled with large input bit lengths and unstable ciphertexts. Saha and Koshiba [16] addressed this with SK17, encoding integer bits as polynomial coefficients to support larger comparisons. Wang et al. [17] further improved SK17 by introducing a faster comparison protocol and a non-interactive variant with reduced ciphertext functionality.

Despite these advancements, existing protocols still face challenges in simultaneously achieving high efficiency, scalability, low communication overhead, and full tree structure confidentiality—especially under realistic semi-honest adversary models.

1.2. Our Contributions

To overcome the above-mentioned limitations, we propose a novel three-party PPDT inference protocol that achieves secure, structure-hiding, and communication-efficient decision tree predictions over encrypted data and models. The key innovations of our work are as follows:

Homomorphic matrix multiplication-based inference: Departing from polynomial approximation and linear path evaluation methods [10], we introduce homomorphic matrix multiplication as the primary operation for encrypted path computation. This novel application supports structured and scalable evaluation of encrypted inputs.
Leaf node pruning during inference: We propose leaf node pruning at inference time, a novel runtime optimization that reduces the number of nodes involved in computation, significantly improving performance. Unlike traditional model pruning during training, this technique operates during encrypted inference.
Structure-hiding inference protocol: The decision tree structure—including internal nodes and branching conditions—is fully hidden from both the client and the server. By ensuring that all path computations are homomorphically encrypted, the protocol mitigates leakage risks present in prior PPDT methods.
Semi-interactive three-party architecture: Our protocol requires only one round of interactive communication between the data holder (client) and the outsourced server. No interaction is required from the model holder during inference. This design enables low-latency, real-world deployment scenarios.

To achieve these properties, we adopt the efficient integer comparison protocol by Wang et al. [17] and the homomorphic matrix multiplication techniques from [18,19] for encrypted path evaluation. By integrating homomorphic matrix multiplication, inference-time leaf node pruning, and tree structure hiding into a semi-interactive three-party framework, our method enables efficient, secure, and practical decision tree inference in untrusted environments. We evaluated the proposed PPDT protocol on standard UCI datasets [20] to demonstrate its performance and practicality.

The algorithm design is detailed in Section 3, the experimental results are presented in Section 4, and the conclusions are provided in Section 5.

2. Preliminaries

In this section, we summarize the related approaches used in our proposal. In Section 2.1, we first review the concept of decision tree and the approach of decision tree classification via the linear function that was introduced by Tai et al. in [10], which was concerned with two kinds of secure computations—comparison at each node of tree and linear function calculation for prediction. We focus on how to improve the efficiency of the above two operations:

Comparison: We use Wang et al.’s protocol [17] instead of the DGK approach [14] to improve the efficiency of secure comparison in each node.
Prediction: We use secure inner product/matrix multiplication to replace linear function in order to outsource prediction while maintaining the tree model secret.

Therefore, we recall the secure comparison scheme proposed by Wang et al. [17] in Section 2.2 and the secure matrix multiplication introduced by Doung et al. [18] in Section 2.3. The two schemes are both constructed on the ring-LWE-based homomorphic encryption (see Appendix A for details). The notations in this paper are listed in Notations Section.

2.1. Existing Decision Tree Classification

2.1.1. Decision Tree

Figure 1 presents an example of a decision tree. The decision tree

T : Z^{N} \to Z

is a function that takes as input the feature vector

X = [X_{0}, \dots, X_{i}, \dots, X_{N - 1}]

and outputs the class

T (X) \in {c_{j}}

to which X belongs. Generally, the feature vector space is

R^{N}

; however, in this paper, we denote it as

Z^{N}

because the input is the encrypted attribute data. This paper’s decision tree is a binary tree and consists of two types of nodes: decision nodes and leaf nodes. The decision node

D_{j}

outputs a Boolean value

b_{j} =

1

{X_{λ_{j}} > t_{j}}

where

λ_{j} \in [N]

is the feature vector’s index, and

t_{j}

is the threshold. A leaf node

L_{k}

holds the output value

score (L_{k}) \in C = {C_{0}, \dots, C_{K - 1}}

. In a binary tree, there are m decision nodes and

m + 1

leaf nodes. For example, Figure 1 shows the case of

K = 3, m = 3

,

score (L_{1}) = C_{0},

score (L_{2}) = C_{1}, score (L_{3}) = C_{0}, score (L_{4}) = C_{2}

.

2.1.2. Decision Tree Classification via Linear Function [10]

The Boolean output of the decision node

D_{j}

,

b_{j} = 1 (0)

indicates that the next node is the left (right) child. For edges

e_{j, 0}

and

e_{j, 1}

, respectively connecting to the left child and the right child from the decision node

D_{j}

, we define the corresponding edge cost

ec

as follows:

{ec}_{j, 0} : = 1 - b_{j}, {ec}_{j, 1} : = b_{j} .

(1)

From the root node of the decision tree to each leaf node, only one path is determined. We define the path cost

{pc}_{k}

as the sum of the edge costs in

{Path}_{k}

—the path from root node to the leaf node

L_{k}

. The edge cost

{ec}_{j, 0}

and

{ec}_{j, 1}

are determined by

b_{j}

, which is the comparison result of decision node

D_{j}

. Thereafter, the path cost for each leaf node is defined by a linear function of

b_{j}

. For example, the following

{pc}_{1}, {pc}_{2}, {pc}_{3}

, and

{pc}_{4}

denote the path costs for leaf nodes

L_{1}, L_{2}, L_{3}

, and

L_{4}

in Figure 1, respectively:

\begin{matrix} {pc}_{1} & = {ec}_{1, 0} = 1 - b_{1}, \\ {pc}_{2} & = {ec}_{1, 1} + {ec}_{2, 0} + {ec}_{3, 0} = b_{1} + (1 - b_{2}) + (1 - b_{3}) = 2 + b_{1} - b_{2} - b_{3}, \\ {pc}_{3} & = {ec}_{1, 1} + {ec}_{2, 0} + {ec}_{3, 1} = b_{1} + (1 - b_{2}) + b_{3} = 1 + b_{1} - b_{2} + b_{3}, \\ {pc}_{4} & = {ec}_{1, 1} + {ec}_{2, 1} = b_{1} + b_{2} . \end{matrix}

(2)

The comparison results

b_{j}

indicate that the edge cost to the next node is 0, while the edge cost to the other node is 1. As a result, only the path leading to a specific leaf node

L_{k}

has a total cost

{pc}_{k} = 0

; all other paths have non-zero costs. The classification result is returned if and only if

{pc}_{k} = 0

, meaning the output corresponds to the label stored in the leaf node

L_{k}

.

This can be illustrated with a concrete example (see Bob in Figure 1): Bob’s attributes are height

< 170

, weight

> 60

, and age

> 25

, which yield comparison results

b_{1} = 0

,

b_{2} = 1

, and

b_{3} = 1

, respectively. Substituting these into Equation (2), we compute the path costs as follows:

{pc}_{1} = {pc}_{3} = {pc}_{4} = 1

, and only

{pc}_{2} = 0

. Therefore, the output for Bob is

C_{1}

, the label held by leaf node

L_{2}

.

2.2. Secure Comparison Protocol

In this subsection, we describe the protocol proposed by Wang et al. [17], which securely computes the comparison of decision trees.

2.2.1. $μ$ -bit Integer Comparison [14]

Assume that Alice and Bob respectively have two

μ

-bit integers, a and b, and consider how to compare two integers without revealing the values of a and b between Alice and Bob. Let us define the binary vectors for a and b as follows:

a^{b} = [a_{0}, \dots, a_{μ - 1}]

and

b^{b} = [b_{0}, \dots, b_{μ - 1}]

. In addition, let us define the following two binary vectors

a_{i}^{b}

and

b_{i}^{b}

(1 \leq i \leq μ - 1)

where the first i-bits are the same as those in

a^{b}

and

b^{b}

while the remaining upper bits are set to zeros.

\begin{matrix} a_{i}^{b} & = [a_{0}, \dots, a_{i - 1}, 0, \dots, 0], \\ b_{i}^{b} & = [b_{0}, \dots, b_{i - 1}, 0, \dots, 0] . \end{matrix}

(3)

To compare the two integers a and b, let us define the following

d_{i}

:

d_{i} = w_{i} + v_{i},

(4)

where

\begin{matrix} w_{i} & = 〈a_{i}^{b} - b_{i}^{b}, a_{i}^{b} - b_{i}^{b}〉 \geq 0, \\ v_{i} & = a_{i} - b_{i} + 1 . \end{matrix}

(5)

Here,

w_{i}

is an inner product for calculating how many bits are different between

a_{i}^{b}

and

b_{i}^{b}

. Therefore,

w_{i} = 0

implies that the first i bits of a and b are the same (i.e.,

[a_{0}, \dots, a_{i - 1}] = [b_{0}, \dots, b_{i - 1}]

). Next, we look at the

(i + 1)

-th bit of a and b when

w_{i} = 0

is satisfied. Here,

v_{i} = a_{i} - b_{i} + 1

could have the three values: 0, 1, or 2. If

(a_{i}, b_{i}) = (0, 1)

(i.e.,

a < b

),

v_{i} = 0

; otherwise,

v_{i}

= 1 or 2. That is, if

v_{i} = 0

,

a < b

is satisfied under

w_{i} = 0

; otherwise,

a \geq b

is satisfied. Therefore, to identify whether

a < b

is satisfied, we merely need to check if

d_{i}

in Equation (4) is 0 for any position of i (

i \in {0, 1, \dots, μ - 1}

).

2.2.2. Packing Method

For a

μ

-bit length integer u whose binary vector is denoted as

u^{b} = [u_{0}, \dots, u_{μ - 1}]

, the following packing polynomials are defined:

\begin{matrix} \begin{matrix} {poly}_{1} (u^{b}) & = \sum_{i = 0}^{μ - 1} u_{i} x^{i}, \\ {poly}_{2} (u^{b}) & = \sum_{d = 1}^{μ - 1} \sum_{j = 0}^{d - 1} u_{j} x^{μ d - j} . \end{matrix} \end{matrix}

(6)

Using this packing method, we have [17]

\begin{matrix} poly (d) = ({poly}_{1} (a^{b}) - {poly}_{1} (b^{b})) ({poly}_{2} (a^{b}) - {poly}_{2} (b^{b}) + {poly}_{1 \to 3}) + \tilde{poly} (1), \end{matrix}

(7)

where

1 = (1, 1, \dots, 1)

denotes the binary vector of the

μ

-bit integer

2^{μ} - 1

, and

{poly}_{1 \to 3} = \sum_{i = 1}^{μ} x^{(μ - 1) (i - 1)}

and

\tilde{poly} (1) = {poly}_{1} (1) {poly}_{1 \to 3}

can be computed offline in advance. The coefficient of

x^{i μ} (i = 0, \dots, μ - 1)

in

poly (d)

is

d_{i}

.

2.2.3. Secure Comparison Protocol

Wang et al. proposed three enhanced secure comparison protocols in [17]. Here, we recall the most efficient one that uses the packing method defined by Equation (6). There are three participants in this protocol. Alice and Bob who have

μ

-bit integers a and b, respectively, compare a and b without revealing the data through a server. The server obtains the comparison result, which is the output of the protocol. The protocol is described as follows:

1.: Alice generates a secret–public key pair $(s k, p k)$ and sends $p k$ to Bob and the server.
2.: Alice and Bob compute

${[[a]]}_{i} : = Enc (p k, {poly}_{i} (a^{b}))$

(8)

and

${[[b]]}_{i} : = Enc (p k, {poly}_{i} (b^{b}))$

(9)

for $i = 1, 2,$ respectively, and send the results to the server.
3.: The server computes

$\begin{matrix} [[d]] : = ({[[a]]}_{1} ⊖ {[[b]]}_{1}) \otimes ({[[a]]}_{2} ⊖ {[[b]]}_{2} \oplus {poly}_{1 \to 3}) \oplus \tilde{poly} (1) . \end{matrix}$

(10)
4.: The server masks $[[d]]$ in encrypted form using random polynomial $γ \leftarrow Z_{p}$

$[[d^{'}]] : = [[d]] \oplus γ$

(11)

and sends $[[d^{'}]]$ to Alice.
5.: Alice decrypts $[[d^{'}]]$

$d^{'} = Dec (s k, [[d^{'}]])$

(12)

and sends $d^{'}$ back to the server.
6.: The server unmasks d from $d^{'}$ as follows:

$d = d^{'} - γ = \sum_{i = 0}^{μ - 1} d_{i} x^{i μ} + other terms of degree,$

(13)

and then verifies whether any $i μ$ -th term for $i = 0, \dots, μ - 1$ is 0. If so $a < b$ ; otherwise, $a \geq b$ .

2.3. Ring-LWE-Based Secure Matrix Multiplication

Doung et al. proposed secure matrix multiplication [18] using the packing method proposed by Yasuda et al. [19]. For an ℓ-dimension vector

U = [u_{0}, u_{1}, \dots, u_{ℓ - 1}]

, the following two polynomials are defined:

\begin{matrix} \begin{matrix} {poly}_{1} (U) & = \sum_{m = 0}^{ℓ - 1} u_{m} x^{m}, \\ {poly}_{t} (U) & = - \sum_{m = 0}^{ℓ - 1} u_{m} x^{n - m} . \end{matrix} \end{matrix}

(14)

Note that for any two vectors

A = [a_{0}, a_{1}, \dots, a_{ℓ - 1}]

and

B = [b_{0}, b_{1},

\dots, b_{ℓ - 1}]

, using the above packing method, we have

\begin{matrix} {poly}_{1} (A) \times {poly}_{t} (B) = 〈A, B〉 + other terms of degree, \end{matrix}

(15)

where the constant term of

{poly}_{1} (A) \times {poly}_{t} (B)

provides the inner product

〈A, B〉

.

Let

U

be a (k, ℓ) matrix and let

U_{1}, \dots, U_{k}

denote the row vectors of

U

. For matrix

U

, the packing method is defined as follows:

\begin{matrix} {poly}_{m a t} (U) & = {poly}_{1} (U_{1}) + \dots + {poly}_{1} (U_{k}) x^{(k - 1) ℓ} \\ = \sum_{i = 1}^{k} {poly}_{1} (U_{i}) x^{(i - 1) ℓ} . \end{matrix}

(16)

Assume that

k ℓ \leq n

(n: dimension of polynomial

x^{n} + 1

). Letting

(k, ℓ)

matrix be

\begin{matrix} A = [\begin{matrix} A_{1} \\ ⋮ \\ A_{k} \end{matrix}] \end{matrix}

(17)

and letting B denote an ℓ-dimensional vector, we have

{poly}_{m a t} (A) \times {poly}_{t} (B) = \sum_{i = 1}^{k} {poly}_{1} (A_{i}) \times {poly}_{t} (B) x^{(i - 1) ℓ},

(18)

where the coefficient of

x^{(i - 1) ℓ} (i = 1, \dots, k)

is

〈A_{i}, B〉

, the inner product of vectors

A_{i}

and B.

3. PPDT Classification Model

In this section, we propose a PPDT classification model that mainly consists of the following two processing parts:

(1) Path cost calculation and (2) secure integer comparison.

The basic idea of the path cost calculation comes from the Tai et al.’s decision tree classification protocol via linear function [10] where a decision tree model is treated as a plaintext under a two-party computation setting (see Section 2.1). In contrast, we extend the Tai et al.’s protocol such that not only input data but also a decision tree model can be encrypted to hide actual contents between a data provider and a model provider. To actualize this, we propose a secure three-party path cost calculation by extending the PPDT protocol proposed by Tai et al. [10] using the integer comparison protocol described in Section 2.2 that was proposed by Wang et al. [17].

3.1. Computation Model

To address the practical considerations of deploying our proposed algorithm in real-world cloud environments, we consider a scenario in which an organization that owns a decision tree classification model outsources the inference task by sending encrypted data to a cloud service provider (e.g., AWS and Google Cloud). Figure 2 illustrates the structure of our computational model, which involves three entities: the client, who possesses the feature vector to be classified; the model holder, who holds the trained decision tree; and the cloud server, which performs the encrypted computation on behalf of the model holder.

The client encrypts the feature vectors and sends them to the model holder, who then encrypts the information needed to calculate the decision tree’s path cost and threshold.
The client’s encrypted feature vectors are sent to the server, relaying the model holder, to conceal the information ( $λ_{i}$ ) about which elements of the feature vector threshold of the decision node should compare.
On behalf of the model holder, the server performs the necessary calculation for the decision tree prediction and sends the client’s encrypted classification results.
The client decrypts the information and obtains the classification results.

Even if the organization does not maintain its own servers, decision tree classification with privacy preservation is possible. The concrete process of the protocol is shown in Section 3.3.

A Representative Application of Online Medical Diagnostics

Machine-learned diagnostic models developed by medical institutions represent valuable intellectual property that is central to their competitive advantage. These institutions seek to offer diagnostic services without disclosing proprietary models, and cloud-based computation provides a practical solution to scale such services while reducing local computational costs.

However, concerns related to data confidentiality and model privacy present challenges for real-world deployment. Clients demand privacy-preserving services that do not expose their sensitive health data, while cloud service providers typically prefer not to manage or store sensitive information due to increased compliance and security risks.

Therefore, enabling secure and efficient inference over encrypted data and models aligns with the interests of all stakeholders. Our approach allows computations to be carried out on encrypted inputs and models, thereby mitigating privacy concerns and reducing the operational burdens associated with sensitive data management on cloud platforms.

3.2. Computation of Path Cost by Matrix × Vector Operation

In our method, we encrypt the path costs described in Section 2.1 and send them to the server to keep decision tree model secrets and allow the server to calculate the path costs. Specifically, we transform the path cost

{pc}_{k} = p_{k, 0} + p_{k, 1} b_{1} + \dots + p_{k, m} b_{m}

for a leaf node

L_{k}

. Therefore, we introduce two vectors,

P_{k}

and B, that can be used to calculate the path cost:

\begin{matrix} {pc}_{k} = 〈[p_{k, 0}, p_{k, 1}, \dots, p_{k, m}], [1, b_{1}, \dots, b_{m}]〉 = 〈P_{k}, B〉 . \end{matrix}

(19)

Here, B is a comparison result vector while

P_{k}

is a path vector. We define path matrix

P

of decision tree

T

as follows:

\begin{matrix} P = [\begin{matrix} P_{1} \\ ⋮ \\ P_{m + 1} \end{matrix}] . \end{matrix}

(20)

Therefore, it is possible to replace the calculation of the path cost of the decision tree by

P \times B

. To obtain the correct multiplication result for a

(k, ℓ)

matrix and a vector of length ℓ using Equation (18), the constraint

k ℓ \leq n

must be satisfied. Path matrix

P

is an

(m + 1, m + 1)

matrix; thus

{(m + 1)}^{2} \leq n

must be satisfied. Therefore, we divide the unconstrained path matrix

P

into several rows and compute each of them. Specifically,

P

is an

(m + 1, m + 1)

matrix; thus,

m^{'} = ⌊ n / (m + 1) ⌋

rows can be computed. The path matrix is divided into

S = ⌈ (m + 1) / m^{'} ⌉

matrices. When

S > (m + 1) / m^{'}

, we add vector

I = [1, 0, \dots, 0]

for several times so that each partitioned matrix has

m^{'}

rows, which hides the size of the decision tree. With the above operations, we obtain S

(m^{'}, m + 1)

path matrices as follows:

\begin{matrix} P_{1} = [\begin{matrix} P_{1} \\ ⋮ \\ P_{m^{'}} \end{matrix}], P_{2} = [\begin{matrix} P_{m^{'} + 1} \\ ⋮ \\ P_{2 m^{'}} \end{matrix}], \dots, P_{S} = [\begin{matrix} P_{(S - 1) m^{'} + 1} \\ ⋮ \\ P_{m + 1} \\ I \\ ⋮ \\ I \end{matrix}] . \end{matrix}

(21)

Therefore, the secure matrix × vector operation in Section 2.3 allows us to securely compute the path cost with the encrypted path matrix.

3.3. Proposed Protocol

The protocol’s detailed procedure is illustrated in Figure 3 and Algorithm 1. It comprises eight steps involving data transmission and computation. Let K represent the number of classes in the classification task

C = {C_{0}, C_{1}, \dots, C_{K - 1}}

.

The following provides a step-by-step description of the protocol:

Step 1 (client):: The client generates a secret–public key pair $(s k, p k)$ and sends $p k$ to the model holder and server.
Step 2 (client):: The client encrypts each element of a feature vector by packing it using Equation (6).

$\begin{matrix} {[[X_{i}]]}_{1} & = Enc (p k, {poly}_{1} (x_{i}^{b})), \\ [X_{i}]]_{2} & = Enc (p k, {poly}_{2} (x_{i}^{b})) . \end{matrix}$

(22)

For $i = 1, \dots, N$ , the client sends ciphertexts to the model holder.
Step 3 (model holder):: For $j = 1, \dots, m$ , the model holder encrypts threshold $t_{j}$ by packing it using the Equation (6).

$\begin{matrix} {[[t_{j}]]}_{1} & = Enc (p k, {poly}_{1} (t_{j}^{b})), \\ [t_{j}]]_{2} & = Enc (p k, {poly}_{2} (t_{j}^{b})) . \end{matrix}$

(23)

For $k = 1, \dots, m + 1$ , the model holder generates path vector $P_{k}$ multiplied by a random number

$\begin{matrix} P_{k}^{'} = r_{k} \cdot P_{k}, \end{matrix}$

(24)

and the following classification result vector is generated:

$\begin{matrix} V_{k} : = r_{k}^{'} \cdot P_{k} + [score (L_{k}), 0, \dots, 0], \end{matrix}$

(25)

where $score (L_{k}) \in C, r_{k}, r_{k}^{'} \in Z_{p}^{*}$ . As described in Section 3.2, the model holder generates S path matrices $P_{1}, \dots, P_{S}$ and S classification result matrices $V_{1}, \dots, V_{S}$ . Note that the path vector and classification result vector corresponding to the same path should be in the same matrix row.
For $s = 1, \dots, S$ , path matrix $P_{s}$ and classification result matrix $V_{s}$ are encrypted as follows:

$\begin{matrix} {[[P_{s}]]}_{m a t} & = Enc (p k, {poly}_{m a t} (P_{s})), \\ [V_{s}]]_{m a t} & = Enc (p k, {poly}_{m a t} (V_{s})), \end{matrix}$

(26)

where Equation (16) is used to compute ${poly}_{m a t}$ . A pair of ciphertexts of the elements of a threshold and its comparative feature vector

$\begin{matrix} ({[[X_{λ_{j}}]]}_{1}, {[[X_{λ_{j}}]]}_{2}), ({[[t_{j}]]}_{1}, {[[t_{j}]]}_{2}), \end{matrix}$

(27)

and ciphertext pairs of the path matrices and classification result matrices

$\begin{matrix} {[[P_{s}]]}_{m a t}, {[[V_{s}]]}_{m a t} \end{matrix}$

(28)

are sent to the server for $j = 1, \dots, m$ and $s = 1, \dots, S$ , respectively.
Step 4 (server):: The server calculates $[[d_{j}]]$ in Equation (10) as $a = t_{j}, b = X_{λ_{j}}$ . The server masks $[[d_{j}]]$ in encrypted form using random polynomial $γ_{j} \leftarrow Z_{p}$ and then sends $[[d_{j}^{'}]]$ to the client.
Step 5 (client):: The client decrypts $[[d_{j}^{'}]]$ and obtains $d_{j}^{'} = Dec (s k, [[d_{j}^{'}]])$ , which is then returned to the server.
Step 6 (server):: The server obtains $d_{j}$ from Equation (13) and the comparison result vector $B = [1, b_{1}, \dots, b_{m}]$ . If any of the $i μ$ -th $(i = 0, \dots, μ - 1)$ coefficients in $d_{j}$ is zero, then $b_{j} = 1$ ; otherwise, $b_{j} = 0$ .
Step 7 (server):: The server packs the comparison result vector B using Equation () to obtain ${poly}_{t} (B)$ . The server calculates

$\begin{matrix} [[{\tilde{P}}_{s}]] & = {[[P_{s}^{'}]]}_{m a t} \otimes {poly}_{t} (B), \\ [{\tilde{V}}_{s}]] & = {[[V_{s}]]}_{m a t} \otimes {poly}_{t} (B) \end{matrix}$

(29)

for $s = 1, \dots, S$ and sends $([[{\tilde{P}}_{s}]], [[{\tilde{V}}_{s}]])$ to the client.
Step 8 (client):: The client obtains the number of path matrix $S = ⌈ (m + 1) / m^{'} ⌉$ using Equation (21), where $m^{'} = ⌊ n / (m + 1) ⌋$ . Then, for $s = 1, \dots, S$ , it decrypts $[[{\tilde{P}}_{s}]]$ to obtain a polynomial with randomized non-zero coefficients for the path matrix ${\tilde{P}}_{s}$ , in which coefficients of $x^{(k - 1) (m + 1)}$ are k-th path cost ${pc}_{k} \cdot r_{k}$ according to Equations (19) and (20). Since ${pc}_{k} \cdot r_{k} = 0 \Leftrightarrow {pc}_{k} = 0$ , then it is verified whether any $(k - 1) (m + 1)$ -th term for $k = 1, \dots, m^{'}$ is 0, which implies the corresponding path cost ${pc}_{k} = 0$ . If so, the corresponding leaf of $T$ is the classification result according to Equation (25), and the client decrypts $[[{\tilde{V}}_{s}]]$ and obtains $C_{o u t}$ from the coefficient of $x^{(k - 1) (m + 1)}$ .

Algorithm 1 Efficient PPDT Inference via Homomorphic Matrix Multiplication

Input:: $X = (X_{1}, \dots, X_{N})$ : $X_{i}$ denotes the i-th feature vector of data X with N features, and $T = (t_{1}, \dots, t_{m})$ : $t_{j}$ denotes the j-th threshold of decision tree $T$ with m decision nodes.
n: degree of polynomial, S: number of path matrices, $μ$ : maximum bit length of $X_{i}$ and $t_{j}$ .
Output:: Classification result $C_{o u t}$
1:: Client generates a secret–public key pair $(s k, p k)$ and sends $p k$ to the model holder and server.
Client encrypts each feature vector of data X (refer to Equation (22))
2:: for $i = 1$ to N do
3:: ${[[X_{i}]]}_{1} \leftarrow Enc (p k, {poly}_{1} (x_{i}^{b}))$ , ${[[X_{i}]]}_{2} \leftarrow Enc (p k, {poly}_{2} (x_{i}^{b})) .$
4:: end for
5:: return $[[X]] = {({[[X_{i}]]}_{1}, {[[X_{i}]]}_{2})}_{i = 1}^{N}$ , sends $[[X]]$ to Model holder.
Model holder encrypts threshold $t_{j}$ (refer to Equation (23))
6:: for $j = 1$ to m do
7:: ${[[t_{j}]]}_{1} \leftarrow Enc (p k, {poly}_{1} (t_{j}^{b}))$ , ${[[t_{j}]]}_{2} \leftarrow Enc (p k, {poly}_{2} (t_{j}^{b}))$ .
8:: end for
9:: return $[[t]] = {{[[t_{j}]]}_{1}, {[[t_{j}]]}_{2}}$
Model holder generates path matrix P and classification result matrix V (refer to Equations (24) and (25))
10:: for $k = 1$ to $m + 1$ do
11:: generate $P_{k}$ , select $r_{k}, r_{k}^{'} \in Z_{p}^{*}$ ,
compute $P_{k}^{'} \leftarrow r_{k} \cdot P_{k}$ , $V_{k} : = r_{k}^{'} \cdot P_{k} + [score (L_{k}), 0, \dots, 0] .$
12:: end for
12:: return $P, V$ is divided into S matrices $P_{1}, \dots, P_{S}$ and $V_{1}, \dots, V_{S}$ , as shown in Equation (21).
The Model Holder encrypts path matrices and classification result matrices (refer to Equations (16) and (26))
14:: for $s = 1$ to S do
15:: ${[[P_{s}]]}_{m a t} \leftarrow Enc (p k, {poly}_{m a t} (P_{s}))$ , ${[[V_{s}]]}_{m a t} \leftarrow Enc (p k, {poly}_{m a t} (V_{s})) .$
16:: end for
17:: return $[[P]] = {[[P_{s}]]}, [[V]] = {[[V_{s}]]}$ , and sends $[[X]], [[t]], [[P]], [[V]]$ to Cloud Server.
Server calculates d via one-round communication with the Client (refer to Equation (10))
18:: for $j = 1$ to S do
19:: computes $[[d_{j}]] \leftarrow ({[[t_{j}]]}_{1} ⊖ {[[X_{λ}]]}_{1}) \otimes ({[[t_{j}]]}_{2} ⊖ {[[X_{λ}]]}_{2} \oplus {poly}_{1 \to 3}) \oplus \tilde{poly} (1) .$
20:: selects $γ_{j} \in Z_{p}$ , computes $[[d_{j}^{'}]] \leftarrow [[d_{j}]] \oplus γ_{j}$ , and sends $[[d_{j}^{'}]]$ to Client.
21:: end for
22:: return $[[d^{'}]] = {[[d_{j}^{'}]]}$ , and sends $[[d^{'}]]$ to Client.
23:: Client decrypts $d^{'} \leftarrow Dec ([[d^{'}]], s k)$ and sends it to Server.
Server computes the comparison result vector B (refer to Equation (13))
24:: for $j = 1$ to m do
25:: for $i = 0$ to $μ - 1$ do
26:: let $b_{j} = 0$
27:: if ∃i-th coefficient in $d_{j}$ = 0 then
28:: $b_{j} \leftarrow b_{j} + 1$
29:: end if
30:: end for
31:: end for
32:: return $B = [1, b_{1}, \dots, b_{m}]$
Server computes encrypted path and classification result matrices (refer to Equations () and (29))
33:: for $s = 1$ to S do
34:: $[[{\tilde{P}}_{s}]] \leftarrow {[[P_{s}^{'}]]}_{m a t} \otimes {poly}_{t} (B)$ , $[[{\tilde{V}}_{s}]] \leftarrow {[[V_{s}]]}_{m a t} \otimes {poly}_{t} (B)$ .
35:: end for
36:: return $([[{\tilde{P}}_{s}]], [[{\tilde{V}}_{s}]])$ and sends to Client.
Client decrypts and obtains the classification result (refer to Equations (19), (20) and (25))
37:: The default class is $C_{o u t} = \emptyset$
38:: for $s = 1$ to S do
39:: ${\tilde{P}}_{s} \leftarrow Dec ([[{\tilde{P}}_{s}]], s k)$
40:: for $k = 1$ to $m^{'}$ do
41:: if ∃( $(k - 1) (m + 1)$ -th coefficients in $P_{s}$ = 0) then
42:: computes ${\tilde{V}}_{s} \leftarrow Dec ([[{\tilde{V}}_{s}]], s k)$ .
43:: $C_{o u t} \leftarrow$ the coefficient of $x^{(k - 1) (m + 1)}$
44:: end if
45:: end for
46:: end for
47:: return $C_{o u t}$

3.4. Leaf Node Pruning (LNP)

In this subsection, we describe how to reduce the number of path costs to be calculated. Let the number of leaf nodes of decision tree

T

be

m + 1

and the number of classes to be classified be

K = | C |

. In this case, we do not calculate the path cost for one class (e.g.,

C_{0}

) but only the path cost for the other

K - 1

items. We compute the path cost corresponding to

K - 1

classes, and if any of them is 0, the output is the class corresponding to the path. If none of them is 0, the output is class

C_{0}

corresponding to the uncalculated path. The detailed procedure is illustrated in Algorithm 3.

Algorithm 2 Efficiency-Enhanced PPDT with LNP (Algorithm 1+LNP)

Input:: $X = (X_{1}, \dots, X_{N})$ : $X_{i}$ denotes the i-th feature vector of data X with N features, and $T = (t_{1}, \dots, t_{m})$ : $t_{j}$ denotes the j-th threshold of decision tree $T$ with m nodes.
n: degree of polynomial, S: number of path matrices, $μ$ : maximum bit length of $X_{i}$ and $t_{j}$
$K = | C |$ : number of classes $C = {C_{0}, C_{1}, \dots, C_{K - 1}}$
Output:: Classification result $C_{o u t}$
1:: Given path matrix P and classification result matrix V for $T$ with $m + 1$ leaf nodes.
Model holder prunes leaf nodes that result to the default class
⇒ this process (Lines 2–9 bellow) being inserted between Line 12 and Line 13 of Algorithm 1
2:: for $i = 1$ to $m + 1$ do
3:: checks classification value score on leaf node $L_{i}$ of $T$ ,
4:: if ( $score (L_{i}) = C_{0}$ ) then
5:: deletes $P_{i}$ and $V_{i}$ from P and V, respectively.
6:: $P \leftarrow P ∖ P_{i}$ , $V \leftarrow V ∖ V_{i}$
7:: end if
8:: end for
9:: return $P, V$
10:: Given encrypted path matrix $[[{\tilde{P}}_{s}]]$ and classification result matrix $[[{\tilde{V}}_{s}]]$ (Line 36 of Algorithm 1)
Client decrypts and obtains classification result (refer to Equations (19), (20) and (25))
⇒ this process (Lines 11–21 bellow) replaces Lines 37–47 of Algorithm 1
11:: Default class is $C_{o u t} = C_{0}$
12:: for $s = 1$ to S do
13:: ${\tilde{P}}_{s} \leftarrow Dec ([[{\tilde{P}}_{s}]], s k)$
14:: for $k = 1$ to $m^{'}$ do
15:: if ∃( $(k - 1) (m + 1)$ -th coefficients in $P_{s}$ = 0) then
16:: computes ${\tilde{V}}_{s} \leftarrow Dec ([[{\tilde{V}}_{s}]], s k)$ .
17:: $C_{o u t} \leftarrow$ the coefficient of $x^{(k - 1) (m + 1)}$
18:: end if
19:: end for
20:: end for
21:: return $C_{o u t}$

Note that the default class

C_{0}

must be securely shared in advance between the client and the model holder. Assigning the majority class as the default is generally more effective. The selection of the class excluded from computation depends on the classification task, as outlined below:

Binary classification tasks: (e.g., disease detection, abnormality detection): Only the decision path for class 1 (positive or anomalous) is evaluated. If the result is 0, class 1 is returned; otherwise, class 0 (negative or normal) is output.
Multi-class classification tasks:: Only $K - 1$ classes are evaluated. If the dataset exhibits class imbalance (e.g., ImageNet), the majority class is assigned as the default. In the absence of such imbalance (e.g., MNIST), one class is randomly selected to serve as the default (i.e., the class not evaluated).

We next use a simple example to illustrate homomorphic matrix multiplication and the leaf node pruning process based on Figure 1 to enhance clarity and aid in reader understanding.

Example 1.

Figure 1 shows a decision tree example that is for three classes

C = {C_{0}, C_{1}, C_{2}}

, in which the majority class

C_{0}

is assigned as the default. Feature vector

X : = (X_{1}, X_{2}, X_{3})

, where

X_{1}

,

X_{2}

and

X_{3}

denote height, weight, and age, respectively. For example, client Bob, whose feature vector

X = (X_{1}, X_{2}, X_{3}) = (150, 64, 32)

, tries to use the service to inference his health status. Decision tree

T

has

m = 3

nodes with threshold

(t_{1}, t_{2}, t_{3}) = (170, 60, 25)

. In this case, the comparison result vector can be expressed as

B = (1, b_{1}, b_{2}, b_{3})

.

From Figure 1, it can be easily seen that Bob’s data should be classified to leaf node

L_{2}

by

{Path}_{2}

. In the following, we show how to perform classification inference using client Bob’s data and the model holder’s decision tree model while keeping them encrypted. According to Equations (2) and (19), we have

\begin{matrix} {pc}_{1} & = 〈P_{1}, B〉 = 〈[1, - 1, 0, 0], [1, b_{1}, b_{2}, b_{3}]〉, \\ {pc}_{2} & = 〈P_{2}, B〉 = 〈[2, 1, - 1, - 1], [1, b_{1}, b_{2}, b_{3}]〉, \\ {pc}_{3} & = 〈P_{3}, B〉 = 〈[1, 1, - 1, 1], [1, b_{1}, b_{2}, b_{3}]〉, \\ {pc}_{4} & = 〈P_{4}, B〉 = 〈[0, 1, 1, 0], [1, b_{1}, b_{2}, b_{3}]〉 . \end{matrix}

(30)

Therefore,

\begin{matrix} P = [\begin{matrix} P_{1} \\ P_{2} \\ P_{3} \\ P_{4} \end{matrix}] = [\begin{matrix} 1 & - 1 & 0 & 0 \\ 2 & 1 & - 1 & - 1 \\ 1 & 1 & - 1 & 1 \\ 0 & 1 & 1 & 0 \end{matrix}] . \end{matrix}

(31)

LNP process: Pruning the leaf nodes with class

C_{0}

, i.e.,

L_{1}

and

L_{3}

, while multiplying

P_{k}

by a random non-zero numbers

r_{k}, r_{k}^{'} \in Z_{p}^{*} (k = 2, 4)

, we obtain the path matrix

\begin{matrix} P^{'} = [\begin{matrix} r_{2} P_{2} \\ r_{4} P_{4} \end{matrix}] = [\begin{matrix} P_{1}^{'} \\ P_{2}^{'} \end{matrix}] = [\begin{matrix} 2 r_{2} & r_{2} & - r_{2} & - r_{2} \\ 0 & r_{4} & r_{4} & 0 \end{matrix}], \end{matrix}

(32)

and the corresponding classification result matrix

\begin{matrix} V^{'} = [\begin{matrix} r_{2}^{'} P_{2} + [C_{1}, 0, 0, 0] \\ r_{4}^{'} P_{4} + [C_{2}, 0, 0, 0] \end{matrix}] = [\begin{matrix} 2 r_{2}^{'} + C_{1} & r_{2}^{'} & - r_{2}^{'} & - r_{2}^{'} \\ C_{2} & r_{4}^{'} & r_{4}^{'} & 0 \end{matrix}] . \end{matrix}

(33)

Instead of using the original matrices P and V, we only need to use the pruned matrices

P^{'}

and

V^{'}

for calculation, thus saving computational cost, as shown in Figure 4.

The following two processes are run on encrypted statuses.

Comparison on each node: To check whether

X_{i} > t_{i}

at node

D_{i}

, let μ denote the bit length of the two integers

X_{i}

and

t_{i}

, where

i = 1, 2, 3

. For ease of explanation, we use

X_{3}, t_{3}

as an example, with a bit length of

μ_{3} = 6

.

Bob encrypts each feature of his own data using Equations (6) and (9).

$[[X]] : = {{[[X_{i}]]}_{1}, {[[X_{i}]]}_{2}}_{i = 1}^{3} = {Enc (p k, {poly}_{1} (X_{i})), Enc (p k, {poly}_{2} (X_{i}))}_{i = 1}^{3},$

(34)

where $X_{3} = 32 =$ (100000)₂ in binary, ${poly}_{1} (32) = 1, {poly}_{2} (32) = x^{6} + x^{12} + x^{18} + x^{24} + x^{30}$ .
Model holder encrypts each threshold of $T$ using Equations (6) and (8).

$[[t]] : = {{[[t_{i}]]}_{1}, {[[t_{i}]]}_{2}}_{i = 1}^{3} = {Enc (p k, {poly}_{1} (t_{i})), Enc (p k, {poly}_{2} (t_{i}))}_{i = 1}^{3},$

(35)

where $t_{3} = 25 =$ (11001)₂ in binary, ${poly}_{1} (25) = x + x^{2} + x^{5}, {poly}_{2} (25) = x^{11} + x^{16} + x^{17} + x^{22} + x^{23} + x^{28} + x^{29}$ .
Server runs homomorphic operation to obtain polynomials of the comparison result and then adds a random polynomial using Equation (10) over a polynomial ring $Z_{q}$ , which implies $x^{n} = - 1 mod q$ , as follows.

$\begin{matrix} [[d]] : = & {[[d_{i}]]}_{i = 1}^{3} \\ = & {Enc (p k, ({poly}_{1} (t_{i}) - {poly}_{1} (X_{i})) ({poly}_{2} (t_{i}) - {poly}_{2} (X_{i}) + {poly}_{1 \to 3}) + \tilde{poly (1)})}_{i = 1}^{3}, \\ [d^{'}]] : = & {[[d_{i} + γ]]}_{i = 1}^{3}, \end{matrix}$

(36)

where $γ \in Z_{q}$ .
Bob decrypts $[[d^{'}]]$ and sends $d^{'}$ back to the server. After removing the random polynomial γ from $d^{'}$ , the server can obtain comparison results from $d = {d_{i}}_{i = 1}^{3}$ by checking whether ∃ any coefficient of $x^{(i - 1) μ} = 0$ . If so $X_{i} > t_{i}$ , set $b_{i} = 1$ ; otherwise, $b_{i} = 0$ .
For example, $d_{3} = 3 x^{6} + x^{12} + 4 x^{18} + 4 x^{24} + 4 x^{30} + other terms$ , which is the coefficient of $x^{0} = 0$ , so $b_{3} = 1$ . Similarly, $b_{1} = 0, b_{2} = 0$ ; therefore, $B = [1, b_{1}, b_{2}, b_{3}] = [1, 0, 0, 1]$ .

Classification inference:

The server runs a multiplication homomorphic operation for Bob using vector B and the following encrypted path matrix $[[P^{'}]]$ and classification result matrix $[[V^{'}]]$ based on Equations (14), (16) and (18)

$\begin{matrix} [[P^{'}]] & = Enc ({poly}_{m a t} (P^{'})) \\ = Enc (2 r_{2} + r_{2} x - r_{2} x^{2} - r_{2} x^{3} + 0 \cdot x^{4} + r_{4} x^{5} + r_{4} x^{6} + 0 \cdot x^{7}), \\ [V^{'}]] & = Enc ({poly}_{m a t} (V^{'})) \\ = Enc ((2 r_{2}^{'} + C_{1}) + r_{2}^{'} x - r_{2}^{'} x^{2} - r_{2}^{'} x^{3} + C_{2} x^{4} + r_{4}^{'} x^{5} + r_{4}^{'} x^{6}) \end{matrix}$

(37)

to obtain

$\begin{matrix} Enc ({poly}_{m a t} (P^{'})) \otimes {poly}_{t} (B) \\ = Enc ((2 r_{2} x^{0} + r_{2} x - r_{2} x^{2} - r_{2} x^{3} + r_{4} x^{5} + r_{4} x^{6}) (1 - x^{n - 3} - x^{n - 2})) \\ = Enc (0 \cdot x^{0} + r_{4} x^{4} + other terms) \\ = Enc (F_{path} (x)), \\ Enc ({poly}_{m a t} (V^{'})) \otimes {poly}_{t} (B) \\ = Enc (((2 r_{2}^{'} + C_{1}) + r_{2}^{'} x - r_{2}^{'} x^{2} - r_{2}^{'} x^{3} + C_{2} x^{4} + r_{4}^{'} x^{5} + r_{4}^{'} x^{6}) (1 - x^{n - 3} - x^{n - 2})) \\ = Enc (C_{1} x^{0} + (r_{4} + C_{2}) x^{4} + other terms) \\ = Enc (F_{class} (x)) . \end{matrix}$

(38)
Bob decrypts and obtains the polynomial corresponding to a $(k, ℓ)$ -dimension path matrix $F_{path} (x)$ and then checks its coefficients of $x^{(i - 1) ℓ} (i = 1, \dots, k)$ terms. In this example, $k = 2, ℓ = 4$ , so whether the coefficient of $x^{0}$ or $x^{4}$ is zero is checked.
In fact,

$F_{path} (x) = 〈P_{1}^{'}, B〉 + 〈P_{2}^{'}, B〉 \cdot x^{4} + other terms,$

$F_{class} (x) = (〈P_{1}^{'}, B〉 + C_{1}) + (〈P_{2}^{'}, B〉 + C_{2}) \cdot x^{4} + other terms,$

and since $〈P_{1}^{'}, B〉 = r_{2} 〈P_{2}, B〉, 〈P_{2}^{'}, B〉 = r_{4} 〈P_{4}, B〉$ ,
- the coefficient of $x^{0}$ is zero $\Leftrightarrow {pc}_{2} = 〈P_{2}, B〉 = 0$ , and
- the coefficient of $x^{4}$ is zero $\Leftrightarrow {pc}_{4} = 〈P_{4}, B〉 = 0$ ,
which links to leaf nodes $L_{2}$ and $L_{4}$ , respectively.
As shown in Equation (38), since the coefficient of $x^{0}$ in the first polynomial is zero, if and only if ${pc}_{2} = 0$ , which means X is classified to leaf node $L_{2}$ . Then, the coefficient of $x^{0}$ in the second polynomial is the classification result; that is, $C_{o u t} = C_{1}$ .

3.5. Complexity

The client encrypts each element of the feature vectors with two different packing methods for comparison protocols and sends them to the model holder. The model holder encrypts the threshold for each decision node for

2 m

times. In addition, the path and classification result matrices are encrypted

S = ⌈ (m + 1) / m^{'} ⌉

(m^{'} = ⌊ n / (m + 1) ⌋)

times, respectively. The model holder sends the pairs of threshold and feature vector elements and the pairs of path and classification result matrices to the server. Next, the server and client cooperate to perform the comparison protocol for m times. In this case, the client performs decryption m times. The server performs homomorphic multiplication of the path and classification result matrices

2 S

times and sends the client results. The client obtains the classification result by decrypting the received path and classification result matrices

2 S

times.

3.6. Security Analysis

For a homomorphic encryption scheme, indistinguishability under chosen-plaintext attack (IND-CPA) is the basic security requirement. The SHE scheme used in the experiments in Section 4 is proven IND-CPA secure under the Ring-LWE assumption. In our proposed approach, as the input of the homomorphic computation consists of ciphertexts, the corresponding IND-CPA-secure SHE schemes will not be weakened.

Consider the entities involved in the protocol one by one (see Figure 3) and assume that they are all honest but curious and that they do not collude with one another.

Client. The client can decrypt the ciphertext sent from the cloud server in Steps 5 and 7 using the secret key and obtain $d^{'}$ related to the comparison result and the decision tree structure. However, in Step 4, the noise from the cloud server is added for randomization. Therefore, the client receives a polynomial with completely random coefficients in $d^{'}$ . Similarly, ${\tilde{P}}_{s}$ is also a polynomial with random coefficients except zero, which leads to the classification $C_{o u t} \in C$ ; that is, the coefficient corresponding to ${\tilde{V}}_{s}$ . Thus, the client can obtain the classification $C_{o u t}$ without knowing the decision tree model $T$
Model holder. The model holder only obtains the ciphertext of data X in Step 2, and IND-CPA security guarantees the privacy and security of the data.
Cloud server. The server honestly performs homomorphic operations for the system but is curious about the data provided by the client and the model holder. In Step 3, it receives only encrypted inputs using an SHE scheme that satisfies IND-CPA security, ensuring the confidentiality of both user data and model parameters. Even if the server obtains the comparison result vector $B = [1, b_{1}, \dots, b_{m}]$ from d sent in Step 5, it cannot recover the original data X nor infer meaningful information about the decision tree $T$ (threshold t, path matrix P, and classification result V). This is due not only to encryption but also to randomization of P using noise terms $r_{k}$ and $r_{k}^{'}$ in Step 3, which prevents linkage $b_{i} (i = 1, \dots, m)$ to specific nodes of $T$ .

Beyond technical guarantees, our approach also addresses ethical challenges inherent to PPML in cloud environments. Specifically, it supports secure inference while protecting sensitive user data, ensuring model confidentiality and the reducing risks of unauthorized access or data misuse. These safeguards are especially crucial in sensitive domains such as healthcare and finance, where balancing performance with privacy and intellectual property protection is key to responsible AI deployment.

4. Experiments

4.1. Experimental Setup

We implemented the proposed PPDT protocol in C++ where the BFV method [21] in SEAL 3.3 [22] was used as the encryption library. In our implementation, the two comparison protocols were adopted: Wang et al.’s efficiency-enhanced scheme (Protocol-1 in [17]) and Lu et al.’s non-interactive private comparison (XCMP) [15]. The path cost calculation in Section 3.2 was implemented.

Parameter Choice. First, to ensure that the above comparison methods can accurately decrypt the comparison results, the parameters

n, q, p

, and

σ

of the two methods must respectively satisfy the following two inequality requirements. Second, they must be selected to meet the 128-bit security level. The following parameters were used to ensure that the proposed model worked accurately and had 128-bit security, which is the currently recommended security level in [23]:

Efficiency-enhanced scheme [17]
$n = 2048, {log}_{2} q = 54, p = 40, 961, σ = 3.2 .$
These satisfy the following inequality: $q > 8 p^{2} σ^{4} n^{2} + 4 p σ n^{2 / 3}$ .
XCMP [15]
$n =$ 8192, ${log}_{2} q = 110, p =$ 8191, $σ = 3.2 .$
These satisfy the following inequality: $q > p^{3} σ^{2} n {(σ n + 1)}^{2} + 2 p^{2} σ n (σ n + 1)$ .

Note that

σ = 3.2

is the default noise standard deviation used in Microsoft SEAL. The inequalities involving

n, q, p

, and

σ

ensure the correctness of the corresponding comparison schemes. To support a maximum input bit length of ℓ, we chose n to satisfy the conditions from Wang et al. [17] (i.e.,

ℓ^{2} < n

) and Lu et al. [15] (i.e.,

ℓ < {log}_{2} n

). Based on these constraints, the comparison protocols could handle 32-bit and 13-bit integers for Wang et al.’s and Lu et al.’s schemes, respectively. This implies that the efficiency-enhanced scheme [17] supports comparisons of larger integers than does XCMP [15], even with a relatively smaller n. Given the selected n, we chose q to ensure a 128-bit security level as recommended in [23] and then determined p such that it satisfied the necessary inequalities while remaining efficient to compute.

The experiments were conducted on a standard PC with the Core i7-7700K (4.20 GHz) processor using a single thread mode. We used scikit-learn, a standard open-source machine learning library in Python 3.7.2, to train the decision trees. In both the BFV method and the comparison protocol by Lu et al. [15], an input value must be represented as an integer

a \in Z_{n}

. At every an appropriate number of times when the multiplication is performed, the fractional part of a value must be truncated, and it should be normalized within the range of

[0, 2^{13})

.

In the performance evaluations, the computational time was measured by averaging over 10 trials for the following 8 data sets from the UCI machine learning repository [20]: Heart Disease(HD), Spambase (SP) (which were used for the benchmark in [15]), Breast Cancer Wisconsin (BC), Nursery (NS), Credit-Screening (CS), Shuttle (ST), EGG EYE State (EGG), and Bank Marketing (BK). The data set information and the parameters of the decision tree used are listed in Table 1.

4.2. Performance Comparisons

4.2.1. Path Cost Calculation

We compared the computational time to obtain the path costs under the three-party protocol when the proposed matrix multiplication in Section 3.2 and the Yasuda et al.’s homomorphic inner product calculation [19] were used. Table 2 shows the computational time for the path costs. We also show the computation time in parentheses when LNP was introduced into the proposed path cost calculation.

In calculating path costs by using the naive inner product,

m + 1

homomorphic multiplications are required. Therefore, as the number of nodes in a tree m increases, the computation time for path costs also increases. On the other hand, when calculating the path cost by matrix multiplication, for the decision tree of

m \leq 35

of Table 2, the path cost computation time was the same. The reason is that for parameter n, the entire path cost can be calculated with one matrix in the case of

m \leq 35

. Therefore, for datasets other than BK, more than one path cost can be computed with one matrix. The computation time for the path cost was reduced by more than 90% compared to the inner product case. However, in the case of BK, only

⌊ n / (m + 1) ⌋ = ⌊ 2048 / 1028 ⌋ = 1

path cost could be calculated with a single matrix multiplication. Hence, there is no difference in the computation time for matrix multiplication and inner product when

⌊ n / (m + 1) ⌋ = 1

.

With LNP being applied, the path cost computation time remained unchanged for decision trees (e.g., BC, ST, CS, HD) with

m \leq 35

, as all path costs could be computed within a single matrix. However, for larger trees (e.g., NS, SP, EGG, BK), LNP significantly reduced the computation time—by 26% for NS, 50% for SP, 48% for EGG, and 47% for BK. The effect correlated with the number of classes: datasets with fewer classes (SP, EGG, BK each with 2) benefited more than did NS (5 classes), as fewer classes enhance pruning efficiency. See Table 1 and Figure 5 for dataset-specific details.

4.2.2. Protocol Efficiency

In this experiment, we used the XCMP of Lu et al. [15] for comparisons. We implemented two protocols and measured their computation times.

To evaluate the efficiency of different homomorphic encryption approaches, we also integrated the XCMP scheme into the comparison component of the three-party protocol described in Section 3.3. The path cost computation was performed following the procedure outlined in Steps 6–8 of Section 3.3.
A naive two-party protocol using XCMP by Lu et al. computed the result of two-party comparison (see Appendix B). Then, we computed the path cost under encrypt conditions by additive homomorphism.

Table 3 provides the measurement results.

In the naive two-party protocol, the path cost calculation is performed only by homomorphic addition, so

O (m^{2})

addition is required. Therefore, the path cost calculation accounted more than 80% of the total time. In contrast, when applied to the proposed protocol, the path cost calculation time was reduced to 0.5% of the total time. As a result, the time reduction is achieved by applying XCMP to the proposed protocol on the experiments’ dataset. In particular, the reduction rate was higher for the EGG and BK datasets with a large number of decision nodes.

4.3. Time Complexity

To verify the computational complexity in Section 3.5, we measured the computation time of the proposed protocol for a decision tree constructed with the number of m decision nodes fixed between [10, 1000] using the BK dataset.

Figure 6 provides the experimental results that demonstrated that the computation time was linear with the number of decision nodes. There were several points at which there was a rapid increase in the number of times the model holder’s encryption was performed, the number of times the client was compounded, and the number of times the server’s path cost was computed. As illustrated in Section 3.5, these computation times depend on

S = ⌈ (m + 1) / m^{'} ⌉

, where

m^{'} = ⌊ n / (m + 1) ⌋

. Thus, as m increases, the computation time increases rapidly.

5. Conclusions

In this paper, we proposed a privacy-preserving decision tree prediction protocol that enables secure outsourcing using homomorphic encryption. The proposed method uses the comparison protocol proposed by Wang et al. [17] or Lu et al. [15] to encrypt both feature vectors of a data holder and decision tree parameters of a model holder, and the structure of a decision tree model is hidden against both data holder and outsourced servers by calculating the path cost using the homomorphic matrix multiplication proposed by Dung et al. [18].

Our experimental results demonstrate that the computation time for three-party protocol using XCMP [15] is drastically reduced by more than 85% compared to the naive two-party protocol without a reduction in the prediction accuracy. In addition, we compared the method using homomorphic matrix multiplication [18] with homomorphic inner product [19]. As a result, we confirmed that there were cases where the computation time was reduced by more than 90%. We also proposed a leaf node pruning (LNP) algorithm to accelerate the three-party prediction protocol. The computation time was reduced by up to 50% when the proposed LNP was applied.

However, the proposed method has certain limitations. In particular, while the comparison results at each decision node are revealed to the outsourced server, the server cannot determine which specific feature each node corresponds to. This contrasts with the protocol by Lu et al. [15], which prevents such leakage but incurs significantly higher computational costs. This highlights a fundamental trade-off between security and computational efficiency: our approach enhances scalability and performance at the cost of limited information leakage. In many practical scenarios, this trade-off may be acceptable; however, minimizing data exposure remains a critical objective. In future work, we will aim to design a more efficient protocol that preserves performance while eliminating any information leakage, thereby strengthening both privacy and practicality.

Author Contributions

Conceptualization, S.F., L.W. and S.O.; methodology, S.F., L.W. and S.O.; software, S.F. and S.O.; validation, S.F. and S.O.; formal analysis, S.F. and L.W.; investigation, S.F. and L.W.; resources, S.O.; data curation, S.F.; writing—original draft preparation, S.F., L.W. and S.O.; writing—review and editing, L.W.; supervision, S.O.; project administration, S.O. and L.W.; funding acquisition, S.O. and L.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially supported by JST CREST (grant number JPMJCR21M1) and JST AIP accelerated program (grant number JPMJCR22U5), Japan.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article; further inquiries can be directed to the corresponding author.

Acknowledgments

We would like to thank Takuya Hayashi for the useful discussion.

Conflicts of Interest

The authors declare no conflicts of interest.

Notations

The following notations are used in this manuscript:

$1$ ${\cdot}$	Functions that return 1 if · is true and 0 if false
$μ$	Maximum bit length of an integer
A $= [a_{0}, \dots a_{ℓ - 1}]$	Vector of length ℓ
$A_{d}$ $= [a_{0}, \dots, a_{d - 1}, 0, \dots, 0]$	d-bit subvector of A
$a^{b}$ $= [a_{0}, \dots, a_{μ - 1}]$	$μ$ -bit binary vector of integer a,
	where $a_{μ - 1}$ is the least significant bit of integer a.
N	Dimension of the feature vector
X $= [X_{0}, \dots, X_{N - 1}]$	Feature vector
$D_{i}$	Decision node
$L_{j}$	Leaf node
$λ_{i}$	Index of feature vectors to be compared by decision node $D_{i}$
$t_{i}$	Threshold of decision node $D_{i}$
$c_{j}$	Class to be output by leaf node $L_{j}$
m	Number of decision nodes in the decision tree
$m + 1$	Number of leaf nodes in the decision tree

Parameters being used in the ring-LWE-based encryption schemes:

n	An integer of power of 2 that denotes the degree of polynomial $x^{n} + 1$ . Define a polynomial ring $Z : = Z [x] / (x^{n} + 1)$ ,
q	An integer composed of $q = q_{1} \times \dots \times q_{k}$ ( $q_{i}$ is a prime number). Define a polynomial ring representing a ciphertext space $Z_{q} : = Z_{q} [x] / (x^{n} + 1)$ ,
p	Define a plaintext space $Z_{p} : = Z_{p} [x] / (x^{n} + 1)$ , where p and q are mutually prime natural numbers with the relation $p < q$ ,
$σ$	Standard deviation of the discrete Gaussian distribution defining the secret key space $Z_{(0, σ^{2})}$ . The elements of $Z_{(0, σ^{2})}$ are polynomials on the ring $Z$ . Each coefficient is independently sampled from the discrete Gaussian distribution of the variance $σ^{2}$ .

Appendix A. Ring-LWE-Based Homomorphic Encryption

This study used the ring-LWE-based public key homomorphic encryption library called the Simple Encrypted Arithmetic Library (SEAL) v.3.3 [22] to implement our protocols. SEAL implements the somewhat homomorphic encryption scheme proposed by Fan et al. [21]. Due to the additive and multiplicative homomorphism, packing plaintexts provides an efficient homomorphic inner product and matrix multiplication calculations. See [21,22] for details on the encryption scheme.

The somewhat homomorphic encryption scheme consists of the following four basic algorithms:

$ParamGen (1^{λ})$ : input security parameter $1^{λ}$ and output system parameter $p p = (n, q, p, σ)$ .
$KeyGen$ : input system parameter $p p$ and output public key $p k$ and secret key $s k$ .
$Enc (p k, \cdot)$ : input plaintext m and output ciphertext c.
$Dec (s k, \cdot)$ : input ciphertext c and output plaintext m.

Homomorphic addition and multiplication algorithms are defined by

Add

and

Mul

, and the corresponding decryption algorithms are represented by

DecA

and

DecM

, respectively. Let

c = Enc (p k, m)

and

c^{'} = Enc (p k, m^{'})

denote the ciphertexts of the two plaintexts m and

m^{'}

, respectively. Then, the sum and product of m and

m^{'}

can be calculated as follows.

\begin{matrix} \begin{matrix} Add (c, c^{'}) & = c_{a d d} & \in Z_{q}, \\ DecA (s k, c_{a d d}) & = m + m^{'} & \in Z_{p}; \\ Mul (c, c^{'}) & = c_{m u l} & \in Z_{q}, \\ DecM (s k, c_{m u l}) & = m m^{'} & \in Z_{p} . \end{matrix} \end{matrix}

(A1)

Hereafter, we write

Enc (p k, \cdot) : = [[\cdot]], Add (c, c^{'}) : = c \oplus c^{'}

, and

Mul (c, c^{'}) = c \otimes c^{'}

. The difference between c and

c^{'}

can be obtained using homomorphic addition, and we define

Sub (c, c^{'}) : = c ⊖ c = Add (c, - c^{'})

.

Appendix B. XCMP [15]

Assume that client and server have two ℓ-bit integer values such as a and b, respectively. According to the algorithm in Figure A1, the server computes

[[C]]

to obtain the comparison result in encrypted form. In this scheme, the comparison result is a constant term in the polynomial C, as shown in

C = 1 {a > b} + other terms of degree

. It is possible to perform scalar multiples, additions, and subtractions on the resulting ciphertext

[[C]]

.

Figure A1. XCMP [15].

References

Konečný, J.; McMahan, H.B.; Ramage, D.; Richtárik, P. Federated Optimization: Distributed Machine Learning for On-Device Intelligence. arXiv 2016, arXiv:1610.02527. [Google Scholar] [CrossRef]
Fredrikson, M.; Jha, S.; Ristenpart, T. Model Inversion Attacks that Exploit Confidence Information and Basic Countermeasures. In Proceedings of the 22nd ACM SIGSAC Conference on Computer and Communications Security, Denver, CO, USA, 12–16 October 2015; Ray, I., Li, N., Kruegel, C., Eds.; ACM: New York, NY, USA, 2015; pp. 1322–1333. [Google Scholar] [CrossRef]
Rivest, R.L.; Dertouzos, M.L. On Data Banks and Privacy Homomorphisms. In Foundations of Secure Computation; DeMillo, R., Ed.; Academic Press: Cambridge, MA, USA, 1978; Volume 4, pp. 169–180. [Google Scholar]
Gentry, C. A Fully Homomorphic Encryption Scheme. Ph.D. Thesis, Stanford University, Stanford, CA, USA, 2009. [Google Scholar]
Akavia, A.; Leibovich, M.; Resheff, Y.S.; Ron, R.; Shahar, M.; Vald, M. Privacy-Preserving Decision Trees Training and Prediction. In Proceedings of the Machine Learning and Knowledge Discovery in Databases—European Conference, ECML PKDD 2020, Ghent, Belgium, 14–18 September 2020; Proceedings, Part I. Hutter, F., Kersting, K., Lijffijt, J., Valera, I., Eds.; Lecture Notes in Computer Science. Springer: Berlin/Heidelberg, Germany, 2020; Volume 12457, pp. 145–161. [Google Scholar] [CrossRef]
Fréry, J.; Stoian, A.; Bredehoft, R.; Montero, L.; Kherfallah, C.; Chevallier-Mames, B.; Meyre, A. Privacy-Preserving Tree-Based Inference with TFHE. In Proceedings of the Mobile, Secure, and Programmable Networking—9th International Conference, MSPN 2023, Paris, France, 26–27 October 2023; Revised Selected Papers. Bouzefrane, S., Banerjee, S., Mourlin, F., Boumerdassi, S., Renault, É., Eds.; Lecture Notes in Computer Science. Springer: Berlin/Heidelberg, Germany, 2023; Volume 14482, pp. 139–156. [Google Scholar] [CrossRef]
Hao, Y.; Qin, B.; Sun, Y. Privacy-Preserving Decision-Tree Evaluation with Low Complexity for Communication. Sensors 2023, 23, 2624. [Google Scholar] [CrossRef] [PubMed]
Shin, H.; Choi, J.; Lee, D.; Kim, K.; Lee, Y. Fully Homomorphic Training and Inference on Binary Decision Tree and Random Forest. In Proceedings of the Computer Security—ESORICS 2024—29th European Symposium on Research in Computer Security, Bydgoszcz, Poland, 16–20 September 2024; Proceedings, Part III. García-Alfaro, J., Kozik, R., Choras, M., Katsikas, S.K., Eds.; Lecture Notes in Computer Science. Springer: Berlin/Heidelberg, Germany, 2024; Volume 14984, pp. 217–237. [Google Scholar] [CrossRef]
Cong, K.; Das, D.; Park, J.; Pereira, H.V.L. SortingHat: Efficient Private Decision Tree Evaluation via Homomorphic Encryption and Transciphering. In Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security, CCS 2022, Los Angeles, CA, USA, 7–11 November 2022; Yin, H., Stavrou, A., Cremers, C., Shi, E., Eds.; ACM: New York, NY, USA, 2022; pp. 563–577. [Google Scholar] [CrossRef]
Tai, R.K.H.; Ma, J.P.K.; Zhao, Y.; Chow, S.S.M. Privacy-Preserving Decision Trees Evaluation via Linear Functions. In Proceedings of the Computer Security—ESORICS 2017—22nd European Symposium on Research in Computer Security, Oslo, Norway, 11–15 September 2017; Proceedings, Part II. Foley, S.N., Gollmann, D., Snekkenes, E., Eds.; Lecture Notes in Computer Science. Springer: Berlin/Heidelberg, Germany, 2017; Volume 10493, pp. 494–512. [Google Scholar] [CrossRef]
Zheng, Y.; Duan, H.; Wang, C.; Wang, R.; Nepal, S. Securely and Efficiently Outsourcing Decision Tree Inference. IEEE Trans. Dependable Secur. Comput. 2022, 19, 1841–1855. [Google Scholar] [CrossRef]
Wu, D.J.; Feng, T.; Naehrig, M.; Lauter, K.E. Privately Evaluating Decision Trees and Random Forests. Proc. Priv. Enhancing Technol. 2016, 2016, 335–355. [Google Scholar] [CrossRef]
Maddock, S.; Cormode, G.; Wang, T.; Maple, C.; Jha, S. Federated Boosted Decision Trees with Differential Privacy. In Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security, CCS 2022, Los Angeles, CA, USA, 7–11 November 2022; Yin, H., Stavrou, A., Cremers, C., Shi, E., Eds.; ACM: New York, NY, USA, 2022; pp. 2249–2263. [Google Scholar] [CrossRef]
Damgård, I.; Geisler, M.; Krøigaard, M. A correction to ’efficient and secure comparison for on-line auctions’. Int. J. Appl. Cryptogr. 2009, 1, 323–324. [Google Scholar] [CrossRef]
Lu, W.; Zhou, J.; Sakuma, J. Non-interactive and Output Expressive Private Comparison from Homomorphic Encryption. In Proceedings of the 2018 on Asia Conference on Computer and Communications Security, AsiaCCS 2018, Incheon, Republic of Korea, 4–8 June 2018; Kim, J., Ahn, G., Kim, S., Kim, Y., López, J., Kim, T., Eds.; ACM: New York, NY, USA, 2018; pp. 67–74. [Google Scholar] [CrossRef]
Saha, T.K.; Koshiba, T. An Efficient Privacy-Preserving Comparison Protocol. In Proceedings of the Advances in Network-Based Information Systems, The 20th International Conference on Network-Based Information Systems, NBiS 2017, Ryerson University, Toronto, ON, Canada, 24–26 August 2017; Barolli, L., Enokido, T., Takizawa, M., Eds.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2018; Volume 7, pp. 553–565. [Google Scholar] [CrossRef]
Wang, L.; Saha, T.K.; Aono, Y.; Koshiba, T.; Moriai, S. Enhanced Secure Comparison Schemes Using Homomorphic Encryption. In Proceedings of the Advances in Networked-Based Information Systems—The 23rd International Conference on Network-Based Information Systems, NBiS 2020, Victoria, BC, Canada, 31 August–2 September 2020; Barolli, L., Li, K.F., Enokido, T., Takizawa, M., Eds.; Advances in Intelligent Systems and Computing. Springer: Berlin/Heidelberg, Germany, 2021; Volume 1264, pp. 211–224. [Google Scholar] [CrossRef]
Duong, D.H.; Mishra, P.K.; Yasuda, M. Efficient Secure Matrix Multiplication Over LWE-Based Homomorphic Encryption. Tatra Mt. Math. Publ. 2016, 67, 69–83. [Google Scholar] [CrossRef]
Yasuda, M.; Shimoyama, T.; Kogure, J.; Yokoyama, K.; Koshiba, T. Practical Packing Method in Somewhat Homomorphic Encryption. In Proceedings of the Data Privacy Management and Autonomous Spontaneous Security–8th International Workshop, DPM 2013, and 6th International Workshop, SETOP 2013, Revised Selected Papers, Egham, UK, 12–13 September 2013; García-Alfaro, J., Lioudakis, G.V., Cuppens-Boulahia, N., Foley, S.N., Fitzgerald, W.M., Eds.; Lecture Notes in Computer Science. Springer: Berlin/Heidelberg, Germany, 2014; Volume 8247, pp. 34–50. [Google Scholar] [CrossRef]
Kelly, M.; Longjohn, R.; Nottingham, K. The UCI Machine Learning Repository. Available online: https://archive.ics.uci.edu (accessed on 31 March 2025).
Fan, J.; Vercauteren, F. Somewhat Practical Fully Homomorphic Encryption. IACR Cryptology ePrint Archive 2012. p. 144. Available online: https://eprint.iacr.org/2012/144 (accessed on 31 March 2025).
Microsoft SEAL (Release 3.3); Microsoft Research: Redmond, WA, USA, 2019; Available online: https://github.com/Microsoft/SEAL (accessed on 31 March 2025).
Albrecht, M.; Chase, M.; Chen, H.; Ding, J.; Goldwasser, S.; Gorbunov, S.; Halevi, S.; Hoffstein, J.; Laine, K.; Lauter, K.; et al. Homomorphic Encryption Security Standard; Technical report; HomomorphicEncryption.org: Toronto, ON, Canada, 2018. [Google Scholar]
Fukui, S.; Wang, L.; Hayashi, T.; Ozawa, S. Privacy-Preserving Decision Tree Classification Using Ring-LWE-Based Homomorphic Encryption. In Proceedings of the Computer Sequrity Symposium 2019, Nagasaki, Japan, 21–24 October 2019; pp. 321–327. [Google Scholar]

Figure 1. Decision tree example.

Figure 2. Our computation model.

Figure 3. Flowchart of the proposed protocol, where

[[\cdot]]

denotes the ciphertext of “·”.

Figure 3. Flowchart of the proposed protocol, where

[[\cdot]]

denotes the ciphertext of “·”.

Figure 4. Image for LNP.

Figure 5. Comparison of computation times with and without LNP across all datasets. For each dataset, the bar chart shows the path cost and total computation times, highlighting the reduction achieved by applying LNP. Datasets with fewer classes (e.g., SP, EGG, BK) showed more significant improvements due to more efficient pruning.

Figure 6. Computation time for decision nodes.

Table 1. Data set information and the size of trained decision tree.

Dataset	Dataset			Model
Dataset	# Data	# Attributes $N$	# Classes $C$	# Nodes $m$
BC	569	30	2	8
ST	43,500	9	7	24
CS	653	15	2	26
HD	720	13	5	35
NS	12,960	8	5	49
SP	4601	57	2	110
EGG	14,980	14	2	724
BK	45,211	16	2	1027

# denotes the number of data, attributes, classes, and decision nodes in the table for brevity.

Table 2. Computation time for path cost calculation in two methods: matrix multiplication of Section 3.2 and the homomorphic inner product calculation of Yasuda et al. [19] that we proposed in [24]. The computation time for the matrix multiplication with the addition of LNP is provided in parentheses. The computation time for LNP was measured as an average of 10 trials for each dataset class.

Dataset Accuracy		Path Cost (ms)		Total Time (ms)
Dataset Accuracy		Matrix (+LNP)	Inner Product	Matrix (+LNP)	Inner Product
BC	96.4%	0.25 (0.25)	3.04	44.19 (43.92)	63.58
ST	99.8%	0.25 (0.25)	6.74	66.85 (66.83)	117.51
CS	90.8%	0.25 (0.25)	7.47	77.57 (76.12)	134.64
HD	61.1%	0.25 (0.25)	9.73	98.82 (98.11)	174.34
NS	98.6%	0.50 (0.37)	11.72	130.87 (129.07)	235.04
SP	90.4%	1.72 (0.86)	13.13	321.89 (302.72)	490.04
EGG	69.8%	88.57 (46.20)	179.31	3425.44 (2516.35)	4442.00
BK	88.6%	247.63 (131.14)	252.87	6485.30 (4394.58)	6455.17

Table 3. Computation time with XCMP [15] in the following scenarios: three-party with matrix multiplication and LNP; two-party with homomorphic addition of XCMP results for computing the path cost.

Dataset	Path Cost (ms)		Total Time (ms)
Dataset	Three-Party	Two-Party	Three-Party	Two-Party
BC	2.17	1577.51	302.19	1958.13
ST	2.17	11,745.70	647.51	12,724.20
CS	2.18	14,905.70	736.95	16,037.90
HD	2.18	27,041.40	961.19	28,298.90
NS	2.18	51,772.50	1321.47	54,268.10
SP	2.19	255,659.00	2996.92	263,008.00
EGG	74.01	10,947,300.00	19,432.90	11,013,300.00
BK	166.74	22,027,200.00	28,821.60	22,130,500.00

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Fukui, S.; Wang, L.; Ozawa, S. Efficient and Privacy-Preserving Decision Tree Inference via Homomorphic Matrix Multiplication and Leaf Node Pruning. Appl. Sci. 2025, 15, 5560. https://doi.org/10.3390/app15105560

AMA Style

Fukui S, Wang L, Ozawa S. Efficient and Privacy-Preserving Decision Tree Inference via Homomorphic Matrix Multiplication and Leaf Node Pruning. Applied Sciences. 2025; 15(10):5560. https://doi.org/10.3390/app15105560

Chicago/Turabian Style

Fukui, Satoshi, Lihua Wang, and Seiichi Ozawa. 2025. "Efficient and Privacy-Preserving Decision Tree Inference via Homomorphic Matrix Multiplication and Leaf Node Pruning" Applied Sciences 15, no. 10: 5560. https://doi.org/10.3390/app15105560

APA Style

Fukui, S., Wang, L., & Ozawa, S. (2025). Efficient and Privacy-Preserving Decision Tree Inference via Homomorphic Matrix Multiplication and Leaf Node Pruning. Applied Sciences, 15(10), 5560. https://doi.org/10.3390/app15105560

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Efficient and Privacy-Preserving Decision Tree Inference via Homomorphic Matrix Multiplication and Leaf Node Pruning

Abstract

1. Introduction

1.1. Related Work

1.2. Our Contributions

2. Preliminaries

2.1. Existing Decision Tree Classification

2.1.1. Decision Tree

2.1.2. Decision Tree Classification via Linear Function [10]

2.2. Secure Comparison Protocol

2.2.1. μ -bit Integer Comparison [14]

2.2.2. Packing Method

2.2.3. Secure Comparison Protocol

2.3. Ring-LWE-Based Secure Matrix Multiplication

3. PPDT Classification Model

3.1. Computation Model

A Representative Application of Online Medical Diagnostics

3.2. Computation of Path Cost by Matrix × Vector Operation

3.3. Proposed Protocol

3.4. Leaf Node Pruning (LNP)

3.5. Complexity

3.6. Security Analysis

4. Experiments

4.1. Experimental Setup

4.2. Performance Comparisons

4.2.1. Path Cost Calculation

4.2.2. Protocol Efficiency

4.3. Time Complexity

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Notations

Appendix A. Ring-LWE-Based Homomorphic Encryption

Appendix B. XCMP [15]

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2.2.1. $μ$ -bit Integer Comparison [14]