Dynamic Multi-Key Block Binary Ring-Compact Bootstrapping

Xiao, Qiwei; Huang, Ruwei

doi:10.3390/math14061045

Open AccessArticle

Dynamic Multi-Key Block Binary Ring-Compact Bootstrapping

by

Qiwei Xiao

and

Ruwei Huang

^*

School of Computer and Electronic Information, Guangxi University, Nanning 530004, China

^*

Author to whom correspondence should be addressed.

Mathematics 2026, 14(6), 1045; https://doi.org/10.3390/math14061045

Submission received: 28 February 2026 / Revised: 17 March 2026 / Accepted: 18 March 2026 / Published: 19 March 2026

(This article belongs to the Section E1: Mathematics and Computer Science)

Download Versions Notes

Abstract

Multi-Key Fully Homomorphic Encryption (MK-FHE) is essential for secure multi-party computation but currently faces significant scalability bottlenecks due to linear computational growth and low bootstrapping throughput. To address these limitations, we propose DMBB-RCB, a novel fully homomorphic, bit-wise Dynamic Multi-Key Block-Binary Ring-Compact Bootstrapping scheme. Our contribution is threefold. First, we integrate the Block Binary Distribution into the dynamic setting, reducing the complexity of the core blind rotation operation from O(P⋅n) to O(p⋅k) (where k ≪ n) by leveraging key sparsity. Second, we implement an amortized ring packing strategy that aggregates multiple Learning with Errors (LWE) ciphertexts into the coefficients of a single Ring Learning with Errors (RLWE) polynomial, enabling the parallel refreshing of messages. Third, we introduce a Ring-Compact extraction architecture that natively translates RLWE states into Multi-Key Regev–Gentry–Sahai–Waters (RGSW) ciphertexts via scheme switching. Unlike traditional pipelines that suffer from severe network latency due to interactive multi-party key-switching after each bootstrapping, our architecture keeps the data entirely within the ring domain. This completely eliminates intermediate interaction rounds, enabling depth-unbounded homomorphic evaluations with zero interaction between participants during the computation phase (interaction is strictly reserved for the final joint decryption step). The proposed scheme supports the dynamic addition of participants without parameter re-generation. Theoretical analysis confirms that DMBB-RCB significantly reduces latency and enhances throughput compared to existing dynamic MKHE solutions.

Keywords:

multi-key FHE; block binary keys; amortized bootstrapping; ring-compact; dynamic MPC

MSC:

94A60

1. Introduction

Fully Homomorphic Encryption (FHE) is considered the “holy grail” of cryptography, allowing arbitrary computations to be performed on encrypted data without access to the secret decryption key. Since Gentry’s first feasible construction [1], FHE has evolved significantly, with the Torus FHE (TFHE) scheme [2,3] emerging as a leading candidate for practical applications. TFHE relies on the hardness of the Learning With Errors (LWE) problem [4] and allows for the evaluation of exact logic gates with constant-time noise refreshing via fast gate-by-gate bootstrapping.

However, in the era of cloud computing and collaborative privacy-preserving analytics, single-key FHE is often insufficient. Real-world scenarios, such as secure genome analysis or financial auditing, typically involve multiple mutually distrusting parties who wish to compute jointly on their private data. This necessitates Multi-Key Homomorphic Encryption (MKHE) [5,6], which extends FHE to support operations on ciphertexts encrypted under different independent keys. The resulting ciphertext can only be decrypted if all involved parties collaborate. While theoretical constructions for MKHE exist [7,8], adapting them to the efficient TFHE framework presents significant challenges. Current state-of-the-art Multi-Key TFHE (MK-TFHE) schemes [9,10] suffer from three severe scalability bottlenecks:

High Computational Complexity of Blind Rotation: The core blind rotation step requires iterating through every bit of the expanded multi-user key. As the number of participants $p$ grows, the required external product operations increase linearly or even quadratically, leading to prohibitive latency [9]. The computational complexity typically scales as $O (p \cdot n)$ , where $n$ is the LWE dimension.
Low Amortized Throughput: Standard TFHE bootstrapping refreshes only one LWE ciphertext per execution. For Boolean circuits requiring massive parallelism, this bit-wise processing is highly inefficient. Although packing techniques (SIMD) [11] exist for BGV [12]/CKKS [13] schemes, applying them to TFHE’s accumulation-based bootstrapping remains non-trivial [14].
Expensive Format Conversion Overhead: Traditional MKHE pipelines require frequent Sample Extraction operations at the end of bootstrapping. This forces the data out of the ring domain, meaning it must be key-switched back to a format suitable for the accumulator, wasting computational cycles and hindering deep circuit evaluations.

To address these limitations, we propose the Dynamic Multi-Key Block-Binary Ring-Compact Bootstrapping (DMBB-RCB) scheme. Our construction synthesizes three advanced optimization strategies into a unified, scalable framework:

Block Binary Keys for Accelerated Blind Rotation: We adopt a Block Binary Distribution [15] for the LWE secret keys. By structuring the key as a concatenation of $k$ blocks (each with a Hamming weight of at most one), we redesign the blind rotation algorithm to iterate over $k$ blocks rather than $n$ bits. This structural sparsity reduces the core complexity from $O (p \cdot n)$ to $O (p \cdot k)$ (where $k ≪ n$ ), providing a theoretical speedup proportional to the block length $l$ .
Amortized Ring Packing: We implement a PackLWE algorithm [15] tailored for the multi-key setting. This enables the packing of multiple scalar LWE ciphertexts into the coefficients of a single RLWE polynomial. Consequently, a single execution of our bootstrapping circuit refreshes multiple messages simultaneously, significantly increasing the amortized throughput.
Dynamic Ring-Compact Architecture: Instead of extracting LWE samples at the end of bootstrapping, we utilize a Scheme Switching technique inspired by recent dynamic MKHE frameworks [16,17] to output Multi-Key RGSW ciphertexts directly. This keeps the data entirely within the Ring domain, enabling seamless, continuous homomorphic evaluation without intermediate LWE-to-RLWE conversions. Furthermore, our scheme supports the dynamic addition of participants without requiring a global parameter reset, making it highly suitable for flexible Multi-Party Computation (MPC) environments.

Recently, the rapid evolution of MKHE has led to several notable advancements in 2024 and 2025. For instance, Kwak et al. [10] introduced parallelizable techniques to reduce quasi-linear complexity, and recent hardware/algorithmic co-designs and algorithmic optimizations [18,19,20] have attempted to optimize circuit bootstrapping and automorphism evaluations. While these contemporary works significantly improve specific TFHE bottlenecks, they predominantly operate within static configurations or still incur the expensive LWE-to-RLWE format conversion overhead. In contrast, our DMBB-RCB framework uniquely synthesizes block-binary sparsity with a dynamic ring-compact architecture, addressing both latency and dynamic participant scalability simultaneously—a gap not fully resolved by the most recent 2024–2025 literature.

2. Preliminaries

In this section, we establish the mathematical notations, algebraic structures, probability distributions, and the fundamental cryptographic primitives underlying the Dynamic Multi-Key Block-Binary Ring-Compact Bootstrapping (DMBB-RCB) scheme.

2.1. Notation

Let

Z

and

R

denote the set of integers and real numbers, respectively. We define the real torus as

T = R / Z

(modulo 1), and the integer ring modulo

q \geq 1

as

Z_{q} = Z / q Z .

Vectors and matrices are denoted by bold lowercase (e.g.,

a, s

) and bold uppercase (e.g.,

A, G

) letters, respectively. For two vectors

u, v

, their inner product is denoted by

⟨ u, v ⟩ = \sum u_{i} v_{i}

. The operations

⌊ \cdot ⌉, ⌊ \cdot ⌋

, and

⌈ \cdot ⌉

denote the nearest-integer rounding, floor, and ceiling functions, respectively. For a finite set

S, x \leftarrow U (S)

denotes sampling

x

uniformly at random from

S

. Let

N

be a power of 2. We define the cyclotomic polynomial rings

𝓡 = Z [X] / (X^{N} + 1)

and

𝓡_{q} = Z_{q} [X] / (X^{N} + 1)

.

2.2. Probability Distributions

The security of our scheme relies on properties of specific error distributions and structured sparse keys.

Definition 1

(Error Distributions). Discrete Gaussian (

𝓓_{Z, σ}

): A discrete Gaussian distribution defined over the integers

Z

with standard deviation

σ > 0

, where the probability of sampling an integer

x \in Z

is proportional to

e x p (- π | x |^{2} / σ^{2})

.

Modular Gaussian ( $𝓓_{T, α}$ ): A Gaussian error distribution defined over the real torus $T = R / Z$ with standard deviation $α$ , concentrated around $0 (m o d 1)$ .
Polynomial Error Distribution ( $𝓓_{𝓡, β}$ ): A distribution of polynomials in the ring $𝓡 = Z [X] / (X^{N} + 1)$ . A polynomial $e (X) \leftarrow 𝓓_{𝓡, β}$ is generated by sampling each of its $N$ integer coefficients independently from the discrete Gaussian distribution $𝓓_{Z, β}$ .

Definition 2

(Block Binary Distribution). Let

l \in Z^{+}

be the block length and

k \in Z^{+}

be the number of blocks, such that the total dimension is

n = l \cdot k

. The block distribution

𝓑_{l}

over

Z^{l}

outputs a vector

b \in {0,1}^{l}

with a Hamming weight

∥ b ∥_{1} \leq 1

uniformly at random. The Block Binary Distribution

𝓑_{l, k}

over

Z^{n}

is defined as the concatenation of

k

independent samples from

𝓑_{l}

. Formally, for

s \leftarrow 𝓑_{l, k}

, we have

s = (s_{0} ∥ s_{1} ∥ \dots ∥ s_{k - 1})

, where

s_{j} \leftarrow 𝓑_{l}

for

0 \leq j < k

.

2.3. Cryptographic Primitives

We recall the fundamental definitions of the LWE and RLWE encryption schemes, along with the gadget decomposition mechanism utilized in TFHE.

Definition 3

(Gadget Decomposition). Let

B \geq 2

be an integer base and

d_{g} \geq 1

be the decomposition depth such that

B^{d_{g}} \geq q

. The gadget vector is defined as

g = (1, B, B^{2}, \dots, B^{d_{g} - 1}) \in Z^{d_{g}}

. We define the decomposition function

{Dec}_{g} : 𝓡_{q} \to 𝓡^{d_{g}}

. For any input polynomial

a \in 𝓡_{q}

, the function outputs a vector of polynomials

u = (u_{0}, u_{1}, \dots, u_{d_{g} - 1})^{⊤} \in 𝓡^{d_{g}}

such that:

a = ⟨ u, g ⟩ = \sum_{i = 0}^{d_{g} - 1} u_{i} B^{i} (m o d q)

where every integer coefficient of each polynomial

u_{i} \in 𝓡

is bounded within the interval

[- B / 2, B / 2) .

Definition 4

(LWE Encryption). For dimension

n = l \cdot k

, a secret key

s \leftarrow 𝓑_{l, k}

. To encrypt a message

μ \in T

, sample a mask vector

a \leftarrow U (T^{n})

and an error term

e \leftarrow D_{α}

. The ciphertext is

{L W E}_{s} (μ) = (a, b) \in T^{n + 1}

, where

b = - ⟨ a, s ⟩ + μ + e (m o d 1)

.

Remark 1

(Practical Message Encoding and Discretization). While theoretical formulations define the message

μ

over the continuous torus

T

, practical implementations cannot process arbitrary real numbers (e.g.,

1 / \sqrt{3}

) due to finite machine precision. In practice, we utilize a discrete message space and encode it into

T

.

Let

Z_{p}

be the finite plaintext space (e.g.,

p = 2

for standard Boolean circuits).

Encoding: A discrete message $m \in Z_{p}$ is encoded into the torus $T$ via the function $μ = Encode (m) = \frac{m}{p} (m o d 1)$ .
Decoding: During decryption, the recovered noisy phase $μ^{*} = μ + e \in T$ is decoded back to the exact message by rescaling and rounding to the nearest integer: $m = Decode (μ^{*}) = ⌊ p \cdot μ^{*} ⌉ (m o d p)$ .

In our software implementation, the continuous torus

T = R / Z

is discretized and mapped to standard unsigned integer data types (e.g., 32-bit or 64-bit integers), where the interval

[0,1)

is represented by the range

[0, 2^{32} - 1]

or

[0, 2^{64} - 1]

, effectively handling the modulo

1

arithmetic via natural CPU integer overflow.

Definition 5

(RLWE Encryption [21]). Let the secret key be a polynomial

z (X) \in 𝓡

. To encrypt a message polynomial

m (X) \in 𝓡_{q}

, sample

a (X) \leftarrow U (𝓡_{q})

and

e (X) \leftarrow P_{β}

. The ciphertext is

{R L W E}_{z} (m) = (a, b) \in 𝓡_{q}^{2}

, defined as

b (X) = - a (X) \cdot z (X) + m (X) + e (X) (m o d X^{N} + 1)

.

Definition 6

(RGSW Encryption [22]). Let

H = diag (g, g) \in 𝓡_{q}^{2 d_{g} \times 2}

be the gadget matrix. An RGSW ciphertext encrypting a message

m \in 𝓡_{q}

under secret key

z

is a matrix

C = Z + m \cdot H \in 𝓡_{q}^{2 d_{g} \times 2}

. Here,

Z

is a matrix where each row is a valid RLWE encryption of 0.

2.4. Standard TFHE Operations

The homomorphic evaluation logic is built upon the External Product and the CMUX gate:

External Product ( $⊙$ ): Let $C \in 𝓡_{q}^{2 d_{g} \times 2}$ be an RGSW ciphertext encrypting $μ \in {0,1}$ and $c = (a, b)^{T} \in 𝓡_{q}^{2}$ be an RLWE ciphertext encrypting $m (X)$ . The external product is computed as $C ⊙ c = {Dec}_{g} (c)^{T} \cdot C$ . The result is a valid RLWE ciphertext encrypting $μ \cdot m (X)$ .
CMUX Gate: The Controlled Multiplexer (CMUX) acts as a homomorphic “if-then-else” gate. For an RGSW ciphertext $C$ encrypting a selection bit $μ \in {0,1}$ and two RLWE ciphertexts $c_{0}, c_{1}$ encrypting $m_{0}, m_{1}$ , the CMUX operation is defined as $CMUX (C, c_{0}, c_{1}) = c_{0} + C ⊙ (c_{1} - c_{0})$ . This outputs an RLWE ciphertext encrypting $m_{μ}$ .

3. Construction of the DMBB-RCB Scheme

In this section, we present the formal construction of the Dynamic Multi-Key Block-Binary Ring-Compact Bootstrapping (DMBB-RCB) scheme. The protocol is composed of five distinct phases: Setup and Key Generation, Amortized Input Preparation (Packing), Block-Binary Dynamic Blind Rotation, Ring-Compact Extraction, and Distributed Decryption.

3.1. Advanced Building Blocks

Before detailing the main protocol, we formally define the advanced cryptographic primitives tailored for our multi-key and ring-compact setting.

Amortized Packing ( $P a c k L W E$ ): This procedure aggregates a set of scalar LWE ciphertexts ${c t_{i}}_{i = 0}^{ν - 1}$ into the coefficients of a single RLWE ciphertext using a set of key-switching keys $𝓚_{P}$ . The algorithm evaluates the decryption circuit homomorphically in the ring domain:

$C_{p a c k e d} = P a c k L W E ({{c t}_{i}}, 𝓚_{P}) = \sum_{i} K e y S w i t c h ({c t}_{i}, 𝓚_{P}) \cdot X^{p_{i}}$
Homomorphic Trace ( $H o m T r a c e$ ): To isolate a specific message encrypted in a packed RLWE ciphertext $c t (X)$ , we define the trace map relative to the sub-ring $Z [X] / (X^{N} + 1)$ via the sum of automorphisms. Let $ψ_{j} : X \mapsto X^{j}$ be automorphisms of the ring $𝓡$ .

$C_{t r a c e} = H o m T r a c e (c t) = \sum_{j \in Gal} ψ_{j} (c t)$
Scheme Switching ( $S c h e m e S w i t c h$ ): This algorithm converts an RLWE ciphertext $c$ directly into an RGSW ciphertext $C_{o u t}$ using a scheme-switching key $𝓚_{S S}$ . It applies the gadget decomposition $u = {Dec}_{g} (c)$ and reconstructs the matrix format:

$C_{o u t} = S c h e m e S w i t c h (c, 𝓚_{S S})$

3.2. Setup and Key Generation

Let

S u b

denote the set of active participants in the dynamic MPC environment:

$S e t u p (1^{λ}, l, k) \to p p$ : Given the security parameter $λ$ , block length $l$ , and block count $k$ , output the public parameters $p p = {n, N, q, Q, B, d_{g}, a_{p u b}, l, k}$ , where $n = l \cdot k$ is the LWE dimension, $N \geq n$ is the polynomial modulus, and $a_{p u b} \leftarrow U (𝓡_{Q})$ is the common reference string.
$K e y G e n (p p, u) \to (s^{(u)}, z^{(u)} (X), {P K}_{u})$ : Each participant $u \in S u b$ samples an LWE secret key $s^{(u)} \leftarrow 𝓑_{l, k}$ . The corresponding RLWE key $z^{(u)} (X) \in 𝓡$ is constructed to embed $s^{(u)}$ :

z^{(u)} (X) = \sum_{i = 0}^{n - 1} s_{i}^{(u)} X^{i} + \sum_{i = n}^{N - 1} r_{i} X^{i}

where

r_{i} \leftarrow U ({- 1,0, 1})

are small random ternary coefficients. The public key is generated as

{P K}_{u} = {R L W E}_{z^{(u)}} (0) = (b^{(u)}, a_{p u b})

:

$E v a l K e y G e n (p p, s^{(u)}, z^{(u)}) \to ({B R K}_{u}, 𝓚_{P}^{(u)}, 𝓚_{S S}^{(u)})$ : Participant $u$ generates Blind Rotation Keys ${B R K}_{u, i} \leftarrow M K - R G S W . E n c (s_{i}^{(u)}, z^{(u)})$ , Packing Keys $𝓚_{P}^{(u)}$ (for LWE to RLWE key-switching), and a Scheme Switching Key $𝓚_{S S}^{(u)} \leftarrow R G S W (z^{(u)})$ .

Remark 2

(Symmetry and Asymmetry in DMBB-RCB). It is worth noting that our framework employs a hybrid cryptographic architecture that leverages both symmetric and asymmetric properties to optimize overall performance. Specifically, the initial encryption of private user data is strictly symmetric (Secret-Key LWE, as per Definition 4) to minimize client-side computational overhead and ciphertext expansion before transmission. In contrast, the homomorphic evaluation phase relies on an asymmetric (Public-Key) paradigm; the evaluation materials, including the Blind Rotation Keys (BRK) and Packing Keys, are published as Public-Key RLWE/RGSW ciphertexts. This allows any untrusted third-party server to process the data without access to the secret keys. Finally, the decryption process is structured as a distributed threshold protocol, where the publicly evaluated ciphertext is jointly decrypted using the participants’ individual symmetric secret keys.

Remark 3

(Cost of Dynamic Participant Addition). A significant advantage of our CRS-based dynamic setup is the minimal overhead required when a new participant joins the computation. Because the global public parameters

p p

and the common reference string

a_{p u b}

remain static, existing participants incur zero computational and communication overhead; they do not need to update, regenerate, or re-broadcast their keys. The new participant

u_{n e w}

only needs to locally execute KeyGen and EvalKeyGen once. The cost for

u_{n e w}

strictly consists of generating one LWE/RLWE key pair and producing their local evaluation keys (

B R K, K P

, and

K S S

). The server simply appends these newly broadcasted keys to its storage without halting ongoing independent computations.

3.3. The DMBB-RCB Evaluation Protocol

High-Level Overview of the DMBB-RCB Pipeline:

Before detailing the specific algorithms, we outline how the cryptographic primitives smoothly transition through our evaluation pipeline.

First, users encrypt their private inputs using standard Secret-Key LWE to minimize client-side overhead.

To initiate the bootstrapping, the Amortized Input Preparation phase homomorphically packs multiple such LWE ciphertexts into a single Public-Key RLWE accumulator polynomial.

Subsequently, the Blind Rotation is executed homomorphically over this RLWE accumulator using the users’ public RGSW evaluation keys.

Instead of reverting to LWE via key-switching, our Ring-Compact Extraction natively transforms the RLWE output into a Multi-Key RGSW ciphertext, keeping the data entirely within the ring domain.

Finally, when the computation is complete, the parties invoke the Distributed Decryption protocol to jointly decrypt the resulting ring-based ciphertext using their secret RLWE key shares.

3.3.1. Amortized Input Preparation

To amortize the bootstrapping cost, we aggregate

ν

independent scalar multi-key LWE ciphertexts

{{c t}_{τ} = (a_{τ}, b_{τ})}_{τ = 0}^{ν - 1}

into a single RLWE polynomial. The target polynomial slots are defined by a mapping function

p (τ)

. The initialized accumulator

{A C C}_{i n} \in 𝓡_{Q} \times 𝓡_{Q}

is constructed by homomorphically subtracting the secret-dependent parts using the aggregate Packing Keys

𝓚_{P} = {𝓚_{P}^{(u)}}_{u \in S u b}

:

{A C C}_{i n} = \sum_{τ = 0}^{ν - 1} (b_{τ} - \sum_{u \in S u b} \sum_{j = 0}^{n - 1} a_{τ, j}^{(u)} s_{j}^{(u)}) X^{p (τ)}

This results in a valid multi-key RLWE encryption

{A C C}_{i n} \approx {R L W E}_{\sum z^{(u)}} (\sum_{τ = 0}^{ν - 1} m_{τ} X^{p (τ)})

.

3.3.2. Block-Binary Dynamic Blind Rotation

The core innovation reduces the external product complexity from

O (| S u b | \cdot n)

to

O (| S u b | \cdot k)

by exploiting the block sparsity. The algorithm computes

{A C C}_{o u t} = {A C C}_{i n} \cdot X^{- \sum_{u \in S u b} ⟨ a^{(u)}, s^{(u)} ⟩}

.

3.3.3. Ring-Compact Extraction

To extract a specific target message

m_{k}

from the packed accumulator without leaving the ring domain, we execute the Ring-Compact sequence. First, a trivial rotation

A C C \leftarrow A C C \cdot X^{- p (τ^{*})}

shifts

m_{k}

to the constant term.

C_{t r a c e} = H o m T r a c e (A C C) = \sum_{i \in (Z / 2 N Z)^{\times}} ψ_{i} (A C C)

Subsequently, we convert the isolated RLWE ciphertext into a multi-key RGSW ciphertext using Scheme Switching:

{C T}_{o u t} = S c h e m e S w i t c h (C_{t r a c e}, {𝓚_{S S}^{(u)}}_{u \in S u b})

This closed-loop design ensures

{C T}_{o u t}

can be directly fed into subsequent CMUX gates.

3.4. Distributed Decryption

To recover the computation result while preserving circuit privacy, we apply a distributed decryption protocol with smudging noise [23].

First, a representative multi-key RLWE sample is extracted:

{c t}_{r l w e} = (b, a) \leftarrow S a m p l e E x t r a c t ({C T}_{o u t})

. Each participant

u \in S u b

computes a partial decryption share

μ_{u} (X)

by masking the exact key multiplication with an independent Gaussian smudging noise

e_{s m u d g e}^{(u)} \leftarrow D_{σ_{s m u d g e}}

:

μ_{u} (X) = a (X) \cdot z^{(u)} (X) + e_{s m u d g e}^{(u)} (X) (m o d Q)

To guarantee statistical indistinguishability, the standard deviation must satisfy

σ_{s m u d g e} \geq 2^{λ_{s t a t}} \cdot B_{n o i s e}

. The shares are aggregated to cancel the public mask:

M_{n o i s y} (X) = b (X) + \sum_{u \in S u b} μ_{u} (X) (m o d Q)

Finally, the exact message

m_{d e c}

is recovered by rounding the constant term of

M_{n o i s y} (X)

relative to the scaling factor

Δ

.

4. Correctness and Security Analysis

In this section, we formally establish the mathematical correctness of the Block-Binary Blind Rotation, derive the strict noise propagation bounds required for correct decryption, and prove the IND-CPA security and circuit privacy of the DMBB-RCB scheme.

4.1. Correctness Analysis

The correctness of DMBB-RCB hinges on the accurate homomorphic evaluation of the decryption phase within the accumulator. We first prove the correctness of the block selectors.

Theorem 1

(Correctness of Block Selectors). Let

s_{B l o c k_{j}}^{(u)}

be the

j

-th block of user

u^{'} s

secret key sampled from

𝓑_{l}

, and let

{B R K}_{u, i}

encrypt the bits of this block. The block selector

H_{u, j}

constructed in Algorithm 1 accurately isolates the active rotation factor such that:

H_{u, j} \approx_{noise} T R G S W (X^{- ⟨ a_{B l o c k_{j}}^{(u)}, s_{B l o c k_{j}}^{(u)} ⟩} - 1)

Algorithm 1: Block-Binary Dynamic Blind Rotation.

Input

: Initialized accumulator A C C_{i n} \in 𝓡_{Q}^{2}

,

$discretized mask vectors {a^{(u)}}_{u \in S u b}$ $, where a^{(u)} \in T^{n}$
$Blind Rotation Keys B R K = {{B R K}_{u, i}}$ $, where B R K_{u, i} \in 𝓡_{Q}^{2 d_{g} \times 2}$

Output

: Rotated accumulator A C C_{o u t} \in 𝓡_{Q}^{2}

.

$A C C \in 𝓡_{Q}^{2} \leftarrow A C C_{i n}$
$for j \leftarrow 0 to k - 1 do$ (Global Block Iteration)
$for each u \in S u b do$ (User Contribution Loop)
$I_{j} \leftarrow {j \cdot l, \dots, j \cdot l + l - 1}$
$Construct Block Selector : H_{u, j} \in 𝓡_{Q}^{2 d_{g} \times 2} \leftarrow \sum_{i \in I_{j}} B R K_{u, i} ⊙ const (X^{- a_{i}^{(u)}} - 1)$
$Accumulator Update : A C C \leftarrow A C C + (A C C ⊙ H_{u, j})$
$end for$
$end for$
$return A C C$

Proof.

By definition of the Block Binary Distribution, the Hamming weight satisfies

∥ s_{B l o c k_{j}}^{(u)} ∥_{1} \leq 1

. We analyze the two possible cases for the selector construction

H_{u, j} = \sum_{i \in I_{j}} {B R K}_{u, i} ⊙_{c o n s t} (X^{- a_{i}^{(u)}} - 1)

:

Case 1 ( $∥ s_{B l o c k_{j}}^{(u)} ∥_{1} = 0$ ): For all $i \in I_{j}$ , $s_{i}^{(u)} = 0$ . Consequently, all ${B R K}_{u, i}$ encrypt $0$ . The linear combination homomorphically yields $0$ , which trivially satisfies $X^{- ⟨ a, 0 ⟩} - 1 = 1 - 1 = 0$ .
Case 2 ( $∥ s_{B l o c k_{j}}^{(u)} ∥_{1} = 1$ ): There exists a unique index $κ \in I_{j}$ such that $s_{κ}^{(u)} = 1$ , and $s_{i}^{(u)} = 0$ for all $i \neq κ$ . The homomorphic sum collapses to the single non-zero term corresponding to $κ$ :

H_{u, j} \approx_{noise} 1 \cdot (X^{- a_{κ}^{(u)}} - 1) + \sum_{i \neq κ} 0 = X^{- a_{κ}^{(u)}} - 1

Since

⟨ a_{B l o c k_{j}}^{(u)}, s_{B l o c k_{j}}^{(u)} ⟩ = a_{κ}^{(u)}

, the theorem holds. □

Theorem 2

(Correctness of Blind Rotation). Algorithm 1 transforms the initialized accumulator

{A C C}_{i n}

into

{A C C}_{o u t} \approx_{noise} {A C C}_{i n} \cdot X^{- \sum_{u \in S u b} ⟨ a^{(u)}, s^{(u)} ⟩}

.

Proof.

The iterative update per block is

A C C \leftarrow A C C + (A C C ⊙ H_{u, j}) = A C C ⊙ (1 + H_{u, j})

. Substituting Theorem 1, the multiplier evaluates to

X^{- ⟨ a_{B l o c k_{j}}^{(u)}, s_{B l o c k_{j}}^{(u)} ⟩}

. By the homomorphism of the external product over

k

blocks and

| S u b |

users, the total phase shift perfectly accumulates:

{A C C}_{o u t} = {A C C}_{i n} \cdot \prod_{u \in S u b} \prod_{j = 0}^{k - 1} X^{- ⟨ a_{B l o c k_{j}}^{(u)}, s_{B l o c k_{j}}^{(u)} ⟩} = {A C C}_{i n} \cdot X^{- ⟨ a, s ⟩}

This completes the homomorphic evaluation of the decryption phase. □

4.2. Noise Bounding Analysis

We track the worst-case variance bounds of the noise through the protocol to establish the parameter constraints. Let

V_{L W E}

,

V_{K S}

and

V_{e x t}

denote the noise variances of a fresh LWE ciphertext, an LWE-to-RLWE key-switching operation, and a TRGSW external product, respectively.

Theorem 3 (Total Error Bound).

For a packing factor

ν

, block count

k = n / l

, and

| S u b |

participants, correct decryption is guaranteed if the final output variance

V_{o u t}

satisfies

V_{o u t} \leq \frac{1}{6^{2}} {(\frac{Q}{2 \cdot B_{m s g}})}^{2}

, where

V_{o u t} \leq \underset{V_{p a c k}}{\underset{⏟}{(ν V_{L W E} + | S u b | n V_{K S})}} + \underset{V_{d r i f t}}{\underset{⏟}{(| S u b | k V_{e x t})}} + V_{S S}

Proof.

Packing Noise ( $V_{p a c k}$ ): Initializing the accumulator via $P a c k L W E$ aggregates noise linearly with the $ν$ input ciphertexts and the required $| S u b | [c i t e_{s} t a r t] \cdot n$ key-switching operations.
Blind Rotation Drift ( $V_{d r i f t}$ ): In standard MK-TFHE, noise accumulates additively over $n$ external products per user $(V_{s t d} \leq | S u b | n V_{e x t}$ ). Our block-binary optimization executes only one external product per block. Thus, the drift is bounded by $V_{d r i f t} \leq | S u b | k V_{e x t} = | S u b | [c i t e_{s} t a r t] \frac{n}{l} V_{e x t}$ , strictly reducing the dominant noise growth by a factor of $l$ .
Extraction ( $V_{S S}$ ): The Scheme Switching introduces controlled noise proportional to its gadget decomposition depth. Combining these terms yields the total variance, which must remain below the 6-sigma decoding gap $σ_{g a p} \approx 6$ . □

4.3. Security Analysis

Our security relies on the standard Decision RLWE assumption and the Block-Binary LWE assumption.

Assumption 1

(Block-Binary LWE). Let

s \leftarrow 𝓑_{l, k}

. The Block-Binary LWE assumption posits that for appropriate parameters, the distribution of samples

(A, A s + e)

is computationally indistinguishable from uniform over

Z_{q}^{m \times n} \times Z_{q}^{m}

. Recent cryptanalysis confirms its hardness against hybrid dual attacks with minimal security loss for small block lengths (e.g.,

l \in {2,3, 4}

).

Theorem 4

(IND-CPA Security). Under the RLWE and Block-Binary LWE assumptions, the DMBB-RCB scheme is IND-CPA secure against a probabilistic polynomial-time (PPT) adversary

𝓐

.

Proof.

Proof Sketch (Hybrid Argument). Let

{Adv}_{𝓐}^{G a m e_{i}}

be the advantage of

𝓐

in Game

i

:

Game 0 (Real): The standard IND-CPA game.
Game 1 (Random Public Keys): We replace ${P K}_{u} = {R L W E}_{z^{(u)}} (0)$ with random pairs from $U (𝓡_{Q}^{2})$ . By the RLWE assumption, $| {Adv}_{𝓐}^{G a m e_{0}} - {Adv}_{𝓐}^{G a m e_{1}} | [c i t e_{s} t a r t] \leq negl (λ)$ .
Games 2 & 3 (Simulated Eval Keys): We replace the Packing Keys $𝓚_{P}$ and Blind Rotation Keys $B R K$ with encryptions of 0. Since these keys are valid RLWE/RGSW ciphertexts and their underlying RLWE secret $z^{(u)}$ is protected (from Game 1), semantic security guarantees $| {Adv}_{𝓐}^{G a m e_{1}} - {Adv}_{𝓐}^{G a m e_{3}} | [c i t e_{s} t a r t] \leq negl (λ)$ . The sparse structure of $s^{(u)}$ as the message in the BRK does not compromise the RLWE hardness.
Game 4 (Random Challenge): We replace the challenge LWE ciphertext $(a^{*}, b^{*})$ with a uniform random vector. By Assumption 1, $| {Adv}_{𝓐}^{G a m e_{3}} - {Adv}_{𝓐}^{G a m e_{4}} | [c i t e_{s} t a r t] \leq negl (λ)$ . In Game 4, the adversary’s advantage is exactly 0. Thus, the scheme is IND-CPA secure. □

Theorem 5

(1-Hop Circuit Privacy). If the smudging noise standard deviation satisfies

σ_{s m u d g e} \geq 2^{λ_{s t a t}} \cdot B_{o u t}

, the partial decryption shares

μ_{u}

can be simulated given only the final output, ensuring circuit privacy.

Proof.

The share computation

μ_{u} = a \cdot z^{(u)} + e_{s m u d g e}^{(u)}

masks the input noise structure. By the Smudging Lemma, the statistical distance between

(a, a \cdot z^{(u)} + e_{s m u d g e} + e_{i n p u t})

and

(a, a \cdot z^{(u)} + e_{s m u d g e})

is bounded by

2^{- λ_{s t a t}}

. Thus, the joint decryption reveals no information about the secret keys

z^{(u)}

beyond the final plaintext result. □

5. Theoretical Performance and Complexity Evaluation

In this section, we evaluate the theoretical performance of the proposed DMBB-RCB scheme. We explicitly note that this section presents an analytical and theoretical evaluation grounded in established cryptographic metrics, rather than an implementation-based software benchmark. We incorporate the complexity analysis deferred from Section 4, establish concrete parameter sets satisfying the derived noise bounds, and compare the asymptotic latency and throughput against existing state-of-the-art Multi-Key FHE schemes.

5.1. Asymptotic Complexity Comparison

The efficiency of a TFHE-style scheme is heavily dominated by the sequence of external products in the Blind Rotation phase. We compare DMBB-RCB against standard Static MK-TFHE and Dynamic MK-TFHE. Let

p = | S u b |

be the number of active participants,

n

be the LWE dimension,

l

be the block length, and

ν

be the packing factor. Let

T_{e x t}

and

T_{k s}

denote the computational cost of a single TRGSW external product and an LWE-to-RLWE key-switching operation, respectively.

Existing schemes mandate iterating through every bit of the expanded multi-key, resulting in a blind rotation complexity of

O (p \cdot n \cdot T_{e x t})

. Furthermore, standard schemes output a scalar LWE sample, requiring an additional

O (n \cdot T_{k s})

key-switching overhead to return to an RLWE format for subsequent gates.

In contrast, DMBB-RCB strictly reduces the blind rotation complexity to

O (p \cdot \frac{n}{l} \cdot T_{e x t})

by iterating over

k = n / l

blocks. Concurrently, the amortized cost per bit is minimized to

O (p \cdot \frac{n}{l \cdot ν} \cdot T_{e x t})

via the

P a c k L W E

mechanism. The Ring-Compact architecture eliminates the post-bootstrapping key-switching overhead entirely, as summarized in Table 1.

5.2. Concrete Parameter Selection

To validate the feasibility of DMBB-RCB, we select parameters targeting 128-bit security. According to recent cryptanalysis, the LWE dimension

n

must be marginally increased compared to standard binary keys to resist hybrid dual attacks when utilizing block-binary keys. Specifically, to maintain a strict 128-bit security level, standard binary keys (

l = 1

) typically require an LWE dimension of

n \approx 600

. Our concrete parameter selection confirms that employing block lengths of

l = 2

(Set-I) and

l = 4

(Set-II) necessitates expanding the dimensions to

n = 650

and

n = 720

, respectively. This precisely accounts for the ~10–15% margin required to offset the structural sparsity. The polynomial modulus parameters

(N, Q)

and the standard deviations of the error distributions (

α

for LWE,

β

for RLWE/RGSW) must carefully balance the noise constraints in Theorem 3 and the standard Lattice Estimator security bounds. Specifically, to guarantee a 128-bit security level against known lattice attacks (e.g., uSVP and dual attacks), the error standard deviations are chosen appropriately relative to the dimensions. In our concrete instantiation, we set the LWE error rate to

α \approx 2^{- 15}

and the RLWE error rate to

β \approx 2^{- 25}

for Set-I, and adjust them marginally for Set-II to accommodate the larger packing factor. As presented in Table 2, we propose two parameter sets derived from the noise bounds in Theorem 3: Set-I targets low-latency execution, while Set-II maximizes high-throughput packing.

Regarding the packing factor

ν

, it directly dictates the initial noise accumulation. As defined in Theorem 3, the noise variance from packing grows linearly with

ν

. To ensure the final noise variance remains below the decoding gap (

V_{o u t} \leq V_{m a x}

), the theoretical upper bound for the packing factor is roughly bounded by

ν_{m a x} \approx ⌊ (V_{m a x} - V_{d r i f t} - V_{S S}) / V_{L W E} ⌋

. For our Set-II parameters (

N = 2048, Q = 2^{25}

),

ν = 1024

approaches this upper boundary; exceeding it would cause decryption failures due to noise overflow.

5.3. Theoretical Performance Projections

We project the execution time based on standard AVX-512 accelerated CPU single-thread execution, estimating one TRGSW External Product (

T_{e x t}

) at approximately 10 ms for

N = 1024

and 35 ms for

N = 2048

. We consider a collaborative group of

p = 4

participants:

Latency: For standard MK-TFHE ( $n = 650$ ), evaluating one gate takes $\approx 4 \times 650 \times 10 ms = 26 s$ . Utilizing Set-I ( $l = 2$ ), DMBB-RCB cuts this execution time strictly by half to $\approx 4 \times 325 \times 10 ms = 13 s$ .
Amortized Throughput: Utilizing Set-II ( $l = 4, ν = 1024, N = 2048$ ), one bootstrapping execution takes $\approx 4 \times 180 \times 35 ms = 25.2 s$ . However, this single execution refreshes 1024 logic gates simultaneously. The standard scheme achieves an effective throughput of $\approx 0.038$ gates/s, whereas DMBB-RCB achieves $\approx 1024 / 25.2 s \approx 40.6$ gates/s. This represents a theoretical throughput improvement of approximately $1000 \times$ .

5.4. Storage and Communication Trade-Offs

While computation is highly optimized, the Ring-Compact property necessitates specific storage trade-offs:

Key Storage: The Blind Rotation Keys ( $B R K$ ) still require encryptions of individual bits to construct the block selector $H_{u, j}$ , maintaining a size of $n \times | R G S W |$ , comparable to standard schemes. However, embedding the LWE key into the RLWE key (Section 3.2) saves $\approx 50 %$ of the key generation storage overhead compared to separate key formulations.
Ciphertext Expansion: The output is an MK-RGSW ciphertext containing $2 d_{g}$ RLWE ciphertexts. For a depth of $d_{g} = 3$ , the output size is $\approx 48 KB$ per user share, compared to $\approx 2.5 KB$ for a scalar LWE sample. While this represents a 19x ciphertext expansion, it is a highly favorable trade-off in realistic MPC scenarios. In modern cloud environments and wide-area networks (WANs), bandwidth is generally abundant, whereas network round-trip latency is the primary bottleneck for distributed systems. Traditional schemes require frequent, interactive key-switching protocols between all parties after each bootstrapping, causing severe latency degradation. By strictly containing the output within the 48 KB MK-RGSW format, DMBB-RCB achieves zero-interaction during the depth-unbounded circuit evaluation phase. Multi-party communication is explicitly deferred to and exclusively required for the final distributed decryption protocol. Exchanging 48 KB of data per share at the very end of the computation is negligible compared to the massive reduction in interactive communication rounds during the evaluation phase.
Runtime Memory Consumption: Beyond static storage, we must also account for dynamic memory (RAM) consumption during the bootstrapping evaluation. In MK-FHE, the primary memory footprint stems from loading the multi-key Blind Rotation Keys (BRK) and the accumulator states into active memory. For $p$ participants, the total evaluation key size scales as $O (p \cdot n \cdot | R G S W |)$ . Using our Set-II parameters ( $n = 720, N = 2048$ ), the BRK size per user is approximately several hundred megabytes. While our block-binary logic reduces the computational iterations, the entire key must still reside in memory. Consequently, evaluating deep circuits for multiple users requires gigabytes of RAM. This memory consumption is entirely manageable for modern cloud servers (the intended environment for DMBB-RCB), but it necessitates proper memory provisioning and precludes deployment on highly memory-constrained edge devices.

5.5. Evaluation Summary

The theoretical evaluation confirms that DMBB-RCB resolves the primary scalability bottlenecks of MK-TFHE. The sparse key structure reduces computational latency by a factor of

l

, while amortized packing increases throughput by a factor of

ν

. The computational cost grows linearly with

p

but is strictly bounded by a shallower slope coefficient (

n / l

vs.

n

), verifying its suitability for large-scale dynamic MPC tasks.

6. Conclusions

In this paper, we addressed the critical scalability and throughput bottlenecks inherent in Multi-Key Homomorphic Encryption (MKHE) by proposing the Dynamic Multi-Key Block-Binary Ring-Compact Bootstrapping (DMBB-RCB) framework.

By abandoning the conventional bit-wise processing paradigm, DMBB-RCB synergizes three advanced cryptographic optimizations: (1) adopting a Block-Binary distribution

(B_{l, k}

) for secret keys to strictly reduce the dominant blind rotation complexity from

O (p \cdot n)

to

O (p \cdot n / l)

; (2) integrating an amortized multi-key

P a c k L W E

mechanism to process

ν

independent messages in parallel, thereby increasing the amortized throughput by orders of magnitude; and (3) implementing a Ring-Compact extraction architecture via Scheme Switching, which outputs Multi-Key RGSW ciphertexts natively to enable depth-unbounded, closed-loop evaluation without interactive LWE-to-RLWE key-switching.

Consequently, DMBB-RCB bridges the gap between the programmable logic flexibility of TFHE and the SIMD efficiency of leveled FHE schemes [12,24]. The dynamic common reference string (CRS) design and interaction-free extraction during the evaluation phase (with interaction restricted solely to the final decryption step) make this framework highly scalable and communication-efficient, positioning it as a robust cryptographic foundation for dynamic Secure Multi-Party Computation (MPC) environments, such as federated learning and collaborative cloud analytics.

7. Limitations and Future Research

While DMBB-RCB achieves substantial computational optimizations, we acknowledge specific trade-offs that motivate our future research directions:

Ciphertext Expansion and Bandwidth: The native output of our Ring-Compact scheme is a Multi-Key RGSW ciphertext, which comprises a matrix of polynomials. This is significantly larger than a scalar LWE sample utilized in standard TFHE, necessitating higher transmission bandwidth during the final result retrieval phase.
Latency vs. Throughput Profile: The initial noise term introduced by polynomial packing grows with the packing factor $ν$ , requiring careful parameter bounds. Furthermore, the overhead of the packing and scheme-switching sub-routines slightly increases the latency of a single bootstrapping execution. Thus, DMBB-RCB is optimized for batched, high-throughput processing rather than ultra-low-latency, real-time control systems.

Building upon these properties, future research will focus on the following trajectories:

Hardware Acceleration: The structured block-wise external products inherent to our block-binary blind rotation are highly amenable to hardware parallelization. Future implementations will explore distributing these operations across FPGAs or GPUs to further minimize absolute latency.
Verifiable MKHE (Malicious Security): To secure the protocol against malicious participants who might supply malformed sparse keys or ciphertexts, integrating Zero-Knowledge Proofs (ZKP) to verify the validity of the $𝓑_{l, k}$ keys and the packing step remains a crucial open problem.
Advanced Packing and Hybrid Architectures: We aim to explore automorphism-based slot permutations within the DMBB-RCB accumulator to evaluate complex linear algebra operations directly inside the bootstrapping loop. Additionally, investigating a hybrid framework that switches between DMBB-RCB (for non-linear Boolean logic) and CKKS (for precision arithmetic) could yield a comprehensive solution for Privacy-Preserving Machine Learning as a Service (PPMLaaS).
Software Engineering and Concrete Implementation: The performance evaluations presented in this manuscript are strictly theoretical projections grounded in empirical baseline metrics. Developing a production-ready, highly optimized multi-party execution framework in low-level languages (such as C++ or Rust) falls outside the mathematical scope of this paper. A full-scale implementation—which must rigorously address memory management, side-channel resistance, and network synchronization—is a massive independent software engineering undertaking. This comprehensive C++/Rust implementation, alongside hardware co-design, constitutes our immediate next step for future work to transition DMBB-RCB from a mathematical protocol to a deployable library.

Author Contributions

Conceptualization, Q.X. and R.H.; methodology, Q.X.; validation, Q.X. and R.H.; formal analysis, Q.X.; investigation, Q.X.; writing—original draft preparation, Q.X.; writing—review and editing, Q.X. and R.H.; supervision, R.H.; funding acquisition, R.H. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation Project of China (No. 62062009) and the Guangxi Key Research and Development Program Project (No. AB24010340).

Data Availability Statement

No new data were created or analyzed in this study.

Conflicts of Interest

The authors declare no conflict of interest.

References

Gentry, C. Fully homomorphic encryption using ideal lattices. In Proceedings of the 41st Annual ACM Symposium on Theory of Computing (STOC), Bethesda, MD, USA, 31 May–2 June 2009; Association for Computing Machinery: New York, NY, USA, 2009; pp. 169–178. [Google Scholar] [CrossRef]
Chillotti, I.; Gama, N.; Georgieva, M.; Izabachène, M. Faster fully homomorphic encryption: Bootstrapping in less than 0.1 seconds. In Advances in Cryptology—ASIACRYPT 2016: 22nd International Conference on the Theory and Application of Cryptology and Information Security, Hanoi, Vietnam, 4–8 December 2016, Proceedings, Part I; Springer: Berlin/Heidelberg, Germany, 2016; pp. 3–33. [Google Scholar] [CrossRef]
Chillotti, I.; Gama, N.; Georgieva, M.; Izabachène, M. TFHE: Fast fully homomorphic encryption over the torus. J. Cryptol. 2020, 33, 34–91. [Google Scholar] [CrossRef]
Regev, O. On lattices, learning with errors, random linear codes, and cryptography. J. ACM (JACM) 2009, 56, 1–40. [Google Scholar] [CrossRef]
López-Alt, A.; Tromer, E.; Vaikuntanathan, V. On-the-fly multiparty computation on the cloud via multikey fully homomorphic encryption. In Proceedings of the 44th Annual ACM Symposium on Theory of Computing (STOC), New York, NY, USA, 19–22 May 2012; Association for Computing Machinery: New York, NY, USA, 2012; pp. 1219–1234. [Google Scholar] [CrossRef]
Clear, M.; McGoldrick, C. Multi-identity and multi-key homomorphic encryption from LWE. In Advances in Cryptology—CRYPTO 2015: 35th Annual Cryptology Conference, Santa Barbara, CA, USA, 16–20 August 2015, Proceedings, Part I; Springer: Berlin/Heidelberg, Germany, 2015; pp. 399–418. [Google Scholar] [CrossRef]
Mukherjee, P.; Wichs, D. Two round multiparty computation via multi-key FHE. In Advances in Cryptology—EUROCRYPT 2016: 35th Annual International Conference on the Theory and Applications of Cryptographic Techniques, Vienna, Austria, 8–12 May 2016, Proceedings, Part II; Springer: Berlin/Heidelberg, Germany, 2016; pp. 735–763. [Google Scholar] [CrossRef]
Peikert, B.; Shiehian, S. Multi-key FHE from LWE, revisited. In Theory of Cryptography: 14th International Conference, TCC 2016-B, Beijing, China, 31 October–3 November 2016, Proceedings, Part I; Springer: Berlin/Heidelberg, Germany, 2016; pp. 217–238. [Google Scholar] [CrossRef]
Chen, H.; Chillotti, I.; Song, Y. Multi-key homomorphic encryption from TFHE. In Advances in Cryptology—ASIACRYPT 2019: 25th International Conference on the Theory and Application of Cryptology and Information Security, Kobe, Japan, 8–12 December 2019, Proceedings, Part II; Springer International Publishing: Cham, Switzerland, 2019; pp. 446–472. [Google Scholar] [CrossRef]
Kwak, H.; Min, S.; Song, Y. Towards practical multi-key TFHE: Parallelizable, key-compatible, quasi-linear complexity. In Public-Key Cryptography—PKC 2024: 27th IACR International Conference on Practice and Theory of Public-Key Cryptography, Sydney, NSW, Australia, 15–17 April 2024, Proceedings, Part IV; Springer: Berlin/Heidelberg, Germany, 2024; pp. 354–385. [Google Scholar] [CrossRef]
Smart, N.P.; Vercauteren, F. Fully homomorphic SIMD operations. Des. Codes Cryptogr. 2014, 71, 57–81. [Google Scholar] [CrossRef]
Brakerski, Z.; Gentry, C.; Vaikuntanathan, V. (Leveled) fully homomorphic encryption without bootstrapping. In Proceedings of the 3rd Innovations in Theoretical Computer Science Conference (ITCS), Cambridge, MA, USA, 8–10 January 2012; Association for Computing Machinery: New York, NY, USA, 2012; pp. 309–325. [Google Scholar] [CrossRef]
Cheon, J.H.; Kim, A.; Kim, M.; Song, Y. Homomorphic encryption for arithmetic of approximate numbers. In Advances in Cryptology—ASIACRYPT 2017: 23rd International Conference on the Theory and Applications of Cryptology and Information Security, Hong Kong, China, 3–7 December 2017, Proceedings, Part I; Springer: Berlin/Heidelberg, Germany, 2017; pp. 409–437. [Google Scholar] [CrossRef]
Micciancio, D.; Sorrell, J. Ring packing and amortized FHEW bootstrapping. In Proceedings of the 45th International Colloquium on Automata, Languages, and Programming (ICALP 2018), Prague, Czech Republic, 9–13 July 2018; pp. 111:1–111:14. [Google Scholar] [CrossRef]
Lee, C.; Min, S.; Seo, J.; Song, Y. Faster TFHE bootstrapping with block binary keys. In Proceedings of the 2023 ACM Asia Conference on Computer and Communications Security (ASIA CCS ’23), Melbourne, VIC, Australia, 10–14 July 2023. [Google Scholar]
Biswas, C.; Dutta, R. Dynamic multi-key FHE in symmetric key setting from LWE without using common reference matrix. J. Ambient Intell. Humaniz. Comput. 2022, 13, 1241–1254. [Google Scholar] [CrossRef]
Xu, K.; Huang, R. Accelerated multi-key homomorphic encryption via automorphism-based circuit bootstrapping. IEEE Access 2025, 13, 1636–1650. [Google Scholar] [CrossRef]
Wang, R.; Wen, Y.; Li, Z.; Lu, X.; Wei, B.; Liu, K.; Wang, K. Circuit Bootstrapping: Faster and Smaller. In Advances in Cryptology—EUROCRYPT 2024: 43rd Annual International Conference on the Theory and Applications of Cryptographic Techniques, Zurich, Switzerland, 26–30 May 2024, Proceedings, Part II; Springer Nature: Cham, Switzerland, 2024; pp. 342–372. [Google Scholar] [CrossRef]
Lee, S.; Kim, D.; Fast, D.S. Compact and Hardware-Friendly Bootstrapping in Less than 3 ms Using Multiple Instruction Multiple Ciphertext. IACR Cryptology ePrint Archive, Paper 2024/1916. 2024. Available online: https://eprint.iacr.org/2024/1916 (accessed on 10 February 2026).
Bernard, O.; Joye, M. Bootstrapping (T)FHE Ciphertexts via Automorphisms: Closing the Gap Between Binary and Gaussian Keys. IACR Cryptology ePrint Archive, Paper 2025/163. 2025. Available online: https://eprint.iacr.org/2025/163 (accessed on 10 February 2026).
Lyubashevsky, V.; Peikert, C.; Regev, O. On ideal lattices and learning with errors over rings. In Advances in Cryptology—EUROCRYPT 2010: 29th Annual International Conference on the Theory and Applications of Cryptographic Techniques, French Riviera, 30 May–3 June 2010, Proceedings; Springer: Berlin/Heidelberg, Germany, 2010; pp. 1–23. [Google Scholar] [CrossRef]
Gentry, C.; Sahai, A.; Waters, B. Homomorphic encryption from learning with errors: Conceptually-simpler, asymptotically-faster, attribute-based. In Advances in Cryptology—CRYPTO 2013: 33rd Annual Cryptology Conference, Santa Barbara, CA, USA, 18–22 August 2013. Proceedings, Part I; Springer: Berlin/Heidelberg, Germany, 2013; pp. 75–92. [Google Scholar] [CrossRef]
Asharov, G.; Jain, A.; López-Alt, A.; Tromer, E.; Vaikuntanathan, V.; Wichs, D. Multiparty computation with low communication, computation and interaction via threshold homomorphic encryption. In Advances in Cryptology—EUROCRYPT 2012: 31st Annual International Conference on the Theory and Applications of Cryptographic Techniques, Cambridge, UK, 15–19 April 2012, Proceedings; Springer: Berlin/Heidelberg, Germany, 2012; pp. 483–501. [Google Scholar] [CrossRef]
Fan, J.; Vercauteren, F. Somewhat Practical Fully Homomorphic Encryption. IACR Cryptol. Eprint Arch. 2012, 2012, 144. Available online: https://eprint.iacr.org/2012/144 (accessed on 10 February 2026).

Table 1. Asymptotic Complexity and Feature Comparison.

Scheme	Blind Rotation Cost	Amortized Cost (Per Bit)	Post-PBS Key Switching	Dynamic Support
Static MK-TFHE [10]	$p \cdot n \cdot T_{e x t}$	$\approx p \cdot n \cdot T_{e x t}$	$Required (n \cdot T_{k s}$ )	No
Dynamic MK-TFHE [17]	$p \cdot n \cdot T_{e x t}$	$\approx p \cdot n \cdot T_{e x t}$	$Required (n \cdot T_{k s}$ )	Yes
DMBB-RCB (Ours)	$p \cdot \frac{n}{l} \cdot T_{e x t}$	$\approx \frac{p \cdot n}{l \cdot ν} \cdot T_{e x t}$	Eliminated	Yes

Table 2. Recommended Parameters for 128-bit Security.

Parameter	Symbol	Set-I (Speed)	Set-II (Capacity)
Block Length	$l$	2	4
LWE Dimension	$n$	650	720
Block Count	$k = n / l$	325	180
RLWE Degree	$N$	1024	2048
RLWE Modulus	$Q$	$2^{25}$	$2^{25}$
Packing Factor	$ν$	1	1024
Gadget Base	$(B_{b r k}, B_{k s})$	$(2^{6}, 2^{4})$	$(2^{9}, 2^{6})$
Error Standard Deviations	$(α, β)$	$(2^{- 15}, 2^{- 25})$	$(2^{- 16}, 2^{- 28})$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xiao, Q.; Huang, R. Dynamic Multi-Key Block Binary Ring-Compact Bootstrapping. Mathematics 2026, 14, 1045. https://doi.org/10.3390/math14061045

AMA Style

Xiao Q, Huang R. Dynamic Multi-Key Block Binary Ring-Compact Bootstrapping. Mathematics. 2026; 14(6):1045. https://doi.org/10.3390/math14061045

Chicago/Turabian Style

Xiao, Qiwei, and Ruwei Huang. 2026. "Dynamic Multi-Key Block Binary Ring-Compact Bootstrapping" Mathematics 14, no. 6: 1045. https://doi.org/10.3390/math14061045

APA Style

Xiao, Q., & Huang, R. (2026). Dynamic Multi-Key Block Binary Ring-Compact Bootstrapping. Mathematics, 14(6), 1045. https://doi.org/10.3390/math14061045

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Dynamic Multi-Key Block Binary Ring-Compact Bootstrapping

Abstract

1. Introduction

2. Preliminaries

2.1. Notation

2.2. Probability Distributions

2.3. Cryptographic Primitives

2.4. Standard TFHE Operations

3. Construction of the DMBB-RCB Scheme

3.1. Advanced Building Blocks

3.2. Setup and Key Generation

3.3. The DMBB-RCB Evaluation Protocol

3.3.1. Amortized Input Preparation

3.3.2. Block-Binary Dynamic Blind Rotation

3.3.3. Ring-Compact Extraction

3.4. Distributed Decryption

4. Correctness and Security Analysis

4.1. Correctness Analysis

4.2. Noise Bounding Analysis

4.3. Security Analysis

5. Theoretical Performance and Complexity Evaluation

5.1. Asymptotic Complexity Comparison

5.2. Concrete Parameter Selection

5.3. Theoretical Performance Projections

5.4. Storage and Communication Trade-Offs

5.5. Evaluation Summary

6. Conclusions

7. Limitations and Future Research

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI