HHL Algorithm for Tensor-Decomposable Matrices

Pilaszewicz, Cezary; Margraf, Marian

doi:10.3390/quantum7030037

Open AccessArticle

HHL Algorithm for Tensor-Decomposable Matrices

by

Cezary Pilaszewicz

^*

and

Marian Margraf

Department of Mathematics and Computer Science, Freie Universität Berlin, Takustr. 9, 14195 Berlin, Germany

^*

Author to whom correspondence should be addressed.

Quantum Rep. 2025, 7(3), 37; https://doi.org/10.3390/quantum7030037

Submission received: 14 July 2025 / Revised: 6 August 2025 / Accepted: 13 August 2025 / Published: 19 August 2025

Download

Browse Figures

Versions Notes

Abstract

We use the HHL algorithm to retrieve a quantum state holding the algebraic normal form (ANF) of a Boolean function. Unlike the standard HHL applications, we do not describe the cipher as an exponentially big system of equations. Rather, we perform a set of small matrix inversions which correspond to the Boolean Möbius transform. This creates a superposition holding information about the ANF in the form

| A_{f} ⟩ = \frac{1}{C} \sum_{I = 0}^{2^{n} - 1} c_{I} | I ⟩

, where

c_{I}

is the coefficient of the ANF and C is a scaling factor. The procedure has a time complexity of

\tilde{O} (n)

for a Boolean function with n-bit input. We also propose two approaches by which some information about the ANF can be extracted from such a state. Next, we use a similar approach, the Dual Boolean Möbius transform, to compute the preimage under the algebraic transition matrix. We show that such a matrix is well-suited for the HHL algorithm when the attacker gets oracle access in the Q2 setting to the Boolean function.

Keywords:

cryptanalysis; quantum computing; algebraic normal form

1. Introduction

The most successful quantum algorithms rely on the quantum computer’s ability to efficiently switch between different representations of a function. Then, a measurement in the corresponding basis delivers information about some structural property of the function. The most prominent example, a measurement in the Fourier basis, can extract information about the periodicity of the function and allows breaking schemes like the RSA. Similarly, the algorithm proposed in [1] uses the Fourier transform over the

{(Z / 2 Z)}^{n}

to find the parity basis representation [2]. In this paper, we want to follow a similar approach, however, in a different setting. We begin with an easy-to-prepare yet exponentially big quantum state. Next, we perform a quantum basis change, which in a classical setting is costly, and retrieve meaningful information about the function.

The HHL algorithm, proposed in 2009 by [3], is a quantum procedure that computes the preimage of a given input. Unlike the classical approaches, it does not linearly depend on the size of the matrix. Rather, its runtime is defined by the sparseness and condition number of the matrix, and only logarithmically by the matrix size. Since most of the modern ciphers can be described as some form of exponentially big equation system, the idea of using HHL to cryptanalyse ciphers came forward [4] and was quickly followed by other publications. However, the common factor was always the abuse of the logarithmic speed-up in the runtime of the HHL algorithm.

In this paper, we will also tackle an exponentially big input

\vec{b}

. In contrast to the standard approaches of HHL-based cryptanalysis, we will consider matrices of a special form. We prove that, if the matrix M can be decomposed into a tensor product

M = M_{1} \otimes M_{2} \otimes \dots \otimes M_{n}

, then computing the pre-image can be significantly sped up. On classical computers, pre-image computation can be done in polynomial time in regard to the vector’s size. Only in the case when the input

\vec{b}

can likewise be decomposed into the following tensor product:

\begin{matrix} \vec{b} = \vec{b_{1}} \otimes \vec{b_{2}} \otimes \dots \otimes \vec{b_{n}}, \end{matrix}

are we able to obtain a subpolynomial runtime. Here, the sizes of

\vec{b_{i}}

’s must match the sizes of corresponding matrices

M_{i}

for each

i \in 1, \dots, n

. If such a decomposition of

\vec{b}

is unknown, or even impossible, the fastest known classical approximation algorithm is the conjugate gradient method, with a runtime of

O (N s \sqrt{κ} {log}_{2} (\frac{1}{ϵ}))

. In our approach, we do not need a vector decomposition to perform the HHL algorithm, allowing a runtime advantage over classical computers. We summarize this observation in the following Theorem:

Theorem 1.

Let

M = ⨂_{i = 1}^{t} M_{i}

be a horizontal decomposition of a Hermitian matrix M. Further, for all

i = 1, \dots, t

, let

M_{i} \in C^{N_{i} \times N_{i}}

be a Hermitian matrix with

κ_{i}

its condition number and

s_{i}

its sparseness. The time complexity of the vertical HHL from Theorem 2 for the matrix M is:

\begin{matrix} \tilde{O} (t \cdot (max_{i} κ_{i} \cdot s_{i} \cdot poly (log N_{i}))) \end{matrix}

Based on the above Theorem, our approach is next applied to a cryptographically significant matrix corresponding to the Boolean Möbius transform. The transform allows transition between the algebraic normal form (ANF) of a Boolean function and its truth table (TT). We decided to decompose this matrix as, in its composed form, it has high sparsity (one full column). Applying the standard HHL algorithm in such a scenario would be counterproductive, as the runtime would be comparable to the gradient approach (

\tilde{O} (N)

, where N is the number of rows in the matrix. Instead, we decompose the Möbius transform into a tensor product of n small sub-matrices (

2 \times 2

before Booleanization) and achieve a runtime of

\tilde{O} (n)

with

N = 2^{n}

.

By capturing the truth table in the superposition and computing its preimage under the Boolean Möbius transform, we manage to encapsulate the algebraic normal form of a Boolean function f under attack in the quantum register. The result is as follows:

\begin{matrix} | A_{f} 〉 = \frac{1}{\sqrt{hw (A_{f})}} \sum_{I = 0}^{2^{n} - 1} c_{I} | I 〉 \end{matrix}

where

c_{I}

is the binary coefficient of the monomial

x^{I}

.

| A_{f} 〉

can be used to both retrieve the ANF of the function, as well as to estimate its Hamming weight. We also use the technique introduced in [5] to extract information about a Boolean function using the algebraic transition matrices (ATMs).

A standard requirement in quantum computation is the unitarity of evolutions applied to the qubits. We employ the HHL algorithm to allow inversion of non-unitary matrices. The HHL algorithm is a well-established framework suitable for this operation. However, in the Boolean Möbius transform setting, the procedure could be adjusted. As the Möbius transform is self-inverse, we do not need to compute the preimage. Instead, computing the image under the transform delivers the same result.

We introduce the notation in Section 1.2. Section 2 focuses on the HHL algorithm. We estimate the runtime and present how to Booleanize a matrix and how the HHL algorithm is used in cryptanalysis. In Section 3, we discuss our new approach to applying HHL to a tensor product. The final section concludes with the proof of Theorem 1. Section 4 shows how to apply the tensor approach to a specific function—the Boolean Möbius transform. We define the Boolean Möbius transform and its tensor decomposition. We explain why it is better to apply the algorithm to decomposed matrices and how to generate the input quantum state. Finally, we describe the result of our algorithm. In Section 5 we elaborate on the cryptographic use-cases of the Boolean Möbius transform. We show how the transform can be used to analyse the algebraic normal form of a Boolean function.

1.1. State-of-the-Art of HHL in Cryptanalysis

The cryptographic appeal of the HHL algorithm is clear. Most modern ciphers are characterized by complex equation systems needed to describe them. The first attempt was introduced in [4], and used the so-called Macaulay matrix—an exponentially big and sparse matrix which can be solved for the Boolean solution of a polynomial system. The next proposal targeted the NTRU system [6] and used the HHL algorithm to solve a polynomial system with noise [7]. Finally, in [8], the Grain-128 and Grain-128a ciphers were cryptanalysed using techniques based on [4,7]. Ref. [9] suggested improvements upon the above papers, and proved a lower-bound on the condition number of the Macaulay matrix approach. In [10], two systems of equations for AES-128, one over

G F (2)

and another over

G F (2^{8})

based on the BES construction introduced in [11], were analysed under the HHL algorithm.

1.2. Notation

We will shortly introduce the common notation used throughout this paper. A function

f : F_{2}^{n} ⟶ F_{2}

is called a Boolean function. It is known that each Boolean function f has a polynomial representation

f (x) \in F_{2} [x_{1}, \dots, x_{n}]

[2]. For an index set

I \subseteq {1, 2, \dots, n}

, we define

x^{I} : = \prod_{i \in I} x_{i}

. We also define

c_{I} \in F_{2}

as the coefficient of the term

x^{I}

, and will sometimes switch to the notation where the index is the decimal number defined by the binary interpretation of I. For a Boolean function

f (x) = \sum_{I \in [n]} c_{I} x^{I}

, the Algebraic Normal Form (ANF) of f is

A_{f} : = (c_{0}, \dots, c_{2^{n} - 1})

. The representation of f in the ANF is unique and defines the function f. Moreover, we can compute the coefficients

c_{I}

’s as follows:

c_{I} = \sum_{supp (x) \subseteq I} f (x),

(1)

where

supp (x) : = {i : x_{i} \neq 0}

[2]. In other words, we can compute the coefficient

c_{I}

by adding (over

F_{2}

) the images of all inputs dominated by I (cf. to precursor sets defined in [5]).

The Hamming weight of a Boolean vector

x \in F_{2}^{n}

is denoted as

hw (x)

, and it is the number of non-zero entries. The degree of a coefficient can be computed using the Hamming weight:

\begin{matrix} d e g (c_{I}) = hw (I) \end{matrix}

The truth table of a Boolean function f is a binary vector

T_{f} \in F_{2}^{2^{n}}

defined as

T_{f} = (f (0), f (1), \dots, f (2^{n} - 1))

.

A vectorial Boolean function is defined as

F : {0, 1}^{n} \to {0, 1}^{m}

for

n, m \in N

. Further, for all

0 \leq i \leq m - 1

,

F_{i} : {0, 1}^{n} \to {0, 1}

is a Boolean function and the projection of F to the ith component. A projection of a string s to its i’th component will be denoted by

{s |}_{i}

.

By

H_{1} = \frac{1}{\sqrt{2}} (\begin{matrix} 1 & 1 \\ 1 & - 1 \end{matrix})

we denote the Hadamard gate, and by

H_{n} = ⨂_{i = 1}^{n} H_{1}

its n-fold tensor product. The E is an identity matrix. In the cases where the size of the identity matrix is not explicitly stated, the index j indicates the size of the matrix

E_{j} \in C^{2^{j} \times 2^{j}}

.

2. HHL Algorithm

The HHL algorithm, named after the authors Harrow, Hassidim, and Lloyd, is one of the new tools in the quantum toolbox used for cryptanalysis. It was first proposed in [3] as an efficient algorithm to solve exponentially big systems of linear equations, outperforming any classical algorithm. Its cryptographic significance was quickly discovered, and [4] proposed the first attack scenario. Since then, a series of cryptographic publications have considered the HHL setting.

We introduce two problems closely related to the HHL algorithm:

Problem 1.

Given a Hermitian matrix

M \in C^{2^{n} \times 2^{n}}

and an input state

\vec{b} \in C^{2^{n}}

with

\vec{b} = (b_{0}, \dots, b_{2^{n} - 1})

find the state

\vec{x}

such that:

\begin{matrix} M \cdot \vec{x} = \vec{b} \end{matrix}

We can also reframe this into the quantum setting as follows:

Problem 2.

Given a Hermitian matrix

M \in C^{2^{n} \times 2^{n}}

and an input state

\begin{matrix} | b 〉 = \sum_{i = 0}^{2^{n} - 1} b_{i} | i 〉 \end{matrix}

with

\vec{b} = (b_{0}, \dots, b_{2^{n} - 1}) \in C^{2^{n}}

find the state

| x 〉 = \sum_{i = 0}^{2^{n} - 1} x_{i} | i 〉

such that:

\begin{matrix} M \cdot \vec{x} = \vec{b} \end{matrix}

with

\vec{x} = (x_{0}, \dots, x_{2^{n} - 1}) \in C^{2^{n}}

.

The HHL algorithm solves Problem 2. To compute the pre-image of

\vec{b}

, we need to represent the vector as a quantum register

| b 〉 = \sum_{i = 0}^{N - 1} b_{i} | i 〉

. Then, M is transformed into a unitary operator

e^{i M n}

, and the operator is applied to

| b 〉

using the Hamiltonian simulation technique [12]. This is equivalent to decomposing

| b 〉

to an eigenbasis of M and determining the corresponding eigenvalues. The register after this transformation is in the following state:

\begin{matrix} \sum_{j = 0}^{N - 1} β_{j} | u_{j} 〉 | λ_{j} 〉 \end{matrix}

where

| b 〉 = \sum_{j = 0}^{N - 1} β_{j} | u_{j} 〉

,

{u_{j}}_{j = 0, \dots, N - 1}

is the eigenbasis and

λ_{j}

’s are the corresponding eigenvalues. Using an additional ancilla register, we perform a rotation conditioned on the value of

λ_{j}

to obtain the following state:

\begin{matrix} \sum_{j = 0}^{N - 1} β_{j} | u_{j} 〉 | λ_{j} 〉 \overset{ancilla register}{\overset{⏞}{(\sqrt{1 - \frac{C^{2}}{λ_{j}^{2}}} | 0 〉 + \frac{C}{λ_{j}} | 1 〉)}} \end{matrix}

We finish with a measurement of the ancilla register. If

| 1 〉

is observed, the eigenvalues

| λ_{j} 〉

of the matrix M are inverted, and the register ends in the following state:

C \cdot \sum_{j = 0}^{N - 1} β_{j} λ_{j}^{- 1} | u_{j} 〉 = M^{- 1} | b 〉 = | x 〉

(2)

2.1. HHL Runtime

In [3], the authors show how to run the HHL algorithm for a matrix

M \in C^{N \times N}

in time

\tilde{O} (κ^{2} \cdot s^{2} \cdot poly (log N))

, where s is the sparseness of the matrix M,

κ

is its condition number, and N is the size. The

\tilde{O} (\cdot)

notation suppresses all slowly-growing parameters (logarithmic with respect to

s, κ

or

log N

). The algorithm requires that M is either efficiently row-computable, or an access to the oracle which returns the indices of non-zero matrix entries for each row. The phase estimation (PE) step is responsible for a significant portion of the runtime of the HHL. For an s-sparse matrix, the PE takes

\tilde{O} (log N s^{2})

computations. We observe that, after the rotation conditioned on

λ_{j}

, we have only a certain probability of observing the ancilla register in a state

| 1 〉

. However, since the constant C from Equation (2) is of magnitude

O (1 / κ)

, with

O (κ)

applications of the amplitude amplification algorithm [13] we can lower the probability of the undesired measurement to a negligible value. Ref. [14] improved the HHL algorithm by lowering the condition number dependence from quadratic to almost linear. Ref. [15] proposed a further improvement to achieve the runtime as follows:

\begin{matrix} \tilde{O} (κ \cdot s \cdot poly (log N)) \end{matrix}

This will be the complexity that we will use in this paper. Nevertheless, since the values

s, κ

of the matrices we propose are constant, the choice of the implementation should not have much influence on the feasibility of the algorithm we develop.

2.2. HHL over Finite Fields

The HHL algorithm in its initial form is defined for matrices and vectors over

C

. Since most of the cryptographic constructions use finite field arithmetic, the question of whether HHL can be adapted to this setting came forward. In this part, we want to present a set of rules which describe how to transform an equation system M over a finite field

F_{2}

to an equation system over

C

as required by the HHL algorithm. Most of the reductions were proposed in [7]. The result will be an equation system over

C

which shares the solution with the original system. We will call the resulting system the Booleanization of M, and the

C

-matrix corresponding to the system a Booleanized matrix. The total sparseness of a matrix is the number of non-zero entries in all of its rows and columns.

Reduction from Boolean Equations to Equations over $C$

For a given set of Boolean functions

F = {f_{1}, \dots, f_{m}} \subseteq F_{2} [X]

with n variables, we want to find a system

F_{C} = {f_{1}^{'}, \dots, f_{m^{'}}^{'}} \subseteq C [X, Z]

, such that when

(\hat{X}, \hat{Z})

is a solution to

F_{C} = 0

, then

\hat{X}

is a solution of

F = 0

. To do this, we will define a new set of variables

Z = {z_{1}, \dots, z_{m}}

and

m + n

new quadratic equations as follows:

\begin{matrix} f_{i} (x_{1}, \dots, x_{n}) - z_{i} = 0 & \forall i = 1, \dots, m \end{matrix}

(3)

\begin{matrix} z_{i} / 2 \in Z & \forall i = 1, \dots, m \end{matrix}

(4)

\begin{matrix} x_{j} - x_{j}^{2} = 0 & \forall j = 1, \dots, n \end{matrix}

(5)

The first two sets of equations guarantee that the value of

f_{i} (\hat{X}) = 0 mod 2

. This represents the Boolean addition logic in the original system

F

. The last equation guarantees that the only possible values of

x_{1}, \dots, x_{n}

are from

F_{2}

. An important observation here is also that

\forall i = 1, \dots, m 0 \leq z_{i} \leq # f_{i}

.

Next, we need to present

z_{i}

’s in a form usable by the quantum computer. Each

z_{i}

will be represented in its binary form with

log (# f_{i})

bits:

\begin{matrix} z_{i} = \sum_{b = 0}^{log # f_{i}} 2^{b} z_{i_{b}} \end{matrix}

However, since Equation (4) holds, we know

\forall 1 \leq i \leq m : z_{i_{0}} = 0

. This means we actually only consider the following representation:

\begin{matrix} z_{i} = \sum_{b = 1}^{log # f_{i}} 2^{b} z_{i_{b}} \end{matrix}

We need to incorporate this into the equation system mentioned above in the following way:

\begin{matrix} f_{i} (x_{1}, \dots, x_{n}) - \sum_{b = 1}^{log # f_{i}} 2^{b} z_{i_{b}} = 0 & \forall i = 1, \dots, m \\ z_{i_{b}} - z_{i_{b}}^{2} = 0 & \forall i = 1, \dots, m, \forall b = 1, \dots, log # f_{i} \\ x_{j} - x_{j}^{2} = 0 & \forall j = 1, \dots, n \end{matrix}

Similarly, as in (5), the equation

z_{i_{b}} - z_{i_{b}}^{2} = 0

forces the values of

z_{i_{b}} \in F_{2}

. The additional equations mentioned in this section increase the number of needed variables from n to

n + m \cdot log # f_{i}

and the number of equations from m to

m \cdot (1 + log # f_{i}) + n

. Thus, in the

\tilde{O}

notation, the runtime of the HHL remains the same for

n \sim m

.

Finally, the HHL algorithm requires the input matrix to be Hermitian. The resulting Booleanized matrix corresponding to the system

F_{C}

is not. Luckily, we can use the procedure proposed by the original authors of the HHL algorithm to cover this case [3]. The technique uses the property that for an arbitrary matrix M that is not Hermitian, we can construct a new matrix

C_{M}

:

\begin{matrix} C_{M} : = (\begin{matrix} 0 & M \\ M^{†} & 0 \end{matrix}), \end{matrix}

The newly constructed matrix

C_{M}

will be Hermitian and a valid HHL input. Here, the

M^{†}

is the conjugate transpose of M. We finish this section with a Lemma characterizing the setting and the runtime of the HHL algorithm:

Lemma 1.

([3,15]). Let

M \in C^{2^{n} \times 2^{n}}

be a Hermitian matrix and

| b 〉 \in C^{2^{n}}

be a quantum state. We denote the application of the HHL with the matrix M to a state

| b 〉

as:

\begin{matrix} H H L_{M} (| b 〉) = | x 〉 \end{matrix}

The result

| x 〉

is a quantum state which solves Problem 2 and the procedure takes

\tilde{O} (κ \cdot s \cdot p o l y (log (N)))

time.

3. HHL for Tensors of Matrices

Usually, when considering the use-case of HHL, we want to leverage the exponential speed-up in regard to the size of the matrix. This is the natural direction, as that is the obvious runtime difference between the HHL and the classical approach. The hindrance is, however, that, even though the matrix can be exponentially big, it should not have too many entries. In fact, a matrix with a single full column eliminates the potential use of the HHL (cf. Section 2.1).

In this section, we present an approach to overcome the above-mentioned obstacles for a special set of matrices. We prove that if for a matrix M we are given a tensor decomposition

M = M_{1} \otimes M_{2} \otimes \dots \otimes M_{n}

, where each

M_{i}

is again Hermitian and has a small size, applying the HHL approach to each

M_{i}

has a low runtime and delivers the same result. In some cases, our algorithm computes the preimage of a quantum state

| b 〉

under the matrix M exponentially faster than the standard HHL approach. A similar technique is a building block of known quantum algorithms, among others, Shor’s or Simon’s algorithm. For these two algorithms, instead of applying a matrix

H_{n}

to an n-long register

| x 〉

, we apply

H_{1}

to each of n qubits of

| x 〉

.

We start with a simple Lemma saying that applying a vertical/tensor (cf. Figure 1) decomposition of a unitary matrix

U = ⨂_{i = 1}^{t} U_{i}

to quantum sub-registers results in the same state as applying the matrix U to the whole register:

Lemma 2.

Let

U_{1}, \dots, U_{t}

be unitary matrices with

U_{i} \in C^{n_{i} \times n_{i}}

. Further, let

| x_{1} 〉, \dots, | x_{t} 〉

be the corresponding quantum states with

| x_{i} 〉 \in C^{n_{i}}

. Then:

\begin{matrix} (⨂_{i = 1}^{t} U_{i}) \cdot (⨂_{i = 1}^{t} | x_{i} 〉) = ⨂_{i = 1}^{t} (U_{i} \cdot | x_{i} 〉) . \end{matrix}

This result is unsurprising and is the reason for most quantum algorithms’ efficiency. However, we will generalize the argument to show that this equality also holds for non-unitary matrices.

Lemma 3.

Let

M_{1}, \dots, M_{t}

be matrices and

E_{1}, \dots, E_{t}

be identity matrices with

M_{i}, E_{i} \in C^{n_{i} \times n_{i}}

and

M = ⨂_{i = 1}^{t} M_{i}

and

| x 〉 \in C^{\prod_{i = 0}^{t} n_{i}}

. Then:

\begin{matrix} M \cdot | x 〉 = \prod_{j = 1}^{t} (⨂_{i = 1}^{j - 1} E_{i} \otimes M_{j} \otimes ⨂_{i = j + 1}^{t} E_{i}) \cdot | x 〉 \end{matrix}

Proof.

Since both the product and the tensor of two matrices are matrices again, it suffices to show this property for two matrices. The rest follows from the associativity of matrix products. Let

M_{1}, E_{1} \in C^{n_{1} \times n_{1}}

,

M_{2}, E_{2} \in C^{n_{2} \times n_{2}}

. Further, let

M = M_{1} \otimes M_{2}

. Then, by standard tensor properties, we know the following:

\begin{matrix} (M_{1} \otimes E_{2}) \cdot (E_{1} \otimes M_{2}) & = (M_{1} \cdot E_{1}) \otimes (M_{2} \cdot E_{2}) \\ = M_{1} \otimes M_{2} \\ = M \end{matrix}

Since

(M_{1} \otimes E_{2}) \cdot (E_{1} \otimes M_{2})

is indifferent to M, it does not make a difference which one we apply to a vector

| x 〉

. □

Again, this is a simple fact from linear algebra, but it will constitute a major stepping stone for our algorithm. We want to highlight that the vector

M \cdot | x 〉

from Lemma 3 does not have to be a valid quantum state even if

| x 〉

is. Since the matrix M is not unitary, it means the result does not have to be normed.

We will now use the observation from Lemma 3 to build a procedure to solve Problem 2 for a matrix

M \in C^{2^{n} \times 2^{n}}

with the property

M = ⨂_{i = 1}^{t} M_{i}

. Instead of applying HHL to the whole matrix M, we will consider its vertical decomposition and apply HHL to the sub-matrices. We highlight that finding such a decomposition is an NP-hard problem [16], but we assume it is given to us. The idea is depicted in Figure 2. The resulting state will be identical to that of HHL applied to the matrix M.

We begin with a Lemma showing that, for a simple-structured tensor product (a tensor product of mostly diagonal matrices), applying the HHL algorithm on the combined state or the horizontal decomposition delivers the same result and has almost identical runtime.

Lemma 4.

Let

M \in C^{N \times N}

be a Hermitian matrix and

H H L_{M}

be the HHL algorithm from Lemma 1. Further, let M have the following structure:

\begin{matrix} M = ⨂_{i = 1}^{j - 1} E_{i} \otimes M_{j} \otimes ⨂_{i = j + 1}^{t} E_{i} \end{matrix}

for a Hermitian

M_{j}

. Here,

E_{i}

’s are identity matrices of appropriate dimension. Then, for a given input

| b 〉

, the following equality holds:

\begin{matrix} H H L_{M} | b 〉 = ⨂_{i = 1}^{j - 1} E_{i} \otimes H H L_{M_{j}} \otimes ⨂_{i = j + 1}^{t} E_{i} | b 〉 \end{matrix}

Proof.

For a matrix

M = ⨂_{i = 1}^{j - 1} E_{i} \otimes M_{j} \otimes ⨂_{i = j + 1}^{t} E_{i}

, the HHL algorithm first uses the Hamiltonian simulation technique to apply

e^{i M n}

to the register

| b 〉

. Observe that we can rewrite M as follows:

\begin{matrix} M & = ⨂_{i = 1}^{j - 1} E_{i} \otimes M_{j} \otimes ⨂_{i = j + 1}^{t} E_{i} \\ = E_{1} \otimes M_{j} \otimes E_{2} \end{matrix}

for

E_{1} = ⨂_{i = 1}^{j - 1} E_{i}

and

E_{2} = ⨂_{i = j + 1}^{t} E_{i}

. Then, the following equality holds:

\begin{matrix} e^{M} & = e^{E_{1} \otimes M_{j} \otimes E_{2}} \\ = \sum_{j = 0}^{\infty} \frac{{(E_{1} \otimes M_{j} \otimes E_{2})}^{j}}{j!} \\ = {(E_{1} \otimes M_{j} \otimes E_{2})}^{0} + (E_{1} \otimes M_{j} \otimes E_{2}) + \frac{1}{2} {(E_{1} \otimes M_{j} \otimes E_{2})}^{2} + \dots \\ = (E_{1}^{0} \otimes M_{j}^{0} \otimes E_{2}^{0}) + (E_{1} \otimes M_{j} \otimes E_{2}) + \frac{1}{2} (E_{1}^{2} \otimes M_{j}^{2} \otimes E_{2}^{2}) + \dots \\ = (E_{1} \otimes M_{j}^{0} \otimes E_{2}) + (E_{1} \otimes M_{j} \otimes E_{2}) + (E_{1} \otimes \frac{1}{2} M_{j}^{2} \otimes E_{2}) + \dots \\ = E_{1} \otimes (M_{j}^{0} + M_{j} + \frac{1}{2} M_{j}^{2} + \dots) \otimes E_{2} \\ = E_{1} \otimes e^{M_{j}} \otimes E_{2} \end{matrix}

As the remaining steps of the HHL algorithm do not depend on the matrix M, the claim follows. □

Because of the simple structure of M from Lemma 4, the runtime of

H H L_{M}

will be

\tilde{O} (κ_{M_{j}} \cdot s_{M_{j}} \cdot p o l y (log N))

. This accounts for an overhead of

\tilde{O} (log N)

when compared to

H H L_{M_{j}}

(under the assumption that

E_{i}

’s and

M_{j}

are of relatively similar size).

Theorem 2.

Let

M = ⨂_{i = 1}^{t} M_{i}

be a vertical decomposition of a Hermitian M. Given

M_{i}

’s are Hermitian matrices as well, let

H H L_{M}

and

H H L_{M_{i}}

be the unitary HHL algorithm circuit for a matrix M and

M_{i}

, respectively. Then for all quantum states

| b 〉

:

\begin{matrix} H H L_{M} | b 〉 = \prod_{j = 1}^{t} (⨂_{i = 1}^{j - 1} E_{i} \otimes H H L_{M_{j}} \otimes ⨂_{i = j + 1}^{t} E_{i}) | b 〉 \end{matrix}

and the result is a valid quantum state

| x 〉

from Problem 2.

Proof.

Let

H H L_{M}

and

H H L_{M_{i}}

be the unitary circuit which implements the application of the HHL algorithm with input matrix M and

M_{i}

, respectively. Then, by the Lemmas 3 and 4, the following constructions are equal:

\begin{matrix} H H L_{M} & = \prod_{j = 1}^{t} (H H L_{(⨂_{i = 1}^{j - 1} E_{i} \otimes M_{j} \otimes ⨂_{i = j + 1}^{t} E_{i})}) \\ = \prod_{j = 1}^{t} (⨂_{i = 1}^{j - 1} E_{i} \otimes H H L_{M_{j}} \otimes ⨂_{i = j + 1}^{t} E_{i}) \end{matrix}

Further, since

\prod_{j = 1}^{t} (⨂_{i = 1}^{j - 1} E_{i} \otimes H H L_{M_{j}} \otimes ⨂_{i = j + 1}^{t} E_{i})

is a valid quantum circuit, the output of the algorithm is a proper quantum state. Finally, as described in Section 2, the output of the algorithm is a vector

| x 〉

such that the following is true:

\begin{matrix} A \cdot \vec{x} = \vec{b} \end{matrix}

This is exactly the solution to Problem 2. □

As the vertical construction is well-parallelizable, the runtime of the vertical HHL will be upper bound by the complexity of the most time-intensive inversion.

Theorem 3.

Let

M = ⨂_{i = 1}^{t} M_{i}

be a horizontal decomposition of a Hermitian matrix M. Further, for all

i = 1, \dots, t

, let

M_{i} \in C^{N_{i} \times N_{i}}

be a Hermitian matrix with

κ_{i}

its condition number and

s_{i}

its sparseness. The time complexity of the vertical HHL from Theorem 2 for the matrix M is:

\begin{matrix} \tilde{O} (t \cdot (max_{i} κ_{i} \cdot s_{i} \cdot poly (log N_{i}))) \end{matrix}

Proof.

The time complexity of applying each

H H L_{M_{i}}

on a subregister is upper-bound by applying the most cost-intensive one. As presented in Section 2.1, this corresponds to a time complexity as follows:

\begin{matrix} \tilde{O} (max_{i} κ_{i} \cdot s_{i} \cdot poly (log N_{i})) \end{matrix}

(6)

Further, applying

E_{i}

to any subregister is equivalent to not evolving the register in any way. Therefore in each step, we apply

H H L_{M_{i}}

to one of the subregisters and leave the other registers constant. Since we need to invert t sub-matrices, we have t steps, each upper-bound by Equation (6). This results in the runtime as follows:

\begin{matrix} \tilde{O} (t \cdot (max_{i} κ_{i} \cdot s_{i} \cdot poly (log N_{i}))) \end{matrix}

In our analysis, we implicitly hide the fact that, as mentioned in Section 2.1, in the HHL algorithm we need to perform a measurement of the ancilla register which, with a certain probability, might fail. However, using the amplitude amplification we can boost the probability of measuring the correct state to be almost 1 in

O (κ_{i})

steps for each HHL-subroutine. In [13], the authors describe a method how to slow the rotation down, when the amplitude of the correct state is big enough. In fact, for each subroutine, with

O (\max_{i} κ_{i})

steps we can obtain a probability of measuring the incorrect state upper-bound by

\frac{1}{2 t}

. Then, for the state

| x 〉

, which is the result of the vertical HHL algorithm, the probability of measuring at least a single

| 0 〉

can be lower-bound by the following:

\begin{matrix} Pr (\frac{M | x 〉}{〈 x | M^{†} M | x 〉} \neq {| 1 〉}^{\otimes t}) & \leq \sum_{i = 1}^{t} Pr (\frac{M | x 〉}{〈 x | M^{†} M | x 〉} \neq (⨂_{j = 1}^{i - 1} | 1 〉) | 0 〉 (⨂_{j = i + 1}^{t} | 1 〉)) \\ \leq \frac{1}{2 t} + \dots + \frac{1}{2 t} \\ = \frac{1}{2} \end{matrix}

Therefore, with at least

\frac{1}{2}

probability the vertical HHL algorithm will succeed. It should be noted that the ancilla registers indicate when one of the matrix inversions has failed. □

Finally, we want to highlight that, in our considerations, all the subroutines

H H L_{M_{j}}

are not executed at the exact same time. Instead, they are sequentially applied to each of the registers. We believe that simultaneously applying all the routines would not drastically change the result and could lead to a speed-up factor of

\tilde{O} (t)

compared to Theorem 3. We keep it as future research to formally prove the claim.

4. HHL for Möbius Transform

In this section, we introduce the Boolean Möbius transform, a building block of many cryptanalytical tools. Further, we present how the method from Section 3 can be used for the Möbius transform.

4.1. Möbius Transform of Boolean Functions

The Boolean Möbius transform is the mapping

μ : F_{2}^{2^{n}} \to F_{2}^{2^{n}}

. It acts as the bijective transform between the ANF of a Boolean function f, and its truth table

T_{f}

. It is defined in the following way:

\begin{matrix} \forall x \in F_{2}^{n} : f (x) = ⨁_{I \in F_{2}^{n}} μ (f) (I) x^{I}, \end{matrix}

where

μ (f)

is the Boolean function defined through the coefficients [17] (cf. Equation (1)). For a Boolean function f, we can compute its Möbius transform in terms of matrices.

Claim 1.

Let

T_{n}

be a Boolean matrix recursively defined as:

1.: $T_{1} = (\begin{matrix} 1 & 0 \\ 1 & 1 \end{matrix})$
2.: $T_{n} = (\begin{matrix} T_{n - 1} & O_{n - 1} \\ T_{n - 1} & T_{n - 1} \end{matrix})$

where

O_{n - 1}

is a

2^{n - 1} \times 2^{n - 1}

0-matrix.

For a Boolean function f,

T_{n}

can serve as the Möbius transform acting on the truth table

T_{f}

and the corresponding ANF as follows:

$μ (A_{f}) = T_{f}$
$μ (T_{f}) = A_{f}$ .

A direct consequence is that

T_{n}

is self-inverse.

The above claim is thoroughly discussed in [18,19]. As a consequence, we can describe the Möbius transform as a tensor product of polynomial terms:

Lemma 5.

We can describe

T_{n}

using the tensor product as:

\begin{matrix} T_{n} = T_{1} \otimes T_{n - 1} \end{matrix}

Proof.

Using the definition of tensor products for matrices, we have the following:

\begin{matrix} T_{1} \otimes T_{n - 1} & = (\begin{matrix} 1 \cdot T_{n - 1} & 0 \cdot T_{n - 1} \\ 1 \cdot T_{n - 1} & 1 \cdot T_{n - 1} \end{matrix}) \\ = (\begin{matrix} T_{n - 1} & O_{n - 1} \\ T_{n - 1} & T_{n - 1} \end{matrix}) \\ = T_{n} \end{matrix}

□

Lemma 6.

Let

T_{f} \in F_{2}^{2^{n}}

be the truth table of a Boolean function f. Then we can compute the algebraic normal form of f as:

\begin{matrix} A_{F} = T_{n} T_{f} = \prod_{j = 1}^{t} (⨂_{i = 1}^{j - 1} E_{1} \otimes T_{1} \otimes ⨂_{i = j + 1}^{t} E_{1}) T_{f} \end{matrix}

Proof.

By Claim 1 and Lemma 5, we know

T_{n}

can be described as follows:

\begin{matrix} T_{n} = ⨂_{i = 1}^{n} T_{1} \end{matrix}

Further, by Lemma 3, we know that the constructions are equivalent:

\begin{matrix} \prod_{j = 1}^{t} (⨂_{i = 1}^{j - 1} E_{1} \otimes T_{1} \otimes ⨂_{i = j + 1}^{t} E_{1}) T_{f} = (⨂_{i = 1}^{n} T_{1}) T_{f} = T_{n} T_{f} \end{matrix}

Finally, by using Claim 1, we know that

T_{n} T_{f} = A_{f}

proving the statement. □

Example 1.

Let

f (X_{1}, X_{2}) = X_{1} \oplus X_{1} X_{2}

. Then, the truth table of f is

T_{f} = (\begin{matrix} 0 \\ 1 \\ 0 \\ 0 \end{matrix})

. We will compute the ANF of f using the procedure mentioned in Lemma 6:

\begin{matrix} T_{2} T_{f} & = (T_{1} \otimes E_{1}) \cdot (E_{1} \otimes T_{1}) T_{f} \\ = (\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 1 & 0 & 1 & 0 \\ 0 & 1 & 0 & 1 \end{matrix}) \cdot (\begin{matrix} 1 & 0 & 0 & 0 \\ 1 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 1 & 1 \end{matrix}) (\begin{matrix} 0 \\ 1 \\ 0 \\ 0 \end{matrix}) \\ = (\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 1 & 0 & 1 & 0 \\ 0 & 1 & 0 & 1 \end{matrix}) (\begin{matrix} 0 \\ 1 \\ 0 \\ 0 \end{matrix}) \\ = (\begin{matrix} 0 \\ 1 \\ 0 \\ 1 \end{matrix}) \\ = A_{f} \end{matrix}

We highlight that the formula for the ANF of f from Lemma 6 resembles the formula mentioned in Theorem 2. We will next show that by using the HHL algorithm we can mimic the application of

T_{n}

and the result will be a quantum state which represents the

A_{f}

.

4.2. Möbius Transform Using HHL

In this section, we show how to tackle a specific initialization of Problem 2. We will compute the preimage of the matrix

T_{n}

describing the Möbius transform.

Problem 3.

Given a matrix

T_{n} \in F_{2}^{2^{n} \times 2^{n}}

defined in Claim 1, and an input state

\begin{matrix} | b 〉 = \sum_{i = 0}^{2^{n} - 1} b_{i} | i 〉 \end{matrix}

with

\vec{b} = (b_{0}, \dots, b_{2^{n} - 1}) \in C^{2^{n}}

, find the state

| x 〉 = \sum_{i = 0}^{2^{n} - 1} x_{i} | i 〉

such that:

\begin{matrix} T_{n} \cdot \vec{x} = \vec{b} \end{matrix}

As we have seen in Section 2, inverting the matrix

T_{n}

would solve Problem 3. However, we cannot simply apply the HHL algorithm for matrix

T_{n}

to an arbitrary register

| b 〉

. First, observe that

T_{n}

is defined as an

F_{2}

-matrix, while HHL operates over

C

. Instead, we need to use the reduction mentioned in Section 2.2 to create a different matrix

Γ_{n}

, which delivers the same result as

T_{n}

over

F_{2}

. We need additional variables and equations to force the solution to be an

F_{2}

-vector and ensure that the addition modulo 2 is correct. It must be noted that this procedure will expand the size of the matrix only polynomially.

Second, the HHL algorithm requires the matrix to be inverted to be Hermitian. The resulting matrix

Γ_{n}

is not. We can use the procedure already mentioned in Section 2 by [3]. The main idea is that, for an arbitrary matrix M that is not Hermitian, we can construct a new matrix

C_{M}

:

\begin{matrix} C_{M} : = (\begin{matrix} 0 & M \\ M^{†} & 0 \end{matrix}), \end{matrix}

and the matrix

C_{M}

will be Hermitian. Since, for our target matrix

Γ_{n}

, all the entries are real, it is clear that

Γ_{n}^{†} = Γ_{n}^{T}

. No conjugation needs to be calculated to provide oracle access to

Γ_{n}^{†}

. Instead, we now also need the column-oracle of the matrix (cf. Section 2.1). The reason for this is the observation that the bottom rows of

C = (\begin{matrix} 0 & Γ_{n} \\ Γ_{n}^{†} & 0 \end{matrix})

are in fact the rows of

Γ_{n}^{†} = Γ_{n}^{T}

, which are actually the columns of

Γ_{n}

. For matrices of exponentially big sparsity, this is an additional requirement.

Further, the input vector must be adjusted. While, for

Γ_{n}

, the input was a vector of the form

\vec{b}

, for the new matrix C we need the vector

\tilde{b} : = (\begin{matrix} \vec{b} \\ 0 \end{matrix})

. Also, the output will have a different form,

\tilde{x} : = (\begin{matrix} 0 \\ \vec{x} \end{matrix})

, due to the following:

\begin{matrix} C \cdot \tilde{x} & = (\begin{matrix} 0 & Γ_{n} \\ Γ_{n}^{†} & 0 \end{matrix}) \cdot (\begin{matrix} 0 \\ \vec{x} \end{matrix}) \\ = (\begin{matrix} Γ_{n} \cdot \vec{x} \\ 0 \end{matrix}) \\ = (\begin{matrix} \vec{b} \\ 0 \end{matrix}) \\ = \tilde{b} \end{matrix}

(7)

There is one problem when we try to apply this approach to the matrix

T_{n}

. The sparsity plays a crucial role in the runtime of the HHL. As the Boolean Möbius transform always has one full column, the sparseness of

T_{n}

is exponential:

\begin{matrix} s (T_{n}) = 2^{n} . \end{matrix}

We will overcome the runtime implications of the high sparsity of

T_{n}

as input to HHL using the decomposition technique.

4.3. Booleanized $T_{n}$ vs. Booleanized $T_{1}$

To overcome the exponentially big runtime of

H H L_{T_{n}}

, we will instead call the HHL with the Boolean Möbius transform for each single qubit. Again, we want to highlight that the HHL algorithm requires a Hermitian matrix, which neither

T_{1}

nor

T_{n}

are. In the construction above, we Booleanized the matrix

T_{n}

to guarantee that the addition is performed modulo 2 and the solution is an

F_{2}

-vector. However, it is not guaranteed that the matrix

Γ_{n}

can be decomposed into a tensor of small matrices.

Instead, for the vertical HHL, we will first decompose the matrix

T_{n}

, and then perform the Booleanization on each sub-matrix

T_{1}

. Refs. [7,8] mentioned a set of additional or alternative equations, which when incorporated into the equation system, allow

F_{2}

-computation. In the example below, we present one candidate for such a matrix.

Example 2.

Using the techniques mentioned in Section 2.2, we contruct a matrix

Γ_{1}^{'} \in C^{5 \times 5}

. For

a, b \in F_{2}

, the preimage under

Γ_{1}^{'}

of the vector

\vec{b} = (\begin{matrix} b_{0} \\ b_{1} \\ 0 \\ 0 \\ 0 \end{matrix})

has the form:

\begin{matrix} \vec{x} = (\begin{matrix} T_{1}^{- 1} (\begin{matrix} b_{0} \\ b_{1} \end{matrix}) |_{0} \\ T_{1}^{- 1} (\begin{matrix} b_{0} \\ b_{1} \end{matrix}) |_{1} \\ * \\ * \\ * \end{matrix}) = (\begin{matrix} T_{1}^{- 1} (\begin{matrix} b_{0} \\ b_{1} \end{matrix}) \\ * \\ * \\ * \end{matrix}) \end{matrix}

We define

Γ_{1}^{'}

as:

\begin{matrix} Γ_{1}^{'} = (\begin{matrix} 1 & 0 & 0 & 0 & 0 \\ - 2 & - 2 & 2 & 1 & 1 \\ 1 & 0 & 0 & 1 & 0 \\ 0 & 1 & 0 & 0 & 1 \\ 0 & 1 & 0 & 0 & - 1 \end{matrix}) \end{matrix}

(8)

such, that:

\begin{matrix} Γ_{1}^{'} (\begin{matrix} 0 \\ 0 \\ 0 \\ 0 \\ 0 \end{matrix}) = (\begin{matrix} 0 \\ 0 \\ 0 \\ 0 \\ 0 \end{matrix}), & Γ_{1}^{'} (\begin{matrix} 1 \\ 0 \\ 1 \\ 1 \\ 0 \end{matrix}) = (\begin{matrix} 1 \\ 1 \\ 0 \\ 0 \\ 0 \end{matrix}), & Γ_{1}^{'} (\begin{matrix} 0 \\ 1 \\ 1 \\ 0 \\ 1 \end{matrix}) = (\begin{matrix} 0 \\ 1 \\ 0 \\ 0 \\ 0 \end{matrix}), & Γ_{1}^{'} (\begin{matrix} 1 \\ 1 \\ 1 \\ 1 \\ 1 \end{matrix}) = (\begin{matrix} 1 \\ 0 \\ 0 \\ 0 \\ 0 \end{matrix}) \end{matrix}

The construction uses a handful of extra qubits for each HHL application, however, the number of needed qubits will stay polynomial in terms of the size of the register holding the state

| b 〉

. The matrix must be Hermitianized before being plugged into the HHL (see Equation (7)). The result will be a matrix

Γ_{1}

. We want to highlight that the sparsity in Example 2 does not depend on the number n:

\begin{matrix} s (Γ_{1}) = 5 \end{matrix}

and the condition number

κ_{Γ_{1}}

is constant.

With the machinery in place, we can solve Problem 3 using the algorithm from Theorem 2. Further, we can estimate the runtime of the algorithm:

Theorem 4.

Let

T_{n} = ⨂_{i = 1}^{n} T_{1}

be a horizontal decomposition of

T_{n}

. Further, let

Γ_{1} \in C^{O (1) \times O (1)}

be the Booleanized version of

T_{1}

. Then, using the vertical HHL algorithm with input

Γ_{1}

, for an arbitrary quantum state

| b 〉

, we can solve Problem 3 in

\tilde{O} (n)

time.

Proof.

As seen in Theorem 1, for a matrix decomposition

M = ⨂_{i = 1}^{t} M_{i}

, the runtime is as follows:

\begin{matrix} \tilde{O} (t \cdot (max_{i} κ_{i} \cdot s_{i} \cdot poly (log N_{i}))) \end{matrix}

In our case, all

M_{i}

’s correspond to the Booleanized version of the

T_{1}

matrix. We can use the techniques mentioned in Section 2.2 to create a Booleanized

Γ_{1}

. All the reduction steps keep the sparsity, the condition number, and the size of the resulting matrix polynomial in terms of the size of

T_{1}

[7]. Let

s_{Γ_{1}}, κ_{Γ_{1}}, N_{Γ_{1}}

be the sparsity, condition number, and size of the matrix

Γ_{1}

, respectively. Plugging the values into the upper-bound, we obtain the collected runtime for the algorithm:

\begin{matrix} \tilde{O} (t \cdot (max_{i} κ_{i} \cdot s_{i} \cdot poly (log N_{i}))) = \tilde{O} (n \cdot (κ_{Γ_{1}} \cdot s_{Γ_{1}} \cdot log (N_{Γ_{1}}))) = \tilde{O} (n) \end{matrix}

Keeping in mind that Problem 3 is an initialization of Problem 2, as well as Theorem 4 is an initialization of Theorem 2, the claim follows. □

4.4. How to Generate Input

We propose two approaches to generate the input

| b 〉

as described in Problem 3. They will consider two different settings and target different encryption scheme types. We first introduce the Q1 model, the more realistic yet less powerful one. In Q1 model, the attacker has access to a quantum computer and some classical oracle. This is the standard model used for Shor’s or Grover’s algorithm. In the Q2 model, we assume a quantum oracle to which the attacker has access. A quantum oracle allows superposition queries. While this attack scenario is based on a very strong assumption, it might still lead to interesting insights into ciphers structure [20,21].

Our Q1 attack will target plaintext-independent constructions, like classical stream ciphers or block ciphers in counter-mode with fixed IVs. Since for a fixed cipher f and a fixed IV value we can build a deterministic classical circuit, we are also able to build its unitary version

U_{f}

[22]. Using

U_{f}

and an equally distributed superposition of all states, we can prepare the following state in polynomial time:

\begin{matrix} | x 〉 | y 〉 = \frac{1}{\sqrt{2^{n}}} \sum_{i = 0}^{2^{n} - 1} | i 〉 | f (i) 〉 . \end{matrix}

Now, when we measure a

| 1 〉

in the second register, only values i for which

f (i) = 1

will remain with a non-zero amplitude in the superposition:

\begin{matrix} | x 〉 | y 〉 = \frac{1}{\sqrt{hw (T_{f})}} \sum_{\begin{matrix} i = 0 \\ f (i) = 1 \end{matrix}}^{2^{n} - 1} | i 〉 | 1 〉 . \end{matrix}

The register

| x 〉

has now exactly the desired form of

| b 〉

from Problem 3, and each

b_{i}

corresponds to the bit output for a given key i. Obviously, the

| 1 〉

measurement only happens with a certain probability. However, as in the cryptographic setting, we mostly work with functions that are almost balanced [2], and we expect the probability to measure the desired state to be fairly high. For a Boolean function f, the said probability is the ratio of the Hamming weight of

T_{f}

and the size of its truth table, as it is for a balanced function

\frac{1}{2}

. We show the exact procedure in Algorithm 1.

Algorithm 1: Q1 model input preparation for Problem 3

Input: Quantum implementation

U_{f}

of a keyed cipher

f : {0, 1}^{n} \to {0, 1}

1 | x 〉 | y 〉 \leftarrow {| 0 〉}^{\otimes n} | 0 〉

;

2 | x 〉 | y 〉 \leftarrow H_{n} | x 〉 | y 〉 = \frac{1}{\sqrt{2^{n}}} \sum_{i = 0}^{2^{n} - 1} | i 〉 | 0 〉

;

3 | x 〉 | y 〉 \leftarrow U_{f} | x 〉 | y 〉 = \frac{1}{\sqrt{2^{n}}} \sum_{i = 0}^{2^{n} - 1} | i 〉 | f (i) 〉

;

4

Measure register

| y 〉

;

5

If

| y 〉 = = | 0 〉

go to step 1;

6

return

| x 〉

;

Output:

| b 〉 = \sum_{i = 0}^{2^{n} - 1} b_{i} | i 〉

, where

b_{i}

is the value of

f (i)

scaled by the factor

\frac{1}{\sqrt{hw (T_{f})}}

to produce a valid quantum state.

In the Q2 model, we can perform a similar trick to retrieve the superposition of all outputs of a function with a fixed key. The superposition, however, encapsulates the possible inputs to an encryption function with a secret key. Especially in the case where there are weak keys, this might allow retrieving some additional information about the key and some encrypted plaintext. The Q2 attack resembles the previous one and is presented in Algorithm 2.

Algorithm 2: Q2 model input preparation for Problem 3

Input: Quantum oracle

O_{f_{k}}

for a cipher

f_{k} : {0, 1}^{n} \to {0, 1}

1 | x 〉 | y 〉 \leftarrow {| 0 〉}^{\otimes n} | 0 〉

;

2 | x 〉 | y 〉 \leftarrow H_{n} | x 〉 | y 〉 = \frac{1}{\sqrt{2^{n}}} \sum_{i = 0}^{2^{n} - 1} | i 〉 | 0 〉

;

3 | x 〉 | y 〉 \leftarrow U_{f_{k}} | x 〉 | y 〉 = \frac{1}{\sqrt{2^{n}}} \sum_{i = 0}^{2^{n} - 1} | i 〉 | f_{k} (i) 〉

;

4

Measure register

| y 〉

;

5

If

| y 〉 = = | 0 〉

go to step 1;

6

return

| x 〉

;

Output:

| b 〉 = \sum_{i = 0}^{2^{n} - 1} b_{i} | i 〉

, where

b_{i}

is the value of

f_{k} (i)

scaled by the factor

\frac{1}{\sqrt{hw (T_{f_{k}})}}

to produce a valid quantum state.

5. Cryptographic Use-Case

In this section, we want to use the HHL algorithm for Möbius transform to perform cryptanalysis of Boolean functions. We begin with an approach that allows creating the state

| A_{f} 〉

, which might be polynomially reconstructible for sparse functions.

5.1. Retrieving the Algebraic Normal Form

In Section 4.2, we showed how to generate the input corresponding to the

T_{f}

for a Boolean function f. The procedure runs in polynomial time and the result is a vector of the following form:

\begin{matrix} | T_{f} 〉 = \frac{1}{\sqrt{hw (T_{f})}} \sum_{\begin{matrix} i = 0 \\ f (i) = 1 \end{matrix}}^{2^{n} - 1} | i 〉 = \frac{1}{\sqrt{hw (T_{f})}} \cdot T_{f} \end{matrix}

We also showed how to construct the matrix input

Γ_{1}

for the HHL algorithm, which corresponds to the Boolean Möbius transform. The matrix is sparse and of low condition number and a constant size. The HHL algorithm would take constant time to run for a single instance of

Γ_{1}

, and

\tilde{O} (n)

time to apply the Möbius transform to the whole register (cf. Theorem 4).

As seen in Section 4.1, applying

T_{n}

to the vector

T_{f}

results in a vector which describes the algebraic normal form of the function f. A similar result is obtained when the vertical HHL algorithm with

Γ_{1}

input is applied to the register

| T_{f} 〉

.

Theorem 5.

Let

H H L_{Γ_{n}}

be the vertical application of

H H L_{Γ_{1}}

to n qubits:

\begin{matrix} H H L_{Γ_{n}} | b 〉 | a n c i l l a 〉 : = \prod_{j = 1}^{n} (⨂_{i = 1}^{j - 1} E_{1} \otimes H H L_{Γ_{1}} \otimes ⨂_{i = j + 1}^{n} E_{1}) | b 〉 | a n c i l l a 〉 \end{matrix}

The ancilla register is needed to cover for the special Booleanized construction of

Γ_{1}

. Then, applying the

H H L_{Γ_{n}}

to the state

| T_{f} 〉

results in the following register:

\begin{matrix} H H L_{Γ_{n}} | T_{f} 〉 | a n c i l l a 〉 = | A_{f} 〉 | a n c i l l a 〉 = (\frac{1}{\sqrt{h w (A_{f})}} \cdot A_{f}) | a n c i l l a 〉 \end{matrix}

With an additional measurement of some helper register we can obtain the pure state

| A_{f} 〉

.

Proof.

As seen in Theorem 4, applying the vertical HHL approach with

Γ_{1}

input allows for computing of the preimage of the input vector

| b 〉

. The result is equivalent to the preimage of

\vec{b}

under the Boolean Möbius transformation

T_{n}

. As seen in Claim 1, the result is a vector

A_{f}

holding the algebraic normal form of f scaled by a factor dependent on the superposition. Finally, we can discard the ancilla register (or perform some additional conditional measurement) and only consider the register

| A_{f} 〉

. □

The resulting register corresponds to the vector description of the ANF of f, scaled by a factor. The important observation is that only the coefficients which are one have a non-zero amplitude.

5.1.1. Sparse ANF Representation

Classically, reconstruction of an unknown Boolean function is a difficult problem. One of the approaches can be obtained from the learning theory and is based on the Fourier spectrum [2]. One can also use the Formula (1) mentioned in Section 1.2 to reveal the ANF. While obtaining the coefficients of low degree coefficients is not problematic, the complexity grows exponentially with the degree of the coefficient. On the other hand, if the degree of the coefficient is too high, it affects a small number of possible outputs and has a low influence on the truth table of the function.

When considering the result of the algorithm described in Theorem 5, we want to highlight that the state

| A_{f} 〉

is an equally distributed superposition of all the base states which contribute to the ANF. Upon measurement, we obtain any of the ANF’s active (non-zero) coefficients with the same probability. The difference, opposing the classical scenario, is that the number of steps we need for each coefficient is independent of the coefficient’s degree. Especially for Boolean functions with sparse representation (polynomial number of coefficients with respect to n) and many high-order coefficients, the ANF reconstruction can be significantly sped up. Simple measurements can be combined with techniques like amplitude amplification to guarantee that the whole ANF is properly recovered in polynomial time [13].

In the Q1 model, retrieving the

A_{f}

can give us information about the secret key that is being used to generate some bit-stream. By combining information from multiple Boolean functions (e.g., multiple one-bit projections generated by the stream cipher) we could build a polynomial-size equation system which can be solved for the secret key. The ANF obtained in the Q2 model does not give us direct information about the key, but could be used to decrypt future ciphertexts.

5.1.2. Estimating the Number of Terms

In some cases, especially when

hw (A_{f})

is super-polynomial, we might consider other properties of f rather than its ANF. The state

| A_{f} 〉

allows us to estimate the number of non-zero coefficients in the algebraic normal form of the Boolean function. We can achieve this using a technique called Quantum Amplitude Estimation [13].

Quantum amplitude estimation deals with a problem connected to the Grover’s algorithm setting. However, instead of searching for an

{x \in X : f (x) = 1}

, we are interested in the size of said set. Similarly, as in Grover’s algorithm, the algorithm divides the space into two subspaces following some indicator function f:

\begin{matrix} | Ψ 〉 = | Ψ_{1} 〉 + | Ψ_{0} 〉 \end{matrix}

We call

| Ψ_{1} 〉

the good components and

| Ψ_{0} 〉

the bad components. We want to estimate the probability of measuring a good component. To achieve it, the algorithm uses the famous Grover’s operator combined with phase estimation. After [13] was published, other potential candidates to solve the same problem were constructed [23,24,25].

We begin with a straightforward observation that the state

| A_{f} 〉

is an equal superposition of all states which correspond to the non-zero coefficients of

A_{f}

. This means that the amplitudes of

| A_{f} 〉

are only zeros and ones scaled by a factor of

\frac{1}{\sqrt{hw (A_{f})}}

. If we were able to estimate the amplitude of one of the active entries, we could determine the Hamming weight

hw (A_{f})

. This is exactly the sparsity of the function f. Further, since all non-zero amplitudes are equal, we just need to find the amplitude of a single element from the ANF (e.g., by measuring

| A_{f} 〉

we find the element from

A_{f}

and subsequently estimate its amplitude). Sometimes, we might not be interested in finding the exact number but rather proving that it is above a certain threshold. Since most of the amplitude estimation algorithms have their runtime dependent on an error threshold, we could verify if the amplitude is only smaller than some value, which guarantees the desired ANF sparseness.

We could iterate the above ideas to prove additional properties. By carefully choosing the function used for amplitude estimation, we could check how many higher-order terms are in the ANF. The good states (as defined in [13]) could be the terms with a degree greater than some threshold. This task would take exponential time in the classical scenario, but could be efficiently performed on a quantum computer.

5.1.3. Comparison to Classical Approaches

Retrieving the algebraic normal form of a Boolean function is a concept closely related to computational learning theory. In the classical scenario, given a set of samples of the form

{(x, f (x))}_{x \in S}

, we want to compute a function

\tilde{f}

such that the following is true:

\begin{matrix} \underset{x \in {0, 1}^{n}}{Pr} (\tilde{f} (x) = f (x)) \geq c \end{matrix}

for some non-negligible constant c. Obviously, reconstructing the original function f would solve this problem. However, since higher-order terms of

A_{f}

only influence a few outputs, ignoring terms with orders bigger than some predetermined value t might be beneficial. How many such high-order terms might be ignored depends on the value of c.

In [2], a learning approach is proposed, which relies on determining the Fourier coefficients of f instead of its ANF. As there are

2^{n}

such coefficients, and computing each one is not a trivial task; for bigger n-values, this is not a manageable approach. The corollary below estimates the runtime of such an approach for a specific set of with total influence bounded by some value t.

Corollary 1

([2]). Let

I (f)

be the total influence of a Boolean function f, defined as:

\begin{matrix} I (f) = \sum_{S \subseteq [n]} | S | \hat{f} {(S)}^{2}, \end{matrix}

where

\hat{f} (S)

are the Fourier coefficients of f.

For

t \geq 1

, let

C = {f : {- 1, 1}^{n} \to {- 1, 1} | I (f) \leq t}

. Then

C

is learnable from random examples with error ϵ in time

n^{O (t / ϵ)}

.

An alternate approach would be learning the actual ANF coefficients of the function f. As mentioned in Section 1.2, the formula for a coefficient of the term

x^{I}

is as follows:

\begin{matrix} c_{I} = \sum_{supp (x) \subseteq I} f (x) . \end{matrix}

So, to compute a coefficient of order

d e g (I) = t

, we need to evaluate the function f at

2^{t}

inputs and add them together. For higher-order terms, the number of computational steps is thus

O (2^{t})

.

In comparison, our approach depends solely on n and the Hamming weight of the algebraic normal form of f. The high-order coefficients are as easy to determine as the low-order terms. Furthermore, we are not aware of any classical algorithm other than trivial counting, which determines the Hamming weight of the ANF of a function.

Historically, certain sparse but high-order functions were used in cryptography. A famous example, the Toyocrypt cipher, was attacked in [26]. Even though the polynomial used there was not secret, some of the currently used ciphers may use functions with sparse polynomial representation.

5.2. Algebraic Transition Matrix

Another approach using a concept closely related to the Boolean Möbius Transform was proposed in [5]. For a vectorial Boolean function

F : {0, 1}^{n} \to {0, 1}^{m}

, ref. [5] define a linear operator mapping from

δ_{x}

to

δ_{F (x)}

, where

δ_{x}

is the Kronecker delta function. The operator is called a transition matrix:

Definition 1

(Transition matrix [5]). Let

K be a field and let F : F_{2}^{n} \to F_{2}^{m}

be a vectorial Boolean function. Define

T^{F} : K^{F_{2}^{n}} \to K^{F_{2}^{m}}

as the unique linear operator that maps

δ_{x}

to

δ_{F (x)}

. The transition matrix of F is the coordinate representation of

T^{F}

with respect to the Kronecker delta bases of

K^{F_{2}^{n}}

and

K^{F_{2}^{m}}

.

The transition matrix has a set of useful properties which allow combining smaller transformations to a bigger, more complex one [5]. However, in this paper, we will not focus on those and rather will show that, in the Q2 model, we can obtain oracle access to the transition matrix.

Example 3

([5]). Consider a function

F : F_{2}^{2} \to F_{2}^{2}

defined as

\begin{matrix} F (x_{1}, x_{2}) = (F_{1} (x_{1}, x_{2}), F_{2} (x_{1}, x_{2})) \mapsto (x_{2} + 1, x_{1} + x_{1} x_{2}) \end{matrix}

The truth table of such a function is as follows:

\begin{matrix} F : & (0, 0) \mapsto (1, 0) \\ (1, 0) \mapsto (1, 1) \\ (0, 1) \mapsto (0, 0) \\ (1, 1) \mapsto (0, 0) \end{matrix}

The transition matrix

T^{F}

is then: as follows

T^{F} = \begin{matrix} \begin{array}{cccc} (0, 0) & (1, 0) & (0, 1) & (1, 1) \end{array} \\ \begin{matrix} (0, 0) \\ (1, 0) \\ (0, 1) \\ (1, 1) \end{matrix} & (\begin{matrix} 0 & 0 & 1 & 1 \\ 1 & 0 & 1 & 1 \\ 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \end{matrix}) \end{matrix}

We want to highlight that, given a column index i, the only non-zero entry appears in the

F (i)

’th row. If we allow superposition queries to the function F (as is assumed in the Q2 model), we have a quantum column-oracle for the transition matrix. To obtain the row-oracle which, given a row index, returns the indices of non-zero entries in that row, we additionally need an oracle for the

F^{- 1}

function. For a permutation, the oracle acts as a bijection. However, as seen in the example above, this must not be the case in general. With some assumptions about the maximal number of elements

ν

that map to the same value, we can still solve this problem with a bigger output register. We also want to highlight that such a matrix is well-suited for the HHL algorithm—the sparseness and condition number can be upper-bound by

s = O (ν) = κ

. Especially for F, which is a permutation, the runtime of HHL with

T^{F}

is

\tilde{O} (n + m)

.

While the transition matrix is not necessarily useful in the HHL setting, they are a building block of a more interesting construction. We will first introduce a dual to the Boolean Möbius transform, as defined in [5], and then use it to build an algebraic transition matrix.

Definition 2

(Dual Boolean (binary) Möbius Transformation [5]). The dual Boolean Möbius transformation of a Boolean function

f \in F_{2}^{F_{2}^{n}}

is the Boolean function

P_{n} (f) = \tilde{f} \in F_{2}^{F_{2}^{n}}

, where

\tilde{f} (u)

is the coordinate of f corresponding to

π_{u}

in the precursor basis. The precursor basis vectors

π_{u}

are the indicator function of precursor sets:

\begin{matrix} π_{u} = 1_{{s u p p (x) \subseteq u}} \end{matrix}

The matrix representation of

P_{n}

has the following form:

\begin{matrix} P_{n} = {(\begin{matrix} 1 & 1 \\ 0 & 1 \end{matrix})}^{\otimes n} \end{matrix}

Observe that the dual Boolean Möbius transform

P_{n}

is actually the transpose of

T_{n}

defined in Section 4.1. By techniques similar to the ones mentioned above, we can also Booleanize

P_{n}

. With the definitions above, we can finally introduce the algebraic transition matrices:

Definition 3

(Algebraic Transition Matrix [5]). Let

F : F_{2}^{n} \to F_{2}^{m}

be a vectorial Boolean function. Define

A^{F} : F_{2}^{F_{2}^{n}} \to F_{2}^{F_{2}^{m}}

as the dual Boolean Möbius transformation of

T^{F}

. That is:

\begin{matrix} A^{F} = P_{m} T^{F} P_{n}^{- 1} \end{matrix}

The algebraic transition matrix of F is the coordinate representation of

T^{F}

with respect to the precursor bases of

F_{2}^{F_{2}^{n}}

and

F_{2}^{F_{2}^{m}}

.

The algebraic transition matrix can be computed as a horizontal decomposition of three matrices, two dual Möbius transforms and a transition matrix. We observe that like the Boolean Möbius transform, its dual is self-inverse. So in fact, we can compute

A^{F}

as follows:

\begin{matrix} A^{F} = P_{m} T^{F} P_{n} \end{matrix}

The HHL algorithm for each of those matrices runs in time

O (m + n)

. The two dual transforms can be vertically decomposed (cf. Figure 1b) using the technique from Section 4.2, while

T^{F}

is well-suited for HHL as we argued above. We can combine the three HHL routines by straight concatenation:

\begin{matrix} H H L_{A^{F}} | b 〉 = H H L_{P_{m}} \circ H H L_{T^{F}} \circ H H L_{P_{n}} | b 〉 \end{matrix}

As a result, we can compute the preimage of a quantum state

| b 〉

under the algebraic transition matrix in polynomial time by combining the classical HHL algorithm with the vertical HHL.

5.2.1. Interpretation of ATMs Mappings

The algebraic transition matrix gives us deep insights into the vectorial Boolean function which is being investigated. In [5], the authors show that, for

I = (i_{1}, \dots, i_{m})

,

J = (j_{1}, \dots, j_{n})

, the matrix

A^{F}

following holds:

\begin{matrix} {(A^{F})}_{(I, J)} = 1 \Leftrightarrow A_{(\prod_{k = 1}^{m} F_{k}^{i_{k}})} |_{J} = 1, \end{matrix}

where

A_{(\prod_{k = 1}^{m} F_{k}^{i_{k}})} |_{J}

is the Jth coefficient

c_{J}

of the algebraic normal form of the Boolean function f as defined in Section 1.2.

Using the HHL algorithm does not give us direct access to the algebraic transition matrix. However, ref. [5] comes with a theorem which explains the result of multiplying

A^{F}

by some vector

g \in F_{2}^{F_{2}^{m}}

:

Theorem 6

(Property of Algebraic Transition Matrices [5]). Let

F : F_{2}^{n} \to F_{2}^{m}

be a vectorial Boolean function and

A^{F}

its algebraic transition matrix. Further, let

g : F_{2}^{m} \to F_{2}

be a Boolean function with m inputs. Then:

\begin{matrix} A_{g \circ F} = {(A^{F})}^{T} A_{g} \end{matrix}

In other words, for a given Boolean function of the form

h = g \circ F

, using the HHL algorithm with input matrix

A^{F}

and a vector

| h 〉

, we can find the state

| g 〉

which encapsulates the searched function.

Example 4

([5]). Consider a function

F : F_{2}^{2} \to F_{2}^{2}

defined as follows

\begin{matrix} F (x_{1}, x_{2}) = (F_{1} (x_{1}, x_{2}), F_{2} (x_{1}, x_{2})) \mapsto (x_{2} + 1, x_{1} + x_{1} x_{2}) \end{matrix}

The algebraic transition matrix

A^{F}

is then as follows:

A^{F} = \begin{matrix} \begin{array}{cccc} 1 & x_{1} & x_{2} & x_{1} x_{2} \end{array} \\ \begin{matrix} 1 \\ F_{1} \\ F_{2} \\ F_{1} \cdot F_{2} \end{matrix} & (\begin{matrix} 1 & 0 & 0 & 0 \\ 1 & 0 & 1 & 0 \\ 0 & 1 & 0 & 1 \\ 0 & 1 & 0 & 1 \end{matrix}) \end{matrix}

The rows correspond to products of the projection functions

F_{i}

’s, while the columns indicate whether the coefficient of each

x^{I}

is one or zero.

The algebraic transition matrix also allows quick computation of the Boolean function obtained by composition of F with any Boolean function. For example for the Boolean function

g (x_{1}, x_{2}) = x_{1} + x_{2}

, we can compute

g \circ F

:

\begin{matrix} {(A^{F})}^{T} \cdot A_{g} & = {(\begin{matrix} 1 & 0 & 0 & 0 \\ 1 & 0 & 1 & 0 \\ 0 & 1 & 0 & 1 \\ 0 & 1 & 0 & 1 \end{matrix})}^{T} (\begin{matrix} 0 \\ 1 \\ 1 \\ 0 \end{matrix}) \\ = (\begin{matrix} 1 \\ 1 \\ 1 \\ 1 \end{matrix}) \\ = 1 + x_{1} + x_{2} + x_{1} x_{2} \\ = g \circ F \end{matrix}

A possible attack scenario includes finding a Boolean form to compute the value of each

x_{i}

from the output of the function F. For example, to find the formula for

x_{1}

, the input vector to the HHL algorithm would have to be as follows:

| b ⟩ = (\begin{matrix} 0 \\ 1 \\ 0 \\ ⋮ \\ 0 \\ ⋮ \\ 0 \end{matrix}) \begin{matrix} 1 \\ x_{1} \\ x_{2} \\ x_{i} x_{2} \\ \prod_{i = 1}^{m} x_{i} \end{matrix}

We highlight that the matrix to be inverted is not

A^{F}

, but

{(A^{F})}^{T}

. This is not a problem as we can just switch the row- and column-oracles to obtain the transpose of the matrix

T^{F}

and use the following fact:

\begin{matrix} {(A^{F})}^{T} & = {(P_{m} T^{F} P_{n})}^{T} \\ = P_{n}^{T} \cdot {(T^{F})}^{T} \cdot P_{m}^{T} \\ = P_{n} \cdot {(T^{F})}^{T} \cdot P_{m} \end{matrix}

For each low-order term

x^{I}

, we can use the techniques used in Section 5.1.2 to verify whether the preimage has a small Hamming weight. If such an input is found, there exists a sparse Boolean function that allows the computation of the value of

x^{I}

. We can recover such a function and use it to deduce the values of

x_{i}

for

i \in I

. For example, we can use the fact that if

x^{I} = 1

, then

\forall i \in I

follows

x_{i} = 1

.

5.2.2. Comparison to Classical Approaches

In the classical setting, to find a formula to compute certain bits, the attacker would have to analyze the ciphers algebraic properties. The concept is similar to algebraic cryptanalysis, where the cipher is rewritten as a set of equations for which the solution is the secret key [27].

We are, however, not interested in building a huge equation system, but rather want to find a function which, combined with the original function, leaks secret bits. The idea resembles the concept of annihilators, introduced in [26]. An annihilator of a Boolean function

f (x) : F_{2}^{n} \to F_{2}

is another Boolean function

g (x) : F_{2}^{n} \to F_{2}

such that the following is true:

\begin{matrix} f (x) \cdot g (x) = 0 \forall x \in F_{2}^{n} \end{matrix}

The annihilators allow reducing the degree of algebraic equations and, as a consequence, finding some of the secret key bits used in encryption. In our approach, we are rather interested in analyzing a vectorial Boolean function

F (x) : F_{2}^{n} \to F_{2}^{m}

using a Boolean function

g (x) : F_{2}^{m} \to F_{2}

such that the following is true:

\begin{matrix} g \circ F (x) = x^{I} \forall x \in F_{2}^{n}, \end{matrix}

but the desired outcome is similar.

6. Conclusions

In this paper, we have shown how tensor-decomposable matrices can be used in the HHL setting to improve the runtime of the linear system solving algorithm proposed in [3]. We have proven that a vertical decomposition has a direct influence on the run-time, and in the quantum setting allows preimage computation even if the input to be inverted is not tensor-decomposable. This is a direct improvement over the classical computation.

Next, we have shown a polynomial-time procedure to create a quantum state which describes the algebraic normal form of a Boolean function f. The register has the form

| A_{f} 〉 = \sum_{i = 0}^{2^{n} - 1} \frac{1}{\sqrt{hw (A_{f})}} \cdot A_{f}

, where

A_{f}

represents the coefficient vector of f. We further show how

| A_{f} 〉

could be used to extract some information about the ANF. In most straightforward scenarios, a distinguisher attack could be performed to differentiate a cryptographic function from a random function. The Hamming weight of the ANF of a random function will be roughly

N / 2

. Meanwhile, a cryptographic function is likely to deviate from this value. The attack could also be used as a testing method for new ciphers. It would allow a quicker verification of the sparsity of ANF representations.

We show how a construct called an algebraic transition matrix can be used in the HHL setting to improve the cryptanalysis of Boolean functions. By using the polynomial-time HHL algorithm for all three parts of the horizontal decomposition of

A^{F}

, with specific input, we can find formulas for the values of the key used in the encryption process.

We finally point to the fact that the Boolean Möbius transform

T_{n}

is self-inverse. Therefore, it is not necessary to compute the preimage under the Möbius transformation. We could compute the value

T_{n} \cdot | b 〉

and obtain the same result. We would not have to invert the eigenvalue

λ_{k}

, but use it as a factor. It remains an open question how significantly such an alteration improves the runtime. We believe that, for some use cases, it might pose an interesting question.

Author Contributions

Conceptualisation, methodology, formal analysis, writing—original draft preparation, writing—review and editing C.P., supervision, project administration and proof-reading M.M. All authors have read and agreed to the published version of the manuscript.

Funding

Supported by Open Access funds of Freie Universität Berlin.

Conflicts of Interest

The authors declare no conflict of interest.

References

Hosoyamada, A. Quantum Speed-Up for Multidimensional (Zero Correlation) Linear Distinguishers. In Advances in Cryptology, Proceedings of the ASIACRYPT 2023, Guangzhou, China, 4–8 December 2023; Springer: Singapore, 2023; pp. 311–345. [Google Scholar] [CrossRef]
O’Donnell, R. Analysis of Boolean Functions; Cambridge University Press: Cambridge, UK, 2014. [Google Scholar] [CrossRef]
Harrow, A.W.; Hassidim, A.; Lloyd, S. Quantum Algorithm for Solving Linear Systems of Equations. Phys. Rev. Lett. 2009, 103, 150502. [Google Scholar] [CrossRef] [PubMed]
Chen, Y.A.; Gao, X.S. Quantum Algorithm for Boolean Equation Solving and Quantum Algebraic Attack on Cryptosystems. J. Syst. Sci. Complex. 2022, 35, 373–412. [Google Scholar] [CrossRef]
Beyne, T.; Verbauwhede, M. Integral Cryptanalysis Using Algebraic Transition Matrices. IACR Trans. Symmetric Cryptol. 2023, 2023, 244–269. [Google Scholar] [CrossRef]
Hoffstein, J.; Pipher, J.; Silverman, J.H. NTRU: A Ring-Based Public Key Cryptosystem. In Algorithmic Number Theory; Goos, G., Hartmanis, J., Van Leeuwen, J., Buhler, J.P., Eds.; Springer: Berlin/Heidelberg, Germany, 1998; Volume 1423, pp. 267–288. [Google Scholar] [CrossRef]
Chen, Y.A.; Gao, X.S.; Yuan, C.M. Quantum Algorithm for Optimization and Polynomial System Solving over Finite Field and Application to Cryptanalysis. arXiv 2018, arXiv:1802.03856v2. [Google Scholar] [CrossRef]
Liu, W.; Gao, J. Quantum Security of Grain-128/Grain-128a Stream Cipher against HHL Algorithm. Quantum Inf. Process. 2021, 20, 343. [Google Scholar] [CrossRef]
Ding, J.; Gheorghiu, V.; Gilyén, A.; Hallgren, S.; Li, J. Limitations of the Macaulay Matrix Approach for Using the HHL Algorithm to Solve Multivariate Polynomial Systems. Quantum 2023, 7, 1069. [Google Scholar] [CrossRef]
Gao, J.; Li, H.; Wang, B.; Li, X. AES-128 under HHL Algorithm. Quant. Inf. Comput. 2022, 22, 0209–0240. [Google Scholar] [CrossRef]
Murphy, S.; Robshaw, M.J. Essential Algebraic Structure within the AES. In Advances in Cryptology, Proceedings of the CRYPTO 2002, Santa Barbara, CA, USA, 18–22 August 2022; Goos, G., Hartmanis, J., Van Leeuwen, J., Yung, M., Eds.; Springer: Berlin/Heidelberg, Germany, 2002; Volume 2442, pp. 1–16. [Google Scholar] [CrossRef]
Berry, D.W.; Ahokas, G.; Cleve, R.; Sanders, B.C. Efficient Quantum Algorithms for Simulating Sparse Hamiltonians. Commun. Math. Phys. 2007, 270, 359–371. [Google Scholar] [CrossRef]
Brassard, G.; Hoyer, P.; Mosca, M.; Tapp, A. Quantum Amplitude Amplification and Estimation. AMS Contemp. Math. Ser. 2000, 305. [Google Scholar] [CrossRef]
Ambainis, A. Variable Time Amplitude Amplification and a Faster Quantum Algorithm for Solving Systems of Linear Equations. arXiv 2010, arXiv:1010.4458. [Google Scholar] [CrossRef]
Childs, A.M.; Kothari, R.; Somma, R.D. Quantum Algorithm for Systems of Linear Equations with Exponentially Improved Dependence on Precision. SIAM J. Comput. 2017, 46, 1920–1950. [Google Scholar] [CrossRef]
Gurvits, L. Classical Deterministic Complexity of Edmonds’ Problem and Quantum Entanglement. In Proceedings of the Thirty-Fifth Annual ACM Symposium on Theory of Computing, San Diego, CA, USA, 9–11 June 2003; pp. 10–19. [Google Scholar] [CrossRef]
Barbier, M.; Cheballah, H.; Le Bars, J.M. On the Computation of the Möbius Transform. Theor. Comput. Sci. 2020, 809, 171–188. [Google Scholar] [CrossRef]
Carlet, C. Boolean Functions for Cryptography and Error-Correcting Codes. In Boolean Models and Methods in Mathematics, Computer Science, and Engineering, 1st ed.; Crama, Y., Hammer, P.L., Eds.; Cambridge University Press: Cambridge, UK, 2010; pp. 257–397. [Google Scholar] [CrossRef]
Pieprzyk, J.; Wang, H.; Zhang, X.m. Möbius Transforms, Coincident Boolean Functions and Non-Coincidence Property of Boolean Functions. Int. J. Comput. Math. 2011, 88, 1398–1416. [Google Scholar] [CrossRef]
Kuwakado, H.; Morii, M. Quantum Distinguisher between the 3-Round Feistel Cipher and the Random Permutation. In Proceedings of the 2010 IEEE International Symposium on Information Theory, Austin, TX, USA, 13–18 June 2010; pp. 2682–2685. [Google Scholar] [CrossRef]
Kaplan, M.; Leurent, G.; Leverrier, A.; Naya-Plasencia, M. Breaking Symmetric Cryptosystems Using Quantum Period Finding. In Advances in Cryptology, Proceedings of the CRYPTO 2016, Santa Barbara, CA, USA, 14–18 August 2016; Robshaw, M., Katz, J., Eds.; Springer: Berlin/Heidelberg, Germany, 2016; pp. 207–237. [Google Scholar] [CrossRef]
Nielsen, M.A.; Chuang, I.L. Quantum Computation and Quantum Information, 10th ed.; Cambridge University Press: Cambridge, UK, 2010. [Google Scholar] [CrossRef]
Suzuki, Y.; Uno, S.; Raymond, R.; Tanaka, T.; Onodera, T.; Yamamoto, N. Amplitude Estimation without Phase Estimation. Quantum Inf. Process. 2020, 19, 75. [Google Scholar] [CrossRef]
Vazquez, A.C.; Woerner, S. Efficient State Preparation for Quantum Amplitude Estimation. Phys. Rev. Appl. 2021, 15, 034027. [Google Scholar] [CrossRef]
Grinko, D.; Gacon, J.; Zoufal, C.; Woerner, S. Iterative Quantum Amplitude Estimation. Npj Quantum Inf. 2021, 7, 52. [Google Scholar] [CrossRef]
Courtois, N.T.; Meier, W. Algebraic Attacks on Stream Ciphers with Linear Feedback. In Advances in Cryptology, Proceedings of the EUROCRYPT 2003, Warsaw, Poland, 4–8 May 2003; Biham, E., Ed.; Springer: Berlin/Heidelberg, Germany, 2003; pp. 345–359. [Google Scholar] [CrossRef]
Bard, G.V. Algebraic Cryptanalysis; Springer: Boston, MA, USA, 2009. [Google Scholar] [CrossRef]

Figure 1. We distinguish between decompositions of a matrix which are well-parallelizable (b) and the sequential ones (a). To make a clear distinction, we call them vertical and horizontal (regarding how they are stacked in the standard quantum gate-based computation model). (a) Horizontal decomposition. (b) Vertical decomposition.

Figure 2. Vertical HHL scheme. We note that we actually apply a single

H H L_{M_{i}}

at a time in t steps. The register after step 1 is

(H H L_{M_{1}} \otimes ⨂_{i = 2}^{t} E_{i}) | b 〉

. The state in step 2 is

(E_{1} \otimes H H L_{M_{2}} \otimes ⨂_{i = 3}^{t} E_{i}) \circ {(M_{1} \otimes ⨂_{i = 2}^{t} E_{i})}^{- 1} | b 〉

.

Figure 2. Vertical HHL scheme. We note that we actually apply a single

H H L_{M_{i}}

at a time in t steps. The register after step 1 is

(H H L_{M_{1}} \otimes ⨂_{i = 2}^{t} E_{i}) | b 〉

. The state in step 2 is

(E_{1} \otimes H H L_{M_{2}} \otimes ⨂_{i = 3}^{t} E_{i}) \circ {(M_{1} \otimes ⨂_{i = 2}^{t} E_{i})}^{- 1} | b 〉

.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Pilaszewicz, C.; Margraf, M. HHL Algorithm for Tensor-Decomposable Matrices. Quantum Rep. 2025, 7, 37. https://doi.org/10.3390/quantum7030037

AMA Style

Pilaszewicz C, Margraf M. HHL Algorithm for Tensor-Decomposable Matrices. Quantum Reports. 2025; 7(3):37. https://doi.org/10.3390/quantum7030037

Chicago/Turabian Style

Pilaszewicz, Cezary, and Marian Margraf. 2025. "HHL Algorithm for Tensor-Decomposable Matrices" Quantum Reports 7, no. 3: 37. https://doi.org/10.3390/quantum7030037

APA Style

Pilaszewicz, C., & Margraf, M. (2025). HHL Algorithm for Tensor-Decomposable Matrices. Quantum Reports, 7(3), 37. https://doi.org/10.3390/quantum7030037

Article Menu

HHL Algorithm for Tensor-Decomposable Matrices

Abstract

1. Introduction

1.1. State-of-the-Art of HHL in Cryptanalysis

1.2. Notation

2. HHL Algorithm

2.1. HHL Runtime

2.2. HHL over Finite Fields

Reduction from Boolean Equations to Equations over $C$

3. HHL for Tensors of Matrices

4. HHL for Möbius Transform

4.1. Möbius Transform of Boolean Functions

4.2. Möbius Transform Using HHL

4.3. Booleanized $T_{n}$ vs. Booleanized $T_{1}$

4.4. How to Generate Input

5. Cryptographic Use-Case

5.1. Retrieving the Algebraic Normal Form

5.1.1. Sparse ANF Representation

5.1.2. Estimating the Number of Terms

5.1.3. Comparison to Classical Approaches

5.2. Algebraic Transition Matrix

5.2.1. Interpretation of ATMs Mappings

5.2.2. Comparison to Classical Approaches

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

HHL Algorithm for Tensor-Decomposable Matrices

Abstract

1. Introduction

1.1. State-of-the-Art of HHL in Cryptanalysis

1.2. Notation

2. HHL Algorithm

2.1. HHL Runtime

2.2. HHL over Finite Fields

Reduction from Boolean Equations to Equations over C

3. HHL for Tensors of Matrices

4. HHL for Möbius Transform

4.1. Möbius Transform of Boolean Functions

4.2. Möbius Transform Using HHL

4.3. Booleanized T n vs. Booleanized T 1

4.4. How to Generate Input

5. Cryptographic Use-Case

5.1. Retrieving the Algebraic Normal Form

5.1.1. Sparse ANF Representation

5.1.2. Estimating the Number of Terms

5.1.3. Comparison to Classical Approaches

5.2. Algebraic Transition Matrix

5.2.1. Interpretation of ATMs Mappings

5.2.2. Comparison to Classical Approaches

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Reduction from Boolean Equations to Equations over $C$

4.3. Booleanized $T_{n}$ vs. Booleanized $T_{1}$