Adversarial Perturbations for Defeating Cryptographic Algorithm Identification

Yin, Shuijun; Wu, Di; Zhang, Haolan; Li, Heng; Yao, Zhiyuan; Yuan, Wei

doi:10.3390/bdcc10010013

Open AccessArticle

Adversarial Perturbations for Defeating Cryptographic Algorithm Identification

by

Shuijun Yin

^1,2,

Di Wu

³

,

Haolan Zhang

^1,*,

Heng Li

¹,

Zhiyuan Yao

¹ and

Wei Yuan

¹

School of Electronic Information and Communication, Huazhong University of Science and Technology, Wuhan 430074, China

²

Wuhan Maritime Communication Research Institute, Wuhan 430079, China

³

School of Electronic and Information Engineering, South China University of Technology, Guangzhou 510006, China

^*

Author to whom correspondence should be addressed.

Big Data Cogn. Comput. 2026, 10(1), 13; https://doi.org/10.3390/bdcc10010013 (registering DOI)

Submission received: 19 November 2025 / Revised: 15 December 2025 / Accepted: 18 December 2025 / Published: 30 December 2025

(This article belongs to the Topic New Trends in Cybersecurity and Data Privacy)

Download

Browse Figures

Versions Notes

Abstract

Recent advances in machine learning have enabled highly effective ciphertext-based cryptographic algorithm identification, posing a potential threat to encrypted communication. Inspired by adversarial example techniques, we present CSPM (Class-Specific Perturbation Mask Generation), a novel adversarial-defense framework that enhances ciphertext unidentifiability through misleading machine-learning-based cipher classifiers. CPSM constructs lightweight, reversible bit-level perturbations that alter statistical ciphertext features without affecting legitimate decryption. The method leverages class prototypes to capture representative bit-distribution patterns for each cryptographic algorithm and integrates two complementary mechanisms—mimicry-based perturbing, which steers ciphertexts toward similar cipher classes, and distortion-based perturbing, which disrupts distinctive statistical traits—through a ranking-based greedy search. Extensive experiments on seven widely used cryptographic algorithms and fifteen NIST statistical feature configurations demonstrate that CSPM consistently reduces algorithm-identification accuracy by over 25%. These results confirm that perturbation position selection, rather than magnitude, dominates attack efficacy. CSPM provides a practical defense mechanism, offering a new perspective for safeguarding encrypted communications against statistical and machine-learning-based traffic analysis.

Keywords:

cryptographic algorithm identification; adversarial perturbation; machine learning security; NIST statistic feature

1. Introduction

In modern cryptographic systems, the identification of the cryptographic algorithms plays a pivotal role in cryptographic analysis [1,2]. When plaintext is inaccessible, an adversary typically begins by inferring the cryptographic algorithms underlying the intercepted ciphertext [3,4,5] (e.g., AES, DES, RSA). Once the algorithm is successfully identified, the attacker can launch targeted cryptanalysis by selecting optimized key-recovery strategies, tailoring algebraic or statistical attacks, or leveraging structural vulnerabilities, thereby substantially reducing both the complexity and computational cost of decryption. As a result, cryptographic algorithm identification serves as a practical and highly valuable capability in real-world cryptanalysis and network offense scenarios.

The rapid progress in artificial intelligence in recent years has emerged as a key enabler for cryptographic algorithm identification. Specifically, machine learning-based schemes have shown considerable success in identifying various international standard ciphers [6,7]. In such approaches, ciphertexts are first transformed into discriminative statistical representations before being fed into learning models. A common and effective practice is to extract statistical features (e.g., the NIST-15 features) based on the NIST SP 800-22 statistical test suite [8], which quantifies the degree of randomness and structural regularity within ciphertext sequences. The NIST-15 features such as Frequency Test and Runs Test capture distinct statistical behaviors exhibited by different cryptographic algorithms, due to their internal diffusion and substitution structures. By converting ciphertexts into numerical vectors through these statistical indicators, supervised classifiers (e.g., Support Vector Machines, Random Forests, or Deep Neural Networks) can be trained to automatically learn the mapping between feature distributions and algorithms. Therefore, this method essentially addresses a multi-class classification problem in machine learning, where each class corresponds to a cryptographic algorithm. For illustration, Figure 1 depicts that legitimate parties Alice and Bob encrypt/decrypt plaintext via a pre-negotiated cipher and key, while adversary Eve passively intercepts the ciphertext and infers its underlying cryptographic algorithm to leverage algorithm-specific vulnerabilities or optimized key recovery. To this end, Eve trains a deep neural network (DNN) with ciphertext samples from known algorithms to automatically predict the type of cryptographic algorithm used in the intercepted ciphertext. Up to now, DNNs have demonstrated promising performance in cryptographic algorithm identification, thus emerging as the mainstream choice.

In this paper, we for the first time apply adversarial example techniques to counter cryptographic algorithm identification. Specifically, we generate adversarial ciphertexts by applying controlled, reversible bit-level perturbations that are designed to mislead cryptographic algorithm-identification models without compromising legitimate decryption. This approach acts as a defense mechanism to enhance ciphertext unidentifiability. For illustration, Figure 2 depicts that before transmission, the sender imposes the perturbations on her ciphertext, which the receiver removes prior to normal decryption. The adversary intercepts the perturbed ciphertext but fails to infer the type of the cryptographic algorithm, since the statistical characteristics of the ciphertext have been modified to mislead the DNN-based identification model. However, the receiver can correctly decrypt the ciphertext, after removing the perturbations according to the perturbing method he knows. Compared with traditional countermeasures (multi-layer encryption, algorithm obfuscation, protocol camouflage), this approach is simpler and more practical: it acts directly on ciphertexts without modifying algorithms/keys management.

In our method, we implement adversarial perturbations on ciphertexts via bit flipping. Our method employs two strategies to mislead the target classifier: (1) making each class resemble another specific class (i.e., the mimic class) via bit flipping, thereby increasing discrimination difficulty; (2) altering the inherent features of each class through bit flipping, making it less like itself. The key lies in selecting the bit positions for flipping. To reduce implementation complexity, we choose the same bit positions for ciphertexts encrypted with the same algorithm (i.e., class). Hence our method is called the Class-Specific Perturbation Mask Generation algorithm (CSPM), which uses a mask to indicate the bit positions for flipping in the ciphertext. To generate such masks, CSPM first trains a deep neural network to emulate the adversary’s identification model (called the substitute model). By interacting with the substitute model (i.e., perturbing the ciphertext and sending the results to the substitute model for classification), CSPM scores each bit position, determines the mask based on these scores, and selects the bits with the highest scores for flipping. We conduct extensive experiments to evaluate CSPM across multiple cipher families using the NIST-15 statistical suite as the feature extraction backbone. The results demonstrate that CSPM achieves a substantial degradation of identification accuracy in all valid feature–cipher combinations: the accuracy decline consistently exceeds 25%, reaching up to 35.72% in the best case. Moreover, CSPM requires flipping only a small fraction of the ciphertext bits while preserving the underlying cryptographic scheme and keying process, thereby offering a stealthy and practical defense. The main contributions of this work are summarized below:

We propose a class-level adversarial perturbation framework that defeats cryptographic algorithm identification without modifying the encryption scheme or key infrastructure. As we know, our method is the first to resist cryptographic algorithm identification with adversarial perturbations.
Our study indicates that the key to resisting cryptographic algorithm identification lies in the selection of bit positions to flip, rather than the number of flipped bits. Our method determines the flipped bit positions based on two criteria: reducing inter-class distance and altering the class’s own characteristics. This provides a design reference and performance comparison baseline for similar studies.
We conduct extensive evaluation across multiple cryptographic algorithms and NIST-15 statistical features, achieving over 25% accuracy reduction under all settings, demonstrating the effectiveness and efficiency of the proposed method.

2. Preliminary

2.1. Cryptographic Randomness Testing Metrics

Randomness testing of ciphertext is a critical approach to evaluating the security of cryptographic systems. The National Institute of Standards and Technology (NIST) [8] defines 15 randomness testing methods in SP 800-22 to comprehensively assess the global and local randomness properties of binary sequences. These tests effectively determine whether a sequence exhibits statistical characteristics consistent with true randomness. Below is a detailed description of the 15 NIST randomness tests, presented in the original order:

(1) Cumulative Sums Test: examines whether the partial sums of the sequence significantly deviate from the expected value. Excessively large or small sums suggest non-randomness.

(2) Linear Complexity Test: uses the Berlekamp-Massey [9,10] algorithm to compute the linear complexity of the sequence and compares it to the expected complexity of a true random sequence. Significant deviations indicate potential non-randomness.

(3) Longest Run of Ones Test: assesses whether the length of the longest run of consecutive 1 s in the sequence aligns with the expected distribution of a true random sequence. Excessively long or short runs may indicate non-randomness.

(4) Overlapping Template Matching Test: counts the occurrences of specific overlapping patterns of consecutive 1 s in the sequence and compares them to the expected distribution in a true random sequence. Large deviations suggest non-randomness.

(5) Random Excursions Test: evaluates whether the number of visits to specific states (e.g., cumulative sums) in the sequence significantly deviates from the expected behavior in a true random sequence. Large deviations indicate potential non-randomness.

(6) Random Excursions Variant Test: assesses whether the frequency of specific states in a random walk deviates significantly from the expected behavior in a true random sequence. Significant deviations suggest non-randomness.

(7) Approximate Entropy Test: compares the frequency of m-bit and (m + 1)-bit subsequences in the sequence and evaluates deviations from the expected distribution of a true random sequence to determine randomness.

(8) Maurer’s Universal Statistical Test: determines whether the sequence can be significantly compressed. A sequence that resists compression is typically considered random.

(9) Discrete Fourier Transform Test: analyzes the periodicity of the sequence through its frequency spectrum and compares it to that of a true random sequence. Significant deviations in periodicity indicate potential non-randomness.

(10) Serial Test: examines whether the frequency of all m-bit subsequences in the sequence is consistent with that of a true random sequence. Uneven frequency distributions may indicate non-randomness.

(11) Binary Matrix Rank Test: constructs matrices from the sequence and evaluates the linear dependence of fixed-length subsequences. Strong linear dependence suggests non-randomness.

(12) Non-overlapping Template Matching Test: counts the occurrences of specific non-overlapping patterns in the sequence and compares them to the expected distribution in a true random sequence. Large deviations suggest non-randomness.

(13) Block Frequency Test: divides the sequence into non-overlapping M-bit blocks and checks whether the distribution of 0 s and 1 s within each block is consistent with randomness. Significant deviations suggest non-randomness.

(14) Frequency Test: examines whether the proportion of 0 s and 1 s in a binary sequence is approximately equal, as expected in an ideal random sequence.

(15) Runs Test: evaluates whether the distribution of runs (consecutive sequences of identical bits) in the sequence aligns with the expected distribution of a random sequence. Anomalous run distributions may indicate non-randomness.

2.2. Adversarial Examples

Adversarial examples are inputs generated by intentionally adding small perturbations [11,12,13,14]. Such inputs are perceptually indistinguishable from legitimate inputs to humans, yet they can mislead machine learning models. Based on the attacker’s knowledge of the target model, adversarial attacks can be categorized into three types: in white-box attacks, attackers have full knowledge of the target model [11,12,13,14]; in gray-box attacks, attackers have only partial knowledge of the model [15,16,17]; and in black-box attacks, attackers have no knowledge of the model at all [18,19,20]. Currently, research on adversarial examples has extended to numerous fields [21,22,23,24]. For instance, in the field of Android malware detection [21,22,23], attackers generate adversarial applications by modifying features such as permissions and API calls, evading detection while retaining malicious functionality.

3. Problem Definition and Threat Model

3.1. Problem Definition

In a typical secure communication scenario, let Alice be the sender and Bob be the legitimate receiver. Alice encrypts a plaintext p to a ciphertext c using a cryptographic algorithm

E_{k} (\cdot)

with secret key k, obtaining a ciphertext sequence

c = E_{k} (p)

, where

c \in {(0, 1)}^{n}

denotes a binary of length n. The ciphertext is transmitted over an open channel to Bob, who can recover the plaintext by applying the corresponding decryption algorithm

D_{k} (\cdot)

. However, an adversary, Eve, intercepts the ciphertext during transmission. Although Eve cannot directly decrypt c without the key, she possesses a deep neural network classifier

f_{θ} (\cdot)

trained on ciphertext samples from multiple cryptographic algorithms

E_{1}, E_{2}, \dots, E_{k}

. By extracting statistical or structural features

ϕ (c)

from the ciphertext and inputting them into the classifier, Eve can predict the most likely cryptographic algorithms as

\hat{E} = f_{θ} (ϕ (c))

. Once the cryptographic algorithm is identified, Eve can apply targeted cryptanalysis or side-channel attacks to recover the plaintext more efficiently.

To defend against such model-based ciphertext identification, we propose to introduce an adversarial perturbation in the ciphertext level. Specifically, given an original ciphertext

c \in {(0, 1)}^{n}

, we define an adversarial perturbation mask

m \in {(0, 1)}^{n}

, which flips a subset of bits according to the rule

c^{'} = c \oplus m

, where ⊕ denotes the bitwise XOR operation. The perturbed ciphertext

c^{'}

should preserve decryptability for Bob—since Bob, knowing m, can simply recover the original ciphertext as

c = c^{'} \oplus m

before decryption—but it should simultaneously mislead Eve’s classifier so that

f_{θ} (ϕ (c^{'}))

outputs an incorrect cryptographic algorithm label. Formally, the defense objective is to find an optimal perturbation mask

m^{*}

that minimizes the classifier’s confidence on the true algorithm label while maintaining ciphertext reversibility for legitimate communication. In this setting, the adversarial perturbation acts as a protective layer on top of the ciphertext, directly deceiving model-based algorithm identification and thereby strengthening the robustness of cryptographic systems against attacks.

3.2. Threat Model

We consider a realistic ciphertext interception scenario where an adversary, Eve, passively eavesdrops on the communication channel between Alice and Bob. Eve can collect a large number of ciphertext samples encrypted by different algorithms

E_{1}, E_{2}, \dots, E_{k}

, but she does not have access to the corresponding plaintexts or secret keys. Instead of attempting direct decryption, Eve employs a trained deep neural network classifier

f_{θ} (\cdot)

to infer the underlying cryptographic algorithms used for each ciphertext. This classifier takes as input a set of statistical or transformed features

ϕ (c)

extracted from the ciphertext sequence

c \in {(0, 1)}^{n}

, and outputs a predicted algorithm label

\hat{y} = f_{θ} (ϕ (c))

. The goal of Eve is to correctly identify the cryptographic algorithms, which significantly reduces the search space for subsequent cryptanalytic attacks.

We assume that Eve’s classifier is trained offline on a large-scale, labeled dataset of ciphertexts generated from known cryptographic algorithms, so that it generalizes effectively to unseen ciphertexts. The defenders, i.e., Alice and Bob, are aware that such model-based attackers may intercept and analyze ciphertexts; their primary objective is therefore to mislead Eve’s classifier into predicting an incorrect cryptographic-algorithm label for transmitted ciphertexts.

To counteract this threat, Alice and Bob introduce a reversible adversarial perturbation

m \in {0, 1}^{n}

into the transmitted ciphertext, producing a perturbed ciphertext

c^{'} = c \oplus m

. The perturbation m is designed to satisfy two requirements: (1) Bob can efficiently recover the original ciphertext (e.g.,

c = c^{'} \oplus m

), and (2) the classifier

f_{θ}

is misled, i.e.,

f_{θ} (ϕ (c^{'})) \neq f_{θ} (ϕ (c))

, where

ϕ (\cdot)

denotes the feature extraction process used by the classifier.

This setting models a black-box attack scenario from the defender’s perspective: the defender does not have access to Eve’s classifier parameters

θ

, but can generate perturbations by observing the classifier’s predicted confidence scores or using a substitute model trained on similar data. The design objective is thus to construct an adversarial perturbation that remains effective under model uncertainty, thereby degrading the classification accuracy of cryptographic-algorithm systems while maintaining communication integrity between Alice and Bob.

4. Methodology

4.1. Overview

The CSPM method is designed to generate perturbation masks for binary ciphertext samples grouped into k classes, aiming to reduce the classification confidence of a pre-trained deep neural network (DNN) classifier while adhering to a perturbation budget. The perturbation mask is used to flip the chosen bits of the ciphertext with the XOR operation, thereby misleading the classifier. Instead of randomly selecting the bits to be flipped, CSPM employs the following two ideas for optimally choosing and flipping these bits.

1. Mimicry for reducing inter-class distance. This idea is to make samples of a certain class (target class) more similar to those of another class (mimic class) through bit flipping, thereby making it more difficult for the cryptographic algorithm classifier to identify them. By comparing the differences between the ciphertexts produced by different cryptographic algorithms, CSPM identifies the most similar cryptographic algorithm (i.e., mimic class) for each cryptographic algorithm (i.e., target class). It then selects the positions of the bits to be flipped based on the degree of difference between the two algorithms at each bit position. Specifically, bits at positions with greater differences are flipped first, making the modified ciphertext more similar to the one generated by the mimic cryptographic algorithm. In this way, CSPM can mislead the cryptographic algorithm classifier of the adversary.

2. Distortion for altering class features. The idea is to alter the features of samples in each class through bit flipping, thereby making it more difficult for the cryptographic algorithm classifier to identify them. In general, if bit 0 or 1 appears more frequently at certain bit positions of the ciphertext (instead of occurring randomly with equal probability), this can be regarded as a typical feature of the class (i.e., cryptographic algorithm). Hence, CSPM evaluates the randomness of the ciphertext generated by each cryptographic algorithm at every bit position, and identifies bit positions where the probability difference between the occurrence of 0 s and 1 s is significant and flips those bits. This alters the inherent features of the class, making it harder to be accurately identified by the adversary.

It is worth noting that the first step in implementing mimicry and distortion is to represent the feature of every class or cryptographic algorithm. We can use the “average” sample (referred to as the prototype) of each class to accomplish this task. In summary, we use Figure 3 to depict our method, which operates in three stages: (1) Train substitute model and find mimic class, (2) Compute Per-bit Mimic score and specificity score, (3) Greedy construction of perturbation mask.

Formally, let

E = {E_{1}, \dots, E_{k}}

denote the set of k cryptographic algorithms or classes. Each class has the same number of samples. For any class, say

E_{j}

, its binary ciphertext samples are of length

n_{j}

(Please note that ciphertext lengths obtained by different cryptographic algorithms for plaintexts of the same length may vary.), and split into a training subset

C_{j}^{train}

with

n u m_{t r a i n}

samples and a test subset

C_{j}^{test}

with

n u m_{t e s t}

samples. A feature extractor

ϕ (\cdot)

maps binary ciphertext samples to a feature space, and a pre-trained DNN classifier

f_{θ} (\cdot)

predicts the probability that a ciphertext sample belongs to each class. Furthermore, positive weight coefficients

α

and

β

are introduced to balance the impacts of mimicry and distortion methods in the evaluation of bit positions. CLMP aims to produce a perturbation mask

m_{j} \in {0, 1}^{n_{j}}

for each class

E_{j}

that minimizes the classifier’s confidence on the target class. To achieve this goal, CLMP needs to perform the following three main steps.

4.2. Computing the Prototype for Every Class

As mentioned earlier, generating a prototype for each class is a prerequisite for mimicry and distortion. To this end, we prepare a sample subset for each class

E_{j}

, denoted as

C_{j}^{pert}

, which can usually be the training subset

C_{j}^{train}

. For a class

E_{j}

, its prototype

p_{j} \in {[0, 1]}^{n_{j}}

is calculated as the average of the binary samples in

C_{j}^{pert}

. Then, the t-th bit of

p_{j}

is given by:

p_{j} [t] = \frac{1}{n u m_{p}} \sum_{c \in C_{j}^{pert}} c [t], \forall t \in {1, \dots, n_{j}}

(1)

where

n u m_{p}

is the number of samples in

C_{j}^{pert}

, and

c [t] \in {0, 1}

is the t-th bit of sample c. The prototype

p_{j} [t]

represents the empirical probability that bit t is 1 in class

E_{j}

, providing a compact summary of the class’s bit distribution.

4.3. Evaluating Bit Positions in Misleading Classifiers

In the second step, CSPM evaluates the potential of bit positions in misleading classifiers for every class, serving as the foundation for the third step (i.e., perturbation mask generation). More specifically, CSPM analyzes each bit position according to criteria similarity and specificity respectively, and derives the score of bit positions by taking them into account.

First, CSPM identifies the mimic class, and then computes a score for each bit position based on the difference between the target class and the mimic class. The identification of mimic classes is based on the similarity between the prototypes of two classes. Note that the lengths of samples (ciphertexts) from different classes may differ. For the sake of fairness, when evaluating the similarity between the prototypes of two classes, only the minimum length of the two class prototypes is considered. For a target class

E_{target}

, the mimic class

E_{j^{*}}

is selected as the class whose prototype is most similar to

p_{target}

, measured by cosine similarity:

{sim}_{j} = cosine (p_{target}, p_{j}), j^{*} = arg max_{j \neq target} {sim}_{j} .

(2)

In the remainder of this paper,

p_{*} = p_{j^{*}}

is used to denote the prototype of the mimic class. Then we can use the mimicry score

s_{mimic} [t]

to evaluate the t-th bit position of the target class prototype. Here the mimicry score reflects the change in distance between the target and mimic prototypes when bit t is flipped, which can be calculated as:

s_{mimic} [t] = | p_{target} [t] - p_{*} [t] | - | (1 - p_{target} [t]) - p_{*} [t] |, \forall t \in {1, \dots, min {n_{j}, n_{j^{*}}}} .

(3)

According to the above equation, a positive

s_{mimic} [t]

indicates that flipping bit t reduces the distance to the mimic class.

Second, CSPM computes the score of each bit position for every class, based on the probabilities of bits 0 and 1 occurring at this bit position. This score, which we refer to as specificity score, is used to characterize the degree of difference between the occurrence probabilities of bits 0 and 1 at a certain bit position. Compared to the purely random scenario where bits 0 and 1 occur with equal probability, the greater the difference between their occurrence probabilities, the more distinct the specificity of this bit position, making this bit position easier to use for identifying the target class (i.e., cryptographic algorithm). In CSPM, the specificity score of the t-th bit position is thus calculated as:

s_{spe} [t] = \frac{1}{2} | p_{target} [t] - (1 - p_{target} [t]) | = | p_{target} [t] - 0.5 | .

(4)

here,

s_{spe} [t]

quantifies the deviation of the target probability from the uniform distribution (0.5). It is worth noting that this metric is symmetric: a low probability (e.g.,

p_{target} [t] = 0.1

) yields the same specificity score as a high probability (e.g.,

p_{target} [t] = 0.9

). This is intentional, as both cases indicate a strong structural bias (towards 0 or 1, respectively) that distinguishes the ciphertext from random noise, whereas a value close to 0.5 implies high uncertainty and low distinguishability.

According to the above equation, a higher specificity score indicates that bit 0 or 1 appears more frequently at the corresponding bit position. This phenomenon (e.g., bit 0 or 1 appearing more frequently at certain bit positions) is likely to help adversaries identify the cryptographic algorithm.

Finally, we combine the mimicry score and specificity score to derive a final score for each bit position, which is given by:

S [t] = α \cdot s_{mimic} [t] + β \cdot s_{spe} [t],

(5)

where

α

and

β

control the trade-off between two criteria, i.e., mimicry and specificity. The purpose of this score is to comprehensively evaluate the potential of bit flipping at each position in misleading the classifier owned by the adversary, thereby providing a basis for the implementation of adversarial perturbations, as discussed in the following subsection.

4.4. Sorting Bit Positions and Generating Perturbation Masks

In the final step, CSPM sorts the bit positions for every class in descending order based on their final scores, and this order guides the subsequent bit-flipping operations to mislead the cryptographic algorithm classifier as much as possible. The main idea is described below. The CSPM will sequentially try performing bit-flipping at the corresponding positions in accordance with the aforementioned order, and observe the misleading effect of this operation on the cryptographic algorithm classifier. Once it can reduce the classifier’s confidence in the target class, the position will be retained and recorded on the perturbation mask. Otherwise, the position will be ignored by CSPM. For simplicity, we adopt a greedy strategy that iteratively performs the aforementioned operations. This process sequentially identifies and records the bit positions worthy of perturbation, finally forming a perturbation mask for every class. This perturbation mask is then used to perturb other samples of the same class, helping to protect them from being accurately identified by the classifier regarding their corresponding cryptographic algorithm type. The detailed steps are described as follows.

Let

m \in {0, 1}^{n_{j}}

denote the perturbation mask of the j-th class. The mask is initialized as

m = {0}^{n}

. In descending order of final scores, CSPM sequentially selects bit positions, performs bit-flipping operations on all samples in the perturbation set

C_{j}^{pert}

, and calculates the average value of the classifier’s output confidence for the target class (i.e., class j). The average confidence is calculated with the EvalAvgConfidence function, i.e.,

EvalAvgConfidence (C_{target}^{test}, m, ϕ, f_{θ}) = \frac{1}{| C_{target}^{test} |} \sum_{c \in C_{target}^{test}} f_{θ} (ϕ (c \oplus m)) [target],

(6)

where ⊕ represents the XOR operation,

c \oplus m

denotes the perturbed sample,

ϕ (c \oplus m)

reflects the features extracted from the perturbed sample, and

f_{θ} (ϕ (c \oplus m)) [target]

is the probability predicted by the classifier that the perturbed sample belongs to class j. For ease of understanding, we have incorporated the aforementioned process (along with the previous steps) into Algorithm 1.

Algorithm 1 CSPM

Require: Binary ciphertext samples grouped into k classes

E = {E_{1}, \dots, E_{k}}

, each sample of length

n = {n_{1}, . . ., n_{k}}

;

1:: For each class $E_{j}$ : train subset $C_{j}^{train}$ with $n u m_{t r a i n}$ samples and test subset $C_{j}^{test}$ with $n u m_{t e s t}$ samples;
2:: Feature extractor $ϕ (\cdot)$ ; pre-trained DNN classifier $f_{θ} (\cdot)$ ;
3:: Weight coefficients $α, β \geq 0$ .

Ensure: perturbation mask

m_{j} \in {0, 1}^{n}

for each class

E_{j}

.

4:: for $j = 1$ to k do
5:: $p_{j} [t] = \frac{1}{n u m_{p}} \sum_{c \in C_{j}^{pert}} c [t], t = 1, \dots, n$
6:: end for
7:: for target = 1 $t a r g e t = 1$ to k do
▹ Identify mimic class
8:: for $j = 1$ to k, $j \neq t a r g e t$ do
9:: ${sim}_{j} = cosine (p_{target}, p_{j})$
10:: end for
11:: $j^{*} = arg {max}_{j \neq t a r g e t} {sim}_{j}$ , $p_{*} \leftarrow p_{j^{*}}$
▹ Compute combined score for each bit
12:: for $t = 1$ to n do
13:: $d_{before} \leftarrow | p_{target} [t] - p_{*} [t] |$
14:: $d_{after} \leftarrow | (1 - p_{target} [t]) - p_{*} [t] |$
15:: $s_{mimic} [t] \leftarrow d_{before} - d_{after}$
16:: $s_{spe} [t] \leftarrow | p_{target} [t] - 0.5 |$
17:: $S [t] \leftarrow α \cdot s_{mimic} [t] + β \cdot s_{spe} [t]$
18:: end for
19:: $indices \leftarrow argsort (S)$
20:: $m \leftarrow {0}^{n}$
21:: $current_conf \leftarrow EvalAvgConfidence (C_{target}^{test}, m, ϕ, f_{θ})$
22:: for idx in indices do
23:: $m_{tem} \leftarrow m; m_{tem} [idx] \leftarrow 1$
24:: $new_conf \leftarrow EvalAvgConfidence (C_{target}^{test}, m_{trial}, ϕ, f_{θ})$
25:: if $new_conf < current_conf$ then
26:: $m \leftarrow m_{tem}$
27:: $current_conf \leftarrow new_conf$
28:: end if
29:: end for
30:: Store m as the perturbation mask for class $E_{target}$
31:: end for

1:: function EvalAvgConfidence $(C_{target}^{test}, m, ϕ, f_{θ})$
2:: $total \leftarrow 0$
3:: for each $c \in C_{target}^{test}$ do
4:: $c^{'} \leftarrow c \oplus m$
5:: $x \leftarrow ϕ (c^{'})$
6:: $probs \leftarrow f_{θ} (x)$
7:: $total \leftarrow total + probs [target]$
8:: end for
9:: return total/ $| C_{target}^{test} |$
10:: end function

5. Evaluation

In this section, we assess CSPM through the following four research questions (RQs):

RQ1 (Effectiveness): How effectively can CSPM reduce the identification accuracy of cipher-classification models?

RQ2 (Efficiency): What is the computational overhead of CSPM in terms of perturbation generation time and perturbation magnitude?

RQ3 (Mechanistic Insight): How does the perturbation strength relate to attack effectiveness, and can the adversarial behavior of CSPM be explained in terms of its underlying mechanisms?

RQ4 (Ablation Study): How do the individual components of CSPM contribute to its overall performance?

5.1. Experiment Setup

All experiments are conducted on a computing platform equipped with an Intel Core i5-13600 CPU and an NVIDIA GeForce RTX 3060 GPU. Unless otherwise specified, the hyperparameters are set to

α = β = 0.5

(weight coefficient) throughout all experiments.

Dataset: The plaintext corpus used in our experiments is sourced from the Open American National Corpus (OANC). The original text is first segmented into 500,000 fixed-length fragments, from which 10,000 fragments are randomly selected to form the plaintext samples for our evaluation. Each plaintext sample is then encrypted under seven representative international cryptographic algorithms using randomly generated keys, producing a total of 70,000 binary ciphertext sequences (7 classes × 10,000 samples per class).

Identification Model: The cryptographic algorithms identification model is implemented as a three-layer fully connected multi-layer perceptron (MLP). Specifically, the network consists of an input layer matching the ciphertext feature dimension, followed by two hidden layers of 512 and 256 neurons, respectively, each activated by the ReLU function. A final softmax output layer produces the probability distribution over the seven cipher classes. The model is trained using the Adam optimizer with a learning rate of 0.001 and a batch size of 128 for 50 epochs. Cross-entropy loss is employed as the objective function to optimize classification accuracy.

Cryptographic Algorithms for Experiments: To ensure that our study exhibits broad applicability and practical relevance, we incorporate seven widely used international standard ciphers representing three major categories of modern cryptography: block ciphers, stream ciphers, and public-key ciphers. These algorithms collectively cover diverse design paradigms and operational structures, providing a representative benchmark for evaluating the universality and robustness of the proposed method.

DES [3]: is a symmetric-key block cipher standardized by NIST in 1977. It operates on 64-bit plaintext blocks with a 56-bit effective key length and adopts a 16-round Feistel network structure. DES is historically significant as one of the earliest widely deployed encryption standards in commercial applications.

AES-128 [4]: is a symmetric-key substitution–permutation network (SPN) block cipher standardized by NIST in 2001. It encrypts 128-bit plaintext blocks using a 128-bit key across 10 transformation rounds. AES is currently the de facto international standard for government, banking, and embedded security systems.

KASUMI [25]: is a symmetric 64-bit block cipher standardized by 3GPP for use in 2G and 3G mobile communication security. It adopts a Feistel-like structure derived from the MISTY1 cipher, with design optimizations for efficient hardware implementation in mobile network baseband environments.

Grain-128 [26]: is a lightweight stream cipher designed for constrained devices. It employs a combination of a Linear Feedback Shift Register (LFSR) and a Nonlinear Feedback Shift Register (NLFSR) to generate keystream bits. It is widely adopted in low-power wireless security and sensor networks.

RSA [5]: is a public-key cryptosystem based on the hardness of integer factorization. Unlike symmetric ciphers, RSA uses a pair of asymmetric keys (public/private) and operates on large integers instead of fixed-size blocks. It is commonly used for key exchange, authentication, and digital signatures.

PRESENT [27]: is an ultra-lightweight block cipher and commonly used in RFID tags and IoT devices. It encrypts 64-bit blocks with either an 80-bit or 128-bit key using an SPN structure optimized for low-area hardware deployment.

Camellia [28]: is a symmetric-key block cipher developed by NTT and Mitsubishi Electric and later standardized by ISO/IEC. It uses a 128-bit block size and supports key lengths of 128, 192, and 256 bits. Its structure resembles AES but includes additional Feistel-type layers, providing both performance efficiency and strong security guarantees across hardware and software platforms.

Owing to the structural differences among cryptographic algorithms, the ciphertext samples do not share a uniform length, even though the plaintext samples are partitioned into fixed-size segments. The length of ciphertext corresponding to each class is summarized in Table 1. The unit of length in the table is bits. In addition, we identified the corresponding mimic class for each target class through experiments. For relevant details, please refer to Table 2, where original algorithm indicates the target class and the mimic algorithm corresponds to the mimic class.

5.2. RQ1: Effectiveness

Goal and Setup. To comprehensively assess the effectiveness of the proposed CSPM method, we first perform a global greedy search across all cipher classes to measure the maximum achievable degradation in classification accuracy. The experiments cover all seven cryptographic algorithms under fifteen NIST feature configurations. For fairness and interpretability, cryptographic algorithms whose baseline classification accuracy falls below 50% under certain feature settings are excluded from subsequent evaluations. A classifier with such low baseline accuracy lacks sufficient discriminative capability, making it difficult to meaningfully assess the degradation effect introduced by CSPM. As a result, algorithms such as AES and PRESENT, which consistently exhibit weak baseline performance across all feature configurations, are omitted from the reported perturbation results.

Result and Analysis. Table 3 summarizes the effectiveness of CSPM on five cryptographic algorithms under 15 NIST feature configurations. Here, Fe denotes the corresponding feature extraction methods (see details in Section 2), whose definitions are provided in Section 2 O refers to the original classification accuracy (%) before perturbation, F denotes the final accuracy (%) after applying the adversarial mask, and D indicates the accuracy decline (i.e., O − F), which quantifies the effectiveness of CSPM in misleading the classifier. For the remaining algorithms, cases where the baseline accuracy falls below 50% are considered non-informative and are marked as ‘x’.

As shown in Table 3, CSPM consistently achieves substantial degradation—exceeding 25% in all valid attack scenarios—demonstrating strong capability to mislead the classifier. The most significant drop is observed for KASUMI under Feature 12, where the accuracy decreases from 71.82% to 2.32%, yielding a decline of 69.50%. Furthermore, for a certain cryptographic algorithm, the perturbation effectiveness varies with the feature representation. For example, KASUMI experiences a 58.09% decline under Feature 3 (Linear Complexity test) but only 44.06% under Feature 7 (Approximate Entropy test). This discrepancy arises from the distinct sensitivity of different statistical features to adversarial perturbations: Linear Complexity (i.e., Feature 3) captures structural regularities that are more susceptible to perturbations, while Approximate Entropy (i.e., Feature 7) measures higher-order randomness, which is inherently more robust.

5.3. RQ2: Efficiency

Goal and Setup. To further assess the practicality of CSPM, we evaluate the perturbation cost, measured as the number of flipped bits. This number is a critical indicator because it reflects both the visibility and the feasibility of CSPM in real-world scenarios: fewer bit-flipping times correspond to lower modification overhead and stronger stealthiness. In addition, to quantify the computational burden of the proposed method, we measured the execution time required for each perturbation mask generation round. This analysis allows us to examine whether CSPM can construct perturbation masks under a reasonable time budget.

Result and Analysis. Table 4 indicates the performance of CSPM on five cryptographic algorithms across 15 feature settings, along with the corresponding perturbation ratios. Here, D denotes the accuracy decline, while P represents the perturbation percentage (i.e., the number of flipped bits relative to the total ciphertext length). Since ciphertext lengths differ among cryptographic algorithms, the absolute number of flipped bits is not directly comparable; therefore, perturbation percentages provide a normalized and fair metric. As shown in the table, the average perturbation percentage is 13.06% and the lowest perturbation percentage is only 7.94%, indicating that CSPM can cause substantial classifier degradation with only a small number of bit flips, demonstrating both high stealthiness and efficiency.

Moreover, the general trend suggests a positive correlation between the performance drop and perturbation percentage, i.e., larger perturbation percentages tend to induce stronger degradation. However, some cases deviate from this tendency. For example, for RSA, the attack under Feature 5 (Random Excursions Test) causes a 26.35% drop with a 9.38% perturbation percentage, whereas Feature 10 (Serial Test) results in a slightly larger drop of 27.14% while using only an 8.46% perturbation percentage. This discrepancy highlights that perturbation effectiveness also depends on feature sensitivity: certain features are more responsive to local bit flips and can be effectively disrupted with fewer modifications, while others require more substantial perturbation to trigger misclassification. Thus, CSPM does not merely rely on perturbation magnitude; it exploits structurally “critical” bit positions to maximize attack impact, demonstrating its precision and robustness against cryptanalysis.

We measured the per-iteration time of CSPM’s greedy lookup procedure, where each trial flip requires applying the candidate mask to the test samples (via bitwise XOR), extracting features through

ϕ (\cdot)

, and performing inference with the pre-trained DNN. Because each of these steps—especially feature extraction—has variable computational cost, the per-lookup latency is strongly dependent on the chosen NIST feature. In our experiments across 15 NIST feature extractors, the per-iteration evaluation time was observed to range approximately from 25 s to 85 s on the experimental platform. It is important to emphasize that this timing reflects an offline mask-generation cost, not an online or real-time requirement. CSPM produces class-level masks that are computed once (or periodically) and then applied to many ciphertexts, so the one-time generation cost can be amortized over large volumes of traffic. Consequently, an individual mask’s apparent per-iteration expense becomes negligible when distributed over lots of ciphertexts.

5.4. RQ3: Mechanistic Insight

Goal and Setup. To further investigate how the perturbation strength impacts attack performance, we analyze the dynamic variation of model confidence and classification accuracy during the bit-flipping process. Two representative cases are selected under the Feature 12 configuration: KASUMI (which achieves the most significant degradation, with accuracy dropping to 2.32%) and RSA (which exhibits strong initial identification performance under the same setting).

Analysis and Result. Figure 4 and Figure 5 illustrate the change in average confidence and accuracy on the test set with respect to the number of bit-flips. For both algorithms, the trends of confidence decay and accuracy decline are largely consistent. Since CSPM selects perturbation positions based on confidence reduction estimated from the training set, minor fluctuations appear on the test curves; yet the overall trend is a monotonic improvement in attack effectiveness as the flipping process proceeds.

Specifically, for KASUMI, the degradation is relatively mild at the beginning and end of the flipping sequence, while the middle stage exhibits a noticeably steeper decline, indicating that certain structural regions of the cipher are more sensitive to perturbations. In contrast, RSA shows a more uniform and nearly linear degradation trend, suggesting a more stable relationship between perturbation scale and identification failure. These results confirm that the evolution of attack effectiveness is inherently linked to the structural characteristics of different cipher designs.

5.5. RQ4: Ablation Study

Goal and Setup. To further investigate the effectiveness of each component in our ranking-based greedy search, we conduct an ablation study on Feature 4 for four representative ciphers: Camellia, DES, KASUMI, and RSA. Feature 4 is selected because it provides the largest number of valid algorithms under consistent experimental conditions, allowing us to perform comparative analysis across multiple ciphers within the same feature configuration. Specifically, we examine how each scoring mechanism contributes to the construction of the mask by individually isolating the mimicry score and the specificity score, and comparing them with a random flipping strategy and our complete CSPM method.

For each feature, we design four experimental settings: (1) Sorting and flipping based solely on the mimicry score (without using the specificity score); (2) Sorting and flipping based solely on the specificity score (without using the mimicry score); (3) Random flipping as an uninformed baseline; (4) Full CSPM, where both scores are jointly used for ranking.

To ensure a fair comparison, all four experiments use the same number of flipped bits, which is equal to the final number of flipped positions determined by the full CSPM in that scenario. The random flipping strategy is also constrained to this same perturbation budget.

Result and Analysis. As shown in Table 5, CSPM consistently achieves the best performance across all evaluated feature settings. Compared with the random flipping baseline, the accuracy drop is improved by more than 15% in all cases, with the maximum improvement reaching 38.92% (59.30–20.38%). This demonstrates that the effectiveness of misleading the classifier primarily depends on which bits are flipped rather than how many bits are perturbed, emphasizing the importance of perturbation position selection.

Moreover, CSPM also outperforms the two single-score variants, i.e., mimicry-score-only and specificity-score-only flipping. This indicates a complementary relationship between the two ranking mechanisms: the mimicry score tends to identify bits that directly steer the ciphertext samples toward the decision boundary of the classifier, while the specificity score captures the bits that globally distort the extracted feature distribution.

Additionally, the relative improvement observed across different ciphers suggests that the internal redundancy and diffusion properties of cryptographic algorithms also influence their susceptibility to adversarial perturbations. For example, block ciphers with higher diffusion depth (e.g., Camellia, DES) exhibit moderate degradation margins, indicating partial resilience to localized bit manipulations, whereas algorithms with lower diffusion complexity (e.g., KASUMI) are more easily disrupted due to their stronger dependence on low-order statistical structures.

Overall, the ablation study validates that both the dual-score ranking mechanism and the cipher-dependent feature sensitivity are critical to the success of CSPM, confirming its robustness and adaptability across heterogeneous cryptographic algorithms.

6. Discussion and Future Work

6.1. Discussion

Dependence on statistics features. CSPM is primarily designed to interfere with feature-based cipher identification models that rely on handcrafted statistical descriptors such as NIST-15 features, which reflect observable bit-level regularities in ciphertexts. If the adversary instead employs deep representation learning or end-to-end neural fingerprinting models, the mapping between perturbations and extracted features may become highly nonlinear, thereby diminishing the transferability and effectiveness of the generated perturbations.

Limited transferability. The perturbation is generated at the class level but assumes distributional consistency between the samples in perturbation subset and the samples in the test set. In real-world heterogeneous deployments (e.g., different implementations, or channel noise), prototype drift may occur, leading to reduced transferability of the generated masks.

Generalizability to End-to-End Deep Learning Models. While the current implementation of CSPM primarily targets feature-based identifiers relying on handcrafted descriptors (e.g., NIST-15), its core optimization mechanism—black-box greedy search—is inherently model-agnostic. This framework can be readily extended to end-to-end deep learning models, such as Convolutional Neural Networks (CNNs) or Recurrent Neural Networks (RNNs), which learn representations directly from raw ciphertext bits.

Practical Deployment in Encryption Workflows. CSPM is designed for seamless integration into practical secure communication systems such as TLS or VPN workflows. The key advantage lies in the offline capability of mask generation. In a real-world deployment, the adversarial mask does not need to be computed per packet. Instead, it can be pre-generated and negotiated during the initial handshake phase, similar to a session key or initialization vector (IV). During runtime transmission, the sender applies the mask via a lightweight bitwise XOR operation immediately after encryption. This decoupling of generation and application ensures that CSPM introduces negligible computational overhead and zero latency penalties, acting as a transparent privacy layer compatible with existing protocols.

6.2. Future Work

In this work, we validated the efficacy of CSPM primarily against feature-based classifiers implemented via Multi-Layer Perceptrons (MLP). Recognizing the rapid advancement of deep learning in cryptanalysis, our future work will focus on evaluating the robustness of CSPM against end-to-end deep learning architectures, such as Convolutional Neural Networks (CNNs) and Residual Networks (ResNets). Specifically, we aim to investigate whether the perturbations generated based on statistical features can maintain their adversarial effectiveness when transferred to models that learn representations directly from raw ciphertext bits. Additionally, we plan to explore the integration of CSPM into a real-time feedback loop to adaptively counter evolving deep-learning-based traffic analysis models.

7. Related Work

Research on ciphertext-based cryptographic algorithm identification has evolved along two main trajectories: early work that applied classical machine-learning features and classifiers, and more recent work that leverages deep learning and richer feature representations.

7.1. Traditional Machine-Learning Approaches

Early research on machine-learning-based cipher algorithm identification was initiated by Ramzann [29], who in 1988 first introduced the concept and demonstrated its theoretical feasibility. Dileep et al. [30] later extracted ciphertext statistics using a bag-of-words model and applied classical classifiers such as SVM and KNN, achieving over 50% accuracy on DES, thereby validating the empirical feasibility of this direction. Negireddy et al. [31] further employed pattern-recognition techniques to distinguish multiple ciphers (AES, DES, TDES, etc.) across ECB and CBC modes, enriching the problem setting.

Manjula [32] expanded the scope from block ciphers to public-key algorithms using a decision tree classifier and achieved accuracy above 70%, though still under the fixed-key assumption. Chou et al. [33] validated its extension. To further enhance recognition capability, Mishra et al. [34] designed refined ciphertext features to improve separability, while Sharif et al. [7,35] compared eight ensemble classifiers and found Rotation Forest to be most effective. More recently, Barbosa et al. [2] optimized decision-tree models and nearly saturated performance in fixed-key settings, after which a more systematic framework emerged, typically employing randomness-indicator features from ciphertext sequences.

7.2. Deep Learning Approaches

With the rise of deep learning, the field shifted from classical machine learning to models with higher representation capacity. Souza et al. [36] were the first to investigate a DNN-based distinguisher for small-batch AES ciphertexts, reducing dependence on large-volume samples. Sandeep et al. [37] introduced CNNs to capture structural features in ciphertext and achieved near-perfect accuracy on multiple block ciphers. Furthermore, three mature neural network models, including the BP (Back Propagation) neural network, were utilized to conduct deep learning experiments on multiple commonly used algorithms based on randomness indicator feature data [38], optimizing the original experimental results under fixed key conditions.

Cui et al. [39] leveraged Residual Networks (ResNet) to enhance feature extraction depth, while Yuan et al. [40] introduced Transformers to model global dependencies within ciphertext sequences. Hu et al. [41] employed a convolutional neural network (CNN) model. By carefully designing the feature extraction process and leveraging the model’s strong representation capability, the proposed approach can effectively identify five types of group encryption algorithms. Li [42] employ a generic cryptographic algorithm identification scheme based on ciphertext features. Yuan et al. [43] propose a cryptographic algorithm identification scheme of block cipher algorithm using deep learning algorithm Multi-Layer Perception (MLP).

8. Conclusions

In this paper, we propose CLPM, a class-level perturbation mask generation method for defending against ciphertext-based algorithm identification. By injecting lightweight and removable adversarial perturbations directly at the bit level, CLPM effectively misleads the attacker’s classifier without modifying the underlying encryption scheme or key infrastructure. Compared with traditional defenses based on multi-layer encryption or structural obfuscation, CLPM achieves stronger stealthiness and easier deployability by perturbing only a small subset of bits. Experimental results show that CLPM achieves 25.00–69.50% degradation in classification accuracy across seven mainstream cryptographic algorithms and fifteen NIST statistical features. These findings confirm that CLPM provides an efficient and practical defense mechanism against ML-based ciphertext analysis. Overall, CLPM offers a promising direction for improving the robustness of encrypted communication systems against algorithm fingerprinting attacks.

Author Contributions

Conceptualization, S.Y., H.Z. and W.Y.; Methodology, S.Y., H.Z., H.L. and Z.Y.; Software, D.W.; Validation, S.Y.; Data curation, D.W.; Writing—original draft, S.Y. and H.Z.; Writing—review & editing, S.Y., D.W., H.Z., H.L., Z.Y. and W.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data presented in this study are openly available in [Open American National Corpus] at [https://www.anc.org/data/oanc/], 17 December 2025.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Solms, R.; van Niekerk, J. From Information Security to Cyber Security. Comput. Secur. 2013, 38, 97–102. [Google Scholar] [CrossRef]
Barbosa, F.; Vidal, A.; Mello, F. Machine Learning for Cryptographic Algorithm Identification. J. Inf. Secur. Cryptogr. (Enigma) 2016, 3, 3. [Google Scholar] [CrossRef]
Han, S.J.; Oh, H.S.; Park, J. The Improved Data Encryption Standard (DES) Algorithm. In Proceedings of the ISSSTA’95 International Symposium on Spread Spectrum Techniques and Applications, Mainz, Germany, 25 September 1996; pp. 1310–1314. [Google Scholar]
Akkar, M.L.; Giraud, C. An Implementation of DES and AES, Secure Against Some Attacks. In Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2001; pp. 309–318. [Google Scholar]
Shand, M.; Vuillemin, J. Fast Implementations of RSA Cryptography. In Proceedings of the IEEE 11th Symposium on Computer Arithmetic, Windsor, ON, Canada, 29 June–2 July 1993; pp. 252–259. [Google Scholar]
Alani, M.M. Applications of Machine Learning in Cryptography: A Survey. In Proceedings of the 3rd International Conference on Cryptography, Security and Privacy, Kuala Lumpur, Malaysia, 19–21 January 2019; pp. 23–27. [Google Scholar]
Sharif, S.O.; Mansoor, S.P. Performance evaluation of classifiers used for identification of encryption algorithms. ACEEE Int. J. Netw. Secur. 2011, 2, 42–45. [Google Scholar] [CrossRef]
Zaman, J.K.M.S.U.; Ghosh, R. Review on fifteen Statistical Tests proposed by NIST. J. Theor. Phys. Cryptogr. 2012, 1, 18–31. [Google Scholar]
Chien, R. Review of “Algebraic Coding Theory” (Berlekamp, E. R.; 1968). IEEE Trans. Inf. Theory 1969, 15, 509–510. [Google Scholar] [CrossRef]
Massey, J. Shift-register synthesis and BCH decoding. IEEE Trans. Inf. Theory 1969, 15, 122–127. [Google Scholar] [CrossRef]
Szegedy, C.; Zaremba, W.; Sutskever, I.; Bruna, J.; Erhan, D.; Goodfellow, I.J.; Fergus, R. Intriguing properties of neural networks. In Proceedings of the 2nd International Conference on Learning Representations, ICLR 2014, Banff, AB, Canada, 14–16 April 2014. [Google Scholar]
Goodfellow, I.J.; Shlens, J.; Szegedy, C. Explaining and Harnessing Adversarial Examples. In Proceedings of the 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, 7–9 May 2015. Conference Track Proceedings. [Google Scholar]
Carlini, N.; Wagner, D.A. Towards Evaluating the Robustness of Neural Networks. In Proceedings of the 2017 IEEE Symposium on Security and Privacy, SP 2017, San Jose, CA, USA, 22–26 May 2017; pp. 39–57. [Google Scholar] [CrossRef]
Madry, A.; Makelov, A.; Schmidt, L.; Tsipras, D.; Vladu, A. Towards Deep Learning Models Resistant to Adversarial Attacks. In Proceedings of the 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, 30 April–3 May 2018. Conference Track Proceedings. [Google Scholar]
Athalye, A.; Carlini, N.; Wagner, D.A. Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples. In Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stockholm, Sweden, 10–15 July 2018. [Google Scholar]
Tramèr, F.; Kurakin, A.; Papernot, N.; Goodfellow, I.J.; Boneh, D.; McDaniel, P.D. Ensemble Adversarial Training: Attacks and Defenses. In Proceedings of the 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, 30 April–3 May 2018. [Google Scholar]
Papernot, N.; McDaniel, P.D.; Goodfellow, I.J. Transferability in Machine Learning: From Phenomena to Black-Box Attacks using Adversarial Samples. arXiv 2016, arXiv:1605.07277. [Google Scholar] [CrossRef]
Su, J.; Vargas, D.V.; Sakurai, K. One Pixel Attack for Fooling Deep Neural Networks. IEEE Trans. Evol. Comput. 2019, 23, 828–841. [Google Scholar] [CrossRef]
Kurakin, A.; Goodfellow, I.J.; Bengio, S. Adversarial examples in the physical world. In Proceedings of the 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, 24–26 April 2017. Workshop Track Proceedings. [Google Scholar]
Ilyas, A.; Engstrom, L.; Athalye, A.; Lin, J. Black-box Adversarial Attacks with Limited Queries and Information. In Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stockholm, Sweden, 10–15 July 2018. [Google Scholar]
Zhao, K.; Zhou, H.; Zhu, Y.; Zhan, X.; Zhou, K.; Li, J.; Yu, L.; Yuan, W.; Luo, X. Structural Attack against Graph Based Android Malware Detection. In Proceedings of the CCS ’21: 2021 ACM SIGSAC Conference on Computer and Communications Security, Virtual Event, 15–19 November 2021; Kim, Y., Kim, J., Vigna, G., Shi, E., Eds.; ACM: New York, NY, USA, 2021; pp. 3218–3235. [Google Scholar] [CrossRef]
Li, H.; Cheng, Z.; Wu, B.; Yuan, L.; Gao, C.; Yuan, W.; Luo, X. Black-box Adversarial Example Attack towards FCG Based Android Malware Detection under Incomplete Feature Information. In Proceedings of the 32nd USENIX Security Symposium, USENIX Security 2023, Anaheim, CA, USA, 9–11 August 2023; Calandrino, J.A., Troncoso, C., Eds.; USENIX Association: Berkeley, CA, USA, 2023; pp. 1181–1198. [Google Scholar]
Li, H.; Yao, Z.; Wu, B.; Gao, C.; Xu, T.; Yuan, W.; Luo, X. Automated Mass Malware Factory: The Convergence of Piggybacking and Adversarial Example in Android Malicious Software Generation. In Proceedings of the 32nd Annual Network and Distributed System Security Symposium, NDSS 2025, San Diego, CA, USA, 24–28 February 2025. [Google Scholar]
Zhan, D.; Duan, Y.; Hu, Y.; Li, W.; Guo, S.; Pan, Z. MalPatch: Evading DNN-Based Malware Detection With Adversarial Patches. IEEE Trans. Inf. Forensics Secur. 2024, 19, 1183–1198. [Google Scholar] [CrossRef]
Kitsos, P.; Galanis, M.D.; Koufopavlou, O. High-Speed Hardware Implementations of the KASUMI Block Cipher. In Proceedings of the 2004 IEEE International Symposium on Circuits and Systems, Vancouver, BC, Canada, 23–26 May 2004. [Google Scholar]
Hell, M.; Johansson, T.; Maximov, A. A Stream Cipher Proposal: Grain-128. In Proceedings of the 2006 IEEE International Symposium on Information Theory, Seattle, WA, USA, 9–14 July 2006. [Google Scholar]
Poschmann, A. Lightweight Cryptography; Ruhr University Bochum: Bochum, Germany, 2009. [Google Scholar]
Aoki, K.; Ichikawa, T.; Kanda, M.; Matsui, M.; Moriai, S.; Nakajima, J.; Tokita, T. Camellia: A 128-Bit Block Cipher Suitable for Multiple Platforms—Design and Analysis. In Selected Areas in Cryptography 2000; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2000; pp. 39–50. [Google Scholar]
Ramzan, Z. On Using Neural Networks to Break Cryptosystems. 1998; Unpublished work. [Google Scholar]
Dileep, A.D.; Sekhar, C.C. Identification of block ciphers using support vector machines. In Proceedings of the The 2006 IEEE International Joint Conference on Neural Network Proceedings, Vancouver, BC, Canada, 16–21 July 2006; pp. 2696–2701. [Google Scholar]
Nagireddy, S. A Pattern Recognition Approach to Block Cipher Identification. Master’s Thesis, Indian Institute of Technology Madras, Madras, India, 2008. [Google Scholar]
Manjula, R.; Anitha, R. Identification of Encryption Algorithm Using Decision Tree. In Communications in Computer and Information Science; Springer: Berlin/Heidelberg, Germany, 2011; pp. 237–246. [Google Scholar]
Chou, J.W.; Lin, S.D.; Cheng, C.M. On the Effectiveness of Using State-of-the-Art Machine Learning Techniques to Launch Cryptographic Distinguishing Attacks. In Proceedings of the 5th ACM Workshop on Security and Artificial Intelligence, Raleigh, NC, USA, 19 October 2012; pp. 105–110. [Google Scholar]
Mishra, S.; Bhattacharjya, A. Pattern Analysis of Cipher Text: A Combined Approach. In Proceedings of the 2013 International Conference on Recent Trends in Information Technology, Chennai, India, 25–27 July 2013; pp. 393–398. [Google Scholar]
Sharif, S.O.; Kuncheva, L.I.; Mansoor, S.P. Classifying encryption algorithms using pattern recognition techniques. In Proceedings of the 2010 IEEE International Conference on Information Theory and Information Security (ICITIS 2010), Beijing, China, 17–19 December 2010; pp. 1168–1172. [Google Scholar] [CrossRef]
De Souza, W.A.R.; Tomlinson, A. A Distinguishing Attack with a Neural Network. In Proceedings of the 2013 IEEE 13th International Conference on Data Mining Workshops, Dallas, TX, USA, 7–10 December 2013; pp. 154–161. [Google Scholar]
Pamidiparthi, S.; Velampalli, S. Cryptographic Algorithm Identification Using Deep Learning Techniques. In Advances in Intelligent Systems and Computing; Springer: Singapore, 2020; pp. 785–793. [Google Scholar]
Cao, L.R. Research on Cryptographic Algorithm Recognition Based on Deep Learning. Ph.D. Thesis, University of Electronic Science and Technology of China, Chengdu, China, 2021. [Google Scholar]
Cui, X.; Zhang, H.; Fang, X.; Wang, Y.; Wang, D.; Fan, F.; Shu, L. A Secret Key Classification Framework of Symmetric Encryption Algorithm Based on Deep Transfer Learning. Appl. Sci. 2023, 13, 12025. [Google Scholar] [CrossRef]
Yuan, K.; Zhang, B.; Zhou, Y.; Sun, H.; Yang, W.; Jia, C. A Block Cipher Recognition Scheme Based on Deep Learning Transformer Algorithm. IEEE/ResearchGate 2024. [Google Scholar] [CrossRef]
Hu, H.; Yuan, K. Identification of Cryptographic Algorithms Based on CNN. In Proceedings of the 4th International Conference on Computer, Artificial Intelligence and Control Engineering, CAICE ’25, Heifei, China, 10–12 January 2025; ACM: New York, NY, USA, 2025; pp. 182–186. [Google Scholar] [CrossRef]
Li, J.; Sun, H.; Du, Z.; Wang, Y.; Yuan, K.; Jia, C. A generic cryptographic algorithm identification scheme based on ciphertext features. J. Inf. Secur. Appl. 2025, 89, 103984. [Google Scholar] [CrossRef]
Yuan, K.; Yu, D.; Yang, W.; Du, Z.; Shen, L.; Li, Z. Identification of block cipher algorithms using multi-layer perception algorithm. Soft Comput. 2025, 29, 3823–3834. [Google Scholar] [CrossRef]

Figure 1. A real-world encrypted communication system. Alice and Bob are the legitimate communication parties, while Eve acts as the adversary monitoring the transmission channel.

Figure 2. The defense mechanism by perturbations on the ciphertexts in the encrypted communication system.

Figure 3. The overview of CSPM, which operates in three stages: (1) Train substitute model and find mimic class, (2) Compute Per-bit Mimic score and specificity score, (3) Greedy construction of perturbation mask.

Figure 4. The change in average confidence and accuracy on the test set with respect to the flipping step of KASUMI.

Figure 5. The change in average confidence and accuracy on the test set with respect to the flipping step of RSA.

Table 1. The ciphertext sample’s length of different cryptographic algorithms.

Algorithm	AES	Camellia	DES	Grain	KASUMI	PRESENT	RSA
Length	1664	1664	1600	1584	1600	1600	4096

Table 2. The corresponding mimic algorithm for each cryptographic algorithm.

Oringinal Algorithm	Mimic Algorithm
AES	RSA
Camellia	RSA
KASUMI	Camellia
PRESENT	RSA
DES	RSA
Grain	RSA
RSA	AES

Table 3. The performance of CSPM on five cryptographic algorithms under 15 NIST feature configurations. Fe denotes the corresponding feature extraction methods; O refers to the original classification accuracy (%) before perturbation, F denotes the final accuracy (%) after applying the perturbation mask, and D indicates the accuracy decline (i.e., O − F).

Fe	Camellia			DES			Grain			KASUMI			RSA
Fe	O	F	D	O	F	D	O	F	D	O	F	D	O	F	D
1	x	x	x	x	x	x	61.75	24.23	37.52	89.25	50.03	39.22	x	x	x
2	x	x	x	x	x	x	x	x	x	x	x	x	64.03	28.31	35.72
3	x	x	x	71.13	44.25	26.88	x	x	x	87.27	43.21	44.06	x	x	x
4	88.14	57.39	30.75	65.82	33.43	32.39	x	x	x	85.14	25.84	59.3	92.51	67.38	25.13
5	x	x	x	x	x	x	x	x	x	x	x	x	61.12	34.77	26.35
6	x	x	x	x	x	x	x	x	x	x	x	x	60.75	25.34	35.41
7	x	x	x	x	x	x	x	x	x	95.51	37.42	58.09	x	x	x
8	x	x	x	x	x	x	x	x	x	x	x	x	x	x	x
9	x	x	x	x	x	x	x	x	x	x	x	x	x	x	x
10	x	x	x	x	x	x	x	x	x	x	x	x	87.62	60.48	27.14
11	x	x	x	x	x	x	x	x	x	x	x	x	x	x	x
12	x	x	x	x	x	x	x	x	x	71.82	2.32	69.5	93.25	64.76	28.49
13	x	x	x	x	x	x	x	x	x	x	x	x	x	x	x
14	x	x	x	x	x	x	66.67	37.29	29.38	77.15	26.38	50.77	x	x	x
15	x	x	x	x	x	x	x	x	x	x	x	x	x	x	x

Table 4. The attack performance of CSPM on five cryptographic algorithms across fifteen feature settings, along with the corresponding perturbation ratios.

Feature	Camellia		DES		Grain		KASUMI		RSA
Feature	D	P	D	P	D	P	D	P	D	P
1	x	x	x	x	37.52	13.47	39.22	12.49	x	x
2	x	x	x	x	x	x	x	x	35.72	12.37
3	x	x	26.88	8.24	x	x	44.06	16.42	x	x
4	30.75	9.73	32.39	10.45	x	x	59.30	19.36	25.13	7.94
5	x	x	x	x	x	x	x	x	26.35	9.38
6	x	x	x	x	x	x	x	x	35.41	11.7
7	x	x	x	x	x	x	58.09	21.89	x	x
8	x	x	x	x	x	x	x	x	x	x
9	x	x	x	x	x	x	x	x	x	x
10	x	x	x	x	x	x	x	x	27.14	8.46
11	x	x	x	x	x	x	x	x	x	x
12	x	x	x	x	x	x	69.50	25.90	28.49	9.48
13	x	x	x	x	x	x	x	x	x	x
14	x	x	x	x	29.38	9.46	50.77	14.37	x	x
15	x	x	x	x	x	x	x	x	x	x

Table 5. Performance evaluation for various methods.

Abalation Study	Decline (%)
Abalation Study	Camellia	DES	KASUMI	RSA
Sorting with mimicry score	24.89	27.47	39.09	20.93
Sorting with specificity score	26.23	30.19	45.37	22.03
Random Flipping	13.65	13.39	20.38	9.56
CSPM	30.75	32.29	59.30	25.14

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yin, S.; Wu, D.; Zhang, H.; Li, H.; Yao, Z.; Yuan, W. Adversarial Perturbations for Defeating Cryptographic Algorithm Identification. Big Data Cogn. Comput. 2026, 10, 13. https://doi.org/10.3390/bdcc10010013

AMA Style

Yin S, Wu D, Zhang H, Li H, Yao Z, Yuan W. Adversarial Perturbations for Defeating Cryptographic Algorithm Identification. Big Data and Cognitive Computing. 2026; 10(1):13. https://doi.org/10.3390/bdcc10010013

Chicago/Turabian Style

Yin, Shuijun, Di Wu, Haolan Zhang, Heng Li, Zhiyuan Yao, and Wei Yuan. 2026. "Adversarial Perturbations for Defeating Cryptographic Algorithm Identification" Big Data and Cognitive Computing 10, no. 1: 13. https://doi.org/10.3390/bdcc10010013

APA Style

Yin, S., Wu, D., Zhang, H., Li, H., Yao, Z., & Yuan, W. (2026). Adversarial Perturbations for Defeating Cryptographic Algorithm Identification. Big Data and Cognitive Computing, 10(1), 13. https://doi.org/10.3390/bdcc10010013

Article Menu

Adversarial Perturbations for Defeating Cryptographic Algorithm Identification

Abstract

1. Introduction

2. Preliminary

2.1. Cryptographic Randomness Testing Metrics

2.2. Adversarial Examples

3. Problem Definition and Threat Model

3.1. Problem Definition

3.2. Threat Model

4. Methodology

4.1. Overview

4.2. Computing the Prototype for Every Class

4.3. Evaluating Bit Positions in Misleading Classifiers

4.4. Sorting Bit Positions and Generating Perturbation Masks

5. Evaluation

5.1. Experiment Setup

5.2. RQ1: Effectiveness

5.3. RQ2: Efficiency

5.4. RQ3: Mechanistic Insight

5.5. RQ4: Ablation Study

6. Discussion and Future Work

6.1. Discussion

6.2. Future Work

7. Related Work

7.1. Traditional Machine-Learning Approaches

7.2. Deep Learning Approaches

8. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI