A Phishing Software Detection Approach Based on R-Tree and the Analysis of the Edge of Stability Phenomenon

Ao, Licheng; Lin, Yifeng; Yang, Yuer

doi:10.3390/electronics14142862

Open AccessArticle

A Phishing Software Detection Approach Based on R-Tree and the Analysis of the Edge of Stability Phenomenon

by

Licheng Ao

¹

,

Yifeng Lin

^2,3,*

and

Yuer Yang

^2,3,4,*

¹

Jinan University-University of Birmingham Joint Institution, Jinan University, Guangzhou 511436, China

²

Department of Computer Science, The University of Hong Kong, Hong Kong 999077, China

³

College of Cyber Security, Jinan University, Guangzhou 511436, China

⁴

School of Economics, Jinan University, Guangzhou 510632, China

^*

Authors to whom correspondence should be addressed.

Electronics 2025, 14(14), 2862; https://doi.org/10.3390/electronics14142862

Submission received: 22 May 2025 / Revised: 6 July 2025 / Accepted: 8 July 2025 / Published: 17 July 2025

(This article belongs to the Special Issue Machine Learning Approaches for Natural Language Processing)

Download

Browse Figures

Review Reports Versions Notes

Abstract

With the rapid development of science and technology, attackers have invented more and more ways to hide malicious information. Hidden malicious information often contains a large number of malicious codes and malicious scripts, which can be hidden in legitimate software and reconstructed to be executed as the software is executed. In recent years, phishing software has become popular at home and abroad, causing fraud to occur frequently. Among various carriers with high redundancy, images are often used by attackers to hide malicious information because they are often used as information transmission carriers and highly redundant storage. This paper aims to explore how attackers hide malicious information in images and use a convolutional neural network (CNN) framework with acceleration based on the analysis of the Edge of Stability (EOS) phenomenon to detect mobile phishing software. To design a machine learning approach to solve the problem, we summarize the characteristics of nine presented mainstream malicious information hiding methods and present a CNN framework that maintains a high initial learning rate while preventing the gradient from exploding in EOS. R-tree is used to speed up the search for nearby pixels that contain malicious information. The CNN model generated by training under this framework can reach an accuracy of

98.53 %

and has been well implemented in mobile terminals.

Keywords:

machine learning; malicious information; image steganography; Edge of Stability (EOS); R-tree

1. Introduction

With the rapid development and wide application of the Internet, the threat of phishing software has become increasingly serious. Phishing software is essentially a kind of Trojan horse or backdoor, which refers to software written for malicious purposes. The applications [1,2,3] belonging to phishing software can carry out a variety of malicious activities, such as stealing personal privacy and destabilizing the system, along with the startup of normal and legitimate programs without the user’s knowledge. Even if entirely relying on widely used open-source software [4,5] or running programs in loop-closed mobile devices [6,7,8], it can still be difficult to avoid hidden malware in the compiled program. In the past, phishing software was often spread through emails. In recent years, with the continuous advancement and evolution of network attack technologies, phishing software has become increasingly sophisticated and stealthy. This evolution presents significant challenges to maintaining network security.

Over the past few decades, researchers have developed numerous methods and technologies for detecting phishing software. The detection of malicious content in images is a highly scrutinized area of research. This involves the embedding of harmful data, such as malicious scripts or codes, within image files, which are then disseminated to conduct malevolent actions through image distribution channels [9]. Distinct from traditional malware, malicious payloads concealed within superfluous image data exhibit advanced levels of stealth and deceit. This complexity significantly complicates their detection through established malware identification methodologies. For example, for the malicious code injected into the images, the content usually used is the malicious shell code, which is well acknowledged. They are usually in the form of a binary string. To exploit this kind of malicious information in reality, we usually need an application with a known or unknown vulnerability (such as a 0-day vulnerability) to open such images. By constructing ROP chains, the malicious information hidden in the image will be extracted and executed as binary code in the memory of the application, though the binary code is just binary data of the cover images. Considering that malicious codes constitute the majority of such concealed malicious content, this paper focuses on methodologies for identifying and analyzing malicious codes embedded within redundant images.

Currently, research on the detection of malicious code in images focuses on two aspects. On one hand, researchers are committed to developing new detection algorithms and techniques to improve the accuracy and efficiency of detecting malicious code in images. For example, several researchers have developed methods for detecting malicious code in images using deep learning. By employing deep learning networks for feature extraction and classification of images, these methods have achieved notable detection performance [10,11,12]. On the other hand, researchers have also paid attention to the propagation routes and behavioral characteristics of malicious code in images to better understand and respond to the threat of malicious code in images. For example, some researchers have proposed some effective defense strategies and methods by analyzing the propagation pathways and behavioral characteristics of malicious codes in images [13,14].

However, the current detection methods for malicious code in images still have some problems and challenges. Firstly, due to the covert and deceptive nature of images with malicious code, traditional malicious code detection methods often have difficulty detecting and recognizing malicious code in images [15]. Secondly, the propagation routes and behavioral characteristics of malicious code in images are changing and evolving, which brings more challenges to the detection work [16]. Therefore, how to effectively detect malicious code in images has become an important issue in current research.

To address the above problems and challenges, this paper proposes a phishing software detection method based on R-tree and the analysis of the Edge of Stability (EOS) phenomenon. The method achieves fast and accurate detection of malicious code in images by constructing an R-tree index and extracting image steganography features. Specifically, we first segment the image by using the R-tree index to improve the detection efficiency. Subsequently, we extract the steganography features of the image and employ machine learning algorithms for classification and determination. Experimental results show that our method has obvious advantages in both detection accuracy and efficiency.

In order to better illustrate the contributions of this paper, the two main contributions are listed as follows.

This paper investigates various methodologies for concealing malicious information. Following a detailed exposition of multiple techniques for embedding malicious content, it evaluates which methods are most likely to be utilized for such concealment.
This paper introduces a phishing software detection approach based on R-tree and the analysis of the EOS phenomenon. It delves into the exploration of neural network layers, network width, activation functions, and loss functions. Furthermore, it examines the relationship between the EOS phenomenon and learning rate to minimize loss.

The rest of this paper is structured as follows. Section 2 reviews related work and covers foundational knowledge, presenting the latest research related to image steganography, detection of malicious information in images, and the EOS phenomenon. Subsequently, the concepts related to R-trees, neural networks, and the EOS phenomenon are analyzed. Section 3 discusses methods of embedding malicious information, detailing techniques for malicious steganography. Section 4 proposes a phishing software detection method based on R-trees and the analysis of the EOS phenomenon. We first introduce the structure of the R-tree and then propose a phishing software detection method based on the R-tree. Section 5 presents the experimental setup, including parameter tuning and a discussion on issues related to the EOS phenomenon. Section 6 concludes the paper with an overall review and conclusions. Possible future works are also proposed in this section.

2. Background

In this section, we delve into the realm of image steganography, initially presenting an overview of the pertinent techniques and concepts related to RGB (Red, Green, Blue) color models and LSB (Least Significant Bit) methods. Subsequently, we explore cutting-edge technologies in the detection of malicious information embedded within digital media and discuss the frontier research issues in deep learning, particularly focusing on the stability characteristics of high learning rates at the boundary of algorithmic performance. Furthermore, we will provide some preliminary knowledge. We will first introduce the R-tree [17,18], which is widely used in explainable algorithms and high-dimensional problems. Subsequently, we will describe the general neural networks and the Edge of Stability (EOS) phenomenon.

2.1. Image Steganography

Steganography [19,20], a term derived from the Greek words “Stegos”, meaning “cover”, and “Grafia”, denoting “writing”, is defined as “covered writing”. It embodies the art and science of concealing secret information within digital communication objects, aiming to obscure the existence of the message itself [21]. Cryptography [22] is dedicated to the protection of message content via encryption, transforming it into a format that is inaccessible to unauthorized individuals. In contrast, steganography primarily aims to conceal the very existence of the message, thereby maintaining its confidentiality in a subtle manner that evades detection [1].

The process of embedding data within digital images by adjusting pixel values in the spatial domain stands as a rudimentary yet highly effective method [23]. Among these techniques, Least Significant Bit (LSB) modification is notably simple, positioning it as a prevalent method in spatial domain image steganography [23]. The least significant bits (LSBs) of an image encode minimal information, rendering any minor alterations to these bits undetectable by the human visual system. Spatial domain techniques based on LSB modification embed secret bits directly into the cover image by altering its least significant bits, thereby preserving the visual quality of the cover image. In 2016, Dadgostar and Afsari [24] pioneered a novel approach to image steganography, melding interval-valued intuitionistic fuzzy edge detection with refined LSB techniques, marking a notable advancement in the field. This method employs edge detection to identify the image’s edge information and embeds steganographic data into these regions using a modified LSB approach, optimizing the balance between invisibility and payload capacity.

2.2. Malicious Information Detection Technology

The concept of malicious information detection has garnered widespread attention in recent research, owing to its critical impact on the efficiency and reliability of cybersecurity. This process involves the identification and filtration of harmful content within the digital ecosystem, including phishing emails, malicious domains, URLs, and image files embedded with malicious code. Achieving proficiency in this detection is vital for enhancing the effectiveness and efficiency of cybersecurity defenses. During the detection process, the challenge lies in striking an optimal balance between swiftly identifying malicious content and minimizing false positives, which is essential for improving overall security measures. Maintaining this equilibrium represents a significant challenge in the field of cybersecurity, which necessitates precise adjustments in detection algorithms, selection of training data, and other related areas.

In 2010, Zhang et al. [25] delved into anomaly behavior detection and countermeasures within wireless sensor networks. Fast forward to 2022, Aljabri et al. [26] conducted a comprehensive analysis of machine learning applications in the identification of malicious links within networks. Their study extensively explored diverse machine learning algorithms and feature extraction strategies, evaluating their utility and limitations in real-world applications and outlining potential directions for subsequent research endeavors. Furthermore, Chang et al. [27] demonstrated the application of natural language processing (NLP) technologies in developing a financial fraud detection model, specifically focusing on the design of an anti-fraud chatbot. This work highlights the evolving use of NLP in enhancing security measures against sophisticated financial fraud schemes.

In 2023, Atawneh and Aljehani [28] demonstrated the efficacy of deep learning techniques in the identification of phishing emails, underscoring their capability to parse complex email content effectively. They developed and assessed a deep learning-based model tailored for phishing email detection, capitalizing on the advanced feature extraction capabilities of deep learning to analyze and identify malicious behaviors and fraud indicators within emails. By training the model to recognize unique linguistic patterns, formats, and other key characteristics of phishing emails, their research illustrated that deep learning models can proficiently distinguish between phishing and legitimate emails. Different from the convolutional neural network (CNN), some scholars [29] used the graph neural network to train their deep learning models. Concurrently, Thein et al. [30] employed a decision tree model to detect malicious domains, highlighting its rapid and accurate classification prowess. Their research highlighted the effectiveness and utility of decision tree models for tackling cybersecurity issues, especially in detecting and neutralizing malicious domains. Additionally, it provided a structured methodology for the assessment and identification of potential cyber threats.

Lin et al. [31] proposed an interpretable architecture for emerging network systems, which enhances the interpretability of malicious traffic detection from both input and output perspectives, aiming to understand network traffic data and improve the reliability of results. Zhao et al. [32] explored attack strategies to poison models to evade detection by maintaining label consistency and proposed defense mechanisms to mitigate such threats, which helps to understand backdoor risks in natural language processing (NLP) systems and provides insights for robust defense methods to enhance model security. Li et al. [33] proposed a malicious code detection strategy combining CNNs and Transformers, which improved accuracy and surpassed the state-of-the-art malicious code detection techniques.

Liu et al. [34] proposed a pre-trained language model-guided multi-level feature attention network, named PMANet, to advance the detection of malicious URLs. PMANet outperforms previous state-of-the-art pre-trained models and traditional deep learning models in challenging real-world scenarios such as small-scale data, class imbalance, cross-datasets, adversarial attacks, and active malicious URL case studies. Zhao et al. [35] explored the threat of clean-label backdoor attacks in language models and systematically analyzed the attack methods, their effectiveness, and potential defense strategies to enhance the robustness of the model. Fang et al. [36] proposed a deep learning model combining BERT and BiLSTM, which showed excellent performance in processing unbalanced data and capturing the deep semantic features of malicious comments, providing an effective technical means for social media content review and network environment purification.

2.3. EOS

The concept of EOS in neural networks has recently garnered significant attention due to its direct impact on the performance and reliability of deep learning models. The EOS phenomenon within neural networks signifies the equilibrium achieved in updating model parameters throughout the training phase, circumventing instability induced by disproportionately large step sizes, as well as averting diminished learning efficacy resulting from excessively small step sizes. Achieving this equilibrium is crucial for enhancing the training efficiency and generalization capability of models. In a state of EOS, models are able to find an optimal balance between rapid learning and avoiding excessive adjustments, thereby contributing to improved training outcomes. Maintaining this stability presents a key challenge in deep learning, involving meticulous adjustments of training parameters such as learning rates and weight initialization.

In 2022, Chen and Bruna [37] delved into the convergence properties of gradient descent beyond the traditional boundaries of stability. This study challenged prevailing perspectives on the behavior of gradient descent, introducing a novel viewpoint that neural networks can achieve effective convergence and learning even beyond conventional stability limits. In the same year, Zhu and colleagues elucidated the dynamics of EOS during the training process through a simplified example. This research clarified the complexities of training dynamics, offering a clearer understanding of neural network behavior at the brink of stability and the factors that could either advance or retreat from this threshold [38]. Additionally, Wang et al. examined the variations in sharpness along the gradient descent trajectory, especially noting the phenomenon of “progressive sharpening” observed near the EOS [39]. Their work established a mathematical framework to comprehend how sharpness evolves during training and its implications for model performance and stability. In 2025, Qiu et al. [40] also briefly discussed the EOS phenomenon in their convex optimization work. This investigation delved into the foundational mechanisms underlying neural network stability, underscoring the nuanced equilibrium between model complexity and generalization capacity. This research offered insights into leveraging EOS to enhance the robustness of deep learning models.

2.4. R-Tree

The R-tree is a spatial indexing structure designed for the efficient storage and querying of spatial data within databases. It mirrors binary trees with a balanced structure, comprising nodes that each depict a rectangle containing one or more objects. Unlike binary trees, R-trees are adept at addressing search problems in higher dimensions [41,42,43]. They optimize the search operations for spatial data by dividing space into smaller regions and organizing objects based on spatial proximity. This method enables efficient retrieval of objects overlapping or proximate to a designated query area.

To construct an R-tree from coordinates and offsets, it is essential to understand the concepts of Minimum Boundary Rectangles (MBRs) and Minimum Boundary Boxes (MBBs) [44]. An MBB encompasses a set of identical objects or points within a rectangle, defined by the maximum and minimum coordinates of its internal objects or points across each dimension, which can extend beyond two in higher-dimensional spaces. An MBB can serve as the internal object of another MBB as long as the larger MBB fully encompasses the smaller one, making it a common construct for delineating enclosed spaces in 3D environments. Within 2D spaces, the MBR serves a geometric function akin to the MBB, finding application across computer science and spatial data analysis domains. It represents a rectangular shape that encloses a set of points or objects within 2D space, defined by the extremal coordinates of its internal elements along the x and y axes. In specific contexts, the transformation between MBBs and MBRs is essential [45,46], underscoring their critical contribution to spatial indexing and analysis.

In this discussion, the term “distance” typically refers to the Euclidean distance in space. Typically, the organization of fundamental Minimum Boundary Rectangles (MBRs) ought to follow established principles, encompassing all coordinates [47]. The construction of an R-tree begins with these fundamental MBRs, which cover individual objects or coordinates. These are then grouped into nodes based on a specified maximum capacity. Throughout this grouping phase, new parent nodes are established to house the emerging child nodes. Each parent node is assigned a unique ID and an MBR that collectively encompasses the MBRs of its offspring. This grouping and parent node creation continues until all MBRs are allocated to a node, culminating in the root of the R-tree. The leaf nodes, which represent the zeroth level of the R-tree, contain only the basic MBRs. To ensure a more balanced structure, adjustments are made to the last node of each level if its entry count falls below the minimum capacity. Building or adjusting R-tree nodes from bottom to top necessitates indexing for efficient spatial queries. MBR calculations shown in Algorithm 1 can be finalized after the R-tree construction to minimize the need for recalculations during node adjustments or balancing, thereby reducing the overall computational load. Figure 1 illustrates the MBR calculation process, requiring inputs like the root of the R-tree and all coordinates. The R-tree can be dumped to memory or indexed by ID for later retrieval and reconstruction using the same data structure. Moreover, Figure 1 illustrates the R-tree architecture, detailing the interconnections between leaf nodes, non-leaf nodes, foundational MBRs, and points (coordinates) to clarify the constructed R-tree.

Algorithm 1 The MBR computing process [17,18]

1:: function computeFundamentalMBR(coordinates)
2:: $x_{l o w} \leftarrow min (coordinates, key = coordinates . x)$
3:: $x_{h i g h} \leftarrow max (coordinates, key = coordinates . x)$
4:: $y_{l o w} \leftarrow min (coordinates, key = coordinates . y)$
5:: $y_{h i g h} \leftarrow max (coordinates, key = coordinates . y)$
6:: return $[x_{l o w}, x_{h i g h}, y_{l o w}, y_{h i g h}]$
7:: end function
8:: function computeNodeMBR(nodes)
9:: $x_{l o w} \leftarrow min (nodes, key = {nodes . x}_{l o w})$
10:: $x_{h i g h} \leftarrow max (nodes, key = {nodes . x}_{h i g h})$
11:: $y_{l o w} \leftarrow min (nodes, key = {nodes . y}_{l o w})$
12:: $y_{h i g h} \leftarrow max (nodes, key = {nodes . y}_{h i g h})$
13:: return $[x_{l o w}, x_{h i g h}, y_{l o w}, y_{h i g h}]$
14:: end function
15:: function computeRTreeMBR(node)
16:: if node is a fundamental MBR then
17:: node.MBR ← computeFundamentalMBR(node.coordinates)
18:: else
19:: for entry in node.entries do
20:: if entry.MBR is not available then
21:: computeRTreeMBR(entry)
22:: end if
23:: end for
24:: node.MBR ← computeNodeMBR(node.entries)
25:: end if
26:: end function
27:: computeRTreeMBR(root)

Note:

x_{l o w}

,

x_{h i g h}

,

y_{l o w}

and

y_{h i g h}

represent the min value of x-axis, the max value of x-axis, the min value of y-axis and the max value of y-axis in the designated MBR respectively.

2.5. Neural Network

Neural network layers, neural network width, activation functions, and loss functions constitute critical elements within neural networks [48]. The selection and optimization of these components are crucial for the network’s performance and efficacy. During the design and training phases of a neural network, it is essential to make judicious choices based on the specific requirements of the task and data characteristics, adjusting various parameters to achieve optimal learning and predictive outcomes.

2.5.1. Neural Network Layer

Neural network architecture comprises several layers, including input, hidden, and output layers. The input layer serves as the initial receptor of raw data, hidden layers undertake the tasks of feature extraction and learning representations, and the output layer finalizes and presents the prediction. The number of hidden layers and the number of neurons within each can be tailored to the complexity of the task and dataset, facilitating adaptation to diverse learning requirements.

2.5.2. Neural Network Width

Neural network width refers to the number of neurons within each layer. Expanding the network’s width enhances the model’s capacity to fit complex patterns inherent in the data. However, an overly broad network might cause overfitting, marked by high training set performance but inadequate generalization to new data. While a broader network increases representational and learning capabilities, it also escalates computational and storage demands. Therefore, it is crucial to balance the model’s performance against computational resource constraints to strike an optimal balance when deciding on network width. In the case of simpler datasets, adopting a narrower width not only conserves resources but also lowers the likelihood of overfitting. Facing complex data, appropriately broadening the network’s width, and integrating methods like regularization can circumvent overfitting.

2.5.3. Activation Function

Normalization layers, such as batch normalization or layer normalization, play a pivotal role in stabilizing and accelerating the training process. Theoretical analyses of these layers have shown their capability to lower the network’s Lipschitz constant, enhancing convergence performance [49]. By mitigating the effects of gradient vanishing or explosion, normalization layers enhance generalization and enable the use of higher learning rates. Activation functions enable the intricacies of data patterns to be learned, which is essential to artificial neural networks. Drawing parallels to the neuron-based models of the human brain, activation functions determine the signal transmitted to the subsequent neuron. In artificial neural networks, the activation function at a node defines its output for given inputs [50]. Standard computer chip circuits can be viewed as digital activation functions that produce on (1) or off (0) outputs based on the input. Thus, activation functions embody mathematical equations that dictate the output of neural networks, introducing non-linear transformations that empower the networks to learn and model more intricate patterns and relationships. The choice of activation function typically does not directly affect the number of layers or the width of the neural network. Prominent activation functions include the Sigmoid, Rectified Linear Unit (ReLU), and Tanh functions [50,51,52], with their suitability varying according to task specifics and network requirements. For instance, the Sigmoid function is suited for binary classification, whereas the ReLU function is favored for its ability to accelerate learning and foster sparse representations. Table 1 presents common activation functions and their formulas.

In neural network architectures, activation functions such as ReLU and tanh play a pivotal role in modulating the activation state of individual neurons, where ReLU nullifies negative inputs and tanh maps inputs to the [−1, 1] interval. These functions are instrumental in enhancing the network’s capacity to capture nonlinear relationships, thereby augmenting its performance and expressiveness. However, the efficacy of normalization layers is contingent upon the specific architecture and dataset in use. The choice of activation function can indirectly impact the network’s training dynamics and convergence rate due to the distinct derivative properties inherent to each function. For instance, given that ReLU possesses a derivative of 1 for positive inputs and 0 for negative inputs, it can precipitate the vanishing gradient issue. This situation underscores the critical need for careful selection of activation functions, which should be tailored to complement the network’s architectural dimensions and depth. Such a targeted approach is essential for optimizing both performance and the efficiency of convergence.

2.5.4. Loss Function

Loss functions quantify the discrepancy between predicted outcomes and true labels, serving as the objective function for neural network optimization. By minimizing the loss function, neural networks enhance the accuracy of their predictions. Predominantly utilized during the training phase, the loss function is employed after each batch of training data is processed by the model. This procedure initiates with the generation of predictions through forward propagation, which is then followed by an assessment of the deviation between predicted outputs and actual values, culminating in the calculation of loss. Upon determining the loss, the model engages in backpropagation to adjust its parameters, aiming to reduce the discrepancy between predicted and actual values. By reducing the discrepancy between predicted and actual values, this adjustment enhances the model’s learning efficacy. Common loss functions include Mean Squared Error (MSE) and Cross-Entropy Loss (CE) among others [53,54]. Table 2 presents common loss functions and their respective expressions. The selection of an appropriate loss function is contingent upon the nature of the task and the characteristics of the data. For instance, MSE is typically applied to regression problems while Cross-Entropy Loss is favored for classification challenges.

2.6. EOS

In the realm of classical optimization theory, the learning rate

η

must be less than the reciprocal of the smoothness parameter or the reciprocal of the sharpness L to ensure convergence, typically adhering to the condition

η \leq \frac{2}{L}

[55]. Nonetheless, Larger learning rates are frequently adopted, with the smoothness parameter typically not taken into account during the initial phases of training. Recent observations reveal that employing a fixed learning rate

η

for gradient descent in neural network training results in the maximal eigenvalue of the training loss Hessian oscillating above

\frac{2}{η}

. Concurrently, training loss exhibits non-monotonic behavior on short timescales, yet demonstrates a consistent decrease over longer periods. This phenomenon is termed the Edge of Stability (EOS) dynamic [56]. The sharpness metric increases with gradient descent and ultimately stabilizes just above

\frac{2}{η}

.

3. Malicious Information Steganography

This section will explore various steganographic techniques, followed by an assessment to determine which method is most susceptible to the concealment of malicious content.

3.1. Storage of Images

In digital computing, color images are typically stored using the RGB color model, which represents images as a matrix of pixel elements. Organized into a systematic grid of rows and columns, these pixels craft the visual framework of an image [57]. Each pixel is described by a combination of three color components: red, green, and blue. The specific color of each pixel is determined by the values of these components, each ranging within the closed interval [0, 255]. Consequently, an image is essentially an array of integers within this range, representing a mosaic of color information through the interplay of these three fundamental color channels.

3.2. Steganographic Techniques

In addition to the traditional image hiding techniques, we offer another nine techniques that are friendly to embedding malicious binary shell code into images. These nine technologies are not as stealthy as traditional steganography. Still, they are more conducive to allowing vulnerable applications to load malicious shell code when reading images in terms of malicious information steganography.

3.2.1. Sequential and Random Image Steganography

The fundamental principle of sequential information hiding involves dividing the information to be concealed into several small data blocks, which are then inserted into the source data stream in a predetermined sequence to obscure their presence [58]. This method ensures a predictable pattern in the data embedding, which facilitates control over the placement of hidden data. However, its primary limitation occurs when the embedded data volume increases, leading to detectable patterns, especially in high-resolution images. Therefore, sequential methods are best suited for hiding small data volumes in high-security scenarios, where disruption to image quality remains minimal.

In contrast to sequential hiding, random information hiding does not adhere to a specific sequence for embedding the secret information [59]. Instead, it scatters the information randomly throughout the source data, significantly complicating detection and enhancing the stealth of the concealed data. The main advantage of random steganography is its flexibility and high level of stealth, especially when a large volume of data is being embedded. However, the dispersed nature of the embedding can reduce the efficiency of data extraction, particularly when the data is spread too thinly. As a result, random embedding is ideal for high-security applications, but it may not be efficient in scenarios requiring rapid extraction.

Sequential steganography encompasses two primary techniques. The first involves randomly choosing a starting pixel within the image and embedding data sequentially into subsequent pixels. This method introduces randomness in placement to reduce detectability, making it effective for small data volumes. However, as the volume of data increases, it may become more detectable due to its inherent structure. The second technique involves embedding data sequentially according to the pixel positions within the image. This method ensures an orderly distribution of the hidden data, minimizing the likelihood of concentration or dispersion. However, when a large amount of data is embedded, this method becomes more susceptible to detection due to the predictable pattern.

3.2.2. Image Steganography Based on RGB Channels

In computing, color image files are typically stored using an RGB three-channel method. Essentially, an

m \times n

RGB image is a two-dimensional matrix of size

m \times n

, where each element within this matrix is a triplet. This triplet stores specific values of the pixel’s components across the red, green, and blue channels, with each channel operating independently. And the triplet is illustrated in Figure 2a. Leveraging this structure, the redundant parts of an image can be utilized to embed additional data. Figure 2b displays nine mainstream steganographic techniques. Utilizing the same information and based on the same image carrier, one can select continuous pixels within the R, G, or B channels to conceal information, yielding three distinct outcomes. For hiding information in the R channel, a random pixel is chosen as the starting point in the image, and data is embedded in the subsequent pixels. Similarly, information within the G channel can be hidden by randomly choosing a pixel in the image as the initial point and then embedding the data into the following pixels. The approach for concealing information in the B channel can be executed in the same manner as for the G and R channels.

The three methods described above are rooted in the steganographic practices within image processing, which involve embedding information within the pixels of an image without altering its appearance. By carefully selecting appropriate starting points and locations for hiding, it is possible to ensure that the embedding of information does not exceed the boundaries of the image. This approach allows for the concealment of information within the image while maintaining its integrity, effectively achieving the desired steganographic outcome.

3.2.3. Image Steganography Based on Information Entropy

Images can be considered as two-dimensional datasets with each pixel encapsulating specific information. Serving as a metric to quantify disorder or uncertainty in a dataset, information entropy provides insights into the complexity or informational content of an image [60]. The entropy of an image is influenced by the range and distribution of pixel values within it. If the pixel values in an image are uniformly distributed, each will occur with the same probability of reflecting high image information entropy and rich informational content [61]. Conversely, a concentration of pixel values within specific ranges lowers the entropy, suggesting a reduced amount of information.

Sections of an image characterized by minimal information entropy represent areas of order and regularity, where embedding hidden information is less likely to produce visual anomalies or disrupt the image’s overall structure. These regions often encapsulate crucial features or details of the image, making them ideal for preserving the integrity of the image’s characteristics upon the insertion of hidden information [62]. Choosing to insert information into areas with minimal entropy rather than selecting pixels at random reduces the effect on the image’s overall characteristics, thereby improving the concealment of the embedded data. Steganalysis is a technique that employs statistical analysis of an image to detect the presence of hidden information [63]. Choosing the part of the minimum information entropy for information hiding can increase the resistance to steganography analysis. This is because these areas inherently exhibit a higher degree of data disorder, making any inserted information more challenging to detect through statistical methods. In essence, embedding information in areas of minimal information entropy not only bolsters the secrecy and concealment of the data but also minimizes disruption to the image’s global features and increases resilience against steganalysis.

There are, primarily, three approaches to image steganography based on information entropy. The first method selects row segments within the image that align with the concealment information’s length and possess the lowest entropy, serving as the embedding points for the data. The second method is similar to the first but targets segments of columns with the minimum entropy in the image as the medium for information embedding. The third strategy divides the source data into multiple blocks, each corresponding in length to the information intended for concealment. Entropy is computed for each block, with the one exhibiting the lowest entropy selected as the insertion point for the hidden information. This method effectively preserves the structural characteristics of the source data while offering enhanced concealment of the embedded information.

Each of the aforementioned methods is grounded in the principle of information entropy. These methods embed hidden information in the least entropic segments: rows, columns, or blocks, which aim for discreet communication. These approaches all utilize the characteristics of information entropy, opting for areas with lower entropy values for the steganographic process. The primary distinction lies in the unit of selection: rows, columns, or blocks, which affect the impact on the structure of the source data differently in turn. Selecting rows and columns for data embedding considers the broad impact on source data, which may result in directional changes. Conversely, the block-based method emphasizes protecting the source data’s structural integrity to give a relatively minor impact on the source data. This distinction highlights the trade-off between achieving high concealment effectiveness and maintaining the structural integrity of the original data.

4. Methodology

In this section, we introduce phishing software detection methods based on R-tree structures and edge-defined characteristics. Initially, we elaborate on the principles and procedures of image segmentation utilizing R-trees. Subsequently, we detail the process of extracting stable edge features using deep learning algorithms for classification purposes.

4.1. Image Segmentation Based on the R-Tree

The R-tree is a multidimensional indexing structure that facilitates the segmentation of a target into multiple layers and segments. In this work, it is employed to segment images and construct a hierarchical index, thereby enabling deep learning models to achieve rapid retrieval and access. The following sections will delve into the principles and methodologies involved in using R-tree indexing for image segmentation and index construction. Firstly, image preprocessing, including scaling and normalization, is essential to ensure the input image data maintains uniform dimensions and format. This step helps mitigate the impact of varying image sizes on the effectiveness of the R-tree indexing process. Subsequently, to form the foundational Minimum Bounding Rectangles (MBRs), preprocessed images are segmented into multiple smaller blocks using Z-order. These foundational MBRs combine to form upper-layer MBRs, which are designated as the leaf nodes of the R-tree. Each foundational MBR comprises multiple pixels.

The construction of the R-tree index necessitates a bottom-up approach, bundling lower-level Minimum Bounding Rectangles (MBRs) into upper-level ones and ultimately forming a root node with fewer than eight children. In our study, the maximum branching factor is set to 20, with a minimum of 8, noting that root nodes may hold up to 8 children but not exceed this number. This entails allocating leaf nodes to appropriate non-leaf nodes across successive layers until the root node is established. The selection of a segmentation strategy is tailored to the image’s characteristics and needs, employing methods like the greedy algorithm or minimum area enlargement. To expedite the construction of the R-tree for faster pinpointing of pixels embedded with malicious information within deep learning models, our approach involves directly bundling 20 lower-level MBRs into an upper-level one. This is aimed at ensuring each non-leaf node encapsulates similar blocks to boost search efficiency. Trimming the base MBR from the preceding bottom-level MBR is able to adjust the final bottom-level MBR to contain exactly 8 foundational MBRs when the last of these bottom-level MBRs incorporates fewer than 8 foundational units. This ensures the structured efficiency of the indexing mechanism.

During the construction of the R-tree, we introduced several optimization strategies, employing an adaptive splitting technique that selects the optimal division point based on the specific characteristics and distribution of each image. For certain special images, we accelerated the R-tree construction process using approximate query techniques. Employing Locality Sensitive Hashing (LSH) when a local pixel’s RGB values show minimal variation aids in this process. Given that the R-tree index highlights pixels adjacent to those embedded with malicious content, it facilitates convolutional kernels in sidestepping pixel blocks devoid of such data, thereby streamlining the search and analysis endeavors. Following construction, each non-leaf node within the R-tree holds information on several blocks, enabling image retrieval and access by traversing the R-tree. For a specified query image, the R-tree’s indexing structure enables rapid pinpointing of relevant leaf nodes for further processing and analysis.

4.2. Accelerated Learning Based on EOS

EOS, identified as an emerging issue in the deep learning domain over the last year, has offered new insights into model training and recognition processes. It reveals that these processes can be significantly accelerated without sacrificing accuracy, provided the learning rate is sufficiently high. This phenomenon occurs beyond the traditionally theorized bounds of deep learning. This acceleration comes at the cost of training stability, with model loss fluctuating and decreasing across iterations rather than monotonically, and accuracy similarly fluctuating and increasing, not monotonically. However, excessive learning rates can lead to training failure. Consequently, we can harness EOS to accelerate the training and recognition of deep learning models. It is imperative to regulate the learning rate to ensure the training proceeds successfully. The following details the principles and steps for utilizing EOS to speed up training and recognition in deep learning models.

We employed the two open-source datasets for generating our image datasets, injecting them with various categories of malicious information for training the deep learning model in classification, by referring to [64] for checking the convergence. To facilitate this, we opted for a Convolutional Neural Network (CNN) for training purposes. Before commencing training, images were preprocessed through an R-tree-based segmentation technique designed for accelerating the convolutional kernel’s navigation across image blocks, with comprehensive details provided in Section 4.1.

Given the uncertainty regarding which CNN architectures and parameters would best adapt to the two datasets injected with malicious information, an independent exploration of model architectures and parameters was undertaken. This exploration encompassed the design and connectivity of network layers, including variations in the number of neural network layers, network widths, normalization layers (activation functions), and loss functions. All the above were directed towards resolving the scientific issue presented in this research and followed by an assessment of their efficacy.

After completing the training, the trained model was utilized to extract steganography features and perform classification. For each image sample, the image was input into the model via the forward propagation algorithm to yield the model’s output. This output takes the form of a probability vector, showing the distribution across various categories. Based on this distribution, the category corresponding to the highest probability was selected as the classification result for the image. Additionally, the confidence level was returned when calling third-party application programming interfaces (APIs).

To expedite model training and recognition of malicious information classification, we experimented with higher learning rates for quicker training and identification while ensuring model robustness to prevent training from exploding. Feature extraction was conducted at one or several layers, with the outputs of convolutional layers serving as feature vectors. During the comparison, integrating fully connected layers subsequent to the framework’s concluding layer enhances the utilization of comprehensive feature information. The inserted pooling layers, which were placed between convolutional layers, summarized and abstracted feature information.

Finally, we determine the presence of malicious content in images by analyzing classification outcomes and identifying the steganography type with the highest probability from the results to extract the concealed information. If an image is identified as containing malicious information, appropriate interventions and protective actions can be implemented.

5. Experiments and Results

This section begins by detailing our experimental setup and datasets before delving into the exploration and discussion of the efficacy of steganographic methods. Finally, experiments and evaluations were conducted on Convolutional Neural Network (CNN) models that integrate R-tree and steganography features.

5.1. Experimental Environment

The experiment was conducted on a system featuring Windows 10 Pro 22H2 x64 (manufactured by ASUS in China), an 11th Gen Intel(R) Core(TM) i7-11800H CPU (manufactured by Intel Corporation in Santa Clara, CA, USA) at 2.30 GHz with 8 cores, an NVIDIA GeForce RTX 3060 laptop GPU (manufactured by NVIDIA Corporation in Santa Clara, CA, USA), 24 GB RAM, a 512 GB SSD, and a 1024 GB HDD. The operating system was installed on the SSD while the code and datasets resided on the HDD. It is important to note that runtimes for the program vary with each execution due to the state of the machine and the resources available, which differ across machines. However, the trends and ratios of runtime should align with those presented in this study.

5.2. Datasets

For this paper, the dataset CIFAR-10 (“cifar10-5k” for details) was selected as the primary experimental dataset for training due to its classic and stable performance in model training among similar datasets. The Edge of Stability (EOS) phenomenon will be reflected in the poisoned version of this dataset. This version of the CIFAR-10 dataset includes 1000 color images at a resolution of

32 \times 32

. It spans an original category and 9 poisoned categories, where each poisoned category contains 100 images corresponding to one of the nine steganographic methods proposed above. Malicious data, represented as binary strings, were embedded into the images using several information hiding techniques. That is, we employed additional embedding strategies, including row–column and block-based embedding methods that use random selection and minimal entropy techniques. Specifically, considering that the carrier image is square, the square block structure was used as the block structure, where each block of pixels had a portion of the binary string embedded based on the entropy minimization method, ensuring minimal perceptible changes in the image.

While CIFAR-10 is a classic dataset, the Stego260 dataset [65] is also selected for experiments. Different from CIFAR-10, the Stego260 dataset, consisting of color images that have been specifically designed for steganalysis research, is used to compute the testing accuracy after poisoning. These images have been manipulated to include hidden information using various steganographic techniques, making them ideal for testing the accuracy of our detection methods. The poisoned version of the Stego260 dataset contains 10 categories, where each category contains 8000 color images. We divide the dataset into training and testing sets in a ratio of 8:2. By checking the performance of the machine learning model on the Stego260 dataset, we can evaluate its ability to accurately detect malicious data hidden within images across different datasets and scenarios. This comparative analysis will provide valuable insights into the robustness and generalizability of our detection system.

5.3. Steganography Results and Evaluation

The outcomes of steganography as shown in Figure 3 (10 bytes), Figure 4 (50 bytes), and Figure 5 (90 bytes) demonstrate the effectiveness of hiding shellcodes of the same length with an average of 20 bytes. Figure 3 demonstrates that the nine mainstream steganography methods evaluated in this study successfully embed malicious information without discernible differences observable to the naked eye on mobile devices. Although the malicious payloads are minimal, 20-byte shellcodes are sufficient for attackers to execute remote commands. A minority of users might notice slight abnormalities in images from Figure 4 and Figure 5, specifically in (a), (e), (f), (g), and (h), upon close inspection. However, differences are not discernible in images produced using the remaining four methods.

Here, we explore nine mainstream methods of concealing malicious information on mobile platforms. The evaluation of malicious information concealment hinges on two primary criteria: its obfuscation and the ease with which it can be extracted. Storing malicious information consecutively becomes easily detectable by users with methods (a) through (f), due to their random storage approach. This is because large amounts of data stored in consecutive pixels can result in the appearance of noticeable horizontal or vertical irregular lines within the image. Extractability is reflected in the difficulty of connecting and retrieving the hidden malicious information. Methods (a), (d), (e), (f), and (g), which store malicious information continuously at specific locations, facilitate the easiest extraction. For methods (h) and (i), attackers must acquire the specific column numbers of the image to successfully retrieve the hidden content, making extraction relatively more challenging. Methods (b) and (c) require the construction of ROP chains for extraction and execution. Consequently, method (i) emerges as the most suited for embedding malicious information within images. Methods (d), (e), (f), and (g) are suitable for steganography when extractability is the priority. Conversely, if stealth is of utmost importance, method (b) emerges as a viable option. Our experiments validate these conclusions.

5.4. Results and Evaluation of the EOS

This subsection will present the experimental results from different aspects while keeping other parameters unchanged. Subsequently, the EOS phenomenon of the training will be analyzed.

5.4.1. Neural Network Layers

Within each architectural framework, models are synthesized through the integration of diverse layers to elucidate the influence of neural network depth on the training process. This investigation particularly emphasizes the deployment of ReLU activation to illuminate these dynamics. The experimental findings, delineated in Figure 6, delineate the training outcomes utilizing fully connected layers (abbreviated as FC), training devoid of supplementary layers (referred to as CNN), and training incorporating pooling layers, with a focus on average pooling (notated as AvgPool), thereby offering a comparative analysis of layer-specific effects on model performance.

As depicted in Figure 6, all training loss and accuracy curves exhibit slight fluctuations, albeit not significantly. Generally, the impact of neural network depth on model training appears negligible. No marked difference was observed in the training outcomes across various layer counts at the same learning rate. Furthermore, for identical layer depths, the variance in both training loss and accuracy curves remained consistently minimal.

5.4.2. Neural Network Width

In this study, the neural network’s architecture was varied to represent tests across different network widths. The tanh activation function in fully connected layers was selected to explore the relationship between model training and neural network width. When adjusting the width of the neural network, all other parameters were held constant. Network widths were set to 200, 400, 800, and 1600, respectively. The outcomes of these experiments are illustrated in Figure 7, demonstrating how varying network widths influence model performance and learning dynamics.

Similar to variations in neural network layers, different widths of neural networks do not significantly impact model training. As illustrated in Figure 7, no notable differences were observed across varying widths of neural networks at the same learning rate. For a given network width, as the learning rate increases, both the training loss and accuracy curves exhibit earlier and more pronounced fluctuations.

It was observed that when the neural network width was set to 200 with a learning rate of 0.04, model training experienced an explosive failure shortly after initiation. This phenomenon can be attributed to the potential for gradient explosion when overly high learning rates are applied to neural networks with smaller widths. This issue particularly affects networks characterized by a smaller number of neurons and more constricted pathways for information transmission. During the transmission process, gradients are prone to accumulating, leading to excessively large gradient values. Such accumulation can severely impede the updating process of neural network parameters. Consequently, when both the width and depth of a neural network are limited, it is advisable to opt for a smaller initial learning rate.

5.4.3. Standardized Layer (Activation Function)

In this analysis, Rectified Linear Unit (ReLU), Exponential Linear Unit (ELU), Hyperbolic Tangent (tanh), Hard Hyperbolic Tangent (hardtanh), and Softplus are evaluated as activation functions for training within a fully connected neural network framework, which has demonstrated optimal performance on our test dataset. Drawing on the findings of Cohen et al. [56], Mean Squared Error (MSE) is adopted over Cross Entropy (CE) as the loss function for model training, based on observations that training with MSE yields superior results. This experimental setup facilitates an investigation into the impact of normalization layers on training effectiveness. The outcomes of these experiments are documented in Figure 8.

As depicted in Figure 8, curves exhibit significant fluctuations for activation functions with a learning rate set at 0.01, excluding ReLU. This phenomenon can be attributed to several reasons. The input range for the ReLU activation function within normalization layers is broader than for other functions, spanning all real numbers. Due to its gradient being 1 in the positive interval, ReLU avoids the issue of gradient vanishing and introduces sparse activation. In normalization layers, ReLU sets a portion of input values to zero, fostering a more sparse representation within the network. This sparsity contributes to enhanced generalization capability and robustness of neural networks. Moreover, the ReLU activation function boasts high computational efficiency since its calculations involve only simple comparisons and maximum value operations. Since it is just a simple threshold function, for each input value, the ReLU function only needs to perform one comparison and one multiplication operation. Therefore, the computational complexity of the ReLU function can be

O (1)

, which is independent of the size of the input. Consequently, ReLU’s distinct behavior suggests that lower complexity in normalization layers correlates with more stable training. Thus, when initializing neural network models with high learning rates, it is advisable to employ activation functions that offer strong robustness.

As the learning rate escalates, training accuracy curves demonstrate significantly increased fluctuations across all activation functions except Softplus. This trend holds true even for models utilizing the ReLU activation function. The number of iterations before the onset of this jitter decreases. On one hand, excessively high learning rates can cause parameters to overshoot the optimum solution point in the gradient direction, thereby increasing the value of the loss function. On the other hand, an excessive increase in the magnitude of parameter adjustments may precipitate oscillatory movements along the gradient trajectory, engendering fluctuations of the loss function in proximity to the optimal solution. For Softplus, given its continuous differentiability and absence of discontinuities, it can also demonstrate oscillatory tendencies at lower learning rates. This issue arises because the slope of the Softplus function becomes very small near zero. When the learning rate exceeds the optimal value, the network’s convergence speed might increase, potentially leading to excessively large weight updates. In both scenarios, network parameters may oscillate.

It has been conclusively demonstrated that normalization layers significantly influence model training. The phenomenon of EOS manifests regardless of the activation function employed when initializing with a larger learning rate. It is advisable to opt for activation functions with lower computational complexity and enhanced robustness. Furthermore, adhering to the conventional rule for the initial learning rate,

η \leq \frac{2}{L}

, is recommended to ensure training stability and interpretability.

5.4.4. Loss Function

In this experiment, ReLU was selected as the activation function due to its effectiveness. Fully connected layers were employed to capitalize on their potential for superior performance. Comparative experiments were conducted using Cross-Entropy (CE) and Mean Squared Error (MSE) as the loss functions to evaluate their impact on model outcomes.

As illustrated in Figure 9, the use of different loss functions does not affect the trend of the training loss and accuracy curves. The model training concludes at different iterations due to the application of distinct loss functions. Specifically, training tends to be completed earlier when employing the cross-entropy (CE) loss function. Therefore, it is advised to select the appropriate loss function when training neural network models, as an unsuitable choice may lead to slow training or even divergence.

5.5. Results and Evaluation of the Peak Signal-to-Noise Ratio (PSNR)

PSNR is an important indicator for measuring image compression quality, especially in the field of image compression and image reconstruction. The calculation of PSNR relies on the mean square error (MSE) between the original image and the processed (e.g., compressed or denoised) image. The calculation equation of PSNR is shown in Equation (1), where

{MAX}_{I}

is the maximum possible pixel value of the image (usually 255 for 8-bit images).

MSE

is the Mean Square Error (MSE) between the original image and the compressed image shown in Equation (2), where

I_{1} (x, y)

and

I_{2} (x, y)

are the pixel values of the original image and the compressed image at position (x, y), respectively. The variables m and n are the dimensions (width and height) of the image.

PSNR = 10 {log}_{10} (\frac{{MAX}_{I}^{2}}{MSE})

(1)

MSE = \frac{1}{m n} \sum {(I_{1} (x, y) - I_{2} (x, y))}^{2}

(2)

The average PSNR computation results for each image hiding method are shown in Table 3. According to the PSNR data of the nine steganographic methods given, it can be seen that there are certain differences in the noise levels introduced by these steganographic methods when hiding information. It can be observed from the data that the fifth steganographic method has the highest peak-to-noise ratio, reaching

48.29

, while the eighth and seventh steganographic methods have lower peak-to-noise ratios,

46.59

and

46.75

, respectively. This difference may reflect that different steganographic methods have different degrees of influence on the image during the information hiding process. Some steganographic methods may introduce more noise, thereby having a greater impact on the image quality. In addition, it can be inferred that image information hiding based on minimum information entropy can better deceive the human eye since pixels with the least information are modified. Therefore, when choosing a suitable steganographic method, it is necessary to comprehensively consider the peak-to-noise ratio and other performance indicators to ensure that the image quality is maintained to the greatest extent while hiding the information.

5.6. Results and Evaluation of the Testing Results

While Section 5.4 and Section 5.5 focus on the performance of the training procedures, here, we would like to discuss the performance of the testing procedures. After gathering the experimental results, the confusion matrix is summarized in Figure 10. We can see that the values are mainly concentrated on the diagonal. Meanwhile, we can see that some of the wrong classifications fall into the categories of other steganographic methods in similar positions. This meets our expectations. After the calculation, the accuracy of the trained machine learning model has reached

98.53 %

.

6. Conclusions

This study introduces a phishing software detection method that leverages R-trees and the analysis of the Edge of Stability (EOS) phenomenon. Initially, it examines the effectiveness of various information hiding techniques for embedding data within images in mobile phishing software. Subsequently, the paper presents a methodology for detecting malicious content hidden in images of mobile applications, fine-tunes the relevant parameters, and discusses issues related to EOS. Empirical evidence demonstrates that the proposed detection method achieves an accuracy of

98.53 %

within a reasonable time. This achievement not only enhances the speed of training and recognition but also ensures stable performance while reducing the computational overhead on mobile devices.

Our research also encountered certain limitations. The diversity in information hiding techniques introduces a significant challenge due to their varying effectiveness. This challenge necessitates the development of an algorithm that can automatically select the optimal hiding method, taking into account the characteristics of both the malicious information and the image carrier. Moreover, although the phishing software detection method based on R-trees and EOS offers a general solution, crafting a method with enhanced generalization capabilities to detect malicious information hidden within images remains a goal. Moving forward, our efforts will be dedicated to addressing these challenges, aiming for solutions that further refine and extend the capabilities of our detection methodologies.

7. Future Work

In this study, we focused on evaluating the model’s performance using a modified version of the CIFAR-10 dataset and the Stego260 dataset. However, to further validate the robustness and generalizability of our method, we plan to extend our experiments to more complex and real-world datasets in future work. These datasets will include images from actual phishing campaigns, as well as larger-scale datasets that can better represent the variety and complexity of real-world scenarios. These efforts will help ensure that the proposed method remains effective in practical applications and is adaptable to various types of phishing software and malicious information hiding techniques.

Author Contributions

L.A. designed this study, proposed the method, wrote the first version manuscript, and provided funding support. Y.L. revised the manuscript, polished the English expression, and visualized experimental results. Y.Y. supervised this study, conducted the experiments, and revised the typesetting. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

This article does not contain any studies with human participants or animals performed by any of the authors.

Data Availability Statement

For access to the code and dataset used in the experiments of this paper, please visit https://github.com/BatchClayderman/edge-of-stability (accessed on 20 November 2023) and https://github.com/YiYistudy/checkVisible (accessed on 16 June 2025). For hiding malicious information in images via traditional image hiding techniques, please refer to https://github.com/YiYistudy/AddMalicious (accessed on 26 April 2024).

Acknowledgments

Thanks to Boyu Gao from College of Cyber Security, Jinan University, for the instruction on the mobile terminal security course over the academic year and for providing academic-level guidance for this paper. Thanks to Ruitao Feng from Nanyang Technological University, for providing topics related to mobile phishing software. Thanks to Zixuan Huang, from University of Chinese Academy of Sciences, for explaining the different significance of the accuracy in the training and testing procedures. Thanks to the editors and the anonymous reviewers for their insightful comments, which improved the quality of this paper.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Han, W.; Xue, J.; Wang, Y.; Zhu, S.; Kong, Z. Build a roadmap for stepping into the field of anti-malware research smoothly. IEEE Access 2019, 7, 143573–143596. [Google Scholar] [CrossRef]
Wang, M.; Wu, P.; Luo, Q. Construction of Software Supply Chain Threat Portrait Based on Chain Perspective. Mathematics 2023, 11, 4856. [Google Scholar] [CrossRef]
Cui, Z.; Du, L.; Wang, P.; Cai, X.; Zhang, W. Malicious code detection based on CNNs and multi-objective algorithm. J. Parallel Distrib. Comput. 2019, 129, 50–58. [Google Scholar] [CrossRef]
Zhang, L.; Wu, J.; Liu, C.; Liu, C.; Li, K.; Sun, X.; Zhao, L.; Wang, C.; Liu, Y. Fixing Outside the Box: Uncovering Tactics for Open-Source Security Issue Management. arXiv 2025, arXiv:2503.23357. [Google Scholar] [CrossRef]
Yang, Y.; Lin, Y.; Li, Z.; Zhao, L.; Yao, M.; Lai, Y.; Li, P. GooseBt: A programmable malware detection framework based on process, file, registry, and COM monitoring. Comput. Commun. 2023, 204, 24–32. [Google Scholar] [CrossRef]
Feng, R.; Chen, S.; Xie, X.; Meng, G.; Shang-Wei, L.; Liu, Y. A performance-sensitive malware detection system using deep learning on mobile devices. IEEE Trans. Inf. Forensics Secur. 2020, 16, 1563–1578. [Google Scholar] [CrossRef]
Yang, Y.; Chen, Z.; Chen, S.; Du, Z.; Luo, Y.; Zhao, L.; Zhou, L.; Quan, Y. Avpd: An anti-virus model with remote thread injection for android based on ResNet50. In Proceedings of the International Conference on Robotics Automation and Intelligent Control (ICRAIC 2021), Wuhan, China, 26–28 November 2021; Journal of Physics: Conference Series. IOP Publishing: Bristol, UK, 2022; Volume 2203, p. 012078. [Google Scholar]
Feng, R.; Chen, S.; Xie, X.; Meng, G.; Shang-Wei, L.; Liu, Y. Mobidroid: A performance-sensitive malware detection system on mobile platform. In Proceedings of the 2019 24th International Conference on Engineering of Complex Computer Systems (ICECCS), Guangzhou, China, 10–13 November 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 61–70. [Google Scholar]
Kumar, R.; Xiaosong, Z.; Khan, R.U.; Khan, R.; Ahad, L.; Kumar, J. Malicious code detection based on image processing using deep learning. In Proceedings of the 2018 International Conference on Computing and Artificial Intelligence, Chengdu, China, 12–14 March 2018; pp. 81–85. [Google Scholar]
Venkatraman, S.; Alazab, M.; Vinayakumar, R. A hybrid deep learning image-based analysis for effective malware detection. J. Inf. Secur. Appl. 2019, 47, 377–389. [Google Scholar] [CrossRef]
Mercaldo, F.; Santone, A. Deep learning for image-based mobile malware detection. J. Comput. Virol. Hacking Tech. 2020, 16, 157–171. [Google Scholar] [CrossRef]
Naeem, H.; Ullah, F.; Naeem, M.R.; Khalid, S.; Vasan, D.; Jabbar, S.; Saeed, S. Malware detection in industrial internet of things based on hybrid image visualization and deep learning model. Ad Hoc Netw. 2020, 105, 102154. [Google Scholar] [CrossRef]
Khattak, S.; Ramay, N.R.; Khan, K.R.; Syed, A.A.; Khayam, S.A. A taxonomy of botnet behavior, detection, and defense. IEEE Commun. Surv. Tutor. 2013, 16, 898–924. [Google Scholar] [CrossRef]
Chiang, H.S.; Tsaur, W.J. Mobile malware behavioral analysis and preventive strategy using ontology. In Proceedings of the 2010 IEEE Second International Conference on Social Computing, Minneapolis, MN, USA, 20–22 August 2010; IEEE: Piscataway, NJ, USA, 2010; pp. 1080–1085. [Google Scholar]
Shabtai, A.; Moskovitch, R.; Elovici, Y.; Glezer, C. Detection of malicious code by applying machine learning classifiers on static features: A state-of-the-art survey. Inf. Secur. Tech. Rep. 2009, 14, 16–29. [Google Scholar] [CrossRef]
Alazab, M. Profiling and classifying the behavior of malicious codes. J. Syst. Softw. 2015, 100, 91–102. [Google Scholar] [CrossRef]
Lin, W.; Lin, Y.; Yang, Y. An Approach for Searching the Top K Nearest Areas Based on R-tree. In Proceedings of the 2023 International Conference on Networks, Communications and Intelligent Computing (NCIC), Suzhou, China, 17–19 November 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 267–272. [Google Scholar]
Wu, Y.; Lin, Y.; Yang, Y.; Xu, W. Top K Nearest Book Searching Approach in Metaverse Libraries Based on R-Tree. In Proceedings of the 2023 International Conference on Networks, Communications and Intelligent Computing (NCIC), Suzhou, China, 17–19 November 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 183–188. [Google Scholar]
Subhedar, M.S.; Mankar, V.H. Current status and key issues in image steganography: A survey. Comput. Sci. Rev. 2014, 13, 95–113. [Google Scholar] [CrossRef]
Yang, Y. Development and future of information hiding in image transformation domain: A literature review. In Proceedings of the 4th International Conference on Image Processing and Machine Vision, Hong Kong, China, 25–27 March 2022; pp. 72–77. [Google Scholar]
Johnson, N.F.; Jajodia, S. Exploring steganography: Seeing the unseen. Computer 1998, 31, 26–34. [Google Scholar] [CrossRef]
Lin, Y.; Yang, Y.; Li, P. Development and future of compression-combined digital image encryption: A literature review. Digit. Signal Process. 2024, 158, 104908. [Google Scholar] [CrossRef]
Kadhim, I.J.; Premaratne, P.; Vial, P.J.; Halloran, B. Comprehensive survey of image steganography: Techniques, Evaluations, and trends in future research. Neurocomputing 2019, 335, 299–326. [Google Scholar] [CrossRef]
Dadgostar, H.; Afsari, F. Image steganography based on interval-valued intuitionistic fuzzy edge detection and modified LSB. J. Inf. Secur. Appl. 2016, 30, 94–104. [Google Scholar] [CrossRef]
Zhang, Y.Y.; Chao, H.C.; Chen, M.; Shu, L.; Park, C.-H.; Park, M.-S. Outlier detection and countermeasure for hierarchical wireless sensor networks. IET Inf. Secur. 2010, 4, 361–373. [Google Scholar] [CrossRef]
Aljabri, M.; Altamimi, H.S.; Albelali, S.A.; Al-Harbi, M.; Alhuraib, H.T.; Alotaibi, N.K.; Alahmadi, A.A.; Alhaidari, F.; Mohammad, R.M.A.; Salah, K. Detecting malicious URLs using machine learning techniques: Review and research directions. IEEE Access 2022, 10, 121395–121417. [Google Scholar] [CrossRef]
Chang, J.W.; Yen, N.; Hung, J.C. Design of a NLP-empowered finance fraud awareness model: The anti-fraud chatbot for fraud detection and fraud classification as an instance. J. Ambient. Intell. Humaniz. Comput. 2022, 13, 4663–4679. [Google Scholar] [CrossRef]
Atawneh, S.; Aljehani, H. Phishing email detection model using deep learning. J. Electronics. 2023, 12, 4261. [Google Scholar] [CrossRef]
Ouyang, K.; Fu, S.; Ke, Z.; Guan, R.; Liang, K.; Hu, D. Learn from Global Correlations: Enhancing Evolutionary Algorithm via Spectral GNN. arXiv 2024, arXiv:2412.17629. [Google Scholar]
Thein, T.T.; Shiraishi, Y.; Morii, M. Malicious Domain Detection Based on Decision Tree. IEICE Trans. Inf. Syst. 2023, 106, 1490–1494. [Google Scholar] [CrossRef]
Lin, W.; Xia, C.; Wang, T.; Zhao, Y.; Xi, L.; Zhang, S. Input and output matter: Malicious traffic detection with explainability. IEEE Netw. 2024, 39, 259–267. [Google Scholar] [CrossRef]
Zhao, S.; Tuan, L.A.; Fu, J.; Wen, J.; Luo, W. Exploring clean label backdoor attacks and defense in language models. IEEE/ACM Trans. Audio Speech, Lang. Process. 2024, 32, 3014–3024. [Google Scholar] [CrossRef]
Li, S.; Wang, J.; Song, Y.; Wang, S.; Wang, Y. A lightweight model for malicious code classification based on structural reparameterisation and large convolutional kernels. Int. J. Comput. Intell. Syst. 2024, 17, 30. [Google Scholar] [CrossRef]
Liu, R.; Wang, Y.; Xu, H.; Qin, Z.; Zhang, F.; Liu, Y.; Cao, Z. PMANet: Malicious URL detection via post-trained language model guided multi-level feature attention network. Inf. Fusion 2025, 113, 102638. [Google Scholar] [CrossRef]
Zhao, S.; Xu, X.; Xiao, L.; Wen, J.; Tuan, L.A. Clean-label backdoor attack and defense: An examination of language model vulnerability. Expert Syst. Appl. 2025, 265, 125856. [Google Scholar] [CrossRef]
Fang, Z.; Zhang, H.; He, J.; Qi, Z.; Zheng, H. Semantic and Contextual Modeling for Malicious Comment Detection with BERT-BiLSTM. In Proceedings of the 2025 4th International Symposium on Computer Applications and Information Technology (ISCAIT), Xi’an, China, 21–23 March 2025; IEEE: Piscataway, NJ, USA, 2025; pp. 1867–1871. [Google Scholar]
Chen, L.; Bruna, J. On gradient descent convergence beyond the edge of stability. arXiv 2022, arXiv:2206.04172. [Google Scholar]
Zhu, X.; Wang, Z.; Wang, X.; Zhou, M.; Ge, R. Understanding edge-of-stability training dynamics with a minimalist example. arXiv 2022, arXiv:2210.03294. [Google Scholar]
Wang, Z.; Li, Z.; Li, J. Analyzing sharpness along gd trajectory: Progressive sharpening and edge of stability. Adv. Neural Inf. Process. Syst. 2022, 35, 9983–9994. [Google Scholar]
Qiu, S.; Wang, H.; Zhang, Y.; Ke, Z.; Li, Z. Convex Optimization of Markov Decision Processes Based on Z Transform: A Theoretical Framework for Two-Space Decomposition and Linear Programming Reconstruction. Mathematics 2025, 13, 1765. [Google Scholar] [CrossRef]
Ukey, N.; Yang, Z.; Li, B.; Zhang, G.; Hu, Y.; Zhang, W. Survey on exact knn queries over high-dimensional data space. Sensors 2023, 23, 629. [Google Scholar] [CrossRef] [PubMed]
Lin, W.; Fan, W.; Liu, H.; Xu, Y.; Wu, J. Classification of handheld laser scanning tree point cloud based on different KNN algorithms and random forest algorithm. Forests 2021, 12, 292. [Google Scholar] [CrossRef]
Patel, P.; Garg, D. Comparison of advance tree data structures. arXiv 2012, arXiv:1209.6495. [Google Scholar] [CrossRef]
Sepúlveda, C.S.; Rodríguez, A.; Seco, D. Covering a Set of Points with k Bounding Boxes. In Proceedings of the 2019 38th International Conference of the Chilean Computer Science Society (SCCC), Concepcion, Chile, 4–9 November 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1–6. [Google Scholar]
Qian, X.; Wu, B.; Cheng, G.; Yao, X.; Wang, W.; Han, J. Building a bridge of bounding box regression between oriented and horizontal object detection in remote sensing images. IEEE Trans. Geosci. Remote. Sens. 2023, 61, 1–9. [Google Scholar] [CrossRef]
Yao, Y.; Cheng, G.; Wang, G.; Li, S.; Zhou, P.; Xie, X.; Han, J. On improving bounding box representations for oriented object detection. IEEE Trans. Geosci. Remote. Sens. 2022, 61, 1–11. [Google Scholar] [CrossRef]
Ljosa, V.; Singh, A.K. Top-k spatial joins of probabilistic objects. In Proceedings of the 2008 IEEE 24th International Conference on Data Engineering, Cancún, Mexico, 7–12 April 2008; IEEE: Piscataway, NJ, USA, 2008; pp. 566–575. [Google Scholar]
Wang, Y.; Li, Y.; Song, Y.; Rong, X. The influence of the activation function in a convolution neural network model of facial expression recognition. Appl. Sci. 2020, 10, 1897. [Google Scholar] [CrossRef]
Liu, S.; Li, X.; Zhai, Y.; You, C.; Zhu, Z.; Fernandez-Granda, C.; Qu, Q. Convolutional normalization: Improving deep convolutional network robustness and training. Adv. Neural Inf. Process. Syst. 2021, 34, 28919–28928. [Google Scholar]
Sharma, S.; Sharma, S.; Athaiya, A. Activation functions in neural networks. Towards Data Sci. 2017, 6, 310–316. [Google Scholar] [CrossRef]
Pratiwi, H.; Windarto, A.P.; Susliansyah, S.; Aria, R.; Susilowati, S.; Rahayu, L.; Fitriani, Y.; Meerdekawati, A.; Rahadjeng, I. Sigmoid activation function in selecting the best model of artificial neural networks. In Proceedings of the 1st Bukittinggi International Conference on Education, West Sumatera, Indonesia, 17–18 October 2019; Journal of Physics: Conference Series. IOP Publishing: Bristol, UK, 2020; Volume 1471, p. 012010. [Google Scholar]
Eckle, K.; Schmidt-Hieber, J. A comparison of deep networks with ReLU activation function and linear spline-type methods. Neural Netw. 2019, 110, 232–242. [Google Scholar] [CrossRef] [PubMed]
Christoffersen, P.; Jacobs, K. The importance of the loss function in option valuation. J. Financ. Econ. 2004, 72, 291–318. [Google Scholar] [CrossRef]
Zhou, Y.; Wang, X.; Zhang, M.; Zhu, J.; Zheng, R.; Wu, Q. MPCE: A maximum probability based cross entropy loss function for neural network classification. IEEE Access 2019, 7, 146331–146341. [Google Scholar] [CrossRef]
Roulet, V.; Agarwala, A.; Grill, J.B.; Swirszcz, G.; Blondel, M.; Pedregosa, F. Stepping on the edge: Curvature aware learning rate tuners. Adv. Neural Inf. Process. Syst. 2024, 37, 47708–47740. [Google Scholar]
Cohen, J.M.; Kaur, S.; Li, Y.; Kolter, J.; Talwalkar, A. Gradient descent on neural networks typically occurs at the edge of stability. arXiv 2021, arXiv:2103.00065. [Google Scholar]
Süsstrunk, S.; Buckley, R.; Swen, S. Standard RGB color spaces. In Proceedings of the IS&T;/SID 7th Color Imaging Conference, Scottsdale, AZ, USA, 16–19 November 1999; Volume 7, pp. 127–134. [Google Scholar]
Katzenbeisser, S.; Petitcolas, F. Information Hiding; Artech House: London, UK, 2016. [Google Scholar]
Hammoud, A.K.; Mohaisen, H.N.; Mohammed, M.Q. Secret information hiding in image randomly method using steganography and cryptography. Int. J. Nonlinear Anal. Appl. 2021, 12, 1283–1291. [Google Scholar]
Cover, T.M.; Thomas, J.A. Entropy, relative entropy and mutual information. Elem. Inf. Theory 1991, 2, 12–13. [Google Scholar]
Tsai, D.Y.; Lee, Y.; Matsuyama, E. Information entropy measure for evaluation of image quality. J. Digit. Imaging 2008, 21, 338–347. [Google Scholar] [CrossRef] [PubMed]
Ortiz, A.; Gorriz, J.M.; Ramirez, J.; Salas-Gonzales, D. Improving MR brain image segmentation using self-organising maps and entropy-gradient clustering. Inf. Sci. 2014, 262, 117–136. [Google Scholar] [CrossRef]
Meghanathan, N.; Nayak, L. Steganalysis algorithms for detecting the hidden information in image, audio and video cover media. Int. J. Netw. Secur. Its Appl. (IJNSA) 2010, 2, 43–55. [Google Scholar]
Ji, T.; Luo, Y.; Lin, Y.; Yang, Y.; Zheng, Q.; Lian, S.; Li, J. ImageVeriBypasser: An image verification code recognition approach based on Convolutional Neural Network. Expert Syst. 2024, 41, e13658. [Google Scholar] [CrossRef]
Yu, J.; Zhang, X.; Xu, Y.; Zhang, J. Cross: Diffusion model makes controllable, robust and secure image steganography. Adv. Neural Inf. Process. Syst. 2023, 36, 80730–80743. [Google Scholar]

Figure 1. The spatial structure of an R-tree [17,18].

Figure 2. The schematic diagram of image RGB storage format and steganography. Digital images can generally be represented by the two-dimensional function

I = f (x, y)

. Among them, x and y represent the number of pixels on the images’ x-axis and y-axis, respectively. Each pixel can be represented by a

(r, g, b)

triplet, which represents the values of the R, G, and B channels, respectively. The variable s represents the set of values formed by the selected sequence during image steganography.

Figure 2. The schematic diagram of image RGB storage format and steganography. Digital images can generally be represented by the two-dimensional function

I = f (x, y)

. Among them, x and y represent the number of pixels on the images’ x-axis and y-axis, respectively. Each pixel can be represented by a

(r, g, b)

triplet, which represents the values of the R, G, and B channels, respectively. The variable s represents the set of values formed by the selected sequence during image steganography.

Figure 3. Images with steganographically embedded malicious information (10 bytes).

Figure 4. Images with steganographically embedded malicious information (50 bytes).

Figure 5. Images with steganographically embedded malicious information (90 bytes).

Figure 6. The comparison of training across different layer counts (the values in the brackets indicate the initial learning rates).

Figure 7. The comparison of training across different network widths (the values in the brackets indicate the initial learning rates). The symbol “*” means that an excessively high learning rate resulted in divergence, leading to training failure.

Figure 8. The comparison of training with different normalization layers for modified activation functions (the values in the brackets indicate the initial learning rates).

Figure 9. The comparison of training with different loss functions (values represent learning rates).

Figure 10. The confusion matrix.

Table 1. Common activation functions.

Name	Expression
ELU	$ELU (x) = \{\begin{matrix} x & , x > 0 \\ α (e^{x} - 1) & , x ⩽ 0 \end{matrix}$
Hardtanh	$Hardtanh (x) = \{\begin{matrix} 1 & , x > 1 \\ - 1 & , x ⩽ - 1 \\ x & , otherwise \end{matrix}$
ReLU	$R e L U (x) = \{\begin{matrix} max (0, x) & , x ⩾ 0 \\ 0 & , x < 0 \end{matrix}$
SoftPlus	$SoftPlus (x) = \frac{1}{β} log (1 + e^{β x})$
tanh	$t a n h (x) = \frac{2}{1 + e^{- 2 x}} - 1$
Sigmoid	$Sigmoid (x) = \frac{1}{1 + e^{- x}}$

Note: x represents the input value to the activation function;

α

is a hyperparameter that controls the value for negative inputs in Exponential Linear Unit (ELU);

β

is a parameter that controls the sharpness of the SoftPlus function’s transition.

Table 2. Common loss functions.

Name	Expression
MSE	$J_{MSE} = \frac{1}{N} \sum_{i = 1}^{N} {(y_{i} - {\hat{y}}_{i})}^{2}$
CE	$J_{CE} = - \sum_{i = 1}^{N} y_{i} log ({\hat{y}}_{i}) + (1 - y_{i}) log (1 - {\hat{y}}_{i})$

Note:

J_{MSE}

represents Mean Squared Error loss;

J_{CE}

represents Cross-Entropy loss; N is total number of samples in the dataset; i is the index of the current sample;

y_{i}

is the true label/target value for the i-th sample;

{\hat{y}}_{i}

means the predicted value/probability for the i-th sample.

Table 3. The average PSNR computation results for each image hiding method. Due to the random processes involved, the results obtained after each run may vary. Nonetheless, the trends should be consistent with the results shown in this table.

Method	Count	Value
1	8000	$47.788648$
2	8000	$47.084985$
3	8000	$47.605193$
4	8000	$47.819787$
5	8000	$48.294920$
6	8000	$47.648304$
7	8000	$46.754807$
8	8000	$46.587847$
9	8000	$47.870826$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ao, L.; Lin, Y.; Yang, Y. A Phishing Software Detection Approach Based on R-Tree and the Analysis of the Edge of Stability Phenomenon. Electronics 2025, 14, 2862. https://doi.org/10.3390/electronics14142862

AMA Style

Ao L, Lin Y, Yang Y. A Phishing Software Detection Approach Based on R-Tree and the Analysis of the Edge of Stability Phenomenon. Electronics. 2025; 14(14):2862. https://doi.org/10.3390/electronics14142862

Chicago/Turabian Style

Ao, Licheng, Yifeng Lin, and Yuer Yang. 2025. "A Phishing Software Detection Approach Based on R-Tree and the Analysis of the Edge of Stability Phenomenon" Electronics 14, no. 14: 2862. https://doi.org/10.3390/electronics14142862

APA Style

Ao, L., Lin, Y., & Yang, Y. (2025). A Phishing Software Detection Approach Based on R-Tree and the Analysis of the Edge of Stability Phenomenon. Electronics, 14(14), 2862. https://doi.org/10.3390/electronics14142862

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Phishing Software Detection Approach Based on R-Tree and the Analysis of the Edge of Stability Phenomenon

Abstract

1. Introduction

2. Background

2.1. Image Steganography

2.2. Malicious Information Detection Technology

2.3. EOS

2.4. R-Tree

2.5. Neural Network

2.5.1. Neural Network Layer

2.5.2. Neural Network Width

2.5.3. Activation Function

2.5.4. Loss Function

2.6. EOS

3. Malicious Information Steganography

3.1. Storage of Images

3.2. Steganographic Techniques

3.2.1. Sequential and Random Image Steganography

3.2.2. Image Steganography Based on RGB Channels

3.2.3. Image Steganography Based on Information Entropy

4. Methodology

4.1. Image Segmentation Based on the R-Tree

4.2. Accelerated Learning Based on EOS

5. Experiments and Results

5.1. Experimental Environment

5.2. Datasets

5.3. Steganography Results and Evaluation

5.4. Results and Evaluation of the EOS

5.4.1. Neural Network Layers

5.4.2. Neural Network Width

5.4.3. Standardized Layer (Activation Function)

5.4.4. Loss Function

5.5. Results and Evaluation of the Peak Signal-to-Noise Ratio (PSNR)

5.6. Results and Evaluation of the Testing Results

6. Conclusions

7. Future Work

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI