Finite State GUI Testing with Test Case Prioritization Using Z-BES and GK-GRU

Kumar, Sumit; Nitin,; Yadav, Mitul

doi:10.3390/app131910569

Open AccessArticle

Finite State GUI Testing with Test Case Prioritization Using Z-BES and GK-GRU

by

Sumit Kumar

^1,*

,

Nitin

² and

Mitul Yadav

^3,*

¹

Department of Computer Science and Engineering, Veer Madho Singh Bhandari Uttarakhand Technical University, Dehradun 248007, India

²

Department of Electrical Engineering and Computer Science, University of Cincinnati, Cincinnati, OH 45221, USA

³

Department of Computer Science and Engineering, Dev Bhoomi Institute of Technology, Dehradun 248007, India

^*

Authors to whom correspondence should be addressed.

Appl. Sci. 2023, 13(19), 10569; https://doi.org/10.3390/app131910569

Submission received: 21 August 2023 / Revised: 15 September 2023 / Accepted: 19 September 2023 / Published: 22 September 2023

Download

Browse Figures

Versions Notes

Abstract

:

To deliver user-friendly experiences, modern software applications rely heavily on graphical user interfaces (GUIs). However, it is paramount to ensure the quality of these GUIs through effective testing. This paper proposes a novel “Finite state testing for GUI with test case prioritization using ZScore-Bald Eagle Search (Z-BES) and Gini Kernel-Gated recurrent unit (GK-GRU)” approach to enhance GUI testing accuracy and efficiency. First, historical project data is collected. Subsequently, by utilizing the Z-BES algorithm, test cases are prioritized, aiding in improving GUI testing. Attributes are then extracted from prioritized test cases, which contain crucial details. Additionally, a state transition diagram (STD) is generated to visualize system behavior. The state activity score (SAS) is then computed to quantify state importance using reinforcement learning (RL). Next, GUI components are identified, and their text values are extracted. Similarity scores between GUI text values and test case attributes are computed. Grounded on similarity scores and SAS, a fuzzy algorithm labels the test cases. Data representation is enhanced by word embedding using GS-BERT. Finally, the test case outcomes are predicted by the GK-GRU, validating the GUI performance. The proposed work attains 98% accuracy, precision, recall, f-measure, and sensitivity, and low FPR and FNR error rates of 14.2 and 7.5, demonstrating the reliability of the model. The proposed Z-BES requires only 5587 ms to prioritize the test cases, retaining less time complexity. Meanwhile, the GK-GRU technique requires 38945 ms to train the neurons, thus enhancing the computational efficiency of the system. In conclusion, experimental outcomes demonstrate that, compared with the prevailing approaches, the proposed technique attains superior performance.

Keywords:

GUI testing; Software Testing Automation (STA); user requirements; State Transition Diagram (STD); ZScore-Bald Eagle Search (Z-BES); Gaussian sinusoid—bidirectional encoder representations from transformers; Test Cases (TC); word embedding; Gini Kernel-Gated Recurrent Unit (GK-GRU); State Activity Score (SAS)

1. Introduction

Over the past decade, the graphical user interface (GUI) has proven to be the most promising component in the software development lifecycle due to the user-friendly interactions and experiences it provides. In addition, it is an essential component of most of today’s software programs [1]. However, it is crucial to test graphical user interfaces (GUIs) to assure system dependability, which is essential for maintaining the operation of software products and the satisfaction of end users [2]. GUI testing entails performing a methodical analysis of the user interface’s graphical elements, interactive components, and visual design to verify that it satisfies the criteria and functionalities outlined as intended [3]. Commonly, various techniques, such as manual-based, record-and-replay, and model-based, are employed for GUI testing [4]. However, the traditional manual testing methodologies for GUIs exhibit certain limitations regarding efficiency, coverage, and scalability as the complexity of the software system increases [5]. To overcome these issues, significant efforts have been made to integrate machine learning (ML) techniques, including decision tree (DC), random forest (RF), support vector machine (SVM), etc., with GUI testing. This integration has the capacity to enhance testing methodologies, improve accuracy, and hasten the identification of defects, thus improving the GUI’s performance [6].

The creation of automated testing processes, such as watir, jmeter, selenium, etc., that can learn from historical data, adapt to changing software environments, and provide valuable insights to developers and testers is enabled through ML algorithms [7]. One of the primary benefits of using ML in GUI testing is that it improves the testers’ ability to effectively manage large-scale and complicated GUI designs [8]. However, conventional testing approaches cannot readily accommodate the extensive design space and diverse user interactions that modern applications demand [9,10]. The recurring patterns of user behavior and interactions might be automatically taught by the machine learning algorithms, which would facilitate more complete testing coverage without the need for operator involvement [11,12]. Additionally, the use of ML makes it possible for GUI testing to develop dynamically with the program, reacting to changes in the GUI as well as the associated functionality [13]. However, ML algorithms depend heavily on sufficient and representative training data; insufficient data can cause suboptimal testing outcomes that limit the system’s consistency [14]. To resolve this issue, this study proposes a novel framework called “Finite state testing for GUI with test case prioritization using ZScore-Bald Eagle Search (Z-BES) and Gini Kernel-Gated recurrent unit (GK-GRU)” that effectively satisfies the user requirements.

1.1. Problem Statement

The prevailing limitations of this research include:

Conventional methods often lack the adaptability to handle complex software, resulting in insufficient coverage of high-risk areas.
Prevailing approaches uncover the hidden flaws and unintended behaviors in complex software systems.
When dealing with intricate and extensive GUI designs, existing approaches exhibit limited scalability.

1.2. Objectives

The Z-BES algorithm prioritizes the test cases, focusing on critical areas and efficiently allocating resources for better issue resolution.
The STD visualization provides a comprehensive understanding of system behavior, helping identify gaps and ensuring alignment with expectations.
The utilization of RL and fuzzy logic enables accurate labeling of test case outcomes, improving overall evaluation precision.

2. Related Literature Survey

A previously proposed automatic simulation-based testing approach [15] combined domain expert knowledge and autonomous systems’ operating and environmental parameters. During the testing procedure, type-2 fuzzy logic was applied to facilitate the robust management of data uncertainty. The efficacy of this procedure was demonstrated experimentally. However, this strategy necessitated a significant time investment and resources and required specialized knowledge.

A testing technique for Android apps was also proposed that encompassed an app classifier, test scripting language, state graph modeler, activity classifier, and test adapter [16]. The framework automated the customization of test scripts for different apps and activities. The empirical evaluation provided evidence that the classifiers’ performance was significantly above average. However, the usefulness of the framework was constrained to applications with intricate user interfaces.

The YOLOv5 algorithm that enhanced the mobile app interface element recognition significantly improved the analysis accuracy and identification of minute components [17]. The experimental findings confirmed its superiority in GUI element identification, demonstrating promise for future development in robot testing automation for mobile applications. However, due to the automated nature of the system, a potential for bias existed.

A test prioritization system grounded in the RL method for user interface testing has also been developed [18]. The associated study sought to maximize the number of detected test defects while simultaneously lowering the necessary testing quantity. The findings demonstrated that the method successfully gained insightful knowledge of the test cases and the linkages between them. Despite this, the system had greater computing expenses, particularly when dealing with applications on a large scale.

A distributed state model inference approach for the GUI testing tool was proposed [19]. The methodology that allowed for the inference of a centralized model made use of a distributed architectural framework. The experiment demonstrated the feasibility of using a model with a distributed approach and the lower time requirement compared with models that are not distributed. However, the system required a large amount of memory and computing power, leading to potential scalability restrictions. The systems, namely, interactive event-flow graphs, GUI-level guidance, and enhanced crowd testers’ coverage, have been previously described. The interactive event-flow graphs consolidated the interactions of the testers into a single directed graph, facilitating the visual evaluation of previously tested scenarios. In spite of this, the information that was provided through the interactive event-flow graphs was difficult to read and evaluate.

A deep learning (DL)-centric end-to-end trainable model for GUI similarity and isomorphic GUI identification was established [20]. Visually identifying GUI items while using a DL was required to complete the task. Relative entropy was utilized to allow the measurement of variations in GUI. In addition, the comparison results demonstrated the efficiency of the method. However, the model’s performance was relatively sensitive to differences in the GUI design.

A distributed state model inference for scriptless GUI testing has been developed previosly. To conduct effective GUI tool testing, the procedure included empirical evaluation and an approach that did not require scripts. Therefore, the distributed system performed significantly better than the GUI validation test. The performance of the system was hindered by the exclusion of non-deterministic components in the model.

A previous study incorporated GUI modeling and test coverage analysis [21]. In this work, the unified modeling language (UML) was utilized to structure the GUI components. The test cases were then automatically generated from the UML models and underwent test coverage analysis, efficiently achieving GUI testing based on user perspectives. However, this model required extensive testing time.

An automated GUI functional test was presented based on Simplified Swarm Optimization (SSO) [22]. SSO established the relevant test cases to implement the GUI testing. Moreover, an event-interaction graph (EIG) was created to determine the optimal testing for the GUI tools. This system retained the effective performance of GUI testing but exhibited local optimization issues.

Thus, previous research reveals that GUI testing is generally performed via ML and DL models. Models tend to sustain better performance in GUI testing and fault identification to attract users. However, many traditional models face certain challenges, including a lack of data analytics, inefficiency due to variations, massive memory requirements, loss of power, high computational complexity, and limited resources. These types of drawbacks mainly affect the testing outcomes and cause poor performance. Hence, the proposed system focuses on test case prioritizing-based GUI testing, which can more efficiently address the common limitations of traditional models. The proposed work effectively satisfies user requirements by utilizing well-organized frameworks, such as GK-GRU and Z-BES.

3. Proposed Methodology

The proposed technique aims to identify potential mismatches between GUI designs and test cases, ultimately determining whether the GUI meets the requirements which is shown in Figure 1.

3.1. Historical Projects

The historical project dataset was gathered, comprising the GUI files, test cases, and logs of past testing activities.

3.1.1. Test Cases

Test cases (Tc), which outline the steps and expected results when testing the software, are predefined scenarios. Test cases (Tc) are represented as follows:

T c = R q + S + S t + E_{o u t}

(1)

where Rq specifies the requirements, S signifies scenarios, St exemplifies outline steps, and E_out implies the expected outcomes.

3.1.2. Test Case Prioritization

The Tc are prioritized based on their importance and potential impact on the project. By prioritizing Tc, testing efforts are concentrated on areas most likely to impact the project’s success. The Tc is prioritized by Z-BES. The BES algorithm achieves optimum solutions with a low number of repetitions. However, due to the influence of mean calculations, the BES algorithm cannot readily determine the optimal areas. The utilization of the mean may not be appropriate for distributions that exhibit significant skewness, which might result in delayed convergence. To mitigate this, the ZScore technique is introduced.

Selecting Stage

The Bald Eagle test population (test cases

(T c))

is represented as:

T c_{z} = {T c_{1}, T c_{2}, T c_{3}, \dots \dots, T c_{z \max}}

(2)

The fitness function (high prioritized

(T c)

[H (T c)])

is then defined as follows:

F t = H (T c)

(3)

During hunting, the

(T c)

selects the optimal spot within a chosen search area, which is expressed as follows:

β = β_{B e s t} + l * R d (β_{Z - s c o r e} - β_{i})

(4)

Z - s c o r e = \frac{(T c - μ)}{σ}

(5)

where

R d

specifies a random number from 0 to 1,

l

is the parameter that controls the position changes,

β

implies the new position,

β_{B e s t}

symbolizes the best location,

β_{Z - s c o r e}

represents the

Z - s c o r e

position of all

(T c)

,

β_{i}

delineates the current position of the

(T c)

,

μ

is the mean of the

(T c)

, and

σ

is the standard deviation of the

(T c)

.

Searching Stage

T c

assesses prey within the chosen search area during this phase. The optimal position

(β_{i, N e w})

is defined as follows:

β_{i, N e w} = β_{i} + P (i) * (β_{i} - β_{i + 1}) + Q (i) * (β_{i} - β_{Z - S c o r e})

(6)

where

β_{i + 1}

implies the next position of the

(T c)

, and

P (i)

and

Q (i)

signify scaling factors.

Swooping Stage

The

(T c)

moves toward its target prey from the optimal position in the search space:

β_{i, N e w} = R a n d * β_{B e s t} + P_{1} (i) * (β_{i} - c_{1} * β_{Z - S c o r e}) + Q_{1} (i) * (β_{i} - c_{2} * β_{B e s t})

(7)

here,

c_{1}

and

c_{2}

are the controlling parameters. Therefore, the

H (T c)

is represented as:

H {(T c)}_{ς} = {H {(T c)}_{1}, H {(T c)}_{2}, H {(T c)}_{3}, \dots \dots, H {(T c)}_{ς \max}} ζ = 1, 2, \dots, ζ \max

(8)

The pseudo-code for Z-BEO is presented in Algorithm 1 as follows:

Algorithms 1 Pseudo-code for Z-BEO

Input: Test cases

(T c)

Output: High Prioritized Test Case

H (T c)

Begin
Initialize

the optimization parameters β_{i}

, {M a x}_{I}

Calculate

the fitness function F s

For i = 1 to Max_i do
Select the search space using

β = β_{B e s t} + l * R d (β_{Z - s c o r e} - β_{i})

Search the prey in the search space using

β_{i, N e w} = β_{i} + P (i) * (β_{i} - β_{i + 1}) + Q (i) * (β_{i} - β_{Z - S c o r e})

Swoop the prey with

β_{i, N e w} = R a n d * β_{B e s t} + P_{1} (i) * (β_{i} - c_{1} * β_{Z - S c o r e}) + Q_{1} (i) * (β_{i} - c_{2} * β_{B e s t})

If

F s = = S a t i s f i e d

Return

H (T c)

Else
i = i + 1
End If
End For
End

3.1.3. Attribute Extraction

The important attributes, namely test case ID, test date, version, prerequisites, form name, test data, test scenario, testcase description, step details, expected result, actual result, and customer assigned priority, are extracted from the

H (T c)

. The extracted attributes

(A_{b})

provide context and information about each test case and are defined as:

A_{b} = {A_{1}, A_{2}, A_{3}, \dots \dots \dots, A_{B}}

(9)

where

B

denotes the maximum number of attributes.

3.1.4. State Transition Diagram

The STD is generated using

(A_{b})

, which provides a clear overview of how the system behaves and transitions between different states. By visualizing all possible state transitions, the diagram can reveal missing or unintended transitions that could result in errors. The STD has been presented in Figure 2.

3.1.5. State Activity Score

Based on the STD parameters, the SAS is computed to quantify the relative importance of each state within the system’s behavior. For the score assignment, the RL is utilized. The RL offers advantages in assigning SAS owing to its adaptability to dynamic systems, enabling the algorithm to learn and optimize score assignments centered on interactions and outcomes.

Learning Environment: The RL agent

(A g)

interacts with an environment via different states

(S t)

and takes action

(A c)

to maximize cumulative rewards

(R w)

.

Z-value Learning: The

(A g)

uses expected cumulative rewards (

Z

-values) to make decisions. Primarily,

Z

-values are often initialized randomly.

Reward Feedback: The

(A g)

receives

(R w)

from the environment for each

(A c)

taken. These

(R w)

guide the

(A g)

toward desirable outcomes.

Updating Z-Values: After each

(A c)

, the

Z

-values are updated using Equation (10):

Z (S t, A c) = (1 - κ) * Z (S t, A c) + κ * (R w + φ * M a x (Z (\bar{S} t, \bar{A} c)))

(10)

where the learning rate is implied as

κ

, the discount factor is notated as

φ

, and

\bar{S} t

and

\bar{A} c

specify the next state and action, respectively.

SAS: The SAS is derived from the

Z

-values of each state. Higher

Z

-values specify more active or valuable states. The SAS is notated as

(δ)

.

3.2. GUI

The GUI is based on historical projects. GUIs make software user-friendly by providing a visual and intuitive way for users to interact with software.

3.2.1. GUI Components

The different GUI components

(G_{C})

, namely buttons, labels, checkboxes, and radio buttons, are identified. This information guides further analysis and testing strategies. The different GUI components

(G_{C})

are mathematically termed as:

G_{C} = {G_{1}, G_{2}, G_{2}, \dots \dots, G_{M a x C}}

(11)

3.2.2. Text Value Extraction

The next step involves extracting the text content associated with each component. This comprises extracting labels, instructions, options, messages, and any other textual information presented to users. The text values of the GUI

E (T x)

are represented as follows:

E (T x) = \sum (T x {(G U I)}_{j}) F o r j = 1 t o n

(12)

where

\sum (T x {(G U I)}_{j})

signifies the sum of text content from all GUI elements and

n

implies the maximum text values.

3.2.3. Similarity Score

Here, by comparing

E (T x)

with the

(A_{b})

, a similarity score

(α_{s})

is determined. This comparison helps assess how closely the textual content in the GUI aligns with the expected behavior as specified in the test cases. The

α_{s}

is computed by the Ratcliff/Obershelp similarity technique. The

α_{s}

is estimated by:

α_{s} = \frac{2 * L (l c s (E (T x), A_{b})}{L (E (T x)) + L (A_{b})}

(13)

where

L (l c s (E (t x), A_{b})

specifies the length of the longest common subsequence

(E (T x), A_{b})

,

L (E (t x))

signifies the length of

(E (t x))

, and

L (A_{b})

implies the length of

(A_{b})

.

If

α_{s}

is high, the text content aligns closely, suggesting that the requirement has not changed and the GUI design is likely accurate. Contrarily, if

α_{s}

is low, a significant difference exists in the text values, indicating a potential mismatch between the GUI design and the requirement.

3.3. Labelling

Subsequently,

α_{s}

and

(δ)

are inputted into a fuzzy algorithm to determine a label

(G)

. If the similarity score and SAS are higher, the test case result is designated as “pass.” However, if either or both scores are low, the test case result is designated as “fail.” The proposed work uses the fuzzy algorithm to establish an interpretation of pass or fail based on the combination of these two significant scores. This is presented as:

I F (α_{s} = H i g h) A N D (δ = H i g h) T H E N T c_{Result} = P A S S

(14)

I F (α_{s} = H i g h) A N D (δ = L o w) T H E N T c_{Result} = F A I L

(15)

I F (α_{s} = L o w) A N D (δ = H i g h) T H E N T c_{Result} = F A I L

(16)

I F (α_{s} = L o w) A N D (δ = L o w) T H E N T c_{Result} = F A I L

(17)

3.4. Word Embedding

Here, word embedding is performed by the text values of the GUI components

E (T x)

and the test case attributes

(A_{b})

. Word embedding converts the textual information into numeric vectors, enabling the GK-GRU to process and analyze the data more effectively. The process of word embedding is executed using the GS-BERT algorithm. Although BERT demonstrates proficiency in understanding natural language, it may encounter challenges in appropriately recognizing word order and the impact of word positions. To address this concern, the Gaussian sinusoid encoding method is incorporated into BERT. This entails the incorporation of sinusoidal functions into the positional embeddings (PE) utilized by BERT, enhancing the model’s capacity to accurately determine the relative position of words in a given sequence. Consequently, the utilization of GS-BERT results in enhanced numerical embeddings of textual data.

Primarily, the input text

(I n)

, which is the combination of

E (T x)

and

(A_{b})

, is broken into sub-words. Each token is then represented as a word embedding vector

(W E)

presented as follows:

W E (T k) = χ_{f} (T k)

(18)

where

T k

delineates tokens and

χ_{f}

is the word-to-vector function.

PEs are added to the word embeddings to convey sequence order. Here, by using the GS function, the PE is performed and represented as follows:

G S (P s, v) = Sin (P s / 10000^{(2 * (v / D i m))}) * E x p (- ((P s - M a x_{P s} / 2) / (0.1 * M a x_{P s}))^{2})

(19)

where

P s

implies the position,

D i m

specifies the dimension at index

v

, and

M a x_{P s}

symbolizes the maximum position.

Thereafter, the multi-head self-attention

(X)

computes weights, which indicate the importance of each word’s relation to others. This output is then linearly transformed to produce the final attention representation:

X (y_{1}, y_{2}, y_{3}) = ζ (y_{1} y_{2}^{T p} / \sqrt{D i m_{y_{2}}}) * y_{3}

(20)

where

y_{1}, y_{2}

, and

y_{3}

are query, key, and value matrices, respectively, and

\sqrt{D i m_{y_{2}}}

is the dimension of keys.

A stack of transformer encoder layers

(E)

captures contextual relationships and produces the numeric vectors

(N u)

:

E (W E (I n)) = X (I n) + R c (γ (I n))

(21)

where

X (I n)

computes attention-based representations of the

(I n)

,

γ (I n)

is a neural network for enhancing the attention output, and

R c ()

converts

I n

to the transformed output

(N u)

.

3.5. Classification

Finally,

(N u)

and their corresponding labels

(G)

, collectively designated

(Y)

, are fed into the GK-GRU, which accurately predicts whether the test cases pass or fail.

GRU, which is faster than LSTM, utilizes less memory; however, GRU models may encounter challenges, such as slow learning efficiency and extended training times. Hence, the GK function is introduced to address these concerns. This function is designed to enhance the learning process within the GRU architecture, aiming to mitigate issues associated with prolonged training durations and optimize model performance. The GK-GRU architecture is presented in Figure 3.

The

Y

is inputted to the GK-GRU, represented by Equation (22):

Y_{i t r} = {Y_{1}, Y_{2}, Y_{3}, \dots \dots, Y_{i t r_{M a x}}}

(22)

Update gate

(λ (t))

: The

λ (t)

controls how much of the previous memory to retain and how much of the new information to incorporate:

λ (t) = G K (w t_{λ} * [H (t - 1), Y (t)])

(23)

Here,

G K

is the Gini kernel activation function, represented as:

G K (λ) = \frac{1}{2} (1 + E r (\frac{λ}{\sqrt{2}}))

(24)

where

λ

is the error function, which maps

(λ)

to the range between 0 and 1.

Reset Gate

ℜ (t)

: The

ℜ (t)

is computed to determine how much of the previous hidden state

(H)

to forget:

ℜ (t) = G K (w t_{ℜ} * [H (t - 1), Y (t)])

(25)

The candidate activation

\tilde{H} (t)

is computed as per Equation (26), representing the new information to be added to the memory cell:

\tilde{H} (t) = G K (w t_{\tilde{H}} * [ℜ (t) * H (t - 1), Y (t)])

(26)

(H (t))

and memory cell

(M (t))

are updated using the

(λ (t))

and

\tilde{H} (t)

.

H (t) = (1 - λ (t)) * H (t - 1) + λ (t) * \tilde{H} (t)

(27)

M (t) = H (t)

(28)

where

t

signifies the time step, while

w t_{λ}

,

w t_{ℜ}

, and

w t_{H}

are weight matrices, and

(H (t))

determines whether the testcase is designated as a pass or fail.

4. Results and Discussion

Here, the experiments conducted on the working platform of PYTHON are presented.

Performance Analysis

This phase validates the proposed technique’s performance. The performance of the proposed GK-GRU and prevailing GRU, long short-term memory (LSTM), recurrent neural network (RNN), and deep neural network (DNN) is elucidated in Figure 4. The proposed GK-GRU achieved remarkable results with a precision of 98.85%, recall of 98.64%, F-measure of 98.95%, accuracy of 98.15%, sensitivity of 98.65%, and specificity of 98.46%, while the other remaining classifiers obtained approximate rates of precision, recall, F-measure, accuracy, sensitivity, and specificity of 93%, 95%, 94%, 93%, 95%, and 91%, respectively. Figure 4 suggests that the GK-GRU model exhibits superior performance to existing models due to its capacity to optimize the learning process, leading to enhanced overall performance.

Figure 5 compares the performance metrics, including the true negative rate (TNR), false negative rate (FNR), true positive rate (TPR), false positive rate (FPR), and positive predictive value (PPV) for the proposed GK-GRU and the existing models. The GK-GRU mitigates the challenges related to slow learning efficiency and extended training times. Thus, the GK-GRU model exhibits higher TPR (92.25%) and TNR (85.12%) along with lower FPR (14.25) and FNR (7.54) compared to the other models.

Fitness values for the proposed Z-BES and existing Bald Eagle Search (BES), Galactic Swarm Optimization (GSO), Cockroach Swarm Optimization (CSO), and Bacterial Foraging Optimization (BFO) with various iterations (10, 20, 30, 40, and 50) are presented in Figure 6. The proposed Z-BES algorithm attains increased fitness over iterations (5236–9451) as it enhances convergence by providing a more suitable measure for area selection during optimization.

The prioritization time for the proposed Z-BES and the existing techniques is presented in Table 1. While the existing technique achieves a prioritization time of 8279 ms, the proposed Z-BES attains the shortest prioritization time of 5587 ms. BES can make more informed decisions regarding the direction and magnitude of changes in search areas using the ZScore, leading to improved convergence with limited time.

The receiver operating characteristic (ROC) curve for the proposed GK-GRU and the existing techniques is depicted in Figure 7. A higher area under the ROC curve indicates that the GK-GRU model has a better ability to correctly classify positive cases while minimizing false positives, reflecting its strong discriminatory power and efficiency in evaluating test cases.

Figure 8 compares the efficiency of the proposed GK-GRU and the existing techniques. The Z-BES prioritizes methodology and facilitates the concentration of testing efforts on crucial areas, effectively distributing resources and promptly addressing significant concerns. Additionally, the GK-GRU model improves the learning capabilities inside the system, addressing issues such as reduced efficiency and prolonged training durations. This adaptation contributes to enhanced overall efficiency. The proposed model has an efficiency rate of 98%, whereas that of all other associated techniques is 90%. Thus, the proposed system retains better performance than the other state-of-the-art techniques.

5. Conclusions

By combining Z-BES prioritization and the GK-GRU model, the proposed “finite state testing for GUI with test case prioritization using Z-BES and GK-GRU” framework tackles GUI testing challenges. The proposed technique’s performance has been validated by experimentation analyses. The developed Z-BES gains a minimum prioritization time of 5587 at the 10th iteration, which improves the GUI testing process. Likewise, the proposed GK-GRU demonstrates impressive performance metrics, including 98.85% precision, 98.64% recall, 98.95% F-measure, 98.15% accuracy, 98.65% sensitivity, and 98.46% specificity. Moreover, the proposed GK-GRU requires an average of 38,945 ms for the training process, which reduces the time requirements. Furthermore, the proposed technique exhibits low error values and a 98% efficiency rate. Overall, the proposed technique outperforms the prevailing systems and is more reliable and robust. In this work, GUI testing was performed based on the similarity between GUI component text values and test case attribute values, along with state transition. Although this framework performs well for GUI testing, it has small error rates due to the missing GUI appearance and activity attributes that are not well-structured or follow unconventional design patterns. In the future, GUI segmentation might be applied to distinguish the GUI components (e.g., shapes, colors, visual layouts, and activity diagrams) to improve the performance of GUI testing.

Author Contributions

Validation, M.Y.; Writing—original draft, S.K.; Writing—review & editing, N. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained in the article.

Conflicts of Interest

The authors declare no conflict of interest.

References

Kilincceker, O.; Silistre, A.; Belli, F.; Challenger, M. Model-Based Ideal Testing of GUI Programs-Approach and Case Studies. IEEE Access 2021, 9, 68966–68984. [Google Scholar] [CrossRef]
Eskonen, J.; Kahles, J.; Reijonen, J. Automating GUI testing with image-based deep reinforcement learning. In Proceedings of the 2020 IEEE International Conference on Autonomic Computing and Self-Organizing Systems, ACSOS 2020, Online, 17–21 August 2020; pp. 160–167. [Google Scholar] [CrossRef]
Jeong, J.W.; Kim, N.H.; In, H.P. GUI information-based interaction logging and visualization for asynchronous usability testing. Expert Syst. Appl. 2020, 151, 113289. [Google Scholar] [CrossRef]
Bons, A.; Marín, B.; Aho, P.; Vos, T.E. Scripted and scriptless GUI testing for web applications: An industrial case. Inf. Softw. Technol. 2023, 158, 107172. [Google Scholar] [CrossRef]
Jung, S.K. AniLength: GUI-based automatic worm length measurement software using image processing and deep neural network. SoftwareX 2021, 15, 100795. [Google Scholar] [CrossRef]
Prazina, I.; Becirovic, S.; Cogo, E.; Okanovic, V. Methods for Automatic Web Page Layout Testing and Analysis: A Review. IEEE Access 2023, 11, 13948–13964. [Google Scholar] [CrossRef]
Yan, J.; Zhou, H.; Deng, X.; Wang, P.; Yan, R.; Yan, J.; Zhang, J. Efficient testing of GUI applications by event sequence reduction. Sci. Comput. Program. 2021, 201, 102522. [Google Scholar] [CrossRef]
Xie, M.; Feng, S.; Xing, Z.; Chen, J.; Chen, C. UIED: A hybrid tool for GUI element detection. In Proceedings of the 28th ACM Joint Meeting European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Virtual, 8–13 November 2020; pp. 1655–1659. [Google Scholar] [CrossRef]
Broer Bahaweres, R.; Oktaviani, E.; Kesuma Wardhani, L.; Hermadi, I.; Suroso, A.I.; PermanaSolihin, I.; Arkeman, Y. Behavior-driven development (BDD) Cucumber Katalon for Automation GUI testing case CURA and Swag Labs. In Proceedings of the 2nd International Conference on Informatics, Multimedia, Cyber, and Information System, ICIMCIS 2020, Jakarta, Indonesia, 11–19 November 2020; pp. 87–92. [Google Scholar] [CrossRef]
Samad, A.; Nafis, T.; Rahmani, S.; Sohail, S.S. A Cognitive Approach in Software Automation Testing. SSRN Electron. J. 2021, 1–6. [Google Scholar] [CrossRef]
Jaganeshwari, K.; Djodilatchoumy, S. An Automated Testing Tool Based on Graphical User Interface with Exploratory Behavioural Analysis. J. Theor. Appl. Inf. Technol. 2022, 100, 6657–6666. [Google Scholar]
Zhu, P.; Li, Y.; Li, T.; Yang, W.; Xu, Y. GUI Widget Detection and Intent Generation via Image Understanding. IEEE Access 2021, 9, 160697–160707. [Google Scholar] [CrossRef]
Vos, T.E.J.; Aho, P.; Pastor Ricos, F.; Rodriguez-Valdes, O.; Mulders, A. Testar—Scriptless Testing Through Graphical User Interface. Softw. Test. Verif. Reliab. 2021, 31, e1771. [Google Scholar] [CrossRef]
Ionescu, T.B.; Frohlich, J.; Lachenmayr, M. Improving Safeguards and Functionality in Industrial Collaborative Robot HMIs through GUI Automation. In Proceedings of the IEEE International Conference on Emerging Technologies and Factory Automation, ETFA 2020, Vienna, Austria, 8–11 September 2020; pp. 557–564. [Google Scholar] [CrossRef]
Karimoddini, A.; Khan, M.A.; Gebreyohannes, S.; Heiges, M.; Trewhitt, E.; Homaifar, A. Automatic Test and Evaluation of Autonomous Systems. IEEE Access 2022, 10, 72227–72238. [Google Scholar] [CrossRef]
Ardito, L.; Coppola, R.; Leonardi, S.; Morisio, M.; Buy, U. Automated Test Selection for Android Apps Based on APK and Activity Classification. IEEE Access 2020, 8, 187648–187670. [Google Scholar] [CrossRef]
Cheng, J.; Tan, D.; Zhang, T.; Wei, A.; Chen, J. YOLOv5-MGC: GUI Element Identification for Mobile Applications Based on Improved YOLOv5. Mob. Inf. Syst. 2022, 2022, 8900734. [Google Scholar] [CrossRef]
Nguyen, V.; Le, B. RLTCP: A reinforcement learning approach to prioritizing automated user interface tests. Inf. Softw. Technol. 2021, 136, 106574. [Google Scholar] [CrossRef]
Pastor Ricos, F.; Slomp, A.; Marin, B.; Aho, P.; Vos, T.E.J. Distributed state model inference for scriptless GUI testing. J. Syst. Softw. 2023, 200, 111645. [Google Scholar] [CrossRef]
Zhang, T.; Liu, Y.; Gao, J.; Gao, L.P.; Cheng, J. Deep Learning-Based Mobile Application Isomorphic GUI Identification for Automated Robotic Testing. IEEE Softw. 2020, 37, 67–74. [Google Scholar] [CrossRef]
Paiva, A.C.; Faria, J.C.; Vidal, R.F. Towards the integration of visual and formal models for GUI testing. Electron. Notes Theor. Comput. Sci. 2007, 190, 99–111. [Google Scholar] [CrossRef]
Ahmed, B.S.; Sahib, M.A.; Potrus, M.Y. Generating combinatorial test cases using Simplified Swarm Optimization (SSO) algorithm for automated GUI functional testing. Eng. Sci. Technol. Int. J. 2014, 17, 218–226. [Google Scholar] [CrossRef]

Figure 1. Proposed architecture.

Figure 2. STD architecture.

Figure 3. GK-GRU architecture.

Figure 4. Performance comparison.

Figure 5. Comparative analysis of the proposed GK-GRU.

Figure 6. Fitness versus iteration.

Figure 7. ROC curve.

Figure 8. Efficiency comparison [15,17,19,20].

Table 1. Prioritization time evaluation.

Techniques	Prioritization Time (ms)
Proposed Z-BES	5587
BEA	6847
GSO	7659
CSO	8765
BFO	9845

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kumar, S.; Nitin; Yadav, M. Finite State GUI Testing with Test Case Prioritization Using Z-BES and GK-GRU. Appl. Sci. 2023, 13, 10569. https://doi.org/10.3390/app131910569

AMA Style

Kumar S, Nitin, Yadav M. Finite State GUI Testing with Test Case Prioritization Using Z-BES and GK-GRU. Applied Sciences. 2023; 13(19):10569. https://doi.org/10.3390/app131910569

Chicago/Turabian Style

Kumar, Sumit, Nitin, and Mitul Yadav. 2023. "Finite State GUI Testing with Test Case Prioritization Using Z-BES and GK-GRU" Applied Sciences 13, no. 19: 10569. https://doi.org/10.3390/app131910569

APA Style

Kumar, S., Nitin, & Yadav, M. (2023). Finite State GUI Testing with Test Case Prioritization Using Z-BES and GK-GRU. Applied Sciences, 13(19), 10569. https://doi.org/10.3390/app131910569

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Finite State GUI Testing with Test Case Prioritization Using Z-BES and GK-GRU

Abstract

1. Introduction

1.1. Problem Statement

1.2. Objectives

2. Related Literature Survey

3. Proposed Methodology

3.1. Historical Projects

3.1.1. Test Cases

3.1.2. Test Case Prioritization

Selecting Stage

Searching Stage

Swooping Stage

3.1.3. Attribute Extraction

3.1.4. State Transition Diagram

3.1.5. State Activity Score

3.2. GUI

3.2.1. GUI Components

3.2.2. Text Value Extraction

3.2.3. Similarity Score

3.3. Labelling

3.4. Word Embedding

3.5. Classification

4. Results and Discussion

Performance Analysis

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI