On the Applicability of Quantum Machine Learning

In this article, we investigate the applicability of quantum machine learning for classification tasks using two quantum classifiers from the Qiskit Python environment: the variational quantum circuit and the quantum kernel estimator (QKE). We provide a first evaluation on the performance of these classifiers when using a hyperparameter search on six widely known and publicly available benchmark datasets and analyze how their performance varies with the number of samples on two artificially generated test classification datasets. As quantum machine learning is based on unitary transformations, this paper explores data structures and application fields that could be particularly suitable for quantum advantages. Hereby, this paper introduces a novel dataset based on concepts from quantum mechanics using the exponential map of a Lie algebra. This dataset will be made publicly available and contributes a novel contribution to the empirical evaluation of quantum supremacy. We further compared the performance of VQC and QKE on six widely applicable datasets to contextualize our results. Our results demonstrate that the VQC and QKE perform better than basic machine learning algorithms, such as advanced linear regression models (Ridge and Lasso). They do not match the accuracy and runtime performance of sophisticated modern boosting classifiers such as XGBoost, LightGBM, or CatBoost. Therefore, we conclude that while quantum machine learning algorithms have the potential to surpass classical machine learning methods in the future, especially when physical quantum infrastructure becomes widely available, they currently lag behind classical approaches. Our investigations also show that classical machine learning approaches have superior performance classifying datasets based on group structures, compared to quantum approaches that particularly use unitary processes. Furthermore, our findings highlight the significant impact of different quantum simulators, feature maps, and quantum circuits on the performance of the employed quantum estimators. This observation emphasizes the need for researchers to provide detailed explanations of their hyperparameter choices for quantum machine learning algorithms, as this aspect is currently overlooked in many studies within the field. To facilitate further research in this area and ensure the transparency of our study, we have made the complete code available in a linked GitHub repository.


Introduction
Quantum computing has recently gained significant attention due to its potential to solve complex computational problems exponentially faster than classical computers [1]. Quantum machine learning (QML) is an emerging field that combines the power of quantum computing with traditional machine learning techniques to solve real-world problems more efficiently [2,3]. Various QML algorithms have been proposed, such as quantum kernel estimator [4] and variational quantum circuit [5,6], which have shown promising results in diverse applications, including pattern recognition and classification tasks [7][8][9].
In this study, we aim to compare QKE (quantum kernel estimator) and VQC (variational quantum circuit) with powerful classical machine learning methods such as XGBoost [10], Ridge [11], Lasso [12], LightGBM [13], CatBoost [14], and MLP (multilayer perceptron) [15] on six benchmark datasets partially available in the Scikit-learn library [16] as well as artificially generated datasets. To ensure a fair comparison on the benchmark datasets, we perform a randomized search to optimize hyperparameters for each algorithm, thereby providing a comprehensive statistical comparison of their performance. Furthermore, we provide the full program code in a GitHub repository [17] to make our results reproducible and boost research that can potentially build on our approach.
Since quantum machines are not readily accessible, we can only compare these algorithms' performance on simulated quantum circuits. Although this approach does not reveal the full potential of quantum machine learning, it does highlight how the discussed quantum machine learning methods handle different levels of complexity inherent in the datasets. For this reason, we also developed a method to generate artificial datasets based on quantum mechanical concepts to provide a prototype for a particularly well-suited dataset for quantum machine learning. This will estimate the possible improvements that quantum machine learning algorithms can offer over classical methods in terms of accuracy and efficiency, considering the computational resources needed to simulate quantum circuits.
In this study, we address and partially answer the following research questions: 1.
How do QKE and VQC algorithms compare to classical machine learning methods such as XGBoost, Ridge, Lasso, LightGBM, CatBoost, and MLP regarding accuracy and efficiency on simulated quantum circuits? 2.
To what extent can a randomized search to find a suitable set of hyperparameters make the performance of quantum machine learning algorithms comparable to classical approaches? 3.
What are the limitations and challenges associated with the current state of quantum machine learning, and how can future research address these challenges to unlock the full potential of quantum computing in machine learning applications? 4.
Do quantum machine learning algorithms outperform regular machine learning algorithms on datasets constrained by the rules of quantum mechanics? Thus, do they provide a quantum advantage for datasets that exhibit strong symmetry properties in terms of adhering to Lie algebras?
The research presented in this article is partially inspired by the work of Zeguendry et al. [18], which offers an excellent review and introduction to quantum machine learning. However, their article does not delve into the tuning of hyperparameters for the quantum machine learning models employed, nor does it provide ideas on creating best-suited data for quantum machine learning classification tasks. We aim to expand the toolbox of quantum machine learning, first by discussing the space of Hyperparameters and second by providing a prototype for generating "quantum data". Furthermore, this analysis will help determine the current state of quantum machine learning performance and whether researchers should employ these algorithms in their studies.
We provide the entire program code of our experiments and all the results in a GitHub repository, ensuring the integrity of our findings, fostering research in this field, and offering a comprehensive code for researchers to test quantum machine learning on their classification problems. Thereby, a key contribution of our research is not only the provision of a single implementation of a quantum machine learning algorithm, but also the execution of a randomized search for potential hyperparameters of both classical and quantum machine learning models and a novel approach for generating artificial classification problems based on concepts inherent to quantum mechanics, i.e., Lie groups and algebras.
This article is structured as follows: Section 2 discusses relevant and related work. In Section 3, we describe, reference, and, to some degree, derive all employed techniques. We will not discuss the mathematical details of all employed algorithms here, but rather refer the interested reader to the referenced sources. Section 4 describes our performed experiments in detail, followed by the obtained results in Section 5, which also features a discussion of our findings. Finally, we conclude our findings in Section 6.

Related Work
Considerable research was conducted in recent years to advance quantum machine learning environments and their application field. This starts in the data encoding process, in which Schuld and Killoran [3] investigated quantum machine learning in feature Hilbert spaces theoretically. They proposed a framework for constructing quantum embeddings of classical data to enable quantum algorithms that learn and classify data in quantum feature spaces.
Further research was conducted on introducing novel architectural frameworks. For this, Mitarai et al. [19] presented a method called quantum circuit learning (QCL), which uses parameterized quantum circuits to approximate classical functions. QCL can be applied to supervised and unsupervised learning tasks, as well as reinforcement learning.
Havlíček et al. [4] introduced a quantum-enhanced feature space approach using quantum circuits. This work demonstrated that quantum computers can effectively process classical data with quantum kernel methods, offering the potential for exponential speedup in certain applications.
Furthermore, Farhi and Neven [20] explored the use of quantum neural networks for classification tasks on near-term quantum processors. They showed that quantum neural networks can achieve good classification performance with shallow circuits, making them suitable for noisy intermediate-scale quantum (NISQ) devices.
Other research focused on the advancement of applying quantum fundamentals on classical machine learning applications. Hereby, Rebentrost et al. [21] introduced the concept of a quantum support vector machine for big data classification. They showed that the quantum version of the algorithm can offer exponential speedup compared to its classical counterpart, specifically in the kernel evaluation stage.
To advance the application field of quantum machine learning, Liu and Rebentrost [22] proposed a quantum machine learning approach for quantum anomaly detection. They demonstrated that their method can efficiently solve classification problems, even when the data have a high degree of entanglement.
In this regard, it is worth mentioning the work of Broughton et al. [23] introduced TensorFlow Quantum, an open-source library for the rapid prototyping of hybrid quantumclassical models for classical or quantum data. They demonstrated various applications of TensorFlow Quantum, including supervised learning for quantum classification, quantum control, simulating noisy quantum circuits, and quantum approximate optimization. Moreover, they showcased how TensorFlow Quantum can be applied to advanced quantum learning tasks such as meta-learning, layer-wise learning, Hamiltonian learning, sampling thermal states, variational quantum eigensolvers, classification of quantum phase transitions, generative adversarial networks, and reinforcement learning.
In the review paper by Zeguendry et al. [18], the authors present a comprehensive overview of quantum machine learning from the perspective of conventional machine learning techniques. The paper starts by exploring the background of quantum computing, its architecture, and an introduction to quantum algorithms. It then delves into several fundamental algorithms for QML, which form the basis of more complex QML algorithms and can potentially offer performance improvements over classical machine learning algorithms. In the study, the authors implement three machine learning algorithms: quanvolutional neural networks, quantum support vector machines, and variational quantum circuit. They compare the performance of these quantum algorithms with their classical counterparts on various datasets. Specifically, they implement quanvolutional neural networks on a quantum computer to recognize handwritten digits and compare its performance to convolutional neural networks, stating the performance improvements by quantum machine learning.
Despite these advancements, it is important to note that some of the discussed papers may not have used randomized search CV from Scikit-learn to optimize the classical machine learning algorithms, thereby overstating the significance of quantum supremacy. Nevertheless, the above-mentioned works present a comprehensive overview of the state of the art in quantum machine learning for classification, highlighting the potential benefits of using quantum algorithms in various forms and applications.

Methodology
This section presents our methodology for comparing the performance of classical and quantum machine learning techniques for classification tasks. Our approach is designed to provide a blueprint for future experiments in this area of research. We employ the Scikitlearn library, focusing on the inbuilt functions to select a good set of hyperparameters, i.e., RandomizedSearchCV to compare classical and quantum machine learning models. We also utilize the Qiskit library to incorporate quantum machine learning techniques into our experiments, [24]. The selected datasets for our study include both real-world and synthetic data, enabling a comprehensive evaluation of the classifiers' performance.

Supervised Machine Learning
Supervised machine learning is a subfield of artificial intelligence that focuses on developing algorithms and models to learn patterns and make decisions or predictions based on data [25,26]. The main goal of supervised learning is to predict labels or outputs of new, unseen data given a set of known input-output pairs (training data). This section briefly introduces several classical machine learning techniques used for classification tasks, specifically in the context of supervised learning. These techniques serve as a baseline to evaluate the applicability of quantum machine learning approaches, which are the focus of this paper. Furthermore, we will then introduce the employed quantum machine learning algorithms.
One of the essential aspects of supervised machine learning is the ability to predict/classify data. The models are trained using a labeled dataset, and then the performance of the models is evaluated based on their accuracy in predicting the labels of previously unseen test samples [27]. This evaluation is crucial to estimate the model's ability to generalize the learned information when making predictions on new, real-world data.
Various techniques, such as cross-validation and train-test splits, are often used to obtain reliable performance estimates of the models [28]. By comparing the performance of different models, researchers and practitioners can determine which model or algorithm is better suited for a specific problem domain.

Classical Supervised Machine Learning Techniques
The following list describes the employed algorithms that serve as a baseline for the afterwards described and later tested quantum machine learning algorithms.
• Lasso and Ridge Regression/Classification: Lasso (least absolute shrinkage and selection operator) and Ridge Regression are linear regression techniques that incorporate regularization to prevent overfitting and improve model generalization [11,12]. Lasso uses L1 regularization, which tends to produce sparse solutions, while Ridge Regression uses L2 regularization, which prevents coefficients from becoming too large. Both of these regression algorithms can also be used for classification tasks. • Multilayer Perceptron: MLP is a type of feedforward artificial neural network with multiple layers of neurons, including input, hidden, and output layers [15]. MLPs are capable of modeling complex non-linear relationships and can be trained using backpropagation. • Support Vector Machines (SVM): SVMs are supervised learning models used for classification and regression tasks [29]. They work by finding the optimal hyperplane that separates the data into different classes, maximizing the margin between the classes. • Gradient Boosting Machines: Gradient boosting machines are an ensemble learning method that builds a series of weak learners, typically decision trees, to form a strong learner [30]. The weak learners are combined by iteratively adding them to the model while minimizing a loss function. Notable gradient boosting machines for classification tasks include XGBoost [10], CatBoost [14], and LightGBM [13]. These three algorithms have introduced various improvements and optimizations to the original gradient boosting framework, such as efficient tree learning algorithms, handling categorical features, and reducing memory usage.

Quantum Machine Learning
Quantum machine learning is an emerging interdisciplinary field that leverages the principles of quantum mechanics and quantum computing to improve or develop novel algorithms for machine learning tasks [2]. This section introduces two key quantum machine learning techniques, Variational Quantum Circuit and Quantum Kernel Estimator, and discusses their connections to classical machine learning techniques. Additionally, we briefly introduce Qiskit Machine Learning, a Python package developed by IBM for implementing quantum machine learning algorithms. Furthermore, we want to mention the work done by [18] for a review of quantum machine learning algorithms and a more detailed discussion of the employed algorithms.

Variational Quantum Circuit (VQC)
VQC is a hybrid quantum-classical algorithm that can be viewed as a quantum analog of classical neural networks, specifically the multilayer perceptron [5,6]. VQC employs a parametrized quantum circuit, which is trained using classical optimization techniques to find the optimal parameters for classification tasks. The learned quantum circuit can then be used to classify new data points. Figure 1 illustrates the schematic depiction of the variational quantum circuit, which involves preprocessing the data, encoding it onto qubits using a feature map, processing it through a variational quantum circuit (Ansatz), measuring the final qubit states, and optimizing the circuit parameters θ, Thus, the main building blocks of the VQC are as follows: 1.
Preprocessing: The data are prepared and preprocessed before being encoded onto qubits.

2.
Feature map encoding (yellow in the figure): The preprocessed data are encoded onto qubits using a feature map.

3.
Variational quantum circuit (Ansatz) (steel-blue in the figure): The encoded data undergo processing through the variational quantum circuit, also known as the Ansatz, which consists of a series of quantum gates and operations.

4.
Measurement (orange in the figure): The final state of the qubits is measured, providing probabilities for the different quantum states.

5.
Parameter optimization (Optimizer): The variational quantum circuit is optimized by adjusting the parameters θ, such as the rotations of specific quantum gates, to improve the outcome/classification. We colored the steps that are similar to classical neural networks in light blue and the other steps in yellow, steel-blue, and orange.

Quantum Kernel Estimator
QKE is a technique that leverages the quantum computation of kernel functions to enhance the performance of classical kernel methods, such as support vector machines [4,31]. By computing the kernel matrix using quantum circuits, QKE can capture complex data relationships that may be challenging for classical kernel methods to exploit.
The main building blocks for the employed QKE, which are depicted in Figure 2 are as follows: 1.
Data preprocessing: The input data are preprocessed, which may include tasks such as data cleaning, feature scaling, or feature extraction. This step ensures that the data are in an appropriate format for the following quantum feature maps.

2.
Feature map encoding (yellow in the figure): The preprocessed data are encoded onto qubits using a feature map.

3.
Kernel computation (steel-blue in the figure): Instead of directly computing the kernel matrix from the original data, a kernel function is precomputed using the quantum computing capabilities, meaning that the inner product of two quantum states is estimated on a quantum simulator/circuit. This kernel function captures the similarity between pairs of data points in a high-dimensional feature space. 4.
SVM training: The precomputed kernel function is then used as input to the SVM algorithm for model training. The SVM aims to find an optimal hyperplane that separates the data points into different classes with the maximum margin.
Here, we need to mention that in the documentation of Qiskit machine learning, the developers provided a full QKE implementation without the need to use, e.g., Scikit-learn's SVM-implementation. However, as of the writing of this article, this estimator is no longer available in Qiskit machine learning. Thus, one needs to use a support vector machine implementation from other sources after precomputing the kernel on a quantum simulator.
1 Figure 2. Schematic depiction of the quantum kernel estimator. The QKE consists of several steps. We colored the steps that are similar to classical support vector machines in light blue and the other steps in yellow and steel-blue. The employed QKE algorithm consists of a support vector machine algorithm with precomputed kernel, i.e., a classical machine learning method that leverages the power of quantum computing to efficiently compute the kernel matrix.

Qiskit Machine Learning
Qiskit Machine Learning is an open-source Python package developed by IBM for implementing quantum machine learning algorithms [24]. This package enables researchers and practitioners to develop and test quantum machine learning algorithms, including VQC and QKE, using IBM's quantum computing platform. It provides tools for building and simulating quantum circuits, as well as interfaces to classical optimization and machine learning libraries. Thus, we used this environment and the corresponding quantum simulators described in Appendix A for our experiments.

Accuracy Score for Classification
The accuracy score is a standard metric used to evaluate the performance of classification algorithms. We employed the accuracy score to evaluate all presented experiments.
It is defined as the ratio of correct predictions to the total number of predictions. The formula for the accuracy score is defined as follows: Accuracy = Number of correct predictions Total number of predictions (1) In Scikit-learn, the accuracy score can be computed using the accuracy_score function from the 'sklearn.metrics' module [16]. For more information on the accuracy score and its interpretation, refer to the Scikit-learn documentation [16].

Datasets
In this study, we used six classification datasets from various sources. Two datasets are part of the Scikit-learn library, while the remaining four are obtained/fetched from OpenML. The datasets are described below:

1.
Iris Dataset: A widely known dataset consisting of 150 samples of iris flowers, each with four features (sepal length, sepal width, petal length, and petal width) and one of three species labels (Iris Setosa, Iris Versicolor, or Iris Virginica). This dataset is included in the Scikit-learn library [16].

2.
Wine Dataset: A popular dataset for wine classification, which consists of 178 samples of wine, each with 13 features (such as alcohol content, color intensity, and hue) and one of three class labels (class 1, class 2, or class 3). This dataset is also available in the Scikit-learn library [16].

3.
Indian Liver Patient Dataset (LPD): This dataset contains 583 records, with 416 liver patient records and 167 non-liver patient records [32]. The dataset includes ten variables: age, gender, total bilirubin, direct bilirubin, total proteins, albumin, A/G ratio, SGPT, SGOT, and Alkphos. The primary task is to classify patients into liver or non-liver patient groups.

4.
Breast Cancer Coimbra Dataset: This dataset consists of 10 quantitative predictors and a binary dependent variable, indicating the presence or absence of breast cancer [33,34]. The predictors are anthropometric data and parameters obtainable from routine blood analysis. Accurate prediction models based on these predictors can potentially serve as a biomarker for breast cancer.

5.
Teaching Assistant Evaluation Dataset: This dataset includes 151 instances of teaching-assistant (TA) assignments from the Statistics Department at the University of Wisconsin-Madison, with evaluations of their teaching performance over three regular semesters and two summer semesters [35,36]. The class variable is divided into three roughly equal-sized categories ("low", "medium", and "high"). There are six attributes, including whether the TA is a native English speaker, the course instructor, the course, the semester type (summer or regular), and the class size.

6.
Impedance Spectrum of Breast Tissue Dataset: This dataset contains impedance measurements of freshly excised breast tissue at the following frequencies: 15.625, 31.25, 62.5, 125, 250, 500, and 1000 KHz [37,38]. The primary task is to predict the classification of either the original six classes or four classes by merging the fibro-adenoma, mastopathy, and glandular classes whose discrimination is not crucial.
These datasets were selected for their diverse domains and varied classification tasks, providing a robust testing ground for the quantum classifiers we employed in our experiments. Furthermore, we used artificially generated datasets to control the number of samples. Here, Scikit-learn provides a valuable function called make_classification to generate synthetic classification datasets. This function creates a random n-class classification problem, initially creating clusters of points normally distributed about vertices of an n-informative-dimensional hypercube, and assigns an equal number of clusters to each class [16]. It introduces interdependence between features and adds further noise to the data. The generated data are highly customizable, with options for specifying the number of samples, features, informative features, redundant features, repeated features, classes, clusters per class, and more. For more details on the make_classification function and its parameters, refer to the Scikit-learn documentation available on scikit-learn.org (accessed on 25 June 2023).

Data Obtained from Lie-Algebras
We construct another artificial dataset final dataset for our final evaluation; however, this time, we do this by using tools from the theory of Lie groups. The reason for employing these concepts is that we want to produce data that resembles the complexity inherent to the Qubit-Vectorspace of quantum machine learning and that, furthermore, is generated by applying transformations on vectors that are similar to the manipulations present in quantum machine learning algorithms, e.g., for the VQC, rotations of/around the Blochsphere. Thus, overall, we aim to provide random data for a classification task to show a case where the authors assume quantum machine learning algorithms can, because of their inherent structure, outperform classical machine learning algorithms, and thus, provide a prototype on the type of data specifically tailored to address the inherent structure of quantum machine learning. The theoretical foundations of this section are obtained from [39], and thus, the interested reader is referred to this book for a profound introduction to Lie groups. To further explain the employed ideas, we start by introducing the concept of a Lie group G and the corresponding Lie-algebra g.
A Lie group is a mathematical structure that captures the essence of continuous symmetry. Named after the Norwegian mathematician Sophus Lie, Lie groups are ubiquitous in many areas of mathematics and physics, including the study of differential equations, geometry, and quantum mechanics.
A Lie group is a set G that has the structure of both a smooth manifold and a group in such a way that the group operations (multiplication and inversion) are smooth. That is, a Lie group is a group that is also a differentiable manifold, such that the group operations are compatible with the smooth structure.
Thus, a Lie group is a set G equipped with a group structure (i.e., a binary operation G × G → G, (g, h) → gh that is associative, an identity element e ∈ G, and an inversion operation G → G, g → g −1 ) and a smooth manifold structure such that the following conditions are satisfied:
Lie algebra is associated with each Lie group, a vector space equipped with a binary operation called the Lie bracket. The Lie algebra captures the local structure of the Lie group near the identity element, meaning that the Lie algebra of a Lie group G is the tangent space at the identity, denoted T e G, equipped with the Lie bracket operation. The Lie bracket is defined in terms of the group operation and the differential.
There is a map from the Lie algebra to the Lie group called the exponential map, denoted exp : T e G → G. The exponential map provides a way to generate new group elements from elements of the Lie algebra. In particular, given an element X of the Lie algebra, exp(X) is a group element close to the identity if X is 'small'. We will exploit this concept to generate random data associated with a specific group: We start with a set of generators T a contained within the Lie-algebra g of a Lie group G, where a = 1, 2, . . . d g , i.e., the dimension of the Lie-algebra. We can then create elements g ∈ G by employing: We used the condition for our θ a -values without loss of generality due to the periodicity of the exponential function. To generate our random data, we randomly choose our θ a and create an element of our group. We then apply this element to a corresponding base vector of our vector space. Specifically, in our example, we use the Lie-group SU (2). The special unitary group of degree 2, denoted as SU (2), is a Lie group of 2 × 2 unitary matrices with determinant 1.
The corresponding Lie algebra, su(2), consists of 2 × 2 Hermitian traceless matrices, i.e., the Pauli matrices: The commutation relations of the Pauli matrices form the structure of the su(2) Lie algebra: where [·, ·] denotes the commutator and ε ijk is the Levi-Civita symbol.
To generate a classification dataset from this algebra, we use the following procedure: 1.
Find a set of random parameters θ ∈ (0; π], φ ∈ (0; 2π], λ ∈ (0; 2π]; 2. We then create an element U of SU(2) using these these randomly set parameters: Next, we take one of the basevectors from C 2 , denoted asv to create a new complex vector v using the previously obtained matrix U such that: This vector is then separated into four features F j such that: where v 1 and v 2 denotes the individual components of the vector v, and Re[. . . ] and Im[. . . ] denote their respective real and imaginary parts; 5.
Finally, we assign a class label C to this collection of features such that: and collect the features and the class label into one sample [F 1 , F 2 , F 3 , F 4 , C]. We repeat this process N S times, starting with 1, where N S is the number of samples that we want for our dataset.
Note that this approach can be extended to arbitrary Lie groups, given that one can construct or obtain a Lie group's generators.

Experimental Design
In this section, we describe our experimental design, which aims to provide a fair and comprehensive comparison of the performance of classical machine learning (ML) and quantum machine learning techniques, as discussed in Sections 3.2 and 3.3. Our experiments involve two main components: Firstly, assessing the algorithms' performance on artificially generated datasets with varying parametrizations, and secondly, evaluating the algorithms' performance on benchmark datasets using randomized search to optimize hyperparameters, ensuring a fair comparison. By carefully selecting our experimental setup, we avoid the issue of "cherry-picking" only a favorable subset of results, a common problem in machine learning, leading to heavily biased conclusions.

Artificially Generated Scikit Datasets
To generate the synthetic classification dataset, we utilized Scikit-learn's make_classification function. We employed two features and two classes while varying the number of samples to obtain a performance curve illustrating how the chosen algorithms' performance changes depending on the sample size.
We partitioned each dataset such that 20% of the original data were reserved as a test set to evaluate the trained algorithm, producing the accuracy score used for our assessment. Furthermore, each dataset was normalized such that all features are within the unit interval [0, 1].
As a baseline, we employed the seven classical machine learning algorithms described in Section 3.2, namely Lasso, Ridge, MLP, SVM, XGBoost, LightGBM, and CatBoost. We used two different parameterizations for the classical machine learning algorithms for our comparisons. Firstly, we applied the out-of-the-box implementation without any hyperparameter optimization. Secondly, we used an optimized version of each algorithm found through Scikit-learn's RandomizedSearchCV by testing 20 different models.
We then examined 20 distinct parameter configurations, each for the VQC and QKE classifiers, randomly selected from a predefined parameter distribution. Appendix A discusses the parameter grids for all utilized algorithms and all experiments.

Artificially Generated SU(2) Datasets
For our synthetic SU(2) classification dataset, we used the concepts previously discussed in Section 3.5.1. We employed two complex features, i.e., resulting in four continuous real features, and two classes while varying the number of samples to obtain a performance curve illustrating how the chosen algorithms' performance changes depending on the sample size.
We partitioned each dataset such that 20% of the original data were reserved as a test set to evaluate the trained algorithm, producing the accuracy score used for our assessment. Furthermore, each dataset was normalized such that all features are within the unit interval [0, 1].
As a baseline, we employed the seven classical machine learning algorithms described in Section 3.2, namely Lasso, Ridge, MLP, SVM, XGBoost, LightGBM, and CatBoost. We used two different parameterizations for the classical machine learning algorithms for our comparisons. Firstly, we applied the out-of-the-box implementation without any hyperparameter optimization. Secondly, we used an optimized version of each algorithm found through Scikit-learn's RandomizedSearchCV by testing 20 different models.
We then examined 20 distinct parameter configurations, each for the VQC and QKE classifiers, randomly selected from a predefined parameter distribution. Appendix A discusses the parameter grids for all utilized algorithms and all experiments.

Benchmark Datasets and Hyperparameter Optimization
Our last experiment was to test the two employed quantum machine learning algorithms against the classical machine learning algorithms on six benchmark datasets (Section 3.5). For this reason, we employed Scikit-learn's RandomizedSearchCV to test 20 randomly parameterized models for each algorithm to report the best of these tests. Again, we used a train-test-split to keep 20% of the original data to test the trained algorithm. Furthermore, each dataset was normalized such that all features are within the unit interval [0, 1].

Results
In this section, we present the results of our experiments, comparing the performance of classical machine learning and quantum machine learning techniques on both artificially generated datasets and benchmark datasets (Section 3.5). By analyzing the results, we aim to draw meaningful insights into the strengths and weaknesses of each approach and provide a blueprint for future studies in the area. Everything was calculated on a Lenovo ThinkCentre machine using an Intel(R) Core(TM) i7-4770 CPU 3.40GHz and 16GB RAM and Linux 20.04. We used python 3.6 and the included packages are the following: •

Performance on Artificially Generated Scikit Datasets
In this section, we compare the performance of quantum machine learning algorithms and classical machine learning algorithms on artificially generated classification datasets. The comprehensive experimental setup can be found in Section 4.1.
Regarding accuracy and runtime, our findings are presented in Tables 1 and 2 and Figures 3-5. The measured runtime includes hyperparameter tuning via randomized search and five-fold cross-validating, training, and testing the model.
While QML algorithms perform reasonably well, we observe that they are not a match for properly trained and/or sophisticated state-of-the-art classifiers. Even out-of-the-box implementations of state-of-the-art ML algorithms outperform QML algorithms on these artificially generated classification datasets.
The accuracy of the algorithms varies depending on the dataset size, with larger datasets posing more challenges. CatBoost performed best in our experiments, both outof-the-box and when optimized in terms of high accuracy over all experiments. The quantum kernel estimator is the fifth-best algorithm overall in terms of accuracy, though it outperforms CatBoost regarding the runtime for CatBoost's optimized version. XGBoost and support vector classification (SVC) follow closely, with competitive performances in terms of accuracy. However, variational quantum circuit struggles to achieve high accuracy compared to sophisticated boosting classifiers or support vector machines. Furthermore, we observe the best performance in terms of runtime for the two linear models, Lasso and Ridge. We need to point out that Lasso and Ridge both feature increased runtimes for the datasets of size 50; this is most likely due to the optimizer needing an increased number of iterations due to the small number of samples and their relatively scattered distribution of data points.
Other algorithms, such as multilayer perceptron, Ridge regression, Lasso regression, and LightGBM, exhibit varying performances depending on dataset size and optimization. Despite some reasonable results from QKE, we conclude that classical ML algorithms, particularly sophisticated boosting classifiers, should be chosen to tackle similar problems due to their ease of implementation, better runtime, and overall superior performance.
In summary, while QML algorithms have shown some promise, they cannot yet compete with state-of-the-art classical ML algorithms on artificially generated classification datasets in terms of accuracy and runtime.       datasets. The upper part illustrates the behavior of the accuracies, while the lower part demonstrates how the runtimes change with the increasing size of the test datasets. The right part contains the legend, indicating which algorithms were used, and more specifically, the different parametrizations of the employed quantum machine learning algorithms. Furthermore, the legend is sorted in decreasing order of the average accuracy of the employed algorithms. The parametrization for the QKE is as follows: QKE, feature map, quantum simulator, C-Value for the SVM algorithm. The parametrization for the VQC is as follows: VQC, feature map, Ansatz, optimizer, quantum simulator.

Performance on Artificially Generated SU2 Datasets
In this section, we compare the performance of quantum machine learning algorithms and classical machine learning algorithms on artificially generated classification datasets based on Lie group structures. The detailed experimental setup can be found in Section 4.2.
Regarding accuracy and runtime, our findings are presented in Tables 3 and 4 and Figures 6-8. While QML algorithms perform reasonably well, we observe that they are not a match for properly trained and/or sophisticated state-of-the-art classifiers. Even out-ofthe-box implementations of state-of-the-art ML algorithms outperform QML algorithms on artificially generated classification datasets that are particularly suited for QML.
The accuracy of the algorithms varies depending on the dataset size, with larger datasets providing increased accuracy for most algorithms. CatBoost performed best in our experiments, both out-of-the-box and when optimized in terms of high accuracy over all experiments. The quantum kernel estimator is the fifth-best algorithm overall in terms of accuracy. However, we observe that, on average, CatBoost with improved hyperparameters performs best over all experiments, but is outperformed by the best QKE implementation for 100 and 500 data points. Thus, we conclude that quantum kernel estimators can capture the complexity of this SU(2)-generated dataset, but overall, one is better off with an out-of-the-box CatBoost implementation. This means that we do not observe a quantum advantage for this type of data, but rather that the employed quantum kernel estimator behaves similarly to classical machine learning algorithms, i.e., it exhibits reasonable performance but does not perform best for all datasets, even the ones created by exploiting quantum symmetry properties.
Other algorithms, such as multilayer perceptron, Ridge regression, Lasso regression, and LightGBM, exhibit varying performances depending on dataset size and optimization. Despite some reasonable results from QKE, we conclude that classical ML algorithms, particularly sophisticated boosting classifiers, should be chosen to tackle similar problems due to their ease of implementation, better runtime, and overall superior performance. Furthermore, we again observe the best performance in terms of runtime for the two linear models, Lasso and Ridge. Moreover, again, we observe that Lasso and Ridge both feature increased runtimes for the datasets of size 50.
In summary, while QML algorithms have shown some promise, they cannot yet compete with state-of-the-art classical ML algorithms even on these SU(2)-datasets, where the authors intended to provide evidence for the quantum advantage for datasets generated from symmetry properties inherent to quantum mechanics. Table 3. This table presents the scores/accuracies of our experiments conducted on classification datasets generated via SU(2) generators of varying sizes, e.g., 50 and 100. Given these different dataset sizes, this table is sorted in decreasing order of the average accuracy over all different sample sizes of each algorithm. The parametrization for the QKE is as follows: QKE, feature map, quantum simulator, C-Value for the SVM algorithm. The parametrization for the VQC is as follows: VQC, feature map, Ansatz, optimizer, quantum simulator. For the classical machine learning algorithms, OutOfTheBox means that we did not tune the hyperparameters of the employed algorithm and RandomSearch refers to hyperparameter optimization via a randomized search.

Results on Benchmark Datasets
In this section, we discuss the performance of quantum machine learning and classical machine learning algorithms on six benchmark datasets described in Section 3.5. We include results for the quantum classifiers detailed in Section 3.3 and the classical machine learning classifiers discussed in Section 3.2. The scores/accuracies were obtained using randomized search cross-validation from Scikit-learn with 20 models and five-fold cross-validation.
Our results, shown in Table 5, display the best five-fold cross-validation scores (upper table) and the scores of the best model evaluated on an unseen test subset of the original data (lower table), which makes up 20% of the original data. We observe varying performances of the algorithms on these benchmark datasets. Table 5. These tables present the scores/accuracies of our experiments conducted on publicly available classification datasets. The upper table displays the best five-fold cross-validation scores, obtained using randomized search cross-validation from Scikit-learn, which were employed to identify the optimal model. The lower table shows the scores of the best model evaluated on an unseen test subset of the original data. We include results for the six datasets described in Section 3.5, the quantum classifiers detailed in Section 3.3, and the classical machine learning classifiers discussed in Section 3.2. Notably, both the variational quantum circuit and the quantum kernel estimator classifier show competitive performance on several datasets but do not consistently outperform classical ML algorithms. In particular, QKE achieves a perfect score on the Iris dataset, but its performance varies across the other datasets.

Classifier\Dataset
Classical ML algorithms, such as multilayer perceptron, support vector machines, XGBoost, LightGBM, and CatBoost, exhibit strong performance across all datasets, with some algorithms achieving perfect scores on multiple datasets. CatBoost consistently performs well, ranking as the top-performing algorithm on three of the six datasets. Ridge and Lasso regression show high accuracy on Iris and Wine datasets but perform poorly on the others.
When comparing the runtimes of the experiments, as presented in Table 6, it becomes evident that QML algorithms take substantially longer to execute than their classical counterparts. For instance, the VQC and QKE classifiers take hours to days to complete on various datasets, whereas classical ML algorithms such as Ridge, Lasso, MLP, SVM, XGBoost, LightGBM, and CatBoost typically take seconds to minutes. This significant difference in runtimes could be attributed to the inherent complexity and resource requirements of QML algorithms, which generally demand specialized quantum hardware and simulators. On the other hand, classical ML algorithms are optimized for execution on conventional hardware, making them more efficient and faster to run.
In conclusion, while QML algorithms such as VQC and QKE demonstrate potential in achieving competitive performance on certain datasets, their relatively longer runtimes and less consistent performance across the benchmark datasets may limit their practical applicability compared to classical ML algorithms. Classical ML algorithms, such as CatBoost, XGBoost, and LightGBM, continue to offer superior and more consistent performance with faster execution times, solidifying their place as reliable and powerful tools for classification tasks. Table 6. This table presents the combined runtimes of our experiments conducted on well-known and publicly available classification datasets. The runtimes include both the five-fold randomized search cross-validation process from Scikit-learn, which was employed to identify the optimal model, and the evaluation of the best model on an unseen test subset of the original data. We include results for the six datasets described in Section 3.5, the quantum classifiers detailed in Section 3.3, and the classical machine learning classifiers discussed in Section 3.2.

Comparison and Discussion
In this study, we have compared the performance of quantum machine learning and classical machine learning algorithms on six benchmark datasets and two types of artificially generated classification datasets. We included results for quantum classifiers, such as variational quantum circuit and quantum kernel estimator, and classical machine learning classifiers, such as CatBoost, XGBoost, and LightGBM. Our experiments showed that while QML algorithms demonstrate potential in achieving competitive performance on certain datasets, they do not consistently outperform classical ML algorithms. Additionally, their longer runtimes for the whole process, i.e., hyperparameter tuning via randomized search and five-fold cross-validation, the corresponding training and testing, and less consistent performance across the benchmark datasets, may limit their practical applicability compared to classical ML algorithms, which continue to offer superior and more consistent performance with faster execution times. Furthermore, we constructed artificial datasets with the structure and rulings of quantum Mechanics in mind, i.e., we used symmetry properties and unitary transformations to generate a classification dataset from SU(2)-matrices in order to demonstrate an advantage of quantum machine learning algorithms to tackle problems with an inherent structure relatable to that of quantum circuits and quantum mechanics overall. However, also for these datasets, the employed quantum machine learning algorithms performed reasonably but did not outperform sophisticated boost classifiers. Thus, we cannot conclude a quantum advantage for these datasets.
It is essential to highlight that the QML algorithms' performance in our experiments was based on simulated quantum infrastructures. This is a significant limitation to consider, as the specific constraints and characteristics of the simulated hardware may influence the performance of these algorithms. Furthermore, given the rapid advancement of quantum technologies and hardware, this constraint might be obsolete in the near future.
The impact of quantum simulators, feature maps, and quantum circuits on the performance of quantum estimators stems from the fact that these components play crucial roles in shaping the behavior and capabilities of quantum machine learning algorithms. Quantum simulators, which emulate quantum systems on classical computers, introduce various levels of approximation and noise, leading to deviations from ideal quantum behavior. Different simulators may employ distinct algorithms and techniques, resulting in variations in performance.
Feature maps, responsible for encoding classical data into quantum states, determine how effectively the quantum system can capture and process information. The choice of feature map can greatly influence the ability of quantum algorithms to extract meaningful features and represent the data in a quantum-mechanical space.
Similarly, quantum circuits, composed of quantum gates and operations, define the computational steps performed on the encoded data. Different circuit designs and configurations can affect the expressiveness and depth of the quantum computation, potentially impacting the accuracy and efficiency of the quantum estimators.
Considering the diverse options for quantum simulators, feature maps, and quantum circuits, it becomes essential for researchers to provide detailed explanations of their hyperparameter choices. This entails clarifying the rationale behind selecting a specific simulator, feature map, or circuit design, as well as the associated parameters and their values. By providing such explanations, researchers can enhance the reproducibility and comparability of results, enabling the scientific community to better understand the strengths and limitations of different quantum machine learning algorithms.
Unfortunately, the current state of the field often overlooks the thorough discussion of hyperparameter choices in many studies. This omission restricts the transparency and interpretability of research outcomes and hinders the advancement of quantum machine learning. To address this issue, researchers should embrace a culture of providing comprehensive documentation regarding hyperparameter selection, sharing insights into the decision-making process, and discussing the potential implications of different choices.
By encouraging researchers to provide detailed explanations of hyperparameter choices and corresponding code, we can foster a more robust and transparent research environment in quantum machine learning. This approach enables the replication and comparison of results, promotes knowledge sharing, and ultimately contributes to the development of reliable and effective quantum machine learning algorithms. Additionally, our program code serves as introductory material, providing easy-to-use implementations and a foundation for comparing quantum machine learning and classical machine learning (CML) algorithms.
One possible direction for future research is exploring quantum ensemble classifiers and, consequently, quantum boosting classifiers, as suggested by Schuld et al. [40]. This approach might help in improving the capabilities of QML algorithms and make them more competitive with state-of-the-art classical ML algorithms in terms of high accuracies.
Finally, the relatively lower performance of the employed quantum machine learning algorithms compared to, for example, the employed boosting classifiers might be attributed to quantum machine learning, being constrained by specific rules of quantum mechanics.
In the authors' opinion, quantum machine learning might be constrained by the unitary transformations inherent in, for example, the variational quantum circuits. These transformations are part of the unitary group U(n). Thus, all transformations are constrained by symmetry properties. Classical machine learning models are not constrained by these limitations, meaning that, for instance, different activation functions in neural networks do not preserve certain distance metrics or probabilities when processing data. However, expanding the set of transformations of quantum machine learning and getting rid of possible constraints might improve the capabilities of quantum machine learning models such that these algorithms might be better capable of capturing the information of more complex data. However, this needs to be discussed in the context of quantum computers such that one determines what all possible transformations on a quantum computer are. This means that future research needs to consider the applicability of advanced mathematical frameworks for quantum machine learning regarding the formal requirements of quantum computers.
Furthermore, another constraint of quantum machine learning is that it, and quantum mechanics in general, relies on Hermitian matrices, e.g., to provide real-valued eigenvalues of observables. However, breaking this constraint might be another way to broaden the capabilities of quantum machine learning to better capture complexity, e.g., by using non-Hermitian kernels in a quantum kernel estimator. Here, we want to mention the book by Moiseyev [41], which introduces non-Hermitian quantum mechanics. Furthermore, quantum computers, in general, might provide a testing ground for non-Hermitian quantum mechanics in comparison to Hermitian quantum mechanics. However, at this point, this is rather speculative, but given that natural data are nearly always corrupted by noise and symmetries are never truly perfect in nature, breaking constraints and symmetries might be ideas to expand the capabilities of QML.

Conclusions
In this research, we have explored the applicability of quantum machine learning for classification tasks by examining the performance of variational quantum circuit and quantum kernel estimator algorithms. Our comparison of these quantum classifiers with classical machine learning algorithms, such as XGBoost, Ridge, Lasso, LightGBM, CatBoost, and MLP, on six benchmark datasets and artificially generated classification datasets demonstrated that QML algorithms can achieve competitive performance on certain datasets. However, they do not consistently outperform their classical ML counterparts, particularly with regard to runtime performance and accuracy. Quite the contrary, classical machine learning algorithms still demonstrate superior performance, especially in terms of increased accuracy, in most of our experiments. Furthermore, we cannot conclude a quantum advantage even for artificial data built by data manipulations inherent to quantum mechanics.
As our study's performance comparison relied on simulated quantum circuits, it is important to consider the limitations and characteristics of simulated hardware, which may affect the true potential of quantum machine learning. Given the rapid advancement of quantum technologies and hardware, these constraints may become less relevant in the future.
Quantum simulators, feature maps, and quantum circuits significantly influence quantum estimator performance; hence, a detailed discussion of the chosen hyperparameters is essential. The absence of such a discussion in current research limits the interpretation and replication of experiments. Thus, we aim to encourage transparency in decision-making processes to promote a robust research environment, aiding in knowledge sharing and the creation of reliable quantum machine learning algorithms.
Despite the current limitations, this study has shed light on the potential and challenges of quantum machine learning compared to classical approaches. Thus, by providing our complete code in a GitHub repository, we hope to foster transparency, encourage further research in this field, and offer a foundation for other researchers to build upon as they explore the world of quantum machine learning. Furthermore, the developed SU(2)-data creation might serve as a quantum data prototype for future experiments, and both quantum and regular machine learning algorithms can be tested for their accuracy on datasets like these.
Future research should also consider exploring quantum ensemble classifiers and quantum boosting classifiers, as well as addressing the limitations imposed by the specific rules of quantum mechanics. By breaking constraints and symmetries and expanding the set of transformations in quantum machine learning, researchers may be able to unlock its full potential.