Predicting Partner’s Digital Transformation Based on Artiﬁcial Intelligence

: Partner’s digital transformation is one of the most important metrics for businesses, particularly for businesses in the subscription world. Hence, how to predict partner transformation is a consistent focus in the industry. In this paper, we use an AI (Artiﬁcial Intelligence) relevant algorithm to analyze partner’s digital transformation issues and propose a novel method, named the hybrid VKR (VAE, K-means, and random forest) algorithm, to predict partner transformation. We apply our algorithm to partner transformation issues. First, we show the prediction of about 5980 partners from 25689 partners, who are transformed and sorted according to important indicators. Secondly, we recap the tremendous effort that was required by the company to obtain high-quality results for economic change when a partner is transforming along with one or many of the transformation dimensions. Finally, we identify unethical behavior by looking through deal transaction data. Overall, our work sheds light on several potential problems in partner transformation and calls for improved scientiﬁc practices in this area.


Introduction
Today, more and more attention has been attracted to the phenomenon of partners' digital transformation. Many companies, such as AWS (Amazon Web Services) [1], Microsoft [2], and NetApp [3] have been launched by the partner transformation revolution, which demonstrates the importantance of partner digital's transformation. Our research industry, which is based on subscription production, is also largely influenced by partner transformation, so we are glad to take on this challenge to contribute to the strategy of our company. The company's partners account [4] for more than 80% of the total revenue and serve as key routes to the market. As organizations are digitally evolving, there is a pressing need for a company's partners to also transform [5,6] the way they sell to different customers. Partner's digital transformation [6][7][8] is a multifaceted and fast-moving phenomenon that has significant impacts, including impacts on the business processes and models of firms. The pace of technology uptake will depend on factors such as the type of sector, market, among others While no single indicator is able to reflect the pace of development, adoption, and diffusion, combining indicators can provide insight into the relative positioning of the different players (segmentation by industry, vertical, geo, market, sectors, size, etc.). Assessing the digital transformation of organizations (whether it be business partners, customers, or competitors) is a common question facing the industry today. What makes this evaluation intrinsically challenging is the lack of clearly defined "labels" defining the transformation level of an organization. Currently, little research exists that has been introduced to deal with a company's digital transformation, including the data-driven method [9], the six Ts of transformation model [10], and the building blocks of successful digital transformation method [11]. However, these methods lack a useful guide of information in the practice of advanced technology [12], such as AI (Artificial intelligence) and machine-learning technology. In this paper, we propose a VKR (VAE, K-means, and random forest) algorithm based on AI technology, which is more efficient and more easily predicts a partner's digital transformation.
The challenge regarding a partner's digital transformation [13][14][15] that exists in a company is described below: (1) How to identify partners who are transforming or are ready to transform across one or many of the transformation dimensions (such as New Buying Centers, Platform-Based Outcomes, and Customer Experience Life cycle); (2) When a partner is transforming along one or many of the transformation dimensions, how do their economics change; (3) Can we spot any unethical behavior by looking through the deal transaction data, which could put all of this at risk?
With these challenges in mind, the paper aims are to identify which partners are transferring or finishing transformation, which will help the company enlarge its business and gain much more revenue. The main contributions of this paper are described below: (1) Generalize useful data into a new database from raw data and denoise data; (2) Propose an efficient machine-learning model to predict partner transformation; (3) Analyze a transformation partner regarding their economic change; (4) Introduce a novel algorithm based on the model to identify partners with unethical behavior.
The rest of this paper is organized as follows: related works about partner transformation are described in Section 2; an introduction to our method about partner transformation is in Section 3 and shows the steps for data analysis; Section 4 provides the methodology for evaluation using the model selection process; the results and discussion are reported in Section 5; Section 6 offers a conclusion and expresses the outlook for future work.

Related Work
In recent years, little research has been introduced to deal with the digital transformation of a company, including the data-driven method [9], the six Ts of transformation model [10], and the building blocks of successful digital transformation method [11]. In this section, we will introduce the related work of digital transformation, including the three aforementioned methods.

Data-Driven Method
One method regarding partner transformation that utilizes a data-driven strategy for digital transformation is the data-driven method [9]. The Data-driven method is mainly used in the telecom industry, which has faced massive structural changes. Many aspects of this industry, including the customers, channels, content, and competitors, are all becoming digital, and these aspects are data-driven. The data-driven method can be characterized by three main points: (1) Grow wireless revenues, which relies on growing subscribers, growing average revenue per user, and retaining high-value subscribers; (2) grow strategic revenues, which rely on deployed software-defined platforms, rolling out new IP solutions, and having an extended footprint; (3) reduce cost structure, which relies on a streamlined organization, increased operational efficiencies, and reduced cycle time.

The Six Ts of Transformation Model
The Six Ts of Transformation model [10] is a model that could become necessary for use by educators to guide preservice school librarians in their successful transformation from teacher to school librarian leader. Using the constructs of this model in preparation programs, educators will help preservice school librarians understand their place within the school and will provide them with the tools and strategies needed to overcome barriers, such as insufficiencies in time, funding, and staffing, to enact their role as leaders.

Building Blocks of the Successful Digital Transformation Method
In general, the building blocks of a successful digital transformation method [11] is a conceptual method, which first identifies building blocks, then analyzes the technologyrelated factors, and, finally, analyzes the market-related factors. After analyzing all of the factors, conclusions are made to revise a firm's business model.
Our vision is to go beyond the above three partners' digital transformation methods (data-driven method [9], six Ts of transformation model [10], and building blocks of successful digital transformation method), as they are just provide a theoretical approach to guide firms, but they lack useful information in practice and advanced technology, such as AI (Artificial intelligence) and machine-learning technology. As such, in this paper, we propose a VKR algorithm based on AI technology to address a partner's digital transformation, which is a more efficient and easier was to predict a partner's digital transformation.

Methodology
In order to account for partner's transformation, we have to select some useful features to measure that transformation [7,[16][17][18]. First, we check the data set and clean the data reference description. Second, we design our machine-learning algorithm system to generate a novel model to predict partner transformation. Lastly, we give detailed implementation information about the key algorithm.

Data Preprocess and Feature Engineering
The raw datasets were provided by the company, which contains 3 databases and 14 tables. The data includes some extraneous information, which could act as noise and interfere with our determination of the issues mentioned in Section 1. As such, we need to extract the most important features before we construct a machine-learning model. We analyze key indicators of partner transformation regarding new buying centers, platformenabled outcomes, and customer experience. These indicators are described in more detail in Figure 1. When a partner is transforming along one or many of the transformation dimensions, how do their economics change? We deeply mined the data and generated relevant indicators, which are described in Figure 1, to define transformation dimensions. The first column stands for detail indicators while the other column stands for the attributes of DB (Database) tables. Based on the analysis of the confusion matrix, the highest misclassification rate observed is between the Early Majority and Late Majority. We also notice that, although the class distribution has become more uniform after using SMOTE (Synthetic Minority Oversampling Technique), the misclassification rate and accuracy are still sensitive to the accuracy of ground truth labels. This may be due to the transformation scorecard that was computed being from a final select set of measures based on data availability, significance, and the business assumption that all measures are weighted on an equal footing for the overall score, the detailed information about feature importance values is described in Figure 2.

Machine-Learning Model Solution
In order to show our detailed work about model generation, we describe the main process of model generation. The process of our solution mainly includes data collection, data preprocessing, feature extraction [16], model training, and prediction. Figure 3 illustrates the detailed workflow of the machine-learning system solution architecture.

VKR Algorithm Analysis
Here we use VAE [5,19], K-means, and random forest algorithms as our algorithm conductor that is known as the hybrid VKR described as Algorithm 1. From there, the data is fed into the main training loop of the VAE. An encoder and decoder artificial neural network (ANN) are defined with three hidden layers and a 2D mean layer. The mean VAE layer is the feature embedding layer of a VAE and is used with representation learning to visualize the generated latent space. The VAE was trained in 80 epochs with a batch size of 128 data samples. The Adam optimizer and learning-rate scheduler have been used to train the encoder and decoder network. The VAE optimization objective (loss function) is the sum of the reconstruction loss calculated with binary cross-entropy (BCE) [20,21] and the KL (Kullback-Leibler) divergence as a measure of how the data distribution is different from a VAE latent prior distribution (Gaussian). At the end of the training process, we get a trained VAE model which is stored on a disk for easy loading and usage. Algorithm 1: Hybrid VKR (VAE, K-means, random forest) algorithm Input: Data matrix X ∈ R m * n , there m is total data number and n is the dimension of each data 1. Deal with missing values(Sparse coding) 2. Use normalization data to restructure the input data matrix X

Experiment
We conducted three experiments: (1) predict whether partner transformation will occur; (2) economic changes upon partner transformation; and (3) identifying partners with unethical behavior. Every experiment starts with a hyperparameter cluster, and once the optimal parameter values are found, they are used for the final condition to give the result.

Dataset
We have used the dataset provided from the company's private data; this dataset contains 3 databases and 14 tables. Some of the data is presented in Table 1. We generateed our dataset from the raw dataset and trained our model from it. The attributes of the training dataset are described in Table 2.

Experimental Method
The main process of the experiment is first inputting clean data into the VAE model to get a latent vector, and then for the prediction of whether a partner transformation will occur using the random forest algorithm, amd finally using the K-means algorithm to identify any unethical issues.
From there, the data is fed into the main training loop of the VAE. An encoder and decoder artificial neural network (ANN) are defined with three hidden layers and a 2-D mean layer. The mean VAE layer is the feature embedding layer of a VAE and is used with representation learning to visualize the generated latent space. The VAE was trained in 80 epochs with a batch size of 128 data samples. The Adam optimizer and learning rate scheduler have been used to train the encoder and decoder network. The VAE optimization objective (loss function) is the sum of the reconstruction loss calculated with binary crossentropy (BCE) and the KL divergence as a measure of how the data distribution is different from a VAE latent prior distribution (Gaussian). At the end of the training process, we get a trained VAE model which is stored on a disk for easy loading and usage.  The colored areas in the latent space are mapped back to the preprocessed input data set. The corresponding x-y-coordinates of the mean vector have been determined by looking at the latent space plot. The coordinates are used with a Pandas slice operation in order to extract a subset of the data points for each transformation dimension. These subsets of the input data are then used for the analysis of the output variables (bookings, revenue, gross margin, deal size, and deal velocity) across the four transformation dimensions. Each data point represents a partner (Partner_site_party_key) or an aggregated set of such keys in the case of multi-partner domains such as New_Buying_Center. After finishing the VAE part, there will be an output of the latent vector; next, the latent vector will be fed into a random forest algorithm to identify partner transformation, and will also be used with a K-means algorithm to identify unethical issues.

Results and Discussion
After the experiment, we got the results regarding the questions stated in Section 1, which will be described in detail below.

How Do We Identify Partners Who Are Transforming or Are Ready to Transform across One or Many of the Transformation Dimensions
We used the same data as dealt with in Section 3. As previously mentioned, from the data generated dimensions including buying centers, platform-based outcomes, and customer experience lifecycle [22][23][24].

The Result of Prediction Transformation Partners
We predicted about 5980 partners who are transforming, as described in Table 3.

Discussion
From the above analysis, we predict that about 5980 partners are transforming based on the important dimensions of the partner transformation, such as buying centers, platformbased outcomes, and customer experience lifecycle. We identified that the most important indicators are the rate of revenue new type and the rate of hardware.

When a Partner Is Transforming along One or Many of the Transformation Dimensions, How
Do Their Economics Change?

Relevant Data Collection
First, we needed to pick the useful attribute columns. According to the title, we have selected some useful attributes regarding these aspects and arranged them as Table 4. For most of the above attributes, we find the overall performance of each partner by finding the sum of a group of partners in these aspects. For deal velocity, the mean value of the difference between two columns is obtained to reflect the order processing speed. The result is a large table with partner_site_party_key as the primary attribute, as shown in the figure below. This table has 7316 rows and 14 attributes.
In order to have a holistic view of the data, we obtain individual statistics for each column attribute, as shown in Table 5. From Table 5, it can be observed that some points are deviating from the mean value in some columns. In order to avoid the influence of some abnormal conditions on the result, we have removed the partner belonging to the largest two points and the smallest two points in each column. Next, we compare the changes and differences between partners who are transforming and those who are not. According to the classification results of Section 4.1, we divided the table into two classes and obtained the transformed tables with 2567 rows and untransformed tables with 4693 rows. We take the mean of each column to compare the two types of data, with the results being shown in Figure 6. The top is the partner that is not transforming, and the bottom is the partner that is transforming. From Tables 2 and 3, we can see that the booking, revenue, and gross margin of the partners that are transforming are all larger than the mean value of the partners that are not transforming. The deal size of the partners that are not transforming is larger than the mean value of the partners that are transforming. However, there is not much difference between them in deal velocity.

Data Comparison Diagram
Finally, a comparison diagram is given for the two partner types for each attribute in Figure 7a-h. The blue bars represent the untransformed partner, the yellow bars represent is the transformed partner, and the vertical coordinate is the specific value of the attribute.

Data Comparison Diagram
Finally, a comparison diagram is given for the two partner types for each attribute in Figure 7a-h. The blue bars represent the untransformed partner, the yellow bars represent is the transformed partner, and the vertical coordinate is the specific value of the attribute.

Discussion
Recently, some research has been introduced to deal with the digital transformation of a company, including the data-driven method [9] and the six Ts of transformation model [10]. However, these methods lack the use of advanced technology, such as AI (Artificial intelligence) and machine-learning technology. In this paper, we add the use of AI technology to solve partner's transformation issues and call for improved scientific practices in this area. In order to identify when a partner is transforming along one or many of the transformation dimensions, how do their economics change? First, we collected relevant data which impacts the partner's transformation. Second, we analyzed the data and obtained the transformed tables with 2567 rows and untransformed tables with 4693 rows. Finally, we compared the two partner types for each attribute in Figure 7.

Anomaly Detection
Regarding the anomaly detection [8,25,26] issue, we use data mining techniques to find the main indicators of the discount, and then we use the discount relevant feature to identify any unethical behavior of the partner [27][28][29]. After the analysis, an important indicator of unethical behavior is the discount rate, so we chose two attributes in the table bookings_f.
Using pre_discount_credit_sum and rebate_bookings_sum as the two-dimensional data to create the horizontal and vertical coordinates, we can get a whole to scatter distribution, as shown in Figure 8. It can be seen from the figure that most of the points are concentrated together. The x-coordinate is concentrated in the range 0 to 0.1 × 10 8 , and the y-coordinate is concentrated in the range 0 to 0.5 × 10 9 . However, a small part of the point distribution is very discrete, which indicates that the function of anomaly detection can be realized through these two-dimensional features. Next, we conducted a further quantitative analysis on the data, and obtained the mean, maximum, and minimum value, as well as the median, 3/4 value, and 1/4 value of each column of data, as shown in Figure 9. As can be seen from Figure 8, the discrete points of the anomaly mainly exist in the center of the aggregation that is larger than the point. Therefore, the partner_site_party_key with the two-dimensional feature values greater than 75% are taken as the partners of possible abnormal behaviors [30][31][32][33][34]. At last, we get the final result displayed in Figure 10, partner_site_party_key in the figure is the partner that may have unethical behavior. Figure 10. Abnormal behaviors data.

Discussion
Recently, some research has been introduced to deal with the digital transformation of a company, including the data-driven method [9] and the six Ts of transformation model [10]. However, these methods lack the use of advanced technology, such as AI (Artificial intelligence) and machine-learning technology. In this paper, we add the use of AI technology to solve partner's transformation issues and call for improved scientific practices in this area. From the analysis in this section, we can spot any unethical behavior by looking through our deal transaction data, which puts partner transformation at risk. First, we used data mining techniques to find the main indicators regarding discount. Second, using "pre_discount_credit_sum" and "rebate_bookings_sum" as the two-dimensional data to create the horizontal and vertical coordinates, we can get a whole to scatter distribution, which is shown in Figure 8. Finally, we identify the unethical behavior and display the detailed results in Figure 10.

Conclusions and Prospect
This paper analyzes the challenge of partners'digital transformation, which includes the following issues: (1) How do we identify partners who are transforming or are ready to transform across one or many of the transformation dimensions (such as new buying centers, platform-based outcomes, and customer experience lifecycle); (2) when a partner is transforming along one or many of the transformation dimensions, how do their economics change; and (3) can we spot any unethical behavior by looking through our deal transaction data, which could put the partner transformation at risk?
To solve the above issues, we first analyzed the data, then designed the VKR algorithm, generalizing our machine-learning model to predict partner's digital transformation, subsequent economic change, and the identification of any unethical behavior. Finally, we found the results to answer the partner's digital transformation issues.
In order to research partner's digital transformation more deeply in the future, there are several directions for further research. The first direction is to add more data from the company, such as combining many more features. Another interesting direction is to relax the strict positive definiteness constraint and also support compact, dense feature representations, which can be achieved with the help of fast symmetric factorization techniques.