A Data Imputation Strategy to Enhance Online Game Churn Prediction, Considering Non-Login Periods

Lee, JaeHong; Rerkjirattikal, Pavinee; Nam, SangGyu

doi:10.3390/data10070096

Open AccessArticle

A Data Imputation Strategy to Enhance Online Game Churn Prediction, Considering Non-Login Periods

by

JaeHong Lee

¹

,

Pavinee Rerkjirattikal

²

and

SangGyu Nam

^1,*

¹

School of Information, Computer, and Communication Technology, Sirindhorn International Institute of Technology, Thammasat University, Pathum Thani 12120, Thailand

²

Department of Technology and Operations Management, Faculty of Business Administration, Kasetsart University, Bangkok 10900, Thailand

^*

Author to whom correspondence should be addressed.

Data 2025, 10(7), 96; https://doi.org/10.3390/data10070096

Submission received: 26 March 2025 / Revised: 10 June 2025 / Accepted: 20 June 2025 / Published: 23 June 2025

(This article belongs to the Section Information Systems and Data Management)

Download

Browse Figures

Versions Notes

Abstract

User churn in online games refers to players becoming inactive for an extended period. Even a small increase in churn can lead to significant revenue loss, making churn prediction crucial for sustaining long-term player engagement. Although user churn prediction has been extensively studied, most existing approaches either ignore non-login periods or treat all inactivity uniformly, overlooking key behavioral differences. This study addresses this gap by categorizing non-login periods into three types, as follows: inactivity due to new or dormant users, genuine loss of interest, and temporary inaccessibility caused by external factors. These periods are treated as either non-existent or missing data and imputed using techniques such as mean or mode substitution, linear interpolation, and multiple imputation by chained equations (MICE). MICE was selected due to its ability to impute missing values more robustly by considering multivariate relationships. A random forest (RF) classifier, chosen for its interpretability and robustness to incomplete data, serves as the primary prediction model. Additionally, classifier chains are used to capture label dependencies, and principal component analysis (PCA) is applied to reduce dimensionality and mitigate overfitting. Experiments on real-world MMORPG data show that our approach improves predictive accuracy, achieving a micro-averaged AUC of above 0.92 and a weighted F1 score exceeding 0.70. These findings suggest that our approach improves churn prediction and offers actionable insights for supporting personalized player retention strategies.

Keywords:

user churn prediction; online games; multiple imputation; machine learning

1. Introduction

The video game industry has seen exponential growth in recent years, with the global market valued at nearly USD 300 billion in 2024 [1]. In such a competitive landscape, user retention is critical to the success of game companies. A key challenge is user churn, which is commonly defined as a player ceasing to engage with the game for an extended period (e.g., four consecutive weeks). However, this threshold may vary by game type or platform. Even a slight increase in churn rate can result in significant revenue loss, making churn prediction a high priority for developers and marketers.

Churn prediction involves identifying which players are likely to churn and, in some cases, when this churn may occur, often by analyzing behavioral patterns such as play frequency, in-game actions, purchases, and social interactions. Machine learning (ML) models, especially supervised classifiers, are widely used for this task. However, their performance depends heavily on the quality of input features and how well the data is preprocessed.

A crucial yet underexplored aspect of data preprocessing is how non-login periods are handled. Most existing studies either ignore or treat all inactivity as equivalent to disengagement. This oversimplifies player behavior and may lead to inaccurate predictions, as inactivity can result from various factors. To address this gap, we propose a novel approach that distinguishes between different types of inactivity based on observable behavioral features and activity timestamps when users first become active. We differentiate between the following:

(1): Inactivity before the first recorded login, which may indicate new users, dormant returnees, or temporary inaccessibility due to external factors
(2): Inactivity after a user has already played, which may reflect a genuine loss of interest or temporary inaccessibility due to external factors.

This classification is rule-based, not based on ML models or statistical inference. It enables us to apply more appropriate imputation strategies depending on the likely cause of inactivity. To explore the impact of our proposed approach, we apply a range of imputation techniques, including minimum value substitution, mean, mode, linear interpolation, and multiple imputation by chained equations (MICE). These methods vary in complexity and underlying assumptions; simple methods offer speed and interpretability, while MICE better preserves inter-feature relationships by iteratively modeling variable dependencies.

Our approach is evaluated using gameplay data from the MMORPG Blade & Soul, provided by NCSoft [2]. The dataset is high-dimensional and includes a wide range of behavioral features, such as activity logs, payment transactions, and social interactions. To address this, we apply principal component analysis (PCA) to reduce feature space, preserve structural patterns, and minimize overfitting risk. The imputed data are then used as input to churn prediction using a random forest (RF) classifier, chosen for its interpretability and robustness to missing or noisy data. In addition, classifier chains are employed to capture label dependencies and further enhance performance. The main contributions of this study are as follows:

(1): Proposing a rule-based method to classify non-login periods and apply appropriate imputation techniques.
(2): Evaluating the impact of various imputation methods on churn prediction performance.
(3): Validating our approach using real-world MMORPG data and comparing it with baseline methods that treat inactivity uniformly.

Experimental results demonstrate that our approach improves prediction accuracy and enables a more nuanced interpretation of player inactivity.

The remainder of this paper is structured as follows: Section 2 reviews related literature. Section 3 describes our dataset and churn labeling procedure. Section 4 outlines the proposed framework. Section 5 presents the results and analysis. Section 6 concludes with key findings and future research directions.

2. Background and Related Work

2.1. User Churn Prediction

Churn prediction continues to be a major concern across sectors such as telecommunications, retail, and finance, where identifying users likely to disengage is essential for sustaining long-term customer relationships. Recent studies have employed advanced approaches that draw on behavioral patterns, usage histories, and time-based activity trends to enhance churn detection and guide retention strategies [3,4,5,6].

In the gaming industry, churn prediction has emerged as a critical research area driven by its rapid growth and the substantial financial impact on revenue [7]. Online games generate rich behavioral data, such as session frequency, in-game spending, and achievement progression, which can be used to detect early signs of disengagement. For instance, Kim et al. [8] demonstrated that a small set of log-derived features could match the performance of more complex models, highlighting the importance of feature selection in churn prediction. Similarly, Lee et al. [9] demonstrated that combining loyalty metrics, gameplay variety, and social indicators could improve prediction accuracy.

Beyond gameplay metrics, several studies have explored the role of social and psychological factors in user retention. Park et al. [10] and Kawale et al. [11] emphasized the role of social engagement and achievement-based features in long-term retention, particularly in MMORPGs. Borbora et al. [12] and Lee [13] further identified usability, intrinsic motivation, and community belonging as key determinants of player longevity, showing that churn is influenced by a combination of social dynamics and psychological factors. These psychosocial elements often interact with behavioral signals and evolve over time, reinforcing the importance of temporal modeling in churn analysis.

Efforts to model churn over time have led to the use of temporal analytics. For example, Hadiji et al. [14] quantified the dynamics of engagement over time in free-to-play games. Drachen et al. [15] and Runge et al. [16] developed frameworks to analyze longitudinal user behavior across game lifecycles. Tamassia et al. [17] used hidden Markov models to detect player state transitions from session sequences, and Milošević et al. [18] examined how churn risk can be inferred from longitudinal activity data. Beyond understanding whether a user will churn, recent studies have focused on when churn is likely to happen. Periáñez et al. [19] proposed the conditional inference survival ensemble (CISE) to estimate churn timing across user segments, especially high-value players. Bertens et al. [20] built on this with a scalable survival ensemble model that provides precise churn timing predictions, enabling targeted retention interventions before user churn occurs. However, while these models handle sequential data effectively, they often assume all inactivity reflects disengagement, which may not always be the case.

Recent work across diverse online and mobile games has consistently shown the effectiveness of various churn prediction models. Performance is typically measured using the area under the receiver operating characteristic curve (AUC-ROC), accuracy, and F1 score, particularly for handling class imbalance and assessing overall prediction accuracy. Perisic et al. [21] investigated churn in a free-to-play casual game using cluster analysis and conditional RF, reporting an AUC of 0.71 and accuracy of 72%. Hossain et al. [22]’s study on World of Warcraft identified RF as the best-performing model (AUC of 0.98, F1 score of 0.95, and accuracy of 97%). Hoang et al. [23] introduced a feature tokenizer transformer and an imputation strategy using gradient-boosted regression trees to predict churn in freemium mobile games, reaching an AUC of 0.95 and accuracy of 86.8%. Dontireddy et al. [24] analyzed over 40 million MMORPG character creation logs and found that XGBoost performed best with an AUC of 0.92 and accuracy of 90%. Mulla et al. [25] compared logistic regression and RF for churn prediction in Candy Crush, reporting RF as the stronger model with 97% accuracy.

Several studies have focused on churn prediction using the Blade & Soul MMORPG datasets from 2017 and 2018. Guitart et al. [26], the winning team of the 2017 Game Data Mining competition, applied tree-based ensemble models and long short-term memory (LSTM) networks for binary churn classification, achieving an overall F1 score of 0.62. For 2018 datasets, Sin and Paik [27] employed binary logistic regression and a neural network, reporting accuracies of 83.3% and 86.7%, respectively. More recently, Jin et al. [28] applied a graph neural network (GNN) that integrates player behavior and social relationships via a churn graph structure, achieving an F1 score of 0.93 and accuracy of 89.6%. While these studies demonstrate strong predictive performance, they primarily frame churn prediction as a binary classification task, determining whether a user will churn. In contrast, our study formulates churn prediction as a multi-class classification problem that simultaneously estimates whether and when a user will churn, using discrete time-based labels. Moreover, we address a key limitation overlooked in prior works, which treated non-login periods as indicators of churn and did not apply imputation methods to reconstruct missing behavioral data.

2.2. Handling User Inactivity as Missing Data

In churn prediction, periods of user inactivity, often defined as no login activity, are commonly treated as signs of disengagement. Most prior studies rely heavily on login frequency to construct predictive features, assuming that a prolonged absence implies declining interest. For instance, Runge et al. [16] defined churn based on a fixed inactivity window, while Castro et al. [29], Tamassia et al. [17], and Borbora et al. [12] derived features from raw login data without differentiating the causes of inactivity or addressing unobserved periods. This uniform treatment may result in misclassification when user absence is due to temporary external factors (e.g., work, travel, or technical issues) rather than a loss of interest.

In contrast, our study treats certain types of inactivity as missing data rather than definitive signs of churn. By framing these periods as a data quality issue, we apply structured imputation to reconstruct user timelines and preserve behavioral patterns. Prior research has shown that restoring continuity in behavioral data through imputation can enhance model accuracy [30].

We evaluate several imputation techniques to fill in the missing periods. Traditional methods, such as mean and mode imputation, are simple but often fail to preserve behavioral continuity [31]. Linear interpolation provides temporal smoothness but assumes consistent trends that may not be present in player activity data. Multiple imputation by chained equations (MICE) offers a more robust alternative by iteratively modeling the relationships among variables, thereby preserving the multivariate structure and temporal dependencies [32]. MICE has demonstrated strong performance in restoring data integrity and improving prediction accuracy across various domains, including streamflow analysis [33] and elastic well logs [34]. Although rarely applied to churn prediction, its ability to capture inter-variable relationships and temporal patterns makes it a promising method for imputing missing behavioral data in gaming contexts.

Our approach applies these imputation strategies based on inferred causes of inactivity. By aligning imputation methods with behavioral context, we aim to improve the completeness of input features and, in turn, the accuracy of churn prediction. This structured link between problem framing, preprocessing, and predictive modeling represents a key methodological contribution of our work.

3. Characterizing User Churn

Churn in online games differs from other domains, where membership cancellation is often used to define churn. In gaming, users rarely cancel their accounts even after prolonged disengagement. For example, Lee et al. [35] found that fewer than 1% of inactive users formally canceled their membership, making inactivity-based definitions more appropriate. In this study, we adopt the churn criterion defined in the Blade & Soul dataset from NCSoft: a user is considered churned if they remain inactive for four consecutive weeks during a 12-week churn determination period. This threshold is standard in MMORPGs, where players tend to have longer but less frequent sessions, unlike mobile games, where churn is often defined as 7–14 days of inactivity [8].

The dataset contains weekly activity logs for 100,000 users over an 8-week observation period, followed by a 12-week window for determining churn. As defined by NCSoft for the competition, a user is considered churned if they are inactive for four consecutive weeks during this period. The churn occurrence point is the last active week in which activity is observed before the inactivity begins, as shown in Figure 1. Based on when this point falls relative to the end of the observation window, users are assigned to one of four churn classes as formally defined in Table 1.

The distribution of users across the four labels appears uniform, with each label accounting for exactly 25% of users. Note that this reflects the dataset provided by NCSoft and was not artificially sampled by the authors. The following shows some examples of the labeling process.

User A: Active in weeks 2–5 → Churn Occurrence Point is week 5 (Inactive in 6–12 weeks) → Labeled “2 Months.”
User D: Active in weeks 2, 8, and 12 → First 4-week inactivity starts after week 2 (Inactive in week 3–7) → Labeled “Month.”

4. Methodology

This section outlines our methodology. The overall workflow consists of the following key steps:

(1): Dataset preprocessing;
(2): Dataset structure;
(3): Inactivity treatment;
(4): Data structure transformation;
(5): Imputation methods.

4.1. Dataset Preprocessing

The Blade & Soul dataset consists of four files, as summarized in Table 2. All features are numerical variables and cover user activity (e.g., playtime, login frequency), social interaction (e.g., party participation), and payment behavior. All 100,000 users were linked across files using unique hashed IDs, resulting in a merged dataset that included eight weeks of behavioral activity, followed by the assigned churn label. All numerical features were standardized using z-score normalization.

The activity dataset includes 36 numerical features capturing gameplay behavior, such as combat, progression, and communication. Table 3 summarizes some key variables used for churn modeling.

Beyond gameplay activity, we also include payment behavior and party-based social interaction, both of which have been shown to correlate with churn [36,37]. Because the party dataset includes nearly 7 million records, we aggregated it into weekly summary features. Table 4 lists the derived party and payment-related variables.

The final dataset includes 44 features and 99,485 users after excluding inconsistent records (e.g., mismatched party logs). The churn label distribution remained balanced across the four classes: 24,942, 24,693, 24,863, and 24,987 users.

4.2. Dataset Structure

The dataset used in this study is structured as panel data, recording the weekly average activity for each unique user ID. Each user has one row per week of data, with user IDs linking observations across time, as shown in Figure 2. Here, ID represents a unique user identifier, f denotes features, and d represents recorded data values. Due to varied engagement patterns, users have different numbers of active weeks. To standardize the input for machine learning, each user’s timeline is padded to 8 weeks using NaN (not a number) values for inactive weeks, as shown in Figure 3.

4.3. Inactivity Treatment

In this study, we analyze user inactivity (NaN), their timing, and potential causes to improve the accuracy of churn predictions. To do so, we define First_Active_Week (FAW) as the first week a user shows activity. NaN values occurring before FAW may indicate a new or dormant user (N/D) or temporary absence. At the same time, NaNs after FAW may suggest either decreased interest (DI) or external factors (EFs), such as travel or technical issues. Figure 4 illustrates an example timeline, and Table 5 shows the distribution of NaNs across users by FAW. The table reveals that many data points exhibit NaN values, indicating that a significant portion of users may experience intermittent periods of inactivity.

To reflect behavioral intent, we classify NaNs based on FAW:

(1): NaN data before FAW: May indicate a new/dormant user (N/D) or temporary inactivity due to external factors (EFs).
(2): NaN data after FAW: Typically indicates temporary inactivity, which may be from decreased interest (DI) or external factors (EFs).

We distinguish between non-existent data (from DI or N/D) and missing at random (MAR) data (from EF). The former is imputed using minimum values; the latter is estimated using imputation methods. Table 6 outlines the three assumptions we adopt when imputing NaN values, based on whether they occur before or after the user’s FAW.

Method 1 assumes that all inactivity reflects a lack of engagement: NaNs before FAW are treated as non-existent (either new or dormant users), and NaNs after FAW are attributed to decreased interest. Method 2 distinguishes between the two, treating pre-FAW NaNs as non-existent and post-FAW NaNs as MAR. Method 3 assumes that all inactivity is temporary and externally caused, treating all NaNs as MAR due to EF. This behavior-based categorization supports more accurate imputation and improves churn prediction.

4.4. Data Structure Transformation

After padding each user’s timeline with NaN values, the dataset is structured as a three-dimensional array with the following dimensions: N (number of users),

T = 8

(weeks), and D (number of features). For imputation, we reshape the data into either a wide or a long format, as shown in Figure 5. The choice of structure can influence imputation results, as each format emphasizes different aspects of the data, such as temporal patterns or inter-feature relationships, which may affect how missing values are estimated.

(1): Wide format (N, $T D$ ): All weekly activity features for each user are concatenated into a single row, offering a holistic user-level view.
(2): Long format ( $N D$ , T): The dataset organizes weekly activity information by feature, with time as the primary axis.

4.5. Imputation Methods

We apply four imputation methods to handle missing values in user activity logs. Other imputation approaches, such as KNN and MissForest, were also explored during preliminary experiments. However, due to memory limitations and excessive runtime on the full dataset, their results are not included in the final analysis.

(1)

Mean imputation: Replaces missing values with the mean of each variable.

(2)

Mode imputation: Fills missing entries using the most frequent value.

(3)

Linear interpolation: Estimates missing values by assuming a linear trend between observed values.

(4)

MICE: A more advanced method that models each variable with missing data as a function of the others, preserving multivariate relationships. The MICE process follows five steps:

(a): Perform simple imputation (e.g., mean imputation) for all missing values in the dataset. These imputed values are so-called placeholders.
(b): Set placeholders for one variable back to missing values.
(c): Treat the variable with NaN values as the dependent variable and predict these missing values using regression models based on the remaining variables.
(d): Repeat (b)–(c) for all variables with missing data. One complete iteration, known as a cycle, ensures all missing values are replaced based on inter-variable relationships.
(e): Conduct imputations for multiple cycles until the estimates stabilize, ensuring consistent relationships among variables.

In this study, we use a single imputed dataset from MICE, not full multiple imputation with pooled estimates, meaning that instead of sampling from the full posterior distribution, the mean of the posterior distribution generated by Bayesian ridge regression was used for missing value imputation. This is because our focus is not on inferential statistics, and also requires training and evaluating multiple models across independently imputed datasets. Since our focus is on predictive performance rather than parameter inference, a single deterministic imputation was considered sufficient and more consistent for comparing imputation strategies.

Imputation methods are applied across both wide and long dataset formats, following the NaN treatment strategies defined in Section 4.3. Imputation results also vary by how NaN values are interpreted in relation to the user’s FAW. We apply each method to datasets generated from three NaN treatment strategies (see Figure 6), resulting in several versions of imputed datasets, as shown in Table 7.

To evaluate the impact of different imputation strategies, each imputed dataset is used to train churn prediction models. Model performance is evaluated using standard classification metrics, including AUC-ROC and F1 score (see Section 5.3). This enables a direct comparison of how different imputation methods and assumptions for handling NaNs affect predictive accuracy. The analysis reveals the extent to which imputation quality influences the reliability of churn classification.

5. Experiments and Analysis

We evaluate the impact of different imputation strategies on churn prediction performance. Following the preprocessing steps described earlier, experiments were conducted in the Google Colab Python 3 environment, using a fixed random seed of 42 for reproducibility.

5.1. Data Imputation and Evaluation

To handle missing values arising from user inactivity, we applied several imputation methods as detailed in Section 4.5. Among these, MICE was implemented using the IterativeImputer from Python’s scikit-learn library [38], with Bayesian ridge regression [39]. This setup reduces overfitting and captures parameter uncertainty, contributing to more stable predictions. Table 8 lists the MICE parameters used.

To verify that imputation preserved essential dataset characteristics, we evaluated structural and temporal consistency:

(1): Structural Consistency: We computed feature correlation matrices before and after imputation and quantified their changes using mean correlation difference (MCD). If the relationships among features are significantly altered after imputation, the validity of subsequent analyses may be compromised. As shown in Table 9, all datasets exhibited MCD values below 0.1, indicating minimal structural changes. According to Cohen’s criteria [40], these values suggest a slight difference, supporting the reliability of the imputation methods used.
(2): Temporal Consistency: We also examined whether imputation preserved weekly activity patterns by comparing the original dataset against three imputed datasets: linear_nd_t, mode_nd_t, and mice_all_n_td. Figure 7 shows average play_time and payment_amount trends by churn label across 8 weeks. For example, Label 1’s spike in week 7 and Label 3’s consistently low values are preserved across all imputed versions. These results confirm that key temporal dynamics remained intact after imputation, further supporting the validity of the datasets for modeling.

5.2. Churn Prediction

With the structural and temporal integrity of the imputed datasets confirmed, we proceed to evaluate their impact on churn prediction. Churn prediction is formulated as a four-class classification task, where each user is categorized according to the churn labels defined in Table 1. RF was selected for its strong performance in handling noisy or incomplete data. Its interpretability and low risk of overfitting make it a practical and robust baseline model for this study, especially given the imputed, multi-class dataset. The RF configuration is listed in Table 10. For comparison, we also employed PCA and classifier chains:

PCA reduces feature dimensionality while retaining at least 95% of total variance, which was achieved using 24 components.
Classifier chains model label dependencies by predicting the four churn labels in sequence: [0, 1, 3, 2].

All models were trained and evaluated on an 80:20 split of the dataset, with 20% used for testing.

5.3. Performance Evaluation Metrics

Model performance was assessed using a weighted F1 score and a micro-averaged AUC-ROC, computed on the 20% test set. These metrics were selected based on their effectiveness in handling imbalanced multi-class classification tasks, as discussed in Section 2.1.

AUC-ROC measures the model’s ability to distinguish between positive and negative labels by measuring the area under the ROC curve. In this multi-class setting, we use micro-averaged AUC-ROC, which aggregates performance across all classes by pooling the individual decisions into a single binary classification task [38,41]. It is computed as follows:

$A U C_{m i c r o} = \frac{\sum_{c = 1}^{C} \sum_{i = 1}^{N} \sum_{j = 1}^{N} 1 (y_{i}^{c} > y_{j}^{c})}{\sum_{c = 1}^{C} n_{c} (N - n_{c})}$

(1)

where C is the number of labels, N is the total number of instances, $y_{i}^{c}$ is the predicted probability of instance i belonging to label c, $1 (y_{i}^{c} > y_{j}^{c})$ is the indicator function that returns 1 if the condition is true and 0 otherwise, and $n_{c}$ is the number of instances in label c.
The weighted F1 score is the harmonic mean of precision and recall, accounts for class imbalances, and provides a balanced assessment of classification accuracy. This metric accounts for label imbalance and is well-suited for multi-class evaluation. It is defined as follows:

$F 1_{w e i g h t e d} = \sum_{c = 1}^{C} w_{c} \cdot \frac{2 \cdot {precision}_{c} \cdot {recall}_{c}}{{precision}_{c} + {recall}_{c}}$

(2)

where $w_{c}$ is the proportion of samples in label c, and C is the number of labels.

5.4. Results and Discussions

We evaluate the impact of different imputation methods on churn prediction using the RF model as a baseline. Table 11 presents micro-averaged AUC scores across the four churn labels, and Figure 8 shows the corresponding ROC curves. Among all methods, the MICE-imputed dataset (mice_all_n_td) consistently outperforms others, achieving AUCs of 0.9423 and 0.9734 for labels 0 and 1, and 0.8549 and 0.8773 for labels 2 and 3, respectively.

To further improve classification, we applied classifier chains [42], which sequentially predict labels while incorporating earlier predictions as input features. A chain order of [0, 1, 3, 2] was used, corresponding to the four churn categories. Additionally, PCA was used to reduce feature dimensionality while preserving variance. Although 23 components capture 95% of the variance, 24 components were empirically found to yield the best balance between dimensionality and predictive performance.

Table 12 presents weighted F1 scores across datasets and model configurations (RF, RF + PCA, classifier chains, and classifier chains + PCA), using 5-fold cross-validation. Results are reported as means and standard deviations.

Method 1, represented by the ori_data, serves as the baseline. Method 2, which imputes only NaNs after FAW, yields limited improvements. This is likely due to the relatively small proportion of missing values occurring after FAW (see Table 5). In contrast, Method 3 assumes all NaNs, before and after FAW, are MAR and applies full imputation, consistently yielding the best performance. The MICE-imputed datasets (mice_all_n_td and mice_all_nd_t) achieved the highest weighted F1 scores of

0.7034 \pm 0.0041

and

0.7043 \pm 0.0040

under the RF model. Classifier chains further improve performance, especially when combined with PCA. The best result was observed for mice_all_n_td using classifier chains + PCA (

0.7065 \pm 0.0057

), followed by mice_all_nd_t using classifier chains alone (

0.7054 \pm 0.0037

). Across all configurations, low standard deviations (generally below 0.01) indicate consistent performance across data partitions.

To confirm the statistical significance of these results, we conducted Friedman tests on both the per-label AUC scores (Table 11) and the weighted F1 scores (Table 12). All tests yielded

χ^{2}

values greater than 30 with

p < 0.0001

, indicating that the differences in model performance across imputation strategies are statistically significant. These findings confirm that the choice of imputation method has a substantial impact on both class-wise discriminative ability and overall predictive performance.

Finally, we analyzed feature importance to identify behavioral attributes contributing most to churn prediction. Across all datasets, playtime consistently ranked highest, followed by login frequency, combat and reward metrics, and social interaction features. Regarding computational efficiency, most imputation methods completed within minutes. However, MICE, when applied under Method 3—where all non-login periods are treated as MAR—required approximately three hours due to the high dimensionality of the dataset. Still, given that churn prediction is typically conducted on a weekly or monthly basis, this runtime remains feasible for practical applications.

6. Conclusions and Future Work

User churn in online games can arise from various forms of inactivity, including new or dormant users, loss of interest, or external factors limiting game access. Existing models often treat all non-login periods uniformly or disregard them entirely, resulting in inaccurate behavioral interpretations and compromised predictive performance.

To address this limitation, we proposed a novel approach that differentiates between types of inactivity and handles them as either non-existent data or missing values. We applied multiple imputation strategies, minimum value substitution, mean, mode, linear interpolation, and MICE. Their impact on churn prediction is evaluated using RF, with PCA and classifier chains incorporated to enhance performance.

Our experimental results demonstrated substantial and statistically significant improvements across different dataset treatments. Method 3 datasets (mice_all_n_td and mice_all_nd_t), which treat all non-login periods as missing at random, achieved up to 3% improvement in the F1 score and approximately 1% in AUC compared to the baseline. These gains were validated by Friedman tests (

p < 0.0001

) and supported by low standard deviations (generally below 0.01), confirming performance stability across cross-validation folds. The best result (F1 = 0.7065 ± 0.0057) was achieved using classifier chains with PCA on mice_all_n_td, demonstrating the value of combining robust imputation with label dependency modeling and dimensionality reduction.

These findings highlight the effectiveness of MICE in preserving data integrity and enhancing prediction, while reinforcing the importance of accounting for external factors when modeling churn. Ignoring such behavioral nuances may lead to suboptimal retention strategies and missed intervention opportunities.

6.1. Limitations and Future Research Directions

While this study makes meaningful contributions to churn prediction in online games, several limitations remain, offering areas for future investigation.

First, user inactivity was categorized into the following three types: new or dormant (N/D) users, decreased interest (DI), and external factors (EFs), based primarily on behavioral patterns and the timing relative to the user’s first active week (FAW). In reality, inactivity may result from a broader range of causes, such as the release of competing games or the use of multiple accounts. Expanding this classification to reflect richer behavioral signals could enhance both imputation quality and churn prediction performance, better capturing the complexity of user behavior.

Second, while MICE provided reliable imputation results, its computational demands increased substantially with dataset size and dimensionality. This limited its feasibility for real-time analysis or large-scale game environments. Future work could explore more scalable imputation methods, such as matrix factorization or neural-based approaches, to enable faster processing without sacrificing accuracy.

Third, our experiments were conducted using a random forest model in combination with PCA and classifier chains. While the results demonstrate improvements in prediction, it remains unclear whether these improvements would generalize to other models, such as gradient boosting or deep learning approaches. Future work should investigate the robustness of the imputation strategies across a broader range of prediction models.

Lastly, this study relied on data collected in 2018. Given the rapid evolution of the gaming industry, validating the findings on more recent and diverse datasets is essential. This would help assess whether the proposed methods remain effective under modern player dynamics, engagement systems, and churn behaviors. In addition to dataset generalization, future work could also compare the effectiveness of alternative classifiers and evaluate whether the performance of imputed data remains consistent across different modeling approaches.

6.2. Practical Implications

This study enhances churn prediction accuracy by distinguishing temporary inactivity from actual churn risk, enabling game developers to implement targeted retention strategies, such as personalized rewards, re-engagement campaigns, and timely notifications, to reduce permanent churn. A deeper understanding of inactivity patterns also supports better design decisions, including difficulty balancing, social feature enhancements, and content scheduling, all of which help sustain long-term player engagement and revenue.

Beyond gaming, the proposed methodology applies to industries where user engagement and retention are critical, such as streaming services, e-commerce, and subscription platforms. By interpreting inactivity as a form of missing data and applying appropriate imputation techniques, organizations can improve predictive accuracy and tailor retention efforts more effectively. While domain-specific adjustments may be needed, the overall approach offers broad applicability for enhancing customer lifecycle management.

Author Contributions

Conceptualization, J.L. and S.N.; methodology, J.L. and S.N.; writing—original draft preparation, J.L.; writing—review and editing, P.R. and S.N.; supervision, P.R. and S.N.; funding acquisition, P.R. and S.N. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Thammasat University Research Fund under Contract TUFT034/2568.

Data Availability Statement

The data presented in this study are available on request from the corresponding author due to (consists of proprietary Blade & Soul game data provided by NCSoft for the 2018 Bigcontest competition and is not publicly available).

Conflicts of Interest

The authors declare no conflicts of interest.

References

AstuteAnalytica India Pvt. Ltd. Video Games Market Valuation Is Set to Skyrocket to Reach US $656.21 Billion by 2033. 2025. Available online: https://www.globenewswire.com/news-release/2025/02/06/3021813/0/en/Video-Games-Market-Valuation-is-Set-to-Skyrocket-to-Reach-US-656-21-Billion-by-2033-Astute-Analytica.html (accessed on 4 May 2025).
NCSOFT. Dataset for 2018 Bigcontest Competition. 2024. Available online: https://danbi-ncsoft.github.io/ (accessed on 9 February 2024).
Jiang, P.; Liu, Z.; Abedin, M.Z.; Wang, J.; Yang, W.; Dong, Q. Profit-driven weighted classifier with interpretable ability for customer churn prediction. Omega 2024, 125, 103034. [Google Scholar] [CrossRef]
Suguna, R.; Suriya Prakash, J.; Aditya Pai, H.; Mahesh, T.R.; Vinoth Kumar, V.; Yimer, T.E. Mitigating class imbalance in churn prediction with ensemble methods and SMOTE. Sci. Rep. 2025, 15, 16256. [Google Scholar] [CrossRef] [PubMed]
Chang, V.; Hall, K.; Xu, Q.A.; Amao, F.O.; Ganatra, M.A.; Benson, V. Prediction of Customer Churn Behavior in the Telecommunication Industry Using Machine Learning Models. Algorithms 2024, 17, 231. [Google Scholar] [CrossRef]
Imani, M.; Arabnia, H.R. Hyperparameter Optimization and Combined Data Sampling Techniques in Machine Learning for Customer Churn Prediction: A Comparative Analysis. Technologies 2023, 11, 167. [Google Scholar] [CrossRef]
Mustač, K.; Bačić, K.; Skorin-Kapov, L.; Sužnjević, M. Predicting player churn of a free-to-play mobile video game using supervised machine learning. Appl. Sci. 2022, 12, 2795. [Google Scholar] [CrossRef]
Kim, S.; Choi, D.; Lee, E.; Rhee, W. Churn prediction of mobile and online casual games using play log data. PLoS ONE 2017, 12, e0180735. [Google Scholar] [CrossRef]
Lee, E.; Jang, Y.; Yoon, D.M.; Jeon, J.; Yang, S.; Lee, S.K.; Kim, D.W.; Chen, P.P.; Guitart, A.; Bertens, P.; et al. Game data mining competition on churn prediction and survival analysis using commercial game log data. IEEE Trans. Games 2019, 11, 215–226. [Google Scholar] [CrossRef]
Park, K.; Cha, M.; Kwak, H.; Chen, K.T. Achievement and friends: Key factors of player retention vary across player levels in online multiplayer games. In Proceedings of the 26th International Conference on World Wide Web Companion, Perth, Australia, 3–7 April 2017; International World Wide Web Conferences Steering Committee: Geneva, Switzerland, 2017; pp. 445–453. [Google Scholar] [CrossRef]
Kawale, J.; Pal, A.; Srivastava, J. Churn prediction in MMORPGs: A social influence based approach. In Proceedings of the 2009 International Conference on Computational Science and Engineering, Vancouver, BC, Canada, 29–31 August 2009; Volume 4, pp. 423–428. [Google Scholar] [CrossRef]
Borbora, Z.; Srivastava, J.; Hsu, K.W.; Williams, D. Churn prediction in MMORPGs using player motivation theories and an ensemble approach. In Proceedings of the 2011 IEEE Third International Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third International Conference on Social Computing, Boston, MA, USA, 9–11 October 2011; pp. 157–164. [Google Scholar] [CrossRef]
Lee, K.Y. Factors affecting members’ sense of belonging in virtual community. J. Korean Oper. Res. Manag. Sci. Soc. 2010, 35, 19–45. [Google Scholar]
Hadiji, F.; Sifa, R.; Drachen, A.; Thurau, C.; Kersting, K.; Bauckhage, C. Predicting player churn in the wild. In Proceedings of the 2014 IEEE Conference on Computational Intelligence and Games, Dortmund, Germany, 26–29 August 2014; pp. 1–8. [Google Scholar] [CrossRef]
Drachen, A.; Canossa, A.; Sørensen, J.R.M. Gameplay metrics in game user research: Examples from the trenches. In Game Analytics; Seif El-Nasr, M., Drachen, A., Canossa, A., Eds.; Springer: Berlin/Heidelberg, Germany, 2013; pp. 285–319. [Google Scholar] [CrossRef]
Runge, J.; Gao, P.; Garcin, F.; Faltings, B. Churn prediction for high-value players in casual social games. In Proceedings of the 2014 IEEE Conference on Computational Intelligence and Games, Dortmund, Germany, 26–29 August 2014; pp. 1–8. [Google Scholar] [CrossRef]
Tamassia, M.; Raffe, W.; Sifa, R.; Drachen, A.; Zambetta, F.; Hitchens, M. Predicting player churn in destiny: A hidden Markov models approach to predicting player departure in a major online game. In Proceedings of the 2016 IEEE Conference on Computational Intelligence and Games (CIG), Santorini, Greece, 20–23 September 2016; pp. 1–8. [Google Scholar] [CrossRef]
Milošević, M.; Živić, N.; Andjelković, I. Early churn prediction with personalized targeting in mobile social games. Expert Syst. Appl. 2017, 83, 326–332. [Google Scholar] [CrossRef]
Periá nez, A.; Saas, A.; Guitart, A.; Magne, C. Churn prediction in mobile social games: Towards a complete assessment using survival ensembles. In Proceedings of the 2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA), Montreal, QC, Canada, 17–19 October 2016; pp. 564–573. [Google Scholar] [CrossRef]
Bertens, P.; Guitart, A.; Periáñez, Á. Games and big data: A scalable multi-dimensional churn prediction model. In Proceedings of the 2017 IEEE Conference on Computational Intelligence and Games (CIG), New York, NY, USA, 22–25 August 2017; pp. 33–36. [Google Scholar] [CrossRef]
Perišić, A.; Pahor, M. Clustering mixed-type player behavior data for churn prediction in mobile games. Cent. Eur. J. Oper. Res. 2023, 31, 165–190. [Google Scholar] [CrossRef]
Hossain, M.Y.; Azizi, E.; Zaman, L. Predicting subscription renewal using binary classification in World of Warcraft. Entertain. Comput. 2023, 44, 100522. [Google Scholar] [CrossRef]
Hoang, H.D.; Cam, N.T. Early churn prediction in freemium game mobile using Transformer-based architecture for tabular data. In Proceedings of the 2024 IEEE 3rd World Conference on Applied Intelligence and Computing (AIC), Gwalior, India, 27–28 July 2024; pp. 568–573. [Google Scholar] [CrossRef]
Dontireddy, S.R.; Pallem, R.A.; Chindukuri, V.; Gurram, S.H.; Pannerselvam, I.R. Enhancing Transparency: AI Applications for Detecting Cheating and Predicting Player Attrition in Online Gaming. In Proceedings of the 2024 International Conference on Signal Processing, Computation, Electronics, Power and Telecommunication (IConSCEPT), Karaikal, India, 4–5 July 2024; pp. 1–6. [Google Scholar] [CrossRef]
Mulla, R.; Potharaju, S.; Tambe, S.N.; Joshi, S.; Kale, K.; Bandishti, P.; Patre, R. Predicting Player Churn in the Gaming Industry: A Machine Learning Framework for Enhanced Retention Strategies. J. Curr. Sci. Technol. 2025, 15, 103. [Google Scholar] [CrossRef]
Guitart, A.; Chen, P.P.; Periáñez, Á. The Winning Solution to the IEEE CIG 2017 Game Data Mining Competition. Mach. Learn. Knowl. Extr. 2019, 1, 252–264. [Google Scholar] [CrossRef]
Sin, H.S.; Paik, W. Predicting Churn Rate of the Massively Multiplayer Online Role-Playing Game (MMORPG) Users by Analyzing Playing Behavior. Int. J. Sci. Technol. Res. 2019, 8, 1–20. [Google Scholar]
Han, Y.J.; Moon, J.; Woo, J. Prediction of Churning Game Users Based on Social Activity and Churn Graph Neural Networks. IEEE Access 2024, 12, 101971–101984. [Google Scholar] [CrossRef]
Castro, E.G.; Tsuzuki, M.S.G. Churn prediction in online games using players’ login records: A frequency analysis approach. IEEE Trans. Comput. Intell. AI Games 2015, 7, 255–265. [Google Scholar] [CrossRef]
Templ, M. Enhancing Precision in Large-Scale Data Analysis: An Innovative Robust Imputation Algorithm for Managing Outliers and Missing Values. Mathematics 2023, 11, 2729. [Google Scholar] [CrossRef]
Zhang, Z. Missing data imputation: Focusing on single imputation. Ann. Transl. Med. 2016, 4, 9. [Google Scholar] [CrossRef]
Azur, M.J.; Stuart, E.A.; Frangakis, C.; Leaf, P.J. Multiple imputation by chained equations: What is it and how does it work? Int. J. Methods Psychiatr. Res. 2011, 20, 40–49. [Google Scholar] [CrossRef]
Hamzah, F.B.; Hamzah, F.M.; Razali, S.F.M.; El-Shafie, A. Multiple imputations by chained equations for recovering missing daily streamflow observations: A case study of Langat River basin in Malaysia. Hydrol. Sci. J. 2022, 67, 137–149. [Google Scholar] [CrossRef]
Hallam, A.; Mukherjee, D.; Chassagne, R. Multivariate imputation via chained equations for elastic well log imputation and prediction. Appl. Comput. Geosci. 2022, 14, 100083. [Google Scholar] [CrossRef]
Lee, E.; Kim, B.; Kang, S.; Kang, B.; Jang, Y.; Kim, H.K. Profit optimizing churn prediction for long-term loyal customers in online games. IEEE Trans. Games 2018, 12, 41–53. [Google Scholar] [CrossRef]
Liao, H.Y.; Chen, K.Y.; Liu, D.R.; Chiu, Y.L. Customer churn prediction in virtual worlds. In Proceedings of the 2015 IIAI 4th International Congress on Advanced Applied Informatics, Okayama, Japan, 12–16 July 2015; pp. 115–120. [Google Scholar] [CrossRef]
Fu, X.; Chen, X.; Shi, Y.T.; Bose, I.; Cai, S. User segmentation for retention management in online social games. Decis. Support Syst. 2017, 101, 51–68. [Google Scholar] [CrossRef]
Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
McDonald, G.C. Ridge regression. Wiley Interdiscip. Rev. Comput. Stat. 2009, 1, 93–100. [Google Scholar] [CrossRef]
Cohen, J. Statistical Power Analysis for the Behavioral Sciences; Routledge: London, UK, 2013. [Google Scholar] [CrossRef]
Hand, D.J.; Till, R.J. A simple generalisation of the area under the ROC curve for multiple class classification problems. Mach. Learn. 2001, 45, 171–186. [Google Scholar] [CrossRef]
Read, J.; Pfahringer, B.; Holmes, G.; Frank, E. Classifier chains for multi-label classification. Mach. Learn. 2011, 85, 333–359. [Google Scholar] [CrossRef]

Figure 1. User activity during the 8-week observation and 12-week churn determination period. The red triangle indicates the Churn Occurrence Point, the last active week before four weeks of inactivity. (Source: NCSoft, Seoul, Republic of Korea).

Figure 2. (a) Panel data format showing weekly (WK) activities for each user ID. (b) Interpretation of User A’s activity recorded during week 7–8.

Figure 3. Dataset structure after padding.

Figure 4. Activity data over time for User D with FAW being week 4. Orange, blue, and green indicate NaN before FAW, activity after FAW, and NaN after FAW, respectively.

Figure 5. Reshaped dataset formats: (a) wide format (N,

T D

), (b) long format (

N D

, T).

Figure 5. Reshaped dataset formats: (a) wide format (N,

T D

), (b) long format (

N D

, T).

Figure 6. Illustration of NaN imputation strategies. (a) Example dataset with NaNs and FAW. (b) Method 1: All NaNs are imputed with the minimum values. (c) Method 2: NaNs after FAW are imputed with standard methods; NaNs before FAW with minimum values. (d) Method 3: All NaNs before and after FAW are imputed with the same standard methods.

Figure 7. Comparison of weekly average user activity by label: original data (ori_data) vs. three imputation methods (linear_nd_t, mode_nd_t, mice_all_n_td). Shown are averages for play_time (a–d) and payment_amount (e–h) across four churn labels.

Figure 8. Comparison of ROC curves across different labels and datasets: (a) Label 0, (b) Label 1, (c) Label 2, and (d) Label 3. Each plot focuses on false positive rates between 0 and 0.6 and true positive rates between 0.4 and 1.0, highlighting performance differences among imputation methods in the most relevant evaluation regions.

Table 1. Churn Label in the Dataset.

Churn Label	Label No.	Description
Retained	0	No Churn Occurrence Point
Week	1	The Churn Occurrence Point is within 1 week
Month	2	The Churn Occurrence Point is within 4 weeks
2 Months	3	The Churn Occurrence Point is within 8 weeks

Table 2. Summary of dataset files.

Datasets	# Records	Description	# Features
Activity	440,323	Weekly activity information	36
Party	6,962,341	Party activity history	7
Payment	800,000	Weekly payment transactions	1
Label	100,000	Churn labels	1

Table 3. Key features in the activity dataset.

Feature	Description
Playtime	Total session time logged into the game
LoginDays	Number of days the user logged in
CombatTime	Time spent in combat within the game
EXPGain	Experience points gained from quests and hunting
ChatCount	Number of chat messages across general, area, party, and guild chats
Item Acquisition	Number of items obtained

Table 4. Payment and party activity features for churn prediction in online RPGs.

Feature	Definition
Payment_amount	Weekly payment amount
Party_count	Number of party activities per week
Avg_member_count	Average number of members in a party
Avg_duration	Average duration of party activities
Unique_partners	Number of unique party members
Avg_party	Average number of repeated party activities
Avg_party_var	Variance in the number of repeated party activities
Max_same_party	Maximum number of repeated party activities with the same group composition

Table 5. Distribution of users by FAW and NaN data ratio before and after FAW.

FAW	Number of Users	Ratio Before FAW	Ratio After FAW
1	33,495	0	7.22
2	12,505	12.5	17.35
3	5140	25.0	12.08
4	5398	37.5	5.02
5	3353	50.0	9.03
6	5669	62.5	3.60
7	8189	75.0	0.00
8	25,736	87.5	0.00

Table 6. NaN data treatment and assumption.

Method	NaN Before FAW	NaN After FAW
1	Non-existence (N/D)	Non-existence (DI)
2	Non-existence (N/D)	MAR (EF)
3	MAR (EF)	MAR (EF)

Table 7. Summary of final datasets with different imputation methods and structures.

Dataset	NaN Treatment	Dataset Format	Imputation
ori_data	Method 1	-	Minimum
mice_n_td	Method 2	Wide	MICE
mean_n_td	Method 2	Wide	Mean
mode_n_td	Method 2	Wide	Mode
mice_nd_t	Method 2	Long	MICE
linear_nd_t	Method 2	Long	Linear
mean_nd_t	Method 2	Long	Mean
mode_nd_t	Method 2	Long	Mode
mice_all_n_td	Method 3	Wide	MICE
mice_all_nd_t	Method 3	Long	MICE
linear_all_nd_t	Method 3	Long	Linear
mode_all_n_td	Method 3	Wide	Mode
mode_all_nd_t	Method 3	Long	Mode

Note: For Method 3, all NaNs, both before and after FAW, were imputed using the same standard method (e.g., MICE, mean, or linear interpolation).

Table 8. MICE parameters.

Parameter	Value
estimator	‘BayesianRidge’
sample_posterior	False
imputation_order	‘ascending’
initial_strategy	‘mean’
max_iter	50
n_nearest_features	None
verbose	2
tol	0.01

Table 9. Mean correlation difference for imputation methods and datasets.

Dataset	Mean Correlation Difference
mean_n_td	0.000974
mode_n_td	0.004345
mice_n_td	0.002576
mice_nd_t	0.001740
mean_nd_t	0.002096
linear_nd_t	0.002124
mode_nd_t	0.003123
mice_all_n_td	0.006749
mice_all_nd_t	0.011984
linear_all_nd_t	0.014828
mode_all_n_td	0.005312
mode_all_nd_t	0.015524

Table 10. Random forest parameters.

Parameter	Value
n_estimators	50
max_depth	12
min_samples_split	8
min_samples_leaf	8
max_features	auto
bootstrap	True
criterion	gini
random_state	42

Table 11. Micro-averaged AUC scores (Mean ± Std) for random forest (RF) models across 5-fold cross-validation for each dataset on each label.

Dataset	Label 0	Label 1	Label 2	Label 3
ori_data	$0.9420 \pm 0.0019$	$0.9683 \pm 0.0018$	$0.8461 \pm 0.0038$	$0.8708 \pm 0.0012$
mice_n_td	$0.9424 \pm 0.0016$	$0.9684 \pm 0.0016$	$0.8474 \pm 0.0036$	$0.8723 \pm 0.0014$
mean_n_td	$0.9425 \pm 0.0016$	$0.9681 \pm 0.0018$	$0.8466 \pm 0.0037$	$0.8702 \pm 0.0010$
mode_n_td	$0.9398 \pm 0.0017$	$0.9665 \pm 0.0014$	$0.8420 \pm 0.0034$	$0.8681 \pm 0.0015$
mice_nd_t	$0.9422 \pm 0.0017$	$0.9684 \pm 0.0014$	$0.8456 \pm 0.0030$	$0.8706 \pm 0.0012$
linear_nd_t	$0.9385 \pm 0.0018$	$0.9683 \pm 0.0016$	$0.8451 \pm 0.0032$	$0.8672 \pm 0.0016$
mean_nd_t	$0.9395 \pm 0.0016$	$0.9683 \pm 0.0012$	$0.8459 \pm 0.0032$	$0.8684 \pm 0.0013$
mode_nd_t	$0.9387 \pm 0.0018$	$0.9662 \pm 0.0013$	$0.8436 \pm 0.0031$	$0.8701 \pm 0.0014$
mice_all_n_td	$0.9434 \pm 0.0015$	$0.9739 \pm 0.0015$	0.8586 * ± 0.0034	0.8757 * ± 0.0014
mice_all_nd_t	0.9449 * ± 0.0017	0.9750 * ± 0.0015	$0.8567 \pm 0.0033$	$0.8752 \pm 0.0014$
linear_all_nd_t	$0.9388 \pm 0.0020$	$0.9740 \pm 0.0013$	$0.8518 \pm 0.0024$	$0.8688 \pm 0.0018$
mode_all_n_td	$0.9380 \pm 0.0018$	$0.9655 \pm 0.0015$	$0.8410 \pm 0.0033$	$0.8652 \pm 0.0016$
mode_all_nd_t	$0.9425 \pm 0.0019$	$0.9670 \pm 0.0014$	$0.8442 \pm 0.0030$	$0.8718 \pm 0.0013$

* Indicates the best performance in each model type.

Table 12. Comparison of weighted F1 scores (mean ± Std) for random forest models (5-fold cross-validation).

Dataset	RF	RF + PCA	Chain	Chain + PCA
ori_data	$0.6732 \pm 0.0082$	$0.6955 \pm 0.0060$	$0.6802 \pm 0.0026$	$0.7005 \pm 0.0058$
mice_n_td	$0.6834 \pm 0.0079$	$0.6945 \pm 0.0058$	$0.6872 \pm 0.0031$	$0.7003 \pm 0.0051$
mean_n_td	$0.6804 \pm 0.0084$	$0.6953 \pm 0.0061$	$0.6843 \pm 0.0027$	$0.7008 \pm 0.0054$
mode_n_td	$0.6802 \pm 0.0055$	$0.6942 \pm 0.0052$	$0.6839 \pm 0.0016$	$0.6995 \pm 0.0064$
mice_nd_t	$0.6816 \pm 0.0072$	$0.6936 \pm 0.0057$	$0.6882 \pm 0.0033$	$0.6994 \pm 0.0055$
linear_nd_t	$0.6755 \pm 0.0064$	$0.6947 \pm 0.0060$	$0.6804 \pm 0.0030$	$0.6998 \pm 0.0055$
mean_nd_t	$0.6775 \pm 0.0060$	$0.6939 \pm 0.0064$	$0.6827 \pm 0.0020$	$0.6995 \pm 0.0057$
mode_nd_t	$0.6790 \pm 0.0111$	$0.6947 \pm 0.0060$	$0.6824 \pm 0.0030$	$0.6999 \pm 0.0055$
mice_all_n_td	$0.7034 \pm 0.0041$	0.7063 ± 0.0058 *	$0.7013 \pm 0.0044$	0.7065 ± 0.0057 *
mice_all_nd_t	0.7043 ± 0.0040 *	$0.7029 \pm 0.0053$	0.7054 ± 0.0037 *	$0.7060 \pm 0.0051$
linear_all_nd_t	$0.6893 \pm 0.0067$	$0.6990 \pm 0.0050$	$0.6912 \pm 0.0056$	$0.7025 \pm 0.0059$
mode_all_n_td	$0.6825 \pm 0.0081$	$0.6935 \pm 0.0055$	$0.6856 \pm 0.0035$	$0.7003 \pm 0.0053$
mode_all_nd_t	$0.6883 \pm 0.0055$	$0.6987 \pm 0.0052$	$0.6905 \pm 0.0045$	$0.7018 \pm 0.0056$

* Indicates the best performance in each model type.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lee, J.; Rerkjirattikal, P.; Nam, S. A Data Imputation Strategy to Enhance Online Game Churn Prediction, Considering Non-Login Periods. Data 2025, 10, 96. https://doi.org/10.3390/data10070096

AMA Style

Lee J, Rerkjirattikal P, Nam S. A Data Imputation Strategy to Enhance Online Game Churn Prediction, Considering Non-Login Periods. Data. 2025; 10(7):96. https://doi.org/10.3390/data10070096

Chicago/Turabian Style

Lee, JaeHong, Pavinee Rerkjirattikal, and SangGyu Nam. 2025. "A Data Imputation Strategy to Enhance Online Game Churn Prediction, Considering Non-Login Periods" Data 10, no. 7: 96. https://doi.org/10.3390/data10070096

APA Style

Lee, J., Rerkjirattikal, P., & Nam, S. (2025). A Data Imputation Strategy to Enhance Online Game Churn Prediction, Considering Non-Login Periods. Data, 10(7), 96. https://doi.org/10.3390/data10070096

Article Menu

A Data Imputation Strategy to Enhance Online Game Churn Prediction, Considering Non-Login Periods

Abstract

1. Introduction

2. Background and Related Work

2.1. User Churn Prediction

2.2. Handling User Inactivity as Missing Data

3. Characterizing User Churn

4. Methodology

4.1. Dataset Preprocessing

4.2. Dataset Structure

4.3. Inactivity Treatment

4.4. Data Structure Transformation

4.5. Imputation Methods

5. Experiments and Analysis

5.1. Data Imputation and Evaluation

5.2. Churn Prediction

5.3. Performance Evaluation Metrics

5.4. Results and Discussions

6. Conclusions and Future Work

6.1. Limitations and Future Research Directions

6.2. Practical Implications

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI