The Impact of Cash Holding Decisions on Firm Performance in the IT Industry

Jaeseong Lim; Bong Keun Jeong

doi:10.3390/jrfm18110625

and

¹

D. Wyatt Henderson Department of Accounting, E. Craig Wall Sr. College of Business Administration, Coastal Carolina University, Conway, SC 29528, USA

²

Department of Management and Decision Sciences, E. Craig Wall Sr. College of Business Administration, Coastal Carolina University, Conway, SC 29528, USA

^*

Author to whom correspondence should be addressed.

J. Risk Financial Manag.2025, 18(11), 625;https://doi.org/10.3390/jrfm18110625

This article belongs to the Section Business and Entrepreneurship

Version Notes

Order Reprints

Abstract

This study examines the relationship between corporate cash holdings and firm performance within the IT industry, which is characterized by intense competition and rapid technological advancements. We propose an integrated framework that combines principal component analysis (PCA), machine learning (ML) algorithms, and Shapley additive explanation (SHAP) values to estimate and interpret model outcomes. Based on 21,051 corporate financial statement data items from 2004 and 2023, the empirical evidence supports an inverted U-shaped relationship between cash holdings and profitability, suggesting that holding either too little or too much cash is suboptimal. Among the tested models, the random forest demonstrates the highest explanatory power (R²) and the lowest prediction errors (RMSE), outperforming the traditional ordinary least squares (OLS) regression by explaining 47% more variance. Our findings provide practical implications for researchers and stakeholders interested in enhancing corporate risk management and performance.

Keywords:

cash holdings; IT firm performance; master proxy; machine learning; SHAP (SHapley Additive exPlanations)

1. Introduction

The global business environment has undergone significant transformations due to the COVID-19 pandemic and the rapid adoption of artificial intelligence (AI) technologies. Advancements in technology have significantly lowered geographic obstacles, facilitating business competition through digital platforms. Firms must strategically navigate uncertainty and enhance their performance in response to these developments. Among the various factors influencing corporate success, cash holding decisions have gained significant attention, as they play a critical role in ensuring operational flexibility, supporting investment opportunities, and mitigating financial risks. However, the relationship between cash holdings and firm performance remains subject to debate, with studies suggesting positive (; ), negative (; ), or inverted U-shaped associations (). These mixed findings suggest that contextual factors—particularly industry characteristics—can influence how cash balances affect firm performance (; ).

Prior research on trade-off theory has shown that firms can benefit from maintaining an optimal capital structure. While increased borrowing allows firms to benefit from the tax deductibility of interest expenses, it also exposes them to a higher risk of bankruptcy. Taken together, the trade-off theory suggests that an optimal debt-to-equity ratio exists that maximizes firm value (; ; ). Despite this evidence, it remains underexplored whether an optimal level of cash holdings exists for information technology (IT) firms, and this study aims to fill this gap.

This paper focuses on the information technology (IT) industry, which occupies a pivotal role in driving technological innovation and economic growth. IT firms face unique challenges, including substantial investments in research and development (R&D) as well as high market volatility caused by rapid innovation cycles and competitive pressures. As such, cash holding strategies have become particularly vital for IT firms, serving as a buffer against uncertainty while enabling investment in growth opportunities (). Pecking order theory argues that managers prefer to use internal funds, such as cash reserves, for investment because they are less costly and less affected by information asymmetry than external sources. (; ; ; ). Despite the importance of liquidity management, very few studies specifically examine the IT sector to determine how cash management decisions translate into performance outcomes in the technology-driven environment (). To fill this gap, our study undertakes an in-depth analysis of cash holding levels and their impact on firm performance within the IT sector, accounting for key firm-level control variables such as investment, growth, size, leverage, and R&D intensity.

By leveraging firm-level data and employing robust econometric techniques, we aim to identify effective cash management practices for IT companies. This study finds empirical evidence of an inverted U-shaped relationship between cash holdings and firm performance, using 21,051 firm-year observations from U.S.-listed IT firms from 2004 to 2023. The results suggest that an optimal level of cash holdings exists that can maximize IT firms’ profitability.

Furthermore, this study incorporates machine learning techniques to enhance the effectiveness of the estimation approach. The application of machine learning in finance and accounting has become increasingly popular, with growing scholarly and practical interest in leveraging data-driven algorithms to enhance predictive accuracy and decision-making (; ; ; ). Principal component analysis (PCA) is employed to construct master proxy composite variables for key constructs (profitability, investment, growth, etc.), which mitigates proxy selection bias and enhances the robustness and generalizability of our results. Ordinary least squares (OLS) regression is a commonly used method for analyzing panel data, although interest in applying ML techniques for estimation tasks has been growing (). Accordingly, we employ OLS regression as a benchmark method and compare its performance against machine learning models, including Decision Tree, Random Forest, Multilayer Perceptron, and Support Vector Machine. In addition to model comparison, we introduce SHAP (SHapley Additive exPlanations) values to interpret the machine learning models’ predictions. SHAP values explain how each feature contributes to the model prediction, an alternative to the traditional coefficient-based interpretation in regression models (; ). SHAP values allow for comparison with the variable importance derived from OLS coefficients. This integrated framework can provide higher predictive power and interpretability.

This study offers several theoretical and practical contributions. It provides evidence that the relationship between cash holdings and performance in the IT industry follows an inverted U-shape. In other words, holding cash contributes positively to performance up to a certain point but becomes counterproductive beyond the optimal level. This pattern is consistent across both OLS and ML-based analyses, supporting the existence of an optimal cash balance that maximizes IT firm profitability. The findings underscore the significance of industry context in corporate liquidity research, bridging the literature on cash management, risk mitigation, and firm performance.

Our results have implications for stakeholders. Stakeholder theory emphasizes the responsibility of firms to account for diverse stakeholder expectations within their business processes and strategies (; ). The IT industry is a high-stakes environment, and IT firm leaders tend to be highly ambitious and often exhibit risk-seeking behavior such as new market entry (), which might result in insufficient attention to effective risk management. The observed inverted U-shaped relationship suggests that cash should be treated as a strategic resource and must be managed actively to balance growth opportunities and risk exposure. Exploring corporate cash holding decisions is critical because they can contribute to improved risk management for IT firms. This study contributes to the research on corporate risk management and firm performance by utilizing machine learning techniques within the context of the IT industry. As stakeholders become increasingly aware of optimal cash holding decisions, they are better able to call for corporate actions that enhance corporate risk management and drive improved performance.

The rest of the paper is organized as follows. Section 2 develops our research hypothesis. Section 3 explains the regression model and machine learning techniques. Section 4 reports the regression results. Section 5 presents additional empirical results. Section 6 and Section 7 cover the discussion, limitations, and future research opportunities.

2. Hypothesis Development

Based on the resource-based perspective (; ; ; ), firms can achieve superior performance by strategically identifying and acquiring critical resources that enable developing products and services aligned with market demand. Cash is a strategically valuable resource, as it is the most liquid asset and serves as a buffer against future uncertainties (; ; ; ; ; ). According to pecking order theory, managers prefer to utilize internal funds to finance investments because they are less expensive and involve fewer information asymmetry problems than external sources (; ; ; ).

Cash management strategies are especially important for IT firms because strategic cash reserves ensure they can invest without delay and thus remain competitive (; ). An IT firm with insufficient cash may struggle to make timely investments given that external financing typically involves higher costs than using internal funds (; ). Firms anticipating strong future growth opportunities are likely to hold higher cash reserves (). () demonstrate that U.S.-listed firms with large cash reserves tend to achieve higher market values. Indeed, successful IT firms such as Apple, Alphabet, and Microsoft hold high levels of cash reserves (). () argue that cash-rich companies can use their cash reserves to acquire key competitors and expand their market share. Cash-rich IT firms can also leverage their cash reserves to support innovation, which enhances corporate value in competitive, research-driven industries.

However, when an IT firm’s cash holdings are too excessive, they may be used inefficiently. One critical concern with high cash holdings is moral hazard, where managers could use the cash reserves to serve their own self-interests rather than the shareholders’ interests (). Thus, firms with more experienced board members tend to hold less cash reserves (). High levels of cash holdings are often associated with negative consequences, including diminished shareholder value (), poor earnings quality (), lower accruals quality (), decreased financial statement comparability (), and greater engagement in aggressive real earnings management (). In this sense, excessive cash holdings in IT firms could impair their performance.

In practice, IT firm leaders tend to be highly ambitious and often exhibit risk-seeking behaviors, considering the higher failure rates associated with IT investments (; ; ). Given their substantial investment in R&D and exposure to volatile markets, driven by constant intense competition and pressure to innovate, IT firms should adopt effective cash management strategies to navigate uncertainty while seizing growth opportunities in a timely manner. Thus, we propose that there is an optimal level of cash holdings that maximizes performance in IT firms, and we present the following hypothesis.

Hypothesis.

There is an inverted U-shaped relationship between cash holdings and performance in the IT industry.

3. Empirical Models

3.1. Regression Model

To test whether firms in the IT sector have optimal levels of cash holdings, we estimate the following regression model:

{P R O F I T}_{i, t} = α + β_{1} {C A S H}_{i, t} + β_{2} {C A S H_S Q U A R E D}_{i, t} + λ {C o n t r o l s}_{i, t} + Y e a r + ɛ_{i, t}

(1)

where, for firm i and year t, PROFIT is Earnings Before Interest and Taxes (EBIT) divided by total assets. CASH is a cash ratio, measured as cash and marketable securities divided by the book value of total assets. CASH_SQUARED is the squared value of the cash ratio. We expect the coefficient (β₁) of CASH to be significantly positive and the coefficient (β₂) of CASH_SQUARED to be significantly negative, indicating an inverted U-shaped relationship between cash holdings and profitability within the IT sector. Control variables (Controls) include current assets divided by current liabilities (LIQUIDITY), tangible fixed assets divided by total assets (INVESTMENT), sales growth (GROWTH), total assets (SIZE), total debts divided by total assets (LEVERAGE), and R&D expenditure divided by Total Sales (R&D INTENSITY). We control for year fixed effects (Year) in our regression. Standard errors are robust to both firm-level clustering and heteroscedasticity.

3.2. Machine Learning Techniques

Ordinary Least Squares (OLS) regression is a widely used method for analyzing panel data due to its simplicity, interpretability, and established theoretical framework. However, OLS has its limitations, particularly in modeling nonlinear relationships, handling high-dimensional data, or capturing complex interactions among variables (; ).

Machine learning (ML) algorithms offer a flexible and powerful alternative, especially in scenarios where traditional linear methods may underperform. ML techniques are well suited for handling multicollinearity and can better model nonlinear patterns (; ; ). Because the performance of ML algorithms can vary by dataset and context, it is important to compare a range of models. In this study, we evaluate four widely adopted regression-based ML models: Decision Tree, Random Forest, Multilayer Perceptron, and Support Vector Machine.

Decision Tree is a supervised machine learning technique that constructs a tree-like structure to predict discrete or continuous outcomes. It recursively partitions the dataset based on feature values that minimize prediction error at each node. Although intuitive, easy to interpret, and computationally efficient, Decision Trees may suffer from overfitting, particularly in the presence of noisy data (; ). We employ the Reduced Error Pruning Tree (REPTree) variant in this study, as it is known for its fast regression work and reasonable accuracy ().

Random Forest is an ensemble technique that aggregates predictions from multiple decision trees to enhance overall model accuracy and stability (). During training, the mean predictions of several trees are aggregated to reduce variance, noise, and outliers (). Although more computationally intensive than a single decision tree, it performs well on datasets with complex, nonlinear relationships and interactions.

Multilayer Perceptron is a type of artificial neural network consisting of an input layer, one or more hidden layers, and an output layer. The network is fully connected, and it learns by adjusting connection weights between neurons to minimize the difference between its predictions and the actual outcomes—a process known as backpropagation (; ). While Multilayer Perceptron requires careful tuning of parameters (e.g., number of layers, neurons, learning rate) and significant computational resources, they are highly flexible and effective in modeling complex nonlinear structures ().

Support Vector Machine is a supervised learning algorithm used for both classification and regression problems. It identifies a hyperplane or decision function that minimizes prediction error while maximizing generalization capacity (; ). Support Vector Machine is particularly well suited for high-dimensional data and can capture nonlinear relationships. However, it is computationally intensive for large datasets and requires meticulous parameter tuning (; ).

For data training and validation, we preprocess the dataset by removing missing values and outliers. To reduce variance and prevent overfitting, we apply 10-fold cross-validation, a popular technique to evaluate model performance (). In this method, the dataset is divided into 10 equal subsets. The model is trained on nine folds and validated on the remaining one-fold, rotating the validation set across all folds. Although the dataset is structured as a firm-year panel, our primary objective is not to perform forecasting across time but rather to assess the relative explanatory power of different models (e.g., OLS versus machine-learning algorithms) using firm characteristics. Moreover, the use of 10-fold cross-validation is consistent with prior research applying machine-learning models to panel data in corporate finance and accounting (e.g., ; ). To verify robustness, we repeated the random fold assignment using multiple random seeds and obtained consistent results. We believe that the standard 10-fold cross-validation provides a reasonable and widely accepted approach for evaluating model performance in this study without introducing bias from the panel structure of the data.

Finally, we assess model performance using three standard regression metrics: R² (Coefficient of Determination), Mean Absolute Error (MAE), and Root Mean Square Error (RMSE). These metrics are widely used for an assessment of the models’ predictive accuracy. The hyperparameter configurations employed in this analysis are provided in Appendix A.

4. Empirical Results

4.1. Data and Descriptive Statistics

Corporate financial statement data were obtained from the Compustat annual database. The North American Industry Classification System (NAICS) is the standard employed by federal agencies in the US to classify industries according to their primary business activities (). Table 1 presents IT industries selected based on the NAICS classification codes, whereby we identified the following: computer and electronic product manufacturing (NAICS 334); software publishers (NAICS 51121 and 51321); telecommunications (NAICS 517); data processing, hosting, and related services (NAICS 518); other information services (NAICS 519); and computer systems design and related services (NAICS 54151). Our sample comprises 21,051 firm-year observations with non-missing data across all variables from 2004 to 2023.

Table 1. Selected IT Industries Based on NAICS Classification.

Table 2 presents descriptive statistics for the variables used in our analysis. For each proxy, one representative variable is selected, defined as follows. PROFIT is EBIT divided by total assets. CASH is a cash ratio, calculated as cash and marketable securities divided by the book value of total assets. CASH_SQUARED is the squared value of the cash ratio (i.e., CASH × CASH). LIQUIDITY is current assets divided by current liabilities. INVESTMENT is tangible fixed assets divided by total assets. GROWTH is the annual sales growth rate. SIZE is total assets, expressed in millions. LEVERAGE is total debts divided by total assets. R&D INTENSITY is research and development expenditure divided by Total Sales.

Table 2. Descriptive Statistics.

Table 3 presents the Pearson correlation coefficients for the variables included in the regression model. The correlations between the cash ratio variables (CASH and CASH_SQUARED) and profitability (PROFIT) are found to be statistically insignificant. However, it is important to note that the Pearson correlation coefficient does not capture nonlinear relationships (; ).

Table 3. Pearson Correlation Matrix.

4.2. Multivariate Results

Table 4 reports the results from pooled OLS regression analysis, based on Equation (1), with profitability (PROFIT) as a dependent variable. The key explanatory variables are the cash ratio (CASH) and its squared term (CASH_SQUARED). We found that (i) the coefficient of CASH is significantly positive and (ii) the coefficient of CASH_SQUARED is significantly negative. These findings support the presence of an inverted U-shaped relationship between cash holdings and profitability in the IT sector, which is consistent with our hypothesis.

Table 4. Cash Holdings and Profitability.

To provide a clearer understanding of economic significance, we calculate the turning point (−β₁/2β₂) of the quadratic cash–performance relationship. As indicated in Equation (1), β₁ and β₂ correspond to the coefficients of CASH and CASH_SQUARED, respectively. Using the coefficients reported in Table 4, the turning point is calculated as −1.7240/2 (−2.4247), yielding an approximate value of 0.356. Given that the 95% confidence interval for the estimate ranges from 0.264 to 0.447, the true turning point is highly likely to be located within this range. These empirical results carry managerial implications, suggesting that cash holdings outside this range are likely to be suboptimal for IT firms. Specifically, IT firms with cash and marketable securities below 26.4% of total assets need to consider increasing them, whereas IT firms exceeding 44.7% need to consider reducing them.

5. Additional Tests

5.1. One-Year-Forward Dependent Variable

As an endogeneity remedy, we use a one-year-forward dependent variable for the regression described in Equation (1). Table 5 shows that the coefficient of CASH remains significantly positive, while the coefficient of CASH_SQUARED remains significantly negative, suggesting that the effect of cash holding decisions on subsequent profitability is significant over the next year. Our results collectively suggest an inverted U-shaped relationship between cash holdings and subsequent profitability, thereby supporting our hypothesis.

Table 5. One-Year-Forward Dependent Variable.

The estimated turning point is around 0.317, with the 95% confidence interval spanning from 0.264 to 0.371. This implies that IT firms holding less than 26.4% of total assets in cash and marketable securities need to consider accumulating more, whereas those above 37.1% need to consider trimming them to improve next year’s performance.

5.2. Subsample Analysis: Big IT Firms

As a robustness test, we perform a subsample analysis using a firm-fixed effects regression. First, we rank all observations into four groups based on the magnitude of firm size in each year. Table 6 presents the empirical results for firms in the top quartile of size in each year. Our empirical analyses—using a fixed effects regression model that controls for all time-invariant variables—reaffirm that the coefficient for CASH is positive and significant, while the coefficient for CASH_SQUARED is negative and significant. Interestingly, the inverted U-shaped relationship becomes insignificant for firms in the bottom quartile of size each year, although the results are not tabulated. Our findings suggest that large IT firms can enhance their performance by managing cash holdings at an optimal level.

Table 6. Subsample Analysis: Big IT Firms.

The turning point is estimated at 0.470, with a 95% confidence interval from 0.188 to 0.752. Big IT firms with cash and marketable securities below 18.8% of total assets may improve future performance by increasing cash reserves, while those above 75.2% may need to decrease them.

5.3. Effect of Recessionary Periods

Firms faced significant economic shocks during recessionary periods such as the global financial crisis in 2007–2010 and the COVID-19 pandemic in 2020. To reinforce the robustness of the empirical results, we perform additional analyses focusing on these recession periods, all of which lie within our sample period. The findings reported in Table 7 reinforce the consistency of our main results. Table 7 presents a positive and significant coefficient for CASH and a negative and significant coefficient for CASH_SQUARED, providing evidence that is consistent with our hypothesis.

Table 7. Effect of Recessionary Periods.

Our results indicate a turning point of 0.297, with a 95% confidence interval ranging from 0.212 to 0.383. This suggests that during economic downturns, maintaining cash and marketable securities between 21.2% and 38.3% of total assets may help IT firms optimize performance. Overall, the empirical results remain robust even during periods of economic shock.

5.4. Principal Component Analysis

The existing literature offers multiple proxy measures for each variable of interest. For instance, in the regression test above, we use earnings before interest and taxes (EBIT) divided by total assets as a proxy for PROFIT. However, other profitability metrics exist, such as net income divided by equity or gross profit divided by sales (see Appendix B). Prior studies show that the selection of proxy can substantially influence empirical results, meaning that findings derived from one proxy do not guarantee the same results when a different proxy is used to represent the same underlying construct (). As such, testing multiple proxies is essential for robustness, but analyzing each one separately would be complex and time-consuming.

To address this issue, we adopt a master proxy approach using Principal Component Analysis (PCA), following the method introduced by (). PCA is a statistical technique used to reduce the dimensionality of a dataset while retaining as much of its original variance as possible. It transforms raw data into a new set of uncorrelated variables, called principal components (PCs). The first principal component (PC1) captures the greatest variance, the second (PC2) captures the next highest variance, and so forth. We applied PCA separately to the proxies for each conceptual variable to extract principal components. Because PCs capture the shared variance among all proxies, this method enhances the generalizability of our results compared to using any single proxy. In subsequent analyses, we use the first principal component (PC1) as the master proxy, as it captures the largest proportion of variance and offers a comprehensive summary of the underlying construct.

Table 8 presents the proportion of variance explained by each master proxy. For example, the PC1 derived from ten profitability proxies explains approximately 40 percent of the total variance—equivalent to capturing the information in four distinct proxies. Similarly, PC1s for investment, growth, size, and leverage explain 51.1 percent, 36.4 percent, 67.6 percent, and 48.8 percent of the variance in their respective proxies. These values suggest that the master proxies capture a substantial amount of information, providing a reliable foundation for our empirical analyses. Appendix C presents the scree plots and principal component loadings used in the analysis.

Table 8. Master Proxy Variances.

5.5. Cash Holdings and Profitability Using Master Proxies

Using the master proxies, we estimate the following regression model:

{P R O F I T_M}_{i, t} = α + β_{1} {C A S H}_{i, t} + β_{2} {C A S H_S Q U A R E D}_{i, t} + λ {C o n t r o l s}_{i, t} + Y e a r + ɛ_{i, t}

(2)

where, for firm i and year t, PROFIT_M represents the master proxy derived from ten profitability variables. CASH denotes the cash ratio, calculated as cash and marketable securities divided by the book value of total assets. CASH_SQUARED is the square of the cash ratio. We expect the coefficient (β₁) of CASH to be significantly positive and the coefficient (β₂) of CASH_SQUARED to be significantly negative, indicating an inverted U-shaped relationship between cash holdings and profitability in the IT industry. Control variables (Controls) include the master proxies for two liquidity variables (LIQUIDITY_M), two fixed asset investment variables (INVESTMENT_M), three growth variables (GROWTH_M), five size variables (SIZE_M), three financial leverage variables (LEVERAGE_M), and research and development intensity (R&D INTENSITY). Full definitions of these variables are provided in Appendix D. In addition, we control for year fixed effects (Year) in our regression. Standard errors are robust to both clustering at the firm level and heteroscedasticity.

Table 9 presents the empirical results from Equation (2), employing master proxies. After controlling for these variables, the coefficient for CASH remains significantly positive, and the coefficient for CASH_SQUARED remains significantly negative. These results align with our earlier findings and further confirm the robustness of the hypothesized inverted U-shaped relationship.

Table 9. Cash Holdings and Profitability Using Master Proxies.

5.6. Machine Learning Model Results

Next, we compare the predictive performance of OLS regression with several machine learning models. A higher R² indicates that the model explains more variance, while lower MAE (Mean Absolute Error) and RMSE (Root Mean Square Error) values suggest closer alignment between predicted and actual values. As shown in Table 10, the random forest model outperforms all others, achieving the highest R² and the lowest MAE and RMSE, indicating superior predictive performance. Compared to traditional OLS model, the random forest explains 47 percent more variance, highlighting a substantial improvement in predictive performance.

Table 10. Comparison of Model Performances.

While OLS regression provides coefficients to interpret the direction and magnitude of each variable’s effect, tree-based and other nonlinear machine learning models such as Decision Tree, Random Forest, and Neural Networks do not yield interpretable coefficients. To address this, we compute SHAP (SHapley Additive exPlanations) values, which offer a theoretically grounded and interpretable measure of each feature’s contribution to the model’s predictions (; ; ; ; ; ). SHAP values not only reflect the importance of each feature but also indicate whether the feature has a positive or negative impact on the predictions ().

SHAP values are computed for all individual instances in the Random Forest model, and the mean values are used for global interpretation of the model. Table 11 shows that the mean SHAP value for CASH is positive and for CASH_SQUARED is negative, once again supporting the presence of an inverted U-shaped relationship between cash holdings and profitability. In Appendix E, we present a SHAP dependency plot with additional discussion to support of an inverted U-shaped relationship.

Table 11. Mean SHAP values in Random Forest Model.

6. Discussion

Our study explores the role of cash-holding decisions in improving IT firm performance. The IT sector is distinct due to the prevalence of risk-taking behaviors such as new market entry and high investment failure rates. Insufficient cash holding can constrain research and development (R&D) or acquisitions, exposing IT firms to the risk of financial distress and competitive disadvantages, whereas excessive cash may signal poor management, which could result in unnecessary expenditures or reduced shareholder value. Given competitive pressures and market volatility shaped by rapid technological innovation, IT firms should adopt efficient cash holding strategies to manage uncertainty and seize growth opportunities. We examine the impact of cash holdings on firm performance in the IT industry, using a dataset of 21,051 firm-year observations from US-listed IT firms between 2004 and 2023. To address limitations of conventional ordinary least squares (OLS) regression, we incorporate machine learning (ML) techniques and apply principal component analysis (PCA) to construct master proxies for key variables, thereby improving robustness and generalizability.

Our research is inspired by trade-off theory, which posits that firms can maximize value by maintaining an optimal debt-to-equity ratio. Likewise, our empirical analyses identify an inverted U-shaped relationship between cash holdings and profitability in IT firms, indicating that maintaining an optimal cash balance enhances firm performance in the IT sector. This relationship is valid for both conventional OLS regressions and machine learning models that use master proxies. In both specifications, the findings reveal an inverted U-shaped between cash holdings and firm performance indicating that while cash serves as a buffer against uncertainty and supports innovative projects, excessive cash retention may be associated with inefficient resource allocation or moral hazard. Even after controlling for key firm characteristics such as liquidity, investment, growth, size, leverage, and R&D intensity, the results consistently point to diminishing returns to cash. Overall, the evidence suggests that IT firm managers should pursue balanced cash-management strategies to optimize performance.

On the one hand, having insufficient cash can hinder IT firms’ ability to innovate and compete. Pecking order theory suggests that internal funds—such as cash holdings—are prioritized resources for investment because they are less costly than external sources. Without adequate internal funds, essential R&D projects might be delayed, and strategic investments such as product development or timely acquisitions of startups can lose momentum. In fast-moving technology markets, opportunities appear quickly and disappear just as quickly, and firms without sufficient cash might miss critical growth opportunities. Leading IT companies including Apple, Google, and Microsoft have demonstrated the value of maintaining cash buffers to remain agile and resilient. Thus, for IT firms on the left side of the inverted U-curve, increasing cash from low to moderate levels can yield substantial performance improvements.

On the other hand, excessive cash holdings can be detrimental for IT firms. Once liquidity exceeds the necessary level to fund operations and key projects, additional cash might introduce inefficiencies and governance risks. According to the agency theory, excess cash reserves often encourage managerial overconfidence or moral hazard, prompting executives to invest in risky projects or pursue questionable acquisitions simply because the funds are available. Empirical evidence also suggests that excessive cash can reduce shareholder value and compromise earnings quality (; ; ). Therefore, for IT firms on the right side of the inverted U-shaped curve, each extra dollar of idle liquidity imposes growing costs, in terms of both forgone opportunities and the agency problems that accompany financial slack. Therefore, IT firms must treat cash as a strategic resource that requires active management. Conducting regular assessments of cash levels against industry peers and internal benchmarks can help firms make informed adjustments to their cash policies in a timely manner.

A second key finding is the superior performance of ML techniques over traditional OLS regression. Among the models tested, the random forest demonstrates the strongest predictive accuracy and best overall fit, outperforming OLS regression by explaining 47% more variance. This result highlights the ability of ML methods to capture interactions and nonlinearities that linear models often overlook. While ML models often provide higher predictive accuracy than traditional methods such as OLS, they tend to be less interpretable. OLS provides clear, coefficient-based insights into how each variable influences the outcome, whereas complex ML models typically function as black boxes, making it difficult to understand how predictions are generated. Our framework offers a more balanced approach by combining the predictive power of ML with interpretable techniques such as SHAP values, which help clarify the contribution of each variable and enhance the model’s overall explainability. The SHAP analysis reinforces our findings. The influence of cash holdings on firm performance follows an inverted U-shape indicating that moderate cash levels enhance performance, whereas excessive holdings diminish it.

Finally, the PCA-based master proxy approach address proxy selection bias (; ). For example, one master proxy constructed from ten profitability indicators captures nearly 40 percent of the total variance, equivalent to the explanatory power of four distinct proxies. We demonstrate that the master proxies capture a substantial amount of information, providing a reliable foundation for our empirical analyses.

Upon further analysis, we find that our empirical evidence holds during recessionary periods. This result implies that the inverted U-shaped relationship in IT firms can be leveraged as a strategic approach to risk management and performance, even under recessionary conditions. Interestingly, while the inverted U-shaped relationship between cash holdings and profitability becomes insignificant among IT firms in the bottom size quartile each year, it is significant for those in the top quartile. This implies that as an IT firm grows and increases in size, cash holding strategies become more critical to its performance, highlighting the need for managers in larger IT firms to pay closer attention to effective cash holding decisions.

Our research contributes to the literature on stakeholder theory by providing insights into how firms manage cash reserves within an optimal range. For example, our main findings suggest that IT firms with cash and marketable securities below 26.4% of total assets would benefit from increasing them, whereas firms above 44.7% may need to reduce them to optimize performance. As stakeholders gain greater awareness of optimal cash holding decisions, they are better positioned to influence corporate actions that boost risk management and overall performance. These insights are valuable not only for stakeholders but also for managers who design cash management strategies with future investment opportunities in mind. Our findings underscore the need for regulators to acknowledge optimal cash levels and enforce transparent cash disclosure policies, which can enhance trust and communication between corporate managers and stakeholders.

7. Limitations and Future Research

Our paper’s findings must be interpreted cautiously, as it is necessary to acknowledge several limitations. Although we establish a general inverted U-shaped relationship using a combination of quadratic regression, PCA-based master proxies, and comparative ML models, it is recommended that future studies further explore various methodological approaches to provide firm-specific guidance for managers and regulators on optimal cash thresholds and disclosure policies. We encourage future researchers to further investigate this research avenue.

While we utilize data across IT sectors using NAICS classifications, future work could explore whether the relationship varies across distinct industry contexts. For instance, the optimal level of cash holdings in physical capital-intensive industries may differ from that in human capital-intensive industries, where cash is primarily allocated to talent development. Future studies could extend the analysis to additional industry contexts to assess whether the cash–performance displays heterogeneous inverted U-shaped patterns or even alternative forms of nonlinear relationships.

It would also be an interesting avenue for future research to examine firms’ geographical location or cultural and social factors. For example, firms in countries with stronger uncertainty avoidance may hold higher levels of cash due to precautionary motives. Exploring how geographic, cultural, and social factors influence deviations from the optimal range of cash holdings could provide deeper insights into cross-country heterogeneity in corporate cash management practices.

Author Contributions

Conceptualization, J.L.; methodology, J.L. and B.K.J.; software, J.L. and B.K.J.; validation, J.L. and B.K.J.; formal analysis, J.L. and B.K.J.; investigation, J.L. and B.K.J.; resources, J.L. and B.K.J.; data curation, J.L. and B.K.J.; writing—original draft preparation, J.L. and B.K.J.; writing—review and editing, J.L. and B.K.J.; visualization, J.L. and B.K.J.; supervision, J.L.; project administration, J.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data from the Compustat database serves as the source of financial statement information in this study.

Acknowledgments

We thank the participants of the 27th Southern Association for Information Systems (SAIS) Conference and the seminar at Coastal Carolina University for their helpful comments and suggestions.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Hyperparameter Configurations

Random Forest
batchSize = 100—The number of samples processed before updating model parameters during training.
numTrees = 100—Number of trees to generate in the forest.
maxDepth = 0—Unlimited depth (nodes expand until pure or minimum size).
numFeatures = 0—Uses the default:

\sqrt{(n u m b e r o f a t t r i b u t e s)}

for regression.
.seed = 1—Random seed for reproducibility.
breakTiesRandomly = false—Ties not broken randomly.
minNum = 1.0—Minimum number of instances per leaf.
computeAttributeImportances = false—Variable importance not computed unless specified.
Decision Tree
batchSize = 100—The number of samples processed before updating model parameters during training.
minNum = 2.0—Minimum total weight of instances per leaf.
maxDepth = −1—Unlimited tree depth.
numFolds = 3—Folds used for reduced-error pruning.
seed = 1—Random seed for cross-validation pruning.
noPruning = false—Pruning enabled (i.e., reduced-error pruning).
maxOptimizationRuns = 5—Optimization iterations for pruning.
Support Vector Machine
batchSize = 100—The number of samples processed before updating model parameters during training.
Kernel = PolyKernel—Polynomial kernel function.
filterType = Normalize training data—Preprocessing method for input features.
C = 1.0—Regularization parameter controlling margin/penalty tradeoff.
seed = 1—Random seed.
regOptimizer = RegSMOImproved—Optimization algorithm for SVM regression.
Multilayer Perceptron
batchSize = 100—The number of samples processed before updating model parameters during training.
hiddenLayers = “a”—One hidden layer with (attributes + classes)/2 neurons.
learningRate = 0.3—Step size for weight updates.
momentum = 0.2—Momentum for smoothing weight changes.
trainingTime = 500—Number of epochs (iterations).
validationSetSize = 0—No internal validation set.
seed = 0—Random seed for weight initialization.
normalizeAttributes = true—Input attributes normalized.
normalizeNumericClass = true—Output class normalized (for regression).
validationThreshold = 20—Number of epochs with no improvement before early stopping.

Appendix B. Proxies of Variables

Variables	Proxies	Sources
Profitability	EBIT (Earnings Before Interest and Taxes)/Total Assets	()
	Net Income/Total Assets	()
	Operating income/Total Assets	()
	EBIT/the sum of Equity and Long-term Liabilities	()
	Cost of Goods Sold/Total Assets	()
	Cost of Goods Sold/Sales	()
	EBIT/Sales	()
	Net Income/Sales	()
	Net Income/Equity	()
	EBIT/Capital Employed	()
Liquidity	Current Assets/Current liabilities	()
Liquidity	Current Assets/Sales	()
Investment	Tangible Fixed Assets/Total Assets	()
Investment	The difference in Fixed Assets	()
Growth	Sales Growth	()
	Operating Profit Growth	()
	Fixed Assets Growth	()
Size	Total Assets	()
	Logarithm of Total Assets	()
	Logarithm of Sales	()
	Logarithm of Sales/Total Assets	()
	Logarithm of Fixed Assets	()
Leverage	Total Debts/Total Assets	()
	Long-term Debts/Total Assets	()
	Total Debts/Capital Employed	()
Note: All variables defined above are based on book values.

Appendix C. Scree Plots and PCA Loadings

Profitability

Liquidity

Investment

Growth

Size

Leverage

Appendix D. Variable Definitions

Variables	Definitions
PROFIT_M	Master proxy for ten profitability variables.
CASH	Cash ratio, calculated as cash and marketable securities divided by the book value of total assets.
CASH_SQUARED	Squared value of cash ratio, equal to cash ratio × cash ratio.
LIQUIDITY_M	Master proxy for two liquidity variables.
INVESTMENT_M	Master proxy for two fixed asset investment variables.
GROWTH_M	Master proxy for three growth variables.
SIZE_M	Master proxy for five size variables.
LEVERAGE_M	Master proxy for three financial leverage variables.
R&D INTENSITY	Research and Development (R&D) intensity, calculated as R&D expenditure divided by total sales.

Appendix E. SHAP Dependency Plot

A SHAP dependency plot (n = 250) shows that as CASH increases (moving from left to right), the SHAP values for CASH slightly increase from negative to around zero or positive. This trend suggests that higher CASH levels have a mild positive impact on the model’s prediction, indicating a weak yet positive relationship. The colors represent CASH_SQUARED values (red for high and blue for low). When CASH_SQUARED is high (red), most SHAP values remain around zero or slightly negative, suggesting that high CASH_SQUARED reduces or stabilizes the influence of CASH. Overall, these patterns indicate that CASH initially contributes positively to the prediction, although its marginal effect subsequently diminishes, consistent with an inverted U-shaped relationship.

References

Abedin, M. Z., Guotai, C., Moula, F. E., Azad, A. S. M. S., & Khan, M. S. U. (2019). Topological applications of multilayer perceptrons and support vector machines in financial decision support systems. International Journal of Finance & Economics, 24(1), 474–507. [Google Scholar]
Abel, A. B. (2018). Optimal debt and profitability in the trade-off theory. The Journal of Finance, 73(1), 95–143. [Google Scholar] [CrossRef]
Abuzayed, B. (2012). Working capital management and firms’ performance in emerging markets: The case of Jordan. International Journal of Managerial Finance, 8(2), 155–179. [Google Scholar] [CrossRef]
Acosta-Jiménez, S., Mendoza-Mendoza, M. M., Galván-Tejada, C. E., Galván-Tejada, J. I., Celaya-Padilla, J. M., García-Domínguez, A., Gamboa-Rosales, H., & Solís-Robles, R. (2024). Detection of ovarian cancer using a methodology with feature extraction and selection with genetic algorithms and machine learning. Network Modeling Analysis in Health Informatics and Bioinformatics, 14(1), 3. [Google Scholar] [CrossRef]
Afrifa, G. A. (2016). Net working capital, cash flow and performance of UK SMEs. Review of Accounting and Finance, 15(1), 21–44. [Google Scholar] [CrossRef]
Amir, M., Azhar, Z., Kishan, A., & Krishnen, L. (2024). From the implementation of environmental management accounting to organizational sustainability: Does stakeholder integration strengthen it? Pakistan Journal of Commerce and Social Sciences, 18(4), 1065–1089. [Google Scholar] [CrossRef]
Baños-Caballero, S., García-Teruel, P. J., & Martínez-Solano, P. (2010). Working capital management in SMEs. Accounting & Finance, 50(3), 511–527. [Google Scholar] [CrossRef]
Beasley, M., Bradford, M., & Dehning, B. (2009). The value impact of strategic intent on firms engaged in information systems outsourcing. International Journal of Accounting Information Systems, 10(2), 79–96. [Google Scholar] [CrossRef]
Benaroch, M., & Chernobai, A. (2017). Operational IT failures, IT value destruction, and board-level IT governance changes. MIS Quarterly, 41(3), 729–762. [Google Scholar] [CrossRef]
Boot, A., & Vladimirov, V. (2019). (Non-)Precautionary cash hoarding and the evolution of growth firms. Management Science, 65(11), 5290–5307. [Google Scholar] [CrossRef]
Chambers, N., & Cifter, A. (2022). Working capital management and firm performance in the hospitality and tourism industry. International Journal of Hospitality Management, 102, 103144. [Google Scholar] [CrossRef]
Chan, T., Tan, C.-E., & Tagkopoulos, I. (2022). Audit lead selection and yield prediction from historical tax data using artificial neural networks. PLoS ONE, 17(11), e0278121. [Google Scholar] [CrossRef]
Chang, C.-C., Kao, L.-H., & Chen, H.-Y. (2018). How does real earnings management affect the value of cash holdings? Comparisons between information and agency perspectives. Pacific-Basin Finance Journal, 51, 47–64. [Google Scholar] [CrossRef]
Chen, S., De Simone, L., Hanlon, M., & Lester, R. (2023). The effect of innovation box regimes on investment and employment activity. The Accounting Review, 98(5), 187–214. [Google Scholar] [CrossRef]
Chen, Y., Smith, A. L., Cao, J., & Xia, W. (2014). Information technology capability, internal control effectiveness, and audit fees and delays. Journal of Information Systems, 28(2), 149–180. [Google Scholar] [CrossRef]
Chen, Y.-R., & Chuang, W.-T. (2009). Alignment or entrenchment? Corporate governance and cash holdings in growing firms. Journal of Business Research, 62(11), 1200–1206. [Google Scholar] [CrossRef]
Danso, A., Lartey, T., Fosu, S., Owusu-Agyei, S., & Uddin, M. (2019). Leverage and firm investment: The role of information asymmetry and growth. International Journal of Accounting and Information Management, 27(1), 56–73. [Google Scholar] [CrossRef]
Das, S. (2015). Cash management in IT sector—A study. Journal of Commerce and Accounting Research, 4, 27–39. [Google Scholar] [CrossRef]
Deb, P., David, P., & O’Brien, J. (2017). When is cash good or bad for firm performance? Strategic Management Journal, 38(2), 436–454. [Google Scholar] [CrossRef]
Demers, E., Gaertner, F. B., Kausar, A., Li, H., & Steele, L. B. (2024). Aggregate tone and gross domestic product. Contemporary Accounting Research, 41(4), 2574–2599. [Google Scholar] [CrossRef]
Dewan, S., & Ren, F. (2011). Information technology and firm boundaries: Impact on firm risk and return performance. Information Systems Research, 22(2), 369–388. [Google Scholar] [CrossRef]
Dittmar, A., Mahrt-Smith, J., & Servaes, H. (2003). International corporate governance and corporate cash holdings. The Journal of Financial and Quantitative Analysis, 38(1), 111–133. [Google Scholar] [CrossRef]
Doshi, H., Kumar, P., & Yerramilli, V. (2018). Uncertainty, capital investment, and risk management. Management Science, 64(12), 5769–5786. [Google Scholar] [CrossRef]
Eger, R. J., III, & Hermis, J. M. (2025). Capital structure of special-purpose governments. Journal of Public Budgeting, Accounting & Financial Management, 37(3), 393–414. [Google Scholar]
Elayan, F. A., Li, J., & Meyer, T. O. (2008). Accounting irregularities, management compensation structure and information asymmetry. Accounting & Finance, 48(5), 741–760. [Google Scholar] [CrossRef]
Farinha, J., Mateus, C., & Soares, N. (2018). Cash holdings and earnings quality: Evidence from the main and alternative UK markets. International Review of Financial Analysis, 56, 238–252. [Google Scholar] [CrossRef]
Faysal, S. (2024). The analysis of capital structure theories in emerging markets. International Journal of Management, Accounting & Economics, 11(2), 148–160. [Google Scholar]
Frésard, L., & Salva, C. (2010). The value of excess cash and corporate governance: Evidence from US cross-listings. Journal of Financial Economics, 98(2), 359–384. [Google Scholar] [CrossRef]
García-Teruel, P. J., Martínez-Solano, P., & Sánchez-Ballesta, J. P. (2009). Accruals quality and corporate cash holdings. Accounting & Finance, 49(1), 95–115. [Google Scholar] [CrossRef]
Gholampoor, H., & Asadi, M. (2024). Risk analysis of bankruptcy in the U.S. healthcare industries based on financial ratios: A Machine learning analysis. Journal of Theoretical and Applied Electronic Commerce Research, 19(2), 1303–1320. [Google Scholar] [CrossRef]
Gow, I. D., Larcker, D. F., & Zakolyukina, A. A. (2023). How important is corporate governance? Evidence from machine learning. Chicago booth research paper no. 22-16, no. 2022-137. University of Chicago. [Google Scholar]
Greiner, A. J. (2017). An examination of real activities management and corporate cash holdings. Advances in Accounting, 39, 79–90. [Google Scholar] [CrossRef]
Gu, Z., & Gao, L. (2000). A multivariate model for predicting business failures of hospitality firms. Tourism and Hospitality Research, 2(1), 37–49. [Google Scholar] [CrossRef]
Habib, A., Monzur Hasan, M., & Al-Hadi, A. (2017). Financial statement comparability and corporate cash holdings. Journal of Contemporary Accounting & Economics, 13(3), 304–321. [Google Scholar] [CrossRef]
Harford, J., Mansi, S. A., & Maxwell, W. F. (2008). Corporate governance and firm cash holdings in the US. Journal of Financial Economics, 87(3), 535–555. [Google Scholar] [CrossRef]
Hennessy, C. A., & Whited, T. M. (2007). How costly is external financing? Evidence from a structural estimation. The Journal of Finance, 62(4), 1705–1745. [Google Scholar] [CrossRef]
Hirth, S., & Uhrig-Homburg, M. (2010). Investment timing when external financing is costly. Journal of Business Finance & Accounting, 37(7–8), 929–949. [Google Scholar] [CrossRef]
Howe, J. S., & Jain, R. (2010). Testing the trade-off theory of capital structure. Review of Business, 31(1), 54–67. [Google Scholar]
Hunt, J. O., Myers, J. N., & Myers, L. A. (2022). Improving earnings predictions and abnormal returns with machine learning. Accounting Horizons, 36(1), 131–149. [Google Scholar] [CrossRef]
Iliyas, M., & Barca, M. (2025). A chronological review of resource-based theory and future research directions. Journal of Management, 4, 41–53. [Google Scholar] [CrossRef]
Jolliffe, I. T., & Cadima, J. (2016). Principal component analysis: A review and recent developments. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 374(2065), 20150202. [Google Scholar] [CrossRef]
Jones, S., Johnstone, D., & Wilson, R. (2017). Predicting corporate bankruptcy: An evaluation of alternative statistical frameworks. Journal of Business Finance & Accounting, 44(1–2), 3–34. [Google Scholar]
Kayakus, M., Tutcu, B., Terzioglu, M., Talaş, H., & Ünal Uyar, G. F. (2023). ROA and ROE forecasting in iron and steel industry using machine learning techniques for sustainable profitability. Sustainability, 15(9), 7389. [Google Scholar] [CrossRef]
Kim, C., & Bettis, R. A. (2014). Cash is surprisingly valuable as a strategic asset. Strategic Management Journal, 35(13), 2053–2063. [Google Scholar] [CrossRef]
Kim, J. H., Lim, J., Ahn, J. S., & Kim, Y. (2023). Cash holdings and firm performance in the restaurant industry. Journal of Applied Business and Economics, 25(3), 193–202. [Google Scholar] [CrossRef]
Kori, A., & Gadagin, N. (2024). Interpretable financial risk models: Leveraging gradient boosting and feature importance analysis. International Research Journal of Modernization in Engineering Technology and Science, 6(11), 3347–3366. [Google Scholar]
La Rocca, M., & Cambrea, D. R. (2019). The effect of cash holdings on firm performance in large Italian companies. Journal of International Financial Management & Accounting, 30(1), 30–59. [Google Scholar]
Lee, E., & Powell, R. (2011). Excess cash holdings and shareholder value. Accounting & Finance, 51(2), 549–574. [Google Scholar] [CrossRef]
Li, M., Sun, H., Huang, Y., & Chen, H. (2024). Shapley value: From cooperative game to explainable artificial intelligence. Autonomous Intelligent Systems, 4(1), 2. [Google Scholar] [CrossRef]
Li, N. (2010). Negotiated measurement rules in debt contracts. Journal of Accounting Research, 48(5), 1103–1144. [Google Scholar] [CrossRef]
Lim, J. (2023). Organization capital and corporate governance. Journal of Risk and Financial Management, 16(9), 384. [Google Scholar] [CrossRef]
Liu, J., Xiong, X., Gao, Y., & Zhang, J. (2023). The impact of institutional investors on ESG: Evidence from China. Accounting & Finance, 63(S2), 2801–2826. [Google Scholar]
Louis, H., Sun, A. X., & Urcan, O. (2012). Value of cash holdings and accounting conservatism. Contemporary Accounting Research, 29(4), 1249–1271. [Google Scholar] [CrossRef]
Lozano, M. B., & Yaman, S. (2020). The European financial crisis and firms’ cash holding policy: An analysis of the precautionary motive. Global Policy, 11(S1), 84–94. [Google Scholar] [CrossRef]
Lundberg, S. M., & Lee, S.-I. (2017, December 4–9). A unified approach to interpreting model predictions. The 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA. [Google Scholar]
Mahmood, F., Ahmed, Z., Hussain, N., & Ben-Zaied, Y. (2025). Working capital financing and firm performance: A machine learning approach. Review of Quantitative Finance and Accounting, 65, 71–106. [Google Scholar] [CrossRef]
Melo, M. C., Bernardi, R. C., De La Fuente-Nunez, C., & Luthey-Schulten, Z. (2020). Generalized correlation-based dynamical network analysis: A new high-performance approach for identifying allosteric communications in molecular dynamics trajectories. The Journal of Chemical Physics, 153(13), 134104. [Google Scholar] [CrossRef] [PubMed]
Meng, Y., Yang, N., Qian, Z., & Zhang, G. (2021). What makes an online review more helpful: An interpretation framework using XGBoost and SHAP values. Journal of Theoretical and Applied Electronic Commerce Research, 16(3), 466–490. [Google Scholar] [CrossRef]
MengYun, W., Um-e-Habiba, Husnain, M., Sarwar, B., & Ali, W. (2021). Board financial expertise and corporate cash holdings: Moderating role of multiple large shareholders in emerging family firms. Complexity, 2021(1), 6397515. [Google Scholar] [CrossRef]
Menike, L., Dunusinghe, P., & Ranasinghe, A. (2015). Macroeconomic and firm specific determinants of stock returns: A comparative analysis of stock markets in Sri Lanka and in the United Kingdom. Journal of Finance and Accounting, 3(4), 86–96. [Google Scholar] [CrossRef]
Merrick, L., & Taly, A. (2020, August 25–28). The explanation game: Explaining machine learning models using shapley values. International Cross-Domain Conference for Machine Learning and Knowledge Extraction, Dublin, Ireland. [Google Scholar]
Mitchell, R. K., Van Buren, H. J., III, Greenwood, M., & Freeman, R. E. (2015). Stakeholder inclusion and accounting for stakeholders. Journal of Management Studies, 52(7), 851–877. [Google Scholar] [CrossRef]
Mohamed, W. N. H. W., Salleh, M. N. M., & Omar, A. H. (2012, November 23–25). A comparative study of reduced error pruning method in decision tree algorithms. 2012 IEEE International Conference on Control System, Computing and Engineering, Penang, Malaysia. [Google Scholar]
Mousa, G. A., Elamir, E. A. H., & Hussainey, K. (2022). Using machine learning methods to predict financial performance: Does disclosure tone matter? International Journal of Disclosure and Governance, 19(1), 93–112. [Google Scholar] [CrossRef]
Mubarek, A. M., & Adalı, E. (2017, October 5–8). Multilayer perceptron neural network technique for fraud detection. 2017 International Conference on Computer Science and Engineering (UBMK), Antalya, Turkey. [Google Scholar]
Murthy, U. S., Smith, T. J., Whitworth, J. D., & Zhang, Y. (2020). The effects of information systems compatibility on firm performance following mergers and acquisitions. Journal of Information Systems, 34(2), 211–233. [Google Scholar] [CrossRef]
Myers, S. C., & Majluf, N. S. (1984). Corporate financing and investment decisions when firms have information that investors do not have. Journal of Financial Economics, 13(2), 187–221. [Google Scholar] [CrossRef]
Myers, S. C., & Rajan, R. G. (1998). The paradox of liquidity. The Quarterly Journal of Economics, 113(3), 733–771. [Google Scholar] [CrossRef]
Najem, R., Bahnasse, A., Fakhouri Amr, M., & Talea, M. (2025). Advanced AI and big data techniques in E-finance: A comprehensive survey. Discover Artificial Intelligence, 5(1), 102. [Google Scholar] [CrossRef]
Nguyen Thanh, C., & Phan Huy, T. (2025). Predicting financial reports fraud by machine learning: The proxy of auditor opinions. Cogent Business & Management, 12(1), 2510556. [Google Scholar] [CrossRef]
Nohara, Y., Matsumoto, K., Soejima, H., & Nakashima, N. (2022). Explanation of machine learning models using shapley additive explanation and application for real data in hospital. Computer Methods and Programs in Biomedicine, 214, 106584. [Google Scholar] [CrossRef]
Nwude, E. C., Allison, P. U., & Nwude, C. A. (2021). The relationship between working capital management and corporate returns of cement industry of emerging market. International Journal of Finance & Economics, 26(3), 3222–3235. [Google Scholar]
Opler, T., Pinkowitz, L., Stulz, R., & Williamson, R. (1999). The determinants and implications of corporate cash holdings. Journal of Financial Economics, 52(1), 3–46. [Google Scholar] [CrossRef]
Ozkan, A., & Ozkan, N. (2004). Corporate cash holdings: An empirical investigation of UK companies. Journal of Banking & Finance, 28(9), 2103–2134. [Google Scholar] [CrossRef]
Ozlem, S., & Tan, O. F. (2022). Predicting cash holdings using supervised machine learning algorithms. Financial Innovation, 8(1), 44. [Google Scholar] [CrossRef]
Park, J. C., & Wu, Q. (2009). Financial restatements, cost of debt and information spillover: Evidence From the secondary loan market. Journal of Business Finance & Accounting, 36(9–10), 1117–1147. [Google Scholar] [CrossRef]
Pinillos, J., Macías, H., Castrillon, L., Eslava, R., & De la Cruz, S. (2025). Analysis of the capital structure of Latin American companies in light of trade-off and pecking order theories. Journal of Risk and Financial Management, 18(7), 399. [Google Scholar] [CrossRef]
Rozemberczki, B., Watson, L., Bayer, P., Yang, H.-T., Kiss, O., Nilsson, S., & Sarkar, R. (2022, July 23–29). The shapley value in machine learning. The 31st International Joint Conference on Artificial Intelligence and the 25th European Conference on Artificial Intelligence, Vienna, Austria. [Google Scholar]
Saldanha, T. J. V., Andrade-Rojas, M. G., Kathuria, A., Khuntia, J., & Krishnan, M. S. (2024). How the locus of uncertainty shapes the influence of CEO long-term compensation of information technology capital investments. MIS Quarterly, 48(2), 459–490. [Google Scholar] [CrossRef]
Shubho, S. A., Razib, M. R. H., Rudro, N. K., Saha, A. K., Khan, M. S. U., & Ahmed, S. (2019, December 18–20). Performance analysis of NB Tree, REP tree and random tree classifiers for credit card fraud data. 2019 22nd International Conference on Computer and Information Technology (ICCIT), Dhaka, Bangladesh. [Google Scholar]
Silva, S. (2025). Trade credit and corporate profitability: Evidence from EU-based SMEs. Journal of Corporate Accounting & Finance, 36(1), 81–92. [Google Scholar]
Singhania, M., Sharma, N., & Yagnesh Rohit, J. (2014). Working capital management and profitability: Evidence from Indian manufacturing companies. Decision, 41(3), 313–326. [Google Scholar] [CrossRef]
Situmeang, F. B. I., Gemser, G., Wijnberg, N. M., & Leenders, M. A. A. M. (2016). Risk-taking behavior of technology firms: The role of performance feedback in the video game industry. Technovation, 54, 22–34. [Google Scholar] [CrossRef]
Sun, Q., Yung, K., & Rahman, H. (2012). Earnings quality and corporate cash holdings. Accounting & Finance, 52(2), 543–571. [Google Scholar]
Tan, J., & Peng, M. W. (2003). Organizational slack and firm performance during economic transitions: Two studies from an emerging economy. Strategic Management Journal, 24(13), 1249–1263. [Google Scholar] [CrossRef]
Tellez Gaytan, J. C., Ateeq, K., Rafiuddin, A., Alzoubi, H. M., Ghazal, T. M., Ahanger, T. A., Chaudhary, S., & Viju, G. K. (2022). AI-based prediction of capital structure: Performance comparison of ANN SVM and LR models. Computational Intelligence and Neuroscience, 2022(1), 8334927. [Google Scholar] [CrossRef]
Theissen, M. H., Jung, C., Theissen, H. H., & Graf-Vlachy, L. (2023). Cash holdings and firm value: Evidence for increasing marginal returns. Journal of Management Scientific Reports, 1(3–4), 260–300. [Google Scholar] [CrossRef]
The United States Census Bureau. (2025). North American industry classification system. Available online: https://www.census.gov/naics/ (accessed on 11 October 2025).
Toms, S. (2010). Value, profit and risk: Accounting and the resource–based view of the firm. Accounting, Auditing & Accountability Journal, 23(5), 647–670. [Google Scholar]
Trustorff, J.-H., Konrad, P. M., & Leker, J. (2011). Credit risk prediction using support vector machines. Review of Quantitative Finance and Accounting, 36(4), 565–581. [Google Scholar] [CrossRef]
Weigel, C., & Hiebl, M. R. (2023). Accountants and small businesses: Toward a resource-based view. Journal of Accounting & Organizational Change, 19(5), 642–666. [Google Scholar]
Wernerfelt, B. (1984). A resource--based view of the firm. Strategic Management Journal, 5(2), 171–180. [Google Scholar] [CrossRef]
Wu, D., Ma, X., & Olson, D. L. (2022). Financial distress prediction using integrated Z-score and multilayer perceptron neural networks. Decision Support Systems, 159, 113814. [Google Scholar] [CrossRef]
Xie, X.-T. (2020). Technology enterprise value assessment based on BP neural network. International Journal of Computing Science and Mathematics, 12(2), 192–203. [Google Scholar] [CrossRef]
Xin, M., & Choudhary, V. (2019). IT investment under competition: The role of implementation failure. Management Science, 65(4), 1909–1925. [Google Scholar] [CrossRef]
Yin, Y., & Yin, H. (2025, April 25–27). Optimization of the relationship between cash holding and corporate performance through digital technology. The 2025 International Conference on Digital Economy and Information Systems, Guangzhou, China. [Google Scholar]
Zahariev, A., Angelov, P., & Zarkova, S. (2022). Estimation of bank profitability using vector error correction model and support vector regression. Economic Alternatives, 28(2), 157–170. [Google Scholar] [CrossRef]
Zakaria, N., Sulaiman, A., Min, F. S., & Feizollah, A. (2023). Machine learning in the financial industry: A bibliometric approach to evidencing applications. Cogent Social Sciences, 9(2), 2276609. [Google Scholar] [CrossRef]
Zhang, C., Zhang, H., & Liu, D. (2019). A contrastive study of machine learning on energy firm value prediction. IEEE Access, 8, 11635–11643. [Google Scholar] [CrossRef]
Zhang, S. (2024, December 4–5). Application of random forest algorithm in accounting data analysis and prediction. 2024 4th International Conference on Mobile Networks and Wireless Communications (ICMNWC), Tumakuru, India. [Google Scholar]

Table 1. Selected IT Industries Based on NAICS Classification.

NAICS Codes	Industries
334	Computer and Electronic Product Manufacturing
51121	Software Publishers
51321	Software Publishers
517	Telecommunications
518	Data Processing, Hosting, and Related Services
519	Other Information Services
54151	Computer Systems Design and Related Services

Table 2. Descriptive Statistics.

	N	25th Percentile	Mean	Median	75th Percentile	Standard Deviation
PROFIT	21,051	−0.115	−0.311	0.027	0.088	4.105
CASH	21,051	0.097	0.291	0.237	0.443	0.229
CASH_SQUARED	21,051	0.009	0.137	0.056	0.196	0.182
LIQUIDITY	21,051	1.169	3.043	1.969	3.460	5.412
INVESTMENT	21,051	0.037	0.140	0.082	0.175	0.158
GROWTH	21,051	−0.046	1.421	0.080	0.257	70.821
SIZE	21,051	37.573	5684.242	270.810	1727.153	25,421.916
LEVERAGE	21,051	0.003	0.552	0.131	0.335	9.241
R&D INTENSITY	21,051	0.011	1.234	0.094	0.195	28.665

Note: For each proxy, one representative variable is selected as follows. PROFIT is EBIT divided by total assets. CASH is a cash ratio, measured as cash and marketable securities divided by the book value of total assets. CASH_SQUARED is the squared value of the cash ratio, equal to cash ratio × cash ratio. LIQUIDITY is current assets divided by current liabilities. INVESTMENT is tangible fixed assets divided by total assets. GROWTH is sales growth. SIZE is total assets, expressed in millions. LEVERAGE is total debts divided by total assets. R&D INTENSITY is R&D expenditure divided by Total Sales.

Table 3. Pearson Correlation Matrix.

	(1)	(2)	(3)	(4)	(5)	(6)	(7)	(8)	(9)
PROFIT (1)	1.000
CASH (2)	0.017	1.000
CASH_SQUARED (3)	0.004	0.952 *	1.000
LIQUIDITY (4)	0.034 *	0.337 *	0.354 *	1.000
INVESTMENT (5)	−0.022 *	−0.317 *	−0.286 *	−0.119 *	1.000
GROWTH (6)	−0.002	−0.012	−0.008	−0.006	−0.008	1.000
SIZE (7)	0.022 *	−0.127 *	−0.106 *	−0.058 *	0.150 *	−0.004	1.000
LEVERAGE (8)	−0.525 *	−0.029 *	−0.022 *	−0.026 *	0.033 *	0.000	−0.007	1.000
R&D INTENSITY (9)	−0.030 *	0.061 *	0.078 *	0.043 *	−0.006	−0.001	−0.009	0.014	1.000

Note: * denotes significance at the 0.01 level. The definitions of variables are described in Table 2.

Table 4. Cash Holdings and Profitability.

	PROFIT
	Coef.	t-Value
CASH	1.7240	2.270 **
CASH_SQUARED	−2.4247	−2.995 ***
LIQUIDITY	0.0208	3.103 ***
INVESTMENT	−0.1179	−0.370
GROWTH	−0.0001	−0.433
SIZE	0.0001	5.100 ***
LEVERAGE	−0.2322	−7.630 ***
R&D INTENSITY	−0.0030	−3.760 ***
Year fixed effects	Yes
N	21,051
Adj. R-sq	0.277

Note: *, **, and *** denote significance at the 10%, 5%, and 1% (two-sided) level, respectively. Standard errors are robust to both clustering at the firm level and heteroscedasticity. The definitions of variables are described in Table 2.

Table 5. One-Year-Forward Dependent Variable.

	One-Year-Forward PROFIT
	Coef.	t-Value
CASH	2.0576	2.173 **
CASH_SQUARED	−3.2437	−2.459 **
LIQUIDITY	0.0161	2.151 **
INVESTMENT	−0.0132	−0.029
GROWTH	−0.0001	−0.612
SIZE	0.0001	2.968 ***
LEVERAGE	−0.2763	−4.164 ***
R&D INTENSITY	−0.0051	−2.143 **
Year fixed effects	Yes
N	18,281
Adj. R-sq	0.090

Note: *, **, and *** denote significance at the 10%, 5%, and 1% (two-sided) level, respectively. Standard errors are robust to both clustering at the firm level and heteroscedasticity. The dependent variable is the one-year-forward PROFIT, calculated as one-year-forward EBIT divided by one-year-forward total assets. The definitions of other variables are described in Table 2.

Table 6. Subsample Analysis: Big IT Firms.

	One-Year-Forward PROFIT
	Coef.	t-Value
CASH	0.1207	2.903 ***
CASH_SQUARED	−0.1284	−1.776 *
LIQUIDITY	−0.0013	−1.065
INVESTMENT	0.0324	1.146
GROWTH	0.0014	0.962
SIZE	−0.0001	−1.810 *
LEVERAGE	0.0216	0.911
R&D INTENSITY	−0.3007	−4.909 ***
Year fixed effects	Yes
Firm fixed effects	Yes
N	5258
Adj. R-sq	0.040

Note: *, **, and *** denote significance at the 10%, 5%, and 1% (two-sided) level, respectively. The dependent variable is the one-year-forward PROFIT, calculated as one-year-forward EBIT divided by one-year-forward total assets. The definitions of other variables are described in Table 2.

Table 7. Effect of Recessionary Periods.

	PROFIT
	Coef.	t-Value
CASH	1.6279	2.587 ***
CASH_SQUARED	−2.7366	−2.565 **
LIQUIDITY	0.0248	1.928 *
INVESTMENT	−0.9977	−1.463
GROWTH	−0.0002	−1.907 *
SIZE	0.0001	3.480 ***
LEVERAGE	−0.2260	−6.207 ***
R&D INTENSITY	−0.0006	−1.326
Year fixed effects	Yes
N	5479
Adj. R-sq	0.613

Note: *, **, and *** denote significance at the 10%, 5%, and 1% (two-sided) level, respectively. Standard errors are robust to both clustering at the firm level and heteroscedasticity. The definitions of variables are described in Table 2.

Table 8. Master Proxy Variances.

Variables	Number of Proxies	Variance Explained by PC1
Profitability	10	39.7%
Liquidity	2	57.7%
Investment	2	51.1%
Growth	3	36.4%
Size	5	67.6%
Leverage	3	48.8%

Note: The definitions of proxies are provided in Appendix D. PC1 denotes the first principal component as the master proxy.

Table 9. Cash Holdings and Profitability Using Master Proxies.

	PROFIT_M
	Coef.	t-Value
CASH	0.0502	2.438 **
CASH_SQUARED	−0.0460	−2.380 **
LIQUIDITY_M	0.0068	2.363 **
INVESTMENT_M	−0.0047	−3.574 ***
GROWTH_M	0.0013	2.184 **
SIZE_M	0.0210	6.512 ***
LEVERAGE_M	−0.2958	−2.423 **
R&D INTENSITY	0.0001	2.101 **
Year fixed effects	Yes
N	21,051
Adj. R-sq	0.136

Note: *, **, and *** denote significance at the 10%, 5%, and 1% (two-sided) level, respectively. Standard errors are robust to both clustering at the firm level and heteroscedasticity. The definitions of variables are described in Appendix D.

Table 10. Comparison of Model Performances.

Models	MAE	RMSE	R²
Random Forest	0.0084	0.0723	0.6090
Neural Network	0.0128	0.0989	0.4293
Decision Tree	0.0140	0.1010	0.2174
Support Vector Machine	0.0128	0.1101	0.1207
OLS Regression	0.0824	1.0098	0.1360

Note: To assess the model performances, Mean Absolute Error (MAE), and Root Mean Square Error (RMSE), and R² are computed.

Table 11. Mean SHAP values in Random Forest Model.

	Mean SHAP Value
CASH	0.000140
CASH_SQUARED	−0.000222
LIQUIDITY_M	−0.000064
INVESTMENT_M	−0.000367
GROWTH_M	0.000479
SIZE_M	0.003200
LEVERAGE_M	−0.000068
R&D INTENSITY	0.001593

Note: Dependent variable is the master proxy for ten profitability variables (PROFIT_M). The definitions of variables are described in Appendix D.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

The Impact of Cash Holding Decisions on Firm Performance in the IT Industry

Abstract

1. Introduction

2. Hypothesis Development

3. Empirical Models

3.1. Regression Model

3.2. Machine Learning Techniques

4. Empirical Results

4.1. Data and Descriptive Statistics

4.2. Multivariate Results

5. Additional Tests

5.1. One-Year-Forward Dependent Variable

5.2. Subsample Analysis: Big IT Firms

5.3. Effect of Recessionary Periods

5.4. Principal Component Analysis

5.5. Cash Holdings and Profitability Using Master Proxies

5.6. Machine Learning Model Results

6. Discussion

7. Limitations and Future Research

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Hyperparameter Configurations

Appendix B. Proxies of Variables

Appendix C. Scree Plots and PCA Loadings

Appendix D. Variable Definitions

Appendix E. SHAP Dependency Plot

References

Article Metrics

Citations

Article Access Statistics