Next Article in Journal
Enhanced Random Forest Classifier with K-Means Clustering (ERF-KMC) for Detecting and Preventing Distributed-Denial-of-Service and Man-in-the-Middle Attacks in Internet-of-Medical-Things Networks
Previous Article in Journal
Comparative GIS Analysis of Public Transport Accessibility in Metropolitan Areas
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Sports Analytics and Text Mining NBA Data to Assess Recovery from Injuries and Their Economic Impact

by
Vangelis Sarlis
,
George Papageorgiou
and
Christos Tjortjis
*
School of Science and Technology, International Hellenic University, 57001 Thessaloniki, Greece
*
Author to whom correspondence should be addressed.
Computers 2023, 12(12), 261; https://doi.org/10.3390/computers12120261
Submission received: 23 November 2023 / Revised: 10 December 2023 / Accepted: 12 December 2023 / Published: 16 December 2023

Abstract

:
Injuries are an unfortunate part of professional sports. This study aims to explore the multi-dimensional impact of injuries in professional basketball, focusing on player performance, team dynamics, and economic outcomes. Employing advanced machine learning and text mining techniques on suitably preprocessed NBA data, we examined the intricate interplay between injury and performance metrics. Our findings reveal that specific anatomical sub-areas, notably knees, ankles, and thighs, are crucial for athletic performance and injury prevention. The analysis revealed the significant economic burden that certain injuries impose on teams, necessitating comprehensive long-term strategies for injury management. The results provide valuable insights into the distribution of injuries and their varied effects, which are essential for developing effective prevention and economic strategies in basketball. By illuminating how injuries influence performance and recovery dynamics, this research offers comprehensive insights that are beneficial for NBA teams, healthcare professionals, medical staff, and trainers, paving the way for enhanced player care and optimized performance strategies.

1. Introduction

Sports analytics have found a strong position in the realm of sports, particularly in predicting potential setbacks and optimizing team and player performance. This is evident in diverse applications, from aiding team management in estimating and mitigating injury risks to assisting betting companies in forecasting player and team performance. The overarching goal remains consistent: to reduce associated costs and elevate performance metrics, either for an individual player or the entire team [1,2,3,4]. Engaging in sports boasts numerous advantages but also leads to injuries. Several parameters, such as age, sex, sport nature (contact or noncontact), and training intensity, govern the susceptibility of athletes to injuries [5,6].
According to the National Basket Association (NBA), injuries are a significant concern, as they can potentially detrimentally impact player performance. Although advancements have been made in prevention and rehabilitation, injury rates remain alarmingly high. This unpredictability in injury occurrence can be attributed to various factors, many of which are challenging to pinpoint and measure. Furthermore, the repercussions of such injuries are not only felt by players but also by management, fans, teams, and their overall dynamics [7,8].
Basketball’s universal appeal, much of which can be credited to the NBA’s prominence, spans from casual games to professional tournaments. The influence of the NBA is undeniable, impacting not only the sport’s popularity but also the socioeconomic narratives of players, teams, and the league itself [9,10].
The proliferation of big data has revolutionized how sports professionals, from trainers to healthcare experts, approach injury management. With the advent of biometric tracking tools such as accelerometers, RFID devices, HR monitors, and GPS wearables, the ability to diagnose team or player vulnerabilities has improved. These technologies play pivotal roles in refining performance, curtailing injury risks, and accelerating recovery trajectories [11,12,13].
An injury can have a significant impact on the economic data of a team. A key player sustaining an injury can cause a decline in team performance and ultimately affect revenues and profitability, as it can lead to a decrease in ticket and merchandise sales. If a player is not able to fully recover from an injury, this can cause a decline in their performance on the court and ultimately affect the team’s overall performance [14,15,16,17].
Another study reviewed in-depth injury analysis from the 2017–2021 NBA seasons. There was a rise in injuries and games missed, with the highest injury rates for guards and athletes with 6–15 years of experience. This study underscores the necessity of further research to understand and reduce injury risks in professional basketball [18].
This can also lead to a decline in revenue from sponsorships and advertising. Additionally, injuries can affect a team’s salary cap, which is the amount of money that the team has available to spend on player salaries. If a player sustains an injury, they may not be able to perform at the same level as they did before, which can affect their future salary and contract negotiations. This can also affect the team’s ability to sign new players, as they may have less money available to spend on player salaries. Overall, an injury can have a significant impact on the economic data of a team. Although the immediate effect may be a decrease in attendance, merchandise sales and revenue from sponsorship, the long-term effect can be a negative impact on a team’s overall performance, attendance, merchandise sales, revenue from sponsorship, salary cap, and ability to sign new players [19,20,21].
Researchers have investigated the use of a new ML approach to detect injury risk factors in young team-sport athletes [9]. The results showed that the new approach was effective at detecting injury risk factors and could be a useful tool in injury prevention and risk management.
Several researchers have investigated the knee movement patterns of injured and uninjured adolescent basketball players when landing from a jump. The authors conducted a case–control study and reported that injured players had different knee movement patterns than uninjured players when landing. The results suggest that certain knee movement patterns may contribute to an increased risk of injury in adolescent basketball players [22,23].
In one study, researchers identified the use of ML in classifying the integrity of articular cartilage in the knee joint. They used near-infrared spectroscopy to gather data on the biochemical composition of cartilage samples. Afterward, they applied ML algorithms to the data to classify the cartilage samples into categories of different levels of integrity. The results showed that the ML approach was able to accurately classify the samples, suggesting that the ML approach could be a useful tool for detecting and monitoring cartilage degradation in the knee joint [24,25,26].
The subsequent sections methodically explore the intricate connections between injuries and player performance, the economic repercussions for teams, and the efficacy of modern data science methodologies in addressing these challenges. Section 2, ‘Data and Methods’ details our approach, employing advanced data analytics and text mining techniques to scrutinize injury patterns and their impacts. Section 3, ‘Results,’ presents our findings, highlighting the significant influence of injuries on both player performance and team economics. Finally, in Section 4, ‘Discussion’,’ we interpret these findings, offering insights into the broader implications for injury prevention, team management strategies, and future research directions in the domain of sports analytics. This research aims to explore the multi-dimensional impact of injuries in professional basketball, with a focus on player performance, team dynamics, and economic outcomes. Utilizing text mining techniques on NBA data, this study examines the complex relationship between injury and performance metrics, revealing the significance of certain anatomical sub-areas. The analysis also uncovers the substantial economic burden that injuries impose on teams, highlighting the need for comprehensive, long-term strategies for injury management. The broader objectives of this research are to provide valuable insights into the distribution of injuries and their varied effects, which are crucial for developing effective prevention strategies and economic approaches in basketball. By illuminating the influence of injuries on performance and recovery dynamics, this study offers comprehensive insights beneficial to NBA teams, healthcare professionals, medical staff, and trainers. This contributes to enhancing player care and optimizing performance strategies. This study methodically explores the connections between injuries and player performance, the economic repercussions for teams, and the efficacy of modern data science methodologies in addressing these challenges By synthesizing data-driven analysis with practical considerations, this paper aims to contribute a comprehensive perspective on the complexities and nuances of managing sports injuries in the high-stakes environment of professional basketball.

2. Data and Methods

We aimed to apply advanced data analytics and text mining techniques to study the influence of sports injuries on athlete performance, recovery dynamics, and economic implications for basketball teams. These objectives were achieved using text mining to find patterns among specific anatomical injury sub-areas and shifts in advanced performance indicators across series (2-game, 5-game, and 10-game) were delineated. Subsequent analyses examined the recovery trajectories of patients with different injury types, elucidating the factors influencing recovery duration. Economic ramifications were assessed by scrutinizing the perturbations in pertinent basketball analytics post-injury events.

2.1. Research Questions/Hypothesis

  • Is it possible, using text mining, to distinguish patterns and relationships between references to injuries of anatomical sub-areas and changes in advanced performance indicators across different series, such as 2-game, 5-game, and 10-game series? (RQ1)
  • Which types of injuries take longer for athletes to recover from before they can return to action? (RQ2)
  • How do the different injury types vary in correlation with advanced basketball performance metrics? (RQ3)
  • How does an injury affect the economic data of a team based on basketball analytics? (RQ4)

2.2. Methodology

This study utilized data from multiple sources [27,28,29] to ensure comprehensive information for a robust analysis. Extensive data scraping and preprocessing techniques were utilized. Data was collected for NBA players’ performance, injuries, and salaries from the 2000-01 to 2022-23 seasons using the nba_api to extract data from the NBA’s official website [29] and then consolidated into one dataset based on the data cross-validation, without the information fulfillment and features-addition of the other data source [27,28]. Data retrieval and preprocessing were challenging and involved consolidating data into a supervised model and prioritizing the quality of the data. Our methodology encompasses two dimensions, Data Collection and Data Analysis, as detailed in the following subsections, and it includes data collection, preprocessing, analysis, and results evaluation [30]. Figure 1 presents the methodology outlined for this manuscript. Each step in the process is clearly delineated, providing a concise overview of the sequence from the data collection to the final analysis phases, including injury- and salary-data transformation. The preprocessing included removing irrelevant data, employing text mining to extract detailed information from textual descriptions of injuries and player contracts, and standardizing salary figures with inflation rates. The final dataset, containing 749,631 records across 158 columns, was meticulously prepared for analysis by sorting and dividing into subsets focused on performance and injury data, ensuring a thorough and detailed approach to data analysis.

2.2.1. Data Collection

The data collection involved handling disorganized and diverse data. The methodology included gathering, text mining, preprocessing, analysis, and result evaluation. We used Python scripts and KNIME Analytics Platform flows for data acquisition. Preprocessing included identifying and removing missing and irrelevant data. The data underwent Extract, Transform, and Load (ETL) processing using the NBA API, PostgreSQL, and Python scripts.
Our study acquired and analyzed a comprehensive dataset covering player performance, injuries, and salaries for NBA players from the 2000-01 and 2022-23 seasons. This section details the data sources, types, and dataset shapes. Player performance and injury data were obtained from the nba_api through the extraction of data from the NBA’s official website and database. Web scraping occurred from 2000-01 to 2022-23, involving 2296 players across those seasons. [29].
We conducted two scrape runs: one for regular season per-game_date player performance and another for playoffs. We compiled nine distinct datasets (leaguegamelog, players, boxscoreadvanced, boxscorefourfactors, boxscoremisc, boxscoreplayertrack, boxscorescoring, boxscoretraditional, and boxscoreusage). These datasets were merged using player_id and game_date, creating a complete dataset stored in the PostgreSQL database.
Additionally, the datasets included comment-based metadata explaining player absences, including coaching decisions and injuries. A schema summary of the datasets used is given in Table 1.

2.2.2. Data Engineering

After the raw data were acquired, the next phase involved data engineering. This section explains the processes for data cleansing, structuring, and enhancing the collected data to prepare them for detailed analysis.
  • Preprocessing Player Performance Data
The player performance dataset included a comprehensive array of advanced player statistics on a per-game–date basis, including various key performance indicators (KPIs). These KPIs covered general player information, advanced box score stats, “Four Factors”, miscellaneous metrics, player tracking, traditional scoring, and usage stats. “Four Factors” is a strategic framework focusing on key elements that significantly influence game outcomes. These factors include efficient shooting, characterized by high field goal percentages; ball control, emphasizing minimal turnovers; rebounding prowess, both offensive and defensive; and frequent, effective trips to the free-throw line. Dean’s methodology highlights the importance of these aspects in determining a team’s success on the court, offering a comprehensive blueprint for aspiring players and coaches to enhance their performance and strategies [31,32].
Although not all KPIs were consistently available throughout the entire data scraping period (e.g., specific advanced performance statistics were absent in earlier years, as in 2000), we decided to retain as much relevant information as possible for our comprehensive analysis. Duplicate records, particularly those related to primary player reference information such as player_name, were identified and removed.
  • Injuries Text Mining and Categorization (RQ1)
The injury dataset obtained via nba_api was integrated with our performance dataset. This integration involved adding a comment column within each performance sub-dataset, containing textual descriptions of player injuries. Due to the absence of standardized formats in these comments, we applied text mining techniques to extract injury information.
Next, a comprehensive analysis of injury text records and an initial exploratory analysis were conducted involving word frequency counts to identify the prominent terms (258 unique terms) within injury descriptions. First, we ensured that common stop words, such as “and”, “in”, and “on”, as well as positional words such as “left” and “right”, were filtered out to highlight the pertinent terms.
Subsequently, an n-gram analysis was performed, accounting for injuries, whose descriptions might contain “n” consecutive words for clarity, e.g., “strain knee”. From both our single-word and n-gram” analyses (e.g., 4451 bigram terms), we correctly identified the key aspects of all injuries.
Furthermore, a qualitative examination was crucial. By manually inspecting our results, we identified key terms and phrases that accurately represented the nature and body areas of the injuries. Using these keywords, we formulated our categorization system, wherein each injury description was mapped to a specific category or injury area. As a result, we developed a customized dictionary to classify injuries into predefined categories (244 unique terms).
In the final stage of our mapping, we utilized a predefined map for injuries based on body regions [11], culminating in detailed anatomical sub-areas. For instance, a comment such as “placed on IL with sprained left ankle/right ankle” was categorized as “sprained ankle” for the injury, and “ankle” as the anatomical sub-area.
Moreover, the dataset included multiple duplicates arising from cases where a player missed more than one game due to injury. To address this, we identified and retained only the initial instance and the date of the initial injury occurrence based on specific conditions. Consequently, after mapping the textual descriptions, records were flagged as ‘duplicate = TRUE’ if they referred to the same type of injury for the same player within a 15-day window from the last reported occurrence of that specific injury type.
  • Salaries-Data Transformation (RQ2)
To convert the contracts dataset into a more analytically valuable salaries dataset, we employed text mining techniques once again. This involved extracting contract durations and monetary values from the textual descriptions within the scraped data. For example, a contract described as a “signed restricted free agent (from Clippers) to a 6-year, USD 51 M contract” was parsed to determine its length (6 years) and amount (USD 51 million).
Additionally, we incorporated the inflation rate in the data from [33,34] to standardize the salary figures. This adjustment allowed for more meaningful comparisons of player salaries, considering the year the contract was signed and the economic context in the U.S., where the NBA operates.

2.2.3. Data Analysis

The final dataset for this study resulted from complex data integration, incorporating advanced performance metrics, injury records, and salary data from the 2000-01 to 2022-23 regular and playoff seasons. This comprehensive dataset comprises 749,631 records across 158 columns, encompassing player performance, injury history, and financials.
  • Primary Data Structure
Before the analytical procedures, this merged dataset underwent two additional preprocessing steps. First, records were sorted by player_name and game_date to ensure a sequential arrangement for each player. The dataset was subsequently divided into two datasets:
  • Performance: Data with non-null game dates and performance metrics.
  • Injury: Data with non-null injury dates.
  • Statistical Evaluations
Two statistical tests were applied to the performance data:
  • Paired Sample t-test: Compared pre- and post-injury performance means were compared, providing t-statistics and p-values for each performance metric.
  • Effect Size Estimation: Cohen’s D was used to measure differences between pre- and post-injury performance metrics.
  • Summary of Statistics
In summary, we compiled comprehensive summary statistics:
  • Unique Players: Determined the number of total unique players and those meeting the analysis criteria.
  • Non-NA Records: Counted non-null data points for pre- and post-injury performance metrics.
  • Injury Incidence and criteria metrics: Total injuries were calculated for those meeting specific criteria.
The results were stored in a PostgreSQL database and organized for subsequent exploratory data analysis and hypothesis testing.

3. Results

In this section, the results show the effects of injuries on basketball players’ performance by comparing a series of performance metrics captured during two, five, and ten games before and after injury events. Combining Cohen’s D with the t-test in research provides a more comprehensive understanding of the data’s statistical significance and practical importance in evaluating effect sizes in analysis. The Cohen’s D thresholds 0.2–0.5 for Small, 0.5–0.8 for Medium, and greater than 0.8 for Large Effect are used in science and represent standard benchmarks for interpreting the magnitude of an effect size, regardless of the direction (positive or negative) of the effect. In the context of our research, t-tests were used to identify metrics that showed significant differences pre- and post-injury. Hence, Cohen’s D was used to categorize these differences into small, medium, or large effects. This combination of tests allows us not only to report which differences are statistically significant but also to refer to the practical implications of these differences based on their magnitude. Utilizing statistical methods to assess the significance and magnitude of changes in player efficiency, we aimed to reveal actionable insights into the impact of injuries [35,36,37]. This study not only quantifies the immediate and short-term implications of injuries but also contributes to the strategic planning of player recovery and game strategy, offering a valuable resource for coaches, medical staff, and sports analysts.
Table 2, Table 3 and Table 4 show the results of analyzing anatomical sub-areas in correlation with the significance of the dataset’s performance metrics, as stated in A1 table of the Appendix, the effect size, and the average percentage of change in 2-, 5- and 10-game series (RQ1).
Two Games before/after the injury.
Areas with Significant Impact:
  • The anatomical sub-areas that had statistically significant impacts (p-values less than 0.05) included the ankle, knee, thigh, and abdominal areas and many others, totaling 18 different areas.
  • Cohen’s–D and t-statistic:
    • Large Effect (>0.8): No areas with a large effect size were identified.
    • Medium Effect (0.5–0.8): No areas with a medium effect size were identified.
    • Small Effect (0.2–0.5): The upper arm and forearm area was the only area with a small effect size.
Percentage Change:
  • Most Impacted: The abdominal area experienced the most significant average percentage change, suggesting a considerable decrease in performance metrics post-injury.
  • Least Impacted: The chest area was the least impacted based on the average percentage change, indicating a less substantial decrease in performance.
Areas of Concern:
  • The areas of concern due to significant p-values, but smaller effect sizes included the ankle, knee, and thigh areas and several others, highlighting the need for careful consideration of both statistical significance and effect size.
Five Games before/after the injury.
Areas with significant impact: Very significant impacts on performance were observed in various areas, such as the ankle, knee, thigh, thoracolumbar, and foot areas.
  • Cohen’s D and t-statistic:
    • Large Effect (>0.8): No areas fell under this category.
    • Medium Effect (0.5–0.8): No areas with a medium effect size were identified.
    • Small Effect (0.2–0.5): No areas with a small effect size were identified.
  • Percentage Change:
    • Most impacted: The upper arm–forearm area was identified as the most impacted area based on the average percentage change, indicating a considerable decrease in performance post-injury.
    • Least impacted: The shin area was identified as the least impacted area, suggesting a relatively small decrease in performance.
  • Areas of Concern:
    • Areas of concern with significant p-values but without a strong effect size included the ankle, knee, thigh, thoracolumbar, and foot areas, among others. These areas may require further attention due to this statistical significance.
Ten games before/after the injury.
Areas with significant impact: Very significant impacts on performance were noted in various areas, such as the ankle, knee, thigh, thoracolumbar, and foot areas, among others.
  • Cohen’s D and t-statistic:
    • Large Effect (>0.8): No areas fell under this category.
    • Medium Effect (0.5–0.8): No areas with a medium effect size were identified in the latest dataset.
    • Small Effect (0.2–0.5): No areas with a small effect size are identified. This finding suggested that either the effect sizes were less than 0.2 or that the criteria for categorization may need adjustment based on the dataset specifics.
  • Percentage Change:
    • Most impacted: The chest area is identified as the most impacted area based on the average percentage change, suggesting a notable decrease in performance metrics post-injury.
    • Least impacted: The upper arm–forearm area is identified as the least impacted area, which may suggest a relatively small change in performance.
  • Areas of Concern:
    • Areas of concern with significant p-values but without a corresponding large or medium effect size included the ankle, knee, thigh, thoracolumbar, and foot areas, among several others.
Table 5, Table 6 and Table 7 show the statistically significant results of the advanced performance metrics analyses, including test significance values, effect sizes, and average percentages of change in the 2-, 5-, and 10-game series (RQ1).
Two games before/after the injury.
Notable Observations:
  • Metrics such as POSS_ADVANCED, OFF_RATING_ADVANCED, DEF_RATING_ADVANCED, and others had average p-values less than 0.05, indicating significant impacts in these areas.
  • Cohen’s D and t-statistic:
    • Large Effect: No metrics displayed a large effect size (Cohen’s D > 0.8).
    • Medium Effect: Metrics such as PLUS_MINUS_TRADITIONAL, PCT_BLK_USAGE, and BLK_TRADITIONAL had medium effect sizes (Cohen’s D between 0.5 and 0.8).
    • Small Effect: PCT_TOV_USAGE was the only metric with a small effect size (Cohen’s D between 0.2 and 0.5).
  • Percentage Change:
    • Most Impacted: DFGM_PLAYER_TRACK was the metric with the most significant average percentage change, indicating a considerable decrease in performance metrics post-injury.
    • Least Impacted: BLK_TRADITIONAL was the metric with the least average percentage change, suggesting a small change in performance.
  • Areas of Concern: Metrics such as POSS_ADVANCED, OFF_RATING_ADVANCED, and DEF_RATING_ADVANCED had significant p-values with Cohen’s D <= 0.2, which may indicate areas of concern despite their statistical significance.
Five games before/after the injury.
Metrics with significant impact: All the provided metrics had average p-values less than 0.05, indicating statistically significant impacts.
  • Cohen’s D and t-statistic:
    • Large Effect: None.
    • Medium Effect (0.5–0.8): Metrics such as ‘OPP_FTA_RATE_FOUR_FACTORS’, ‘PCT_PF_USAGE’, ‘E_NET_RATING_ADVANCED’, and ‘OPP_TOV_PCT_FOUR_FACTORS’ fell into this category.
    • Small Effect (0.2–0.5): No metrics fell into the small effect category.
  • Percentage Change:
    • Most impacted: ‘E_NET_RATING_ADVANCED’ was the most impacted metric, with the highest median average percentage change.
    • Notably Positive: Metrics, including ‘POSS_ADVANCED’, ‘OPP_FTA_RATE_FOUR_FACTORS’, ‘PCT_PF_USAGE’, and ‘OPP_TOV_PCT_FOUR_FACTORS’ showed notably positive changes.
  • Metrics of Concern: Metrics such as ‘POSS_ADVANCED’, ‘FG_PCT_PLAYER_TRACK’, ‘OFF_RATING_ADVANCED’, ‘DEF_RATING_ADVANCED’, and ‘E_DEF_RATING_ADVANCED’ were of concern due to their negative Cohen’s D values.
Ten games before/after the injury.
Metrics with significant impact: All the provided metrics had average p-values less than 0.05, indicating statistically significant impacts.
  • Cohen’s D and t-statistic:
    • Large Effect (>0.8): None.
    • Medium Effect (0.5–0.8): ‘OPP_TOV_PCT_FOUR_FACTORS’ and ‘E_NET_RATING_ADVANCED’ were identified in this category.
    • Small Effect (0.2–0.5): Most of the other metrics fell into this category.
  • Percentage Change:
    • The most impacted genes, ‘OPP_TOV_PCT_FOUR_FACTORS’ and ‘E_NET_RATING_ADVANCED’, were identified in this category.
    • Notably Negative: ‘OPP_TOV_PCT_FOUR_FACTORS’ and ‘E_NET_RATING_ADVANCED’ showed notable positive changes.
    • Metrics of Concern: Metrics such as ‘OPP_PTS_FB_MISC’, ‘DEF_RATING_ADVANCED’, ‘OFF_RATING_ADVANCED’, ‘E_DEF_RATING_ADVANCED’, ‘POSS_ADVANCED’, and ‘OPP_EFG_PCT_FOUR_FACTORS’ were of concern due to their negative Cohen’s D values.
In sports, and more specifically in basketball which we examined through this research, understanding the intricate relationships between player injuries, their recovery time, and the resulting economic impact on teams is crucial. This understanding not only aids in better injury management but also in strategizing financial and team dynamics. To delve deeper into this aspect, our study presents two key tables—Table 8 and Table 9—each serving a distinct yet interrelated purpose. Data revealed a significant correlation between the duration of player recovery and the financial implications for NBA teams, underscoring the critical nature of injury management and prevention in professional basketball.
The dataset analyzed in Table 8 encompassed a total of 30 unique teams to assess the average recovery time from injuries and quantify the total economic losses incurred by these teams as a result of these injuries. This table is instrumental in highlighting the broader impact of injuries across the league, offering insights into the average duration players take to recover and how this downtime translates into financial terms for their respective teams. The average recovery time for each team was calculated by summing the recovery periods for all injuries incurred by players on each team over the years studied, and then dividing this sum by the total number of injuries. To determine the total financial losses, we multiplied the per-game salary of each player by the number of games missed due to injury, and then aggregated these totals for each team, corresponding to the teams to which the respective players belonged over the years.
The average recovery time across all teams was approximately 35.98 days, with the total sum of team losses reaching USD 21,208,828,385.5. The team with the highest average recovery time was NOH–NOK–NOP (New Orlean Hornets and New Orleans Pelicans are the same teams with rebranding or relocation), at 48.19 days, which was correlated with a sum of team losses of USD 850,911,070.3. Conversely, the team with the lowest average recovery time was SAS (San Antonio Spurs), at 18.89 days, associated with a sum of team losses of USD 484,022,964.1 (RQ4).
Table 9, on the other hand, takes a more granular approach by breaking down injuries into specific anatomical sub-areas. It examines the average recovery time and associated economic losses for each type of injury, categorized by its location on the body. This detailed analysis is pivotal in understanding which injuries are most detrimental in terms of recovery time and economic burden, thereby guiding teams and healthcare professionals in prioritizing injury prevention and treatment strategies based on the anatomical area affected. It shows the average recovery time in days and the associated financial losses for different anatomical sub-areas affected by injuries within a sport context (RQ2–4).
The data encompassed a range of anatomical sub-areas with varying average recovery times and sums of team losses. The area with the longest average recovery time was the hand–thumb–fingers area at 51.29 days, coinciding with a financial impact of USD 1,025,316,589.5. In contrast, the digestive area had the shortest recovery time at 13.02 days, with associated losses of USD 107,701,146.1.
The knee area had the highest financial burden at USD 4,223,672,393.1, aligning with a significant recovery time of 44.47 days. Respiratory issues, despite having a low average recovery time of 19.80 days, had a disproportionately high financial impact of USD 30,870,312.7, which could reflect the broader implications of respiratory problems on player health and availability. The total sum of team losses across all anatomical sub-areas was substantial, amounting to USD 21,208,828,385.1 for the period from 2000 to 2023.
Table 10, Table 11 and Table 12 provide a breakdown of injuries categorized by anatomical sub-areas with further classification into defensive, miscellaneous, offensive, and rating injury categories. The Grand Total represents the sum of all these categories for each anatomical area. Table 10, Table 11 and Table 12 are based on the proper categorization of Table A1 to achieve more focused analysis. Our filtered data analysis revealed a distribution of injuries across various anatomical sub-areas, with an emphasis on the effect size and significance of each injury. Table A1 shows the categorization of basketball performance analytics (defensive, miscellaneous, offensive, and rating). Rating metrics provide a high-level view of a player’s overall impact. Offensive and defensive metrics break down the specifics of how points are scored and prevented and what efficiencies exist in various facets of the game. The miscellaneous category offers additional context and insights into the nuances of gameplay, such as fast break effectiveness or how players indirectly contribute to scoring (RQ3).
In the analyses shown in Table 10, we examined the incidence of anatomical sub-area injuries across different performance analytics categories: miscellaneous (Misc), offensive, and defensive plays, in addition to the ratings of these injuries. Our dataset spanned two game seasons and included injuries related to COVID-19. The pelvic area exhibited the greatest number of statistically significant effects on performance (11), with the majority falling under the defensive category (3). Interestingly, COVID-19-related issues were notable, with a total of nine occurrences, indicating a significant impact on player availability. The wrist and abdominal areas were also common injury sites, with seven and eight incidences, respectively. Defensive play was associated with the highest number of injuries (24), followed by offensive play (16) and miscellaneous causes (8).
The analysis of anatomical sub-area injuries across a five-game series (Table 11) indicated that a total of 22 statistically significant effects were associated with performance variation. Defensive plays accounted for the highest number of injuries (11), suggesting a greater risk for this play type. The upper arm–forearm area, along with the abdominal area, had the highest number of injuries recorded (three each), reflecting their vulnerability or exposure during gameplay. COVID-19-related instances were recorded (two), but their impact was less pronounced than that of other injury types. Notably, only five injuries were rated, with the rest not specified for severity.
As shown in Table 12, over the course of ten games, our analysis revealed a total of 11 statistically significant effects on performance across four anatomical sub-areas. The upper arm–forearm area had the highest incidence of injuries, with a total of six injuries, four of which were rated for their impact. Defensive plays were again highlighted as the most common scenario for injuries, with a total of four occurrences. Injuries in other areas were less frequent, with the fibular and pelvic areas each reporting two cases and the abdominal area reporting one. Only one injury was attributed to offensive play, and one was categorized as miscellaneous.

4. Discussion

Cohen’s D provides a standardized metric of the difference between two means in terms of standard deviations, making it easier to understand the magnitude of the difference. On the other hand, t-test tests determine whether the difference is statistically significant. A large t-statistic often indicates a large effect size (although the relationship is not strictly linear due to the square root in the denominator) (RQ1).
According to the Results section, one of the aims of the paper is to identify the patterns and relationships of anatomical sub-areas and advanced performance metrics in game series in two-, five-, and ten-game series. In each case, the analysis has shown the following:
  • For two games before/after the injury.
  • Inconclusive Areas:
  • Inconclusive areas, such as the ankle, knee, and thigh areas, included those with significant p-values but were not the most or least impacted.
  • Notable Observations.
  • The Negative Cohen’s D, many areas, including the ankle, knee, and thigh areas, had negative Cohen’s D values, suggesting a decrease in performance post-injury across these areas.
  • Contrast Areas: There are no areas that had a positive average Cohen’s D but a significant p-value, which suggests that all significant areas had a negative impact on performance.
  • Percentage Change Insight: Many areas showed a negative average percentage change, indicating a decrease in performance post-injury. The significant areas with a negative change in performance cover a broad range, from the ankle area to the respiratory area, suggesting widespread impacts of injuries.
  • For five games before/after the injury.
  • Inconclusive Areas:
  • Inconclusive areas with significant p-values that were neither most nor least impacted included the ankle, knee, thigh, thoracolumbar, foot, abdominal, and other areas. These areas exhibited significant statistical findings but did not show the extremities of impact, warranting a more nuanced interpretation.
  • Notable Observations.
  • Negative Cohen’s D: Several areas showed a negative Cohen’s D, suggesting that injuries in these areas, such as the ankle, knee, and thigh areas and several others, generally lead to a decrease in performance post-injury.
  • Percentage Change Insight: The analysis showed that most of the significant areas had a negative average percentage change, indicating a general decrease in performance post-injury across various anatomical sub-areas.
  • For ten games before/after the injury.
  • Inconclusive Areas:
  • Inconclusive areas with significant p-values that were neither most nor least impacted included the ankle, knee, thigh, thoracolumbar, foot, abdominal, and other areas. The impact in these areas is significant but does not show extremes, which could warrant further investigation or a more detailed analysis.
  • Notable Observations.
  • Negative Cohen’s D: Several areas exhibited a negative Cohen’s D, indicating a general trend toward decreased performance post-injury. This includes the ankle, knee, thigh, thoracolumbar, foot, and additional areas.
  • Percentage Change Insight: A range of areas showed a negative average percentage change, denoting a decrease in performance post-injury. These included the ankle, thigh, abdominal, heel, toes, elbow, chest, pelvic, and fibular areas.
The matrix below (Table 13) and Figure 2 (bar chart that compares the percentage change of the anatomical sub-areas) above offers a comparative analysis of the performance impact before and after injury events across two-, five-, and ten-game series. We have structured our findings around key metrics: areas with significant impact, effect size based on Cohen’s D, percentage change in performance, and highlighted areas of concern. This approach provides a comprehensive view of how injuries affect player performance across different games, enabling us to pinpoint specific areas that require attention and possible intervention (RQ1).
  • For two games before/after the injury—Basketball Performance Analytics.
  • Notable Observations.
  • Negative Cohen’s D: Not explicitly listed; however, metrics with a negative Cohen’s D indicate a decrease in performance post-injury.
  • Percentage Change Insight: The most significant percentage change was found for DFGM_PLAYER_TRACK, with a change of −71.23%, and the least significant change was a 65.15% change for BLK_TRADITIONAL. This insight highlights the metrics that have undergone the most and least changes in terms of performance.
  • For five games before/after the injury—Basketball Performance Analytics.
  • Notable Observations.
  • Negative Cohen’s D: A significant number of performance metrics had negative values for Cohen’s D, indicating that, on average, post-injury performance tends to be lower than pre-injury performance.
  • Percentage Change Insight: ‘PCT_PF_USAGE’ stands out as the most impacted metric. The highest positive change observed was 61.32% (in ‘PCT_PF_USAGE’). The most significant negative change was −183.09% (in ‘E_NET_RATING_ADVANCED’).
  • For ten games before/after the injury—Basketball Performance Analytics.
  • Notable Observations.
  • Negative Cohen’s D: The same metrics listed as ‘Metrics of Concern’ also feature here, indicating their potential negative impact.
  • Percentage Change Insight: ‘E_NET_RATING_ADVANCED’ was the most impacted metric, with the highest positive change of 44.24%. The most significant negative change was −11.83% in ‘OPP_PTS_FB_MISC’.
The matrix in Table 14 and Figure 3 (bar chart comparison of the performance metrics median percentage change for two-, five-, and ten-game series) consolidate the information from the previous analyses. The “Most Impacted” column focuses on the largest negative impact, while the “Least Impacted/Notably Positive” column emphasizes metrics that either showed minor declines or demonstrated potential positive changes post-injury. The “Metrics of Concern” column underscores the metrics that exhibited notable declines in performance, which might be areas to prioritize in future analyses or interventions.
The data present the relationship between recovery time and team losses. Several hypotheses could be posited as follows (RQ2–4):
  • Team Performance: Longer recovery times could either indicate a more thorough recovery protocol, potentially leading to better long-term team performance, or could be a sign of more severe injuries. The data alone do not clarify this relationship.
  • Economic Impact: An injury’s economic impact (sum of team losses) does not have a clear connection with recovery time. This could be due to a multitude of factors not accounted for in this dataset, such as the nature of the sport, insurance policies, revenue streams of each team, or the impact of star players’ injuries.
The variation in recovery times and financial losses across anatomical sub-areas could be indicative of several underlying factors (RQ2 and RQ4):
  • Frequency of Injuries: Certain areas, such as the hand–thumb–finger and knee areas, may be more prone to injuries that are both severe and frequent, leading to longer recovery times and higher costs.
  • Economic Impact: This variable is proportional to recovery time, which suggests that some injuries, even those that are less frequent or have shorter recovery times, may still incur significant costs. This could be due to factors such as the player’s position, the importance of the player to the team, or medical expenses specific to certain types of injuries.
  • Implications for Player Health: Respiratory issues, while having a shorter average recovery time, have a high financial impact, possibly due to the COVID-19 pandemic’s effects, as shown by the separate listing of COVID-19 with a very high recovery time (76.44 days) and associated costs.
The data indicate a variance in injury occurrence according to anatomical sub-area, with some regions being more prone to injury in certain contexts (defensive vs. offensive play) than others (RQ3).
  • Injury distribution: A greater frequency of injuries in the pelvic and fibular areas might indicate that these regions are either more vulnerable during play or that the nature of the sport involves more activities that put these areas at risk.
  • Contextual Classification: The predominance of injuries in the miscellaneous category could suggest either a wide variety of other non-classifiable injuries or a potential issue with the classification system itself.
  • Offensive vs. Defensive Injuries: The data might reflect the different types of stresses placed on the body during offensive and defensive plays, with certain areas being more affected by offensive maneuvers and others by defensive actions.
  • Implications for Prevention and Treatment: Understanding which anatomical areas are most at risk and in which context can help in developing targeted injury prevention and rehabilitation programs.
Table 15 synthesizes the results of Table 10, Table 11 and Table 12 injuries across the three types of game series (two, five, and ten games) to highlight trends in physical injuries within different anatomical sub-areas and types of play. This study offers insights into the recurrent nature of certain injuries and informs targeted preventative strategies (RQ1).

5. Conclusions and Future Work

The anatomical impact assessment showed consistent involvement of the knee, ankle, and thigh areas across all of the game series, suggesting that these may be critical areas for athletic performance or injury prevention. In terms of the performance metrics, all of the selected metrics consistently demonstrated statistical significance, with p-values less than 0.05, indicating that these metrics were significantly different across games. Notably, the percentage changes in metrics, such as ‘BLK_TRADITIONAL’ in game two and ‘PCT_PF_USAGE’ in game five, exhibited substantial positive changes, while ‘E_NET_RATING_ADVANCED’ demonstrated the most significant negative change in game five and most positive change in game ten series, indicating a possible area of volatility or targeted improvement. Interestingly, ‘DEF_RATING_ADVANCED’ exhibited a persistent decrease in expression, warranting further investigation. These findings underscore the importance of continuous monitoring and analysis of both anatomical and performance metrics to optimize athlete performance and well-being over time. (RQ1).
Although we analyzed the range of recovery times and the total sum of team losses, the analysis does not offer a straightforward interpretation of the relationship between these variables. Future research should aim to incorporate additional data points, such as the severity of injuries, the number of players affected, the financial structure of each team, the particular sport in question, and any insurance compensation received.
Moreover, understanding the context of the losses—whether they pertain to missed games, decreased ticket sales, or other factors—is crucial for a more comprehensive analysis. Finally, longitudinal studies tracking these variables over multiple seasons could provide insights into whether the observed patterns are consistent and whether recovery times have a direct or indirect impact on the financial outcomes of sports teams. It would also be beneficial to include performance metrics post-recovery to evaluate the effectiveness of recovery time on player performance (RQ1–4).
The analysis revealed significant disparities in recovery times and financial impacts across different anatomical sub-areas. Although some injuries may require longer healing, others may have a greater economic impact regardless of recovery duration. Recovery time does not necessarily correlate directly with financial impact, highlighting the complex nature of sports injuries and their consequences. These data underscore the importance of targeted injury prevention and management strategies for different anatomical sub-areas. Financial implications of injuries extend beyond direct healthcare costs and can significantly affect a team’s finances. For a more comprehensive understanding, further research should incorporate additional variables, such as the frequency of injuries per anatomical area, the average contract value of injured players, and the timeline of injuries in relation to the sports season. Additionally, a more in-depth analysis might consider the role of player insurance policies and the effect of player absence on team performance and revenue streams (RQ2–4).
The analysis of the injury distribution across anatomical sub-areas reveals significant differences in the frequency of injuries associated with defensive and offensive actions, as well as other non-specified (Misc) activities. Certain anatomical areas, particularly the pelvic and fibular areas, are more susceptible to injury, emphasizing the need for focused prevention strategies. The large number of ‘Misc’ injuries suggests a diversity of incidents that occur outside of standard defensive or offensive plays, highlighting the multifaceted nature of sports injuries. The rating data are not directly interpretable without further context, but a high score might be indicative of more severe or costly injuries. To further this research, it would be beneficial to integrate these data with other datasets that provide additional context, such as the type of sport, level of play, condition of the player, and other potential risk factors. Additionally, qualitative data regarding the circumstances of each injury could provide insights into the causal factors and help tailor preventative measures more effectively (RQ3).
Throughout the three examined game series, the data consistently highlighted the upper arm–forearm area as the most injury prone, suggesting a critical focus for prevention and protection efforts. The persistently high risk associated with defensive plays calls for specialized training and potentially revised play strategies to mitigate injury risks. The evolving understanding of injury severity through more frequent ratings by the tenth game in a series provides valuable insights into the areas requiring the most immediate attention. Overall, the conclusions emphasize the necessity of continuous improvement in player safety measures, which should be informed by an ever-growing and precise body of performance analytics data (RQ1 and RQ3).
Sports injuries, beyond their immediate physical toll, cascade into realms of performance and economics in basketball. Recognizing these nuances can empower stakeholders—from medical staff to team management—to strategize injury management, optimize player welfare, and formulate robust team strategies.

Author Contributions

Conceptualization: V.S.; methodology: V.S. and G.P.; software: V.S. and G.P.; validation: C.T., V.S. and G.P.; formal analysis: V.S. and G.P.; investigation: V.S. and G.P.; resources: C.T., V.S. and G.P.; data curation: V.S. and G.P.; writing—original draft preparation: V.S.; writing—review and editing: C.T., V.S. and G.P.; visualization: V.S.; supervision: C.T; project administration: C.T. and V.S.; funding acquisition, (not applicable). All authors have read and agreed to the published version of the manuscript.

Funding

This research is co-financed by Greece and the European Union (European Social Fund-SF) through the Operational Programme “Human Resources Development, Education and Lifelong Learning 2014–2020” in the context of the project “Support for International Actions of the International Hellenic University”, (MIS 5154651).

Data Availability Statement

All data were contained in the main text.

Conflicts of Interest

The authors declare no potential conflicts of interest with respect to the research, authorship, and/or publication of this article. This manuscript complies with ethical standards.

Appendix A

Table A1 displays the categorization of basketball performance metrics into four main areas: rating, Misc, offensive, and defensive. Each area contains various basketball performance analytics for players and teams. In more detail, each performance category analyses:
  • Rating: Measures overall player efficiency and impact on the game. Advanced metrics that normalize performance to account for pace and playing time, such as NET_RATING_ADVANCED and PACE_PER40_ADVANCED, are included. Usage percentages such as USG_PCT highlight a player’s involvement in game plays.
  • Offensive: Quantifies scoring, playmaking, and efficiency of the offense. Traditional statistics such as FGM and FGA track shot-making, whereas advanced statistics such as OFF_RATING_ADVANCED measure offensive efficiency. Scoring percentages associated with specific play types, such as PCT_PTS_2PT_SCORING or PCT_PTS_3PT_SCORING, indicate where a player or team excels in scoring.
  • Defensive: Evaluates a player’s or team’s defensive effectiveness. DREB_PCT_ADVANCED could indicate a player’s ability to rebound on the defensive end. Steal-related statistics such as STL_TRADITIONAL and PCT_STL_USAGE measure defensive playmaking.
  • Miscellaneous: Captures diverse aspects of the game not strictly classified as offensive or defensive. PTS_FB_MISC and PTS_PAINT_MISC provide insight into how teams score in transition and in the paint. Player-tracking data, as indicated by metrics with “_PLAYER_TRACK”, offer a detailed look into player movements and actions. PFD_MISC and SAST_PLAYER_TRACK can indicate a player’s influence on the game beyond primary scoring and assists.
Table A1. Rating, Misc, Offensive and Defensive categorization of the Basketball Performance Analytics.
Table A1. Rating, Misc, Offensive and Defensive categorization of the Basketball Performance Analytics.
RatingMiscOffensiveDefensive
AST_PCT_ADVANCED: Percentage of team field goals a player assisted while on courtBLK_MISC: Miscellaneous block statistics not fitting traditional or advanced categoriesAST_TRADITIONAL: Traditional count of assists made by a player.DREB_PCT_ADVANCED: Advanced metric measuring the percentage of available defensive rebounds a player grabbed while on the court.
AST_RATIO_ADVANCED: Assists per 100 possessions used by a playerBLKA_MISC: Miscellaneous statistics for shots blocked by opponentsE_OFF_RATING_ADVANCED: Advanced metric evaluating a player or team’s offensive efficiency.DEF_RATING_ADVANCED: An advanced metric that estimates a player’s overall defensive impact per 100 possessions.
AST_TOV_ADVANCED: Ratio of a player’s assists to turnoversOPP_PTS_2ND_CHANCE_MISC: Points scored by opponents on second-chance opportunitiesFG_PCT_TRADITIONAL: Traditional field goal percentage, measuring overall shooting success.DREB_TRADITIONAL: Traditional count of defensive rebounds grabbed by a player.
E_NET_RATING_ADVANCED: Player’s net impact on team’s offensive and defensive efficiencyOPP_PTS_FB_MISC: Points scored by opponents on fast breaksFG3_PCT_TRADITIONAL: Traditional three-point field goal percentage.E_DEF_RATING_ADVANCED: Enhanced defensive rating, offering a more comprehensive view of a player’s defensive efficiency.
E_PACE_ADVANCED: Estimate of the pace at which a player plays (possessions per 48 min)OPP_PTS_OFF_TOV_MISC: Points scored by opponents off turnoversFG3A_TRADITIONAL: Traditional count of three-point field goal attempts by a player.BLK_TRADITIONAL: Traditional count of shots blocked by a player.
E_USG_PCT_ADVANCED: Usage rate, measuring the percentage of team plays involving a player while on courtOPP_PTS_PAINT_MISC: Points scored by opponents in the paintFG3M_TRADITIONAL: Traditional count of three-point field goals made.REB_PCT_ADVANCED: Advanced metric measuring the percentage of total rebounds (offensive and defensive) a player grabbed while on the court.
EFG_PCT_ADVANCED: Effective field goal percentage, accounting for 3-point field goalsPF_MISC: Personal fouls count in miscellaneous situationsFGA_TRADITIONAL: Traditional count of total field goal attempts.REB_TRADITIONAL: Traditional count of total rebounds (both offensive and defensive) grabbed by a player.
EFG_PCT_FOUR_FACTORS: Part of the ‘Four Factors’ of basketball success, measuring effective shooting efficiencyPFD_MISC: Count of personal fouls drawn in various scenariosFGM_TRADITIONAL: Traditional count of total field goals made.STL_TRADITIONAL: Traditional count of steals made by a player.
FTA_RATE_FOUR_FACTORS: Free-throw attempt rate in the context of the ‘Four Factors’PTS_2ND_CHANCE_MISC: Points scored on second-chance opportunitiesFT_PCT_TRADITIONAL: Traditional free-throw percentage.PCT_STL_USAGE: Percentage of a player’s steals relative to their overall on-court engagement and usage.
FTA_TRADITIONAL: Traditional count of a player’s free-throw attemptsPTS_FB_MISC: Points scored on fast breaksOFF_RATING_ADVANCED: Advanced metric for assessing a player’s or team’s offensive performance per 100 possessions.
FTM_TRADITIONAL: Traditional count of a player’s successful free throwsPTS_OFF_TOV_MISC: Points scored off turnoversOREB_PCT_ADVANCED: Advanced metric measuring the percentage of available offensive rebounds a player grabbed while on the court.
MIN_ADVANCED: Minutes played, an advanced metric considering various factorsPTS_PAINT_MISC: Points scored in the paintOREB_PCT_FOUR_FACTORS: Offensive rebound percentage as part of the ‘Four Factors’ in basketball analysis.
NET_RATING_ADVANCED: Team’s point differential per 100 possessions while the player is on the courtAST_PLAYER_TRACK: Assists tracked in specific player tracking scenariosOREB_TRADITIONAL: Traditional count of offensive rebounds grabbed by a player.
OPP_EFG_PCT_FOUR_FACTORS: Opponent’s effective field goal percentage, a defensive metricCFG_PCT_PLAYER_TRACK: Player’s catch-and-shoot field goal percentage in player trackingPCT_AST_2PM_SCORING: Percentage of two-point field goals made that were assisted.
OPP_FTA_RATE_FOUR_FACTORS: Opponent’s free-throw attempt rate, indicating defensive efficiencyCFGA_PLAYER_TRACK: Player’s catch-and-shoot field goal attempts in player trackingPCT_AST_3PM_SCORING: Percentage of three-point field goals made that were assisted.
OPP_OREB_PCT_FOUR_FACTORS: Opponent’s offensive rebound percentage, a defensive metricCFGM_PLAYER_TRACK: Player’s catch-and-shoot field goals made in player trackingPCT_AST_FGM_SCORING: Overall percentage of field goals made that were assisted.
OPP_TOV_PCT_FOUR_FACTORS: Opponent’s turnover percentage, a part of defensive metricsDFG_PCT_PLAYER_TRACK: Player’s defense against field goal percentage in player trackingPCT_PTS_2PT_MR_SCORING: Percentage of points scored from mid-range two-point shots.
PACE_ADVANCED: The pace factor, estimating the number of possessions per 48 minDFGA_PLAYER_TRACK: Player’s defense against field goal attempts in player trackingPCT_PTS_2PT_SCORING: Percentage of points scored from all two-point field goals.
PACE_PER40_ADVANCED: Similar to pace factor but calculated per 40 minDFGM_PLAYER_TRACK: Player’s defense against field goals made in player trackingPCT_PTS_3PT_SCORING: Percentage of points scored from three-point field goals.
PCT_AST_USAGE: Percentage of team’s assists for which a player accounts while playingFTAST_PLAYER_TRACK: Free-throw assists tracked in player tracking scenariosPCT_PTS_PAINT_SCORING: Percentage of points scored in the paint.
PCT_BLK_USAGE: Percentage of team’s blocks for which a player accounts while playingORBC_PLAYER_TRACK: Offensive rebounds captured in player trackingPCT_UAST_2PM_SCORING: Percentage of two-point field goals made without an assist.
PCT_BLKA_USAGE: Percentage of a player’s shots that are blocked by opponentsPASS_PLAYER_TRACK: Passes made tracked in player tracking scenariosPCT_UAST_3PM_SCORING: Percentage of three-point field goals made without an assist.
PCT_DREB_USAGE: Percentage of available defensive rebounds a player getsRBC_PLAYER_TRACK: Rebounds captured in player trackingPCT_UAST_FGM_SCORING: Overall percentage of field goals made without an assist.
PCT_FG3A_USAGE: Percentage of a team’s three-point attempts taken by a playerSAST_PLAYER_TRACK: Secondary assists tracked in player-tracking scenariosPCT_FGA_2PT_SCORING: Percentage of total field goal attempts that are two-point shots.
PCT_FG3M_USAGE: Percentage of a team’s three-point makes attributed to a playerSPD_PLAYER_TRACK: Speed of the player during play, tracked in player-tracking scenariosPCT_FGA_3PT_SCORING: Percentage of total field goal attempts that are three-point shots.
PCT_FGA_USAGE: Percentage of team’s field goal attempts taken by a playerTCHS_PLAYER_TRACK: Touches of the ball by the player tracked in player trackingPTS_TRADITIONAL: Traditional count of total points scored by a player.
PCT_FGM_USAGE: Percentage of team’s field goals made by a playerUFG_PCT_PLAYER_TRACK: Player’s unguarded field goal percentage in player trackingPCT_OREB_USAGE: Percentage of team’s offensive rebounds a player accounts for while on the court.
PCT_FTA_USAGE: Percentage of team’s free-throw attempts taken by a playerUFGA_PLAYER_TRACK: Player’s unguarded field goal attempts in player trackingPCT_PTS_USAGE: Percentage of team’s points for which a player accounts while on the court.
PCT_FTM_USAGE: Percentage of team’s free throws made by a playerUFGM_PLAYER_TRACK: Player’s unguarded field goals made in player trackingPCT_PTS_FB_SCORING: Percentage of points scored from fast breaks.
PCT_REB_USAGE: Percentage of team’s total rebounds grabbed by a playerPCT_TOV_USAGE: Percentage of a player’s turnovers relative to their usage ratePCT_PTS_FT_SCORING: Percentage of points scored from free throws.
PIE_ADVANCED: Player Impact Estimate, a measure of a player’s overall statistical contributionDIST_PLAYER_TRACK: Distance covered by the player during play, tracked in player trackingPCT_PTS_OFF_TOV_SCORING: Percentage of points scored off turnovers.
PLUS_MINUS_TRADITIONAL: The point differential when the player is on and off the courtDRBC_PLAYER_TRACK: Defensive rebounds captured in player tracking
POSS_ADVANCED: The number of possessions a player is involved inFG_PCT_PLAYER_TRACK: Player’s overall field goal percentage in player tracking
TM_TOV_PCT_ADVANCED: Team’s turnover percentage while a player is on the courtPF_TRADITIONAL: Traditional count of personal fouls committed by a player
TM_TOV_PCT_FOUR_FACTORS: Team’s turnover percentage as a part of the ‘Four Factors’TO_TRADITIONAL: Traditional count of turnovers committed by a player
TS_PCT_ADVANCED: True shooting percentage, measuring shooting efficiency (including free throws)PCT_PF_USAGE: Percentage of a player’s personal fouls relative to their usage rate
USG_PCT_ADVANCED: Usage percentage, indicating the proportion of team plays used by a playerPCT_PFD_USAGE: Percentage of a player’s personal fouls drawn relative to their usage rate
USG_PCT_USAGE: Similar to usage percentage, a measure of how involved a player is in team plays

References

  1. Sarlis, V.; Tjortjis, C. Sports analytics—Evaluation of basketball players and team performance. Inf. Syst. 2020, 93, 101562. [Google Scholar] [CrossRef]
  2. Cohan, A.; Schuster, J.; Fernandez, J. A deep learning approach to injury forecasting in NBA basketball. J. Sports Anal. 2021, 7, 277–289. [Google Scholar] [CrossRef]
  3. Malamatinos, M.-C.; Vrochidou, E.; Papakostas, G.A. On Predicting Soccer Outcomes in the Greek League Using Machine Learning. Computers 2022, 11, 133. [Google Scholar] [CrossRef]
  4. Cortez, A.; Trigo, A.; Loureiro, N. Football Match Line-Up Prediction Based on Physiological Variables: A Machine Learning Approach. Computers 2022, 11, 40. [Google Scholar] [CrossRef]
  5. Bullock, G.S.; Ferguson, T.; Arundale, A.H.; Martin, C.L.; Collins, G.S.; Kluzek, S. Return to performance following severe ankle, knee, and hip injuries in National Basketball Association players. PNAS Nexus 2022, 1, pgac176. [Google Scholar] [CrossRef] [PubMed]
  6. Cole, B.; Arundale, A.J.H.; Bytomski, J.; Amendola, A. Basketball Sports Medicine and Science; Springer: Berlin/Heidelberg, Germany, 2020. [Google Scholar] [CrossRef]
  7. Mateus, N.; Esteves, P.; Gonçalves, B.; Torres, I.; Gomez, M.A.; Arede, J.; Leite, N. Clustering performance in the European Basketball according to players’ characteristics and contextual variables. Int. J. Sport. Sci. Coach. 2020, 15, 405–411. [Google Scholar] [CrossRef]
  8. Nakase, J.; Kitaoka, K.; Shima, Y.; Oshima, T.; Sakurai, G.; Tsuchiya, H. Risk factors for noncontact anterior cruciate ligament injury in female high school basketball and handball players: A prospective 3-year cohort study. Asia-Pacific J. Sport. Med. Arthrosc. Rehabil. Technol. 2020, 22, 34–38. [Google Scholar] [CrossRef]
  9. Jauhiainen, S.; Kauppi, J.-P.; Leppänen, M.; Pasanen, K.; Parkkari, J.; Vasankari, T.; Kannus, P.; Äyrämö, S. New Machine Learning Approach for Detection of Injury Risk Factors in Young Team Sport Athletes. Int. J. Sports Med. 2021, 42, 175–182. [Google Scholar] [CrossRef]
  10. Kaplan, S. The Economic Value of Popularity: Evidence from Superstars in the National Basketball Association. SSRN Electron. J. 2020, 50. [Google Scholar] [CrossRef]
  11. Sarlis, V.; Chatziilias, V.; Tjortjis, C.; Mandalidis, D. A Data Science approach analysing the Impact of Injuries on Basketball Player and Team Performance. Inf. Syst. 2021, 99, 101750. [Google Scholar] [CrossRef]
  12. Sikka, R.S.; Baer, M.; Raja, A.; Stuart, M.; Tompkins, M. Analytics in sports medicine: Implications and responsibilities that accompany the era of big data. J. Bone Jt. Surg. Am. 2019, 101, 276–283. [Google Scholar] [CrossRef] [PubMed]
  13. Marks, D.L.; Vinegoni, C.; Bredfeldt, J.S.; Boppart, S.A. Methods, systems and software programs for enhanced sports analytics and applications. Appl. Phys. Lett. 2004, 85, 5787–5789. [Google Scholar] [CrossRef]
  14. McKeag, D.B. Handbook of Sports Medicine and Science; CRC Press: Boca Raton, FL, USA, 2020. [Google Scholar] [CrossRef]
  15. Krosshaug, T.; Nakamae, A.; Boden, B.P.; Engebretsen, L.; Smith, G.; Slauterbeck, J.R.; Hewett, T.E.; Bahr, R. Mechanisms of anterior cruciate ligament injury in basketball: Video analysis of 39 cases. Am. J. Sports Med. 2007, 35, 359–367. [Google Scholar] [CrossRef] [PubMed]
  16. Li, B.; Xu, X. Application of Artificial Intelligence in Basketball Sport. J. Educ. Health Sport 2021, 11, 54–67. [Google Scholar] [CrossRef]
  17. Terner, Z.; Franks, A. Modeling player and team performance in basketball. Annu. Rev. Stat. Its Appl. 2021, 8, 1–23. [Google Scholar] [CrossRef]
  18. Torres-Ronda, L.; Gámez, I.; Robertson, S.; Fernández, J. Epidemiology and injury trends in the National Basketball Association: Pre- and perCOVID-19 (2017–2021). PLoS ONE 2022, 17, e0263354. [Google Scholar] [CrossRef] [PubMed]
  19. NVaudreuil, N.J.; van Eck, C.F.; Lombardo, S.J.; Kharrazi, F.D. Economic and Performance Impact of Anterior Cruciate Ligament Injury in National Basketball Association Players. Orthop. J. Sports Med. 2021, 9, 1–6. [Google Scholar] [CrossRef]
  20. Franco, D. Sports Performance Measurement and Analytics: The Science of Assessing Performance, Predicting Future Outcomes, Interpreting Statistical Models, and Evaluating the Market Value of Athletes. Pearson Education LTD. Ric. di Mat. 2007, 56, 285–296. [Google Scholar] [CrossRef]
  21. Nagarajan, R.; Li, L. Optimizing NBA player selection strategies based on salary and statistics analysis. In Proceedings of the 2017 IEEE 15th Intl Conf on Dependable, Autonomic and Secure Computing, 15th Intl Conf on Pervasive Intelligence and Computing, 3rd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech), Orlando, FL, USA, 6–10 November 2017; pp. 1076–1083. [Google Scholar] [CrossRef]
  22. Louw, Q.; Grimmer, K.; Vaughan, C. Knee movement patterns of injured and uninjured adolescent basketball players when landing from a jump: A case-control study. BMC Musculoskelet. Disord. 2006, 7, 22. [Google Scholar] [CrossRef]
  23. Neilson, V.; Ward, S.; Hume, P.; Lewis, G.; McDaid, A. Effects of augmented feedback on training jump landing tasks for ACL injury prevention: A systematic review and meta-analysis. Phys. Ther. Sport 2019, 39, 126–135. [Google Scholar] [CrossRef]
  24. Afara, I.O.; Sarin, J.K.; Ojanen, S.; Finnilä, M.A.J.; Herzog, W.; Saarakkala, S.; Korhonen, R.K.; Töyräs, J. Machine Learning Classification of Articular Cartilage Integrity Using Near Infrared Spectroscopy. Cell. Mol. Bioeng. 2020, 13, 219–228. [Google Scholar] [CrossRef] [PubMed]
  25. Aljunid, M.F.; Manjaiah, D.H. Data Management, Analytics and Innovation; Springer: Berlin/Heidelberg, Germany, 2019; Volume 808. [Google Scholar] [CrossRef]
  26. McClusky, M. Faster, Higher, Stronger: How Sports Science Is Creating a New Generation of Superathletes and What We Can Learn from Them; Cambridge University Press: Cambridge, UK, 2014; Available online: https://www.cambridge.org/core/product/identifier/CBO9781107415324A009/type/book_part (accessed on 15 February 2023).
  27. WNBA Basketball Reference. NBA Basketball Reference. 2021. Available online: https://www.basketball-reference.com/ (accessed on 20 August 2023).
  28. ESPN Enterprises Inc. Website—NBA Stats. ESPN NBA Stats. 2023. Available online: https://www.espn.com/nba/stats (accessed on 1 September 2023).
  29. NBA.com Website. NBA.com. 2023. Available online: https://stats.nba.com (accessed on 1 September 2023).
  30. Jansen, B.J.; Moore, K.; Carman, S. Evaluating the performance of demographic targeting using gender in sponsored search. Inf. Process. Manag. 2013, 49, 286–302. [Google Scholar] [CrossRef]
  31. Oliver, D. Basketball on Paper: Rules and Tools for Performance Analysis; University of Nebraska Press: Lincoln, NE, USA, 2004. [Google Scholar]
  32. Groll, A.; Liebl, D. Editorial special issue: Statistics in sports. AStA Adv. Stat. Anal. 2022, 107, 0123456789. [Google Scholar] [CrossRef] [PubMed]
  33. Investopedia. U.S. Inflation Rate by Year: 1929–2023. Available online: https://www.investopedia.com/inflation-rate-by-year-7253832 (accessed on 1 September 2023).
  34. CoinNews Media Group Company. US Inflation Calculator. Available online: https://www.usinflationcalculator.com/inflation/current-inflation-rates/ (accessed on 1 September 2023).
  35. Donoghue, O.A.; Harrison, A.J.; Coffey, N.; Hayes, K. Functional data analysis of running kinematics in Chronic Achilles tendon injury. Med. Sci. Sports Exerc. 2008, 40, 1323–1335. [Google Scholar] [CrossRef]
  36. Thornton, H.R.; Delaney, J.A.; Duthie, G.M.; Dascombe, B.J. Developing athlete monitoring systems in team sports: Data analysis and visualization. Int. J. Sports Physiol. Perform. 2019, 14, 698–705. [Google Scholar] [CrossRef]
  37. Christmann, J.; Akamphuber, M.; Müllenbach, A.L.; Güllich, A. Crunch time in the NBA—The effectiveness of different play types in the endgame of close matches in professional basketball. Int. J. Sports Sci. Coach. 2018, 13, 1090–1099. [Google Scholar] [CrossRef]
Figure 1. Structure and flow of the methodological approach.
Figure 1. Structure and flow of the methodological approach.
Computers 12 00261 g001
Figure 2. Comparison of two-, five-, and ten-game series in median % change of anatomical sub-areas.
Figure 2. Comparison of two-, five-, and ten-game series in median % change of anatomical sub-areas.
Computers 12 00261 g002
Figure 3. Comparison of two-, five-, and ten-game series for median % change of performance metrics.
Figure 3. Comparison of two-, five-, and ten-game series for median % change of performance metrics.
Computers 12 00261 g003
Table 1. Datasets based on player performance metrics (regular and playoff season) and injury and salaries analytics for the period from 2000 to 2023.
Table 1. Datasets based on player performance metrics (regular and playoff season) and injury and salaries analytics for the period from 2000 to 2023.
NameType# Records# Features
Player performance statisticsRegular733193132
Player performance statisticsPlayoffs48213132
Injury dataOn and off game581514
Salaries dataPer season153654
Table 2. Two Games before/after the injury.
Table 2. Two Games before/after the injury.
Anatomical Sub-AreasAvg. p-ValueMedian p-ValueAvg. t-StatisticMedian t-StatisticAvg. Cohen’s DMedian Cohen’s DAverage of Avg. Percentage ChangeMedian Avg. Percentage Change
Ankle1.03 × 10−281.11 × 10−6818.7719.18−0.40−0.41−18.73−21.97
Knee2.78 × 10−391.33 × 10−4516.2914.39−0.32−0.28−14.00−15.47
Thigh7.85 × 10−204.45 × 10−3112.4512.41−0.38−0.40−15.60−17.80
Abdominal8.68 × 10−61.82 × 10−2810.8011.68−0.54−0.58−28.09−29.06
Foot3.67 × 10−174.80 × 10−2511.1210.62−0.41−0.39−23.16−24.21
Thoracolumbar5.98 × 10−149.26 × 10−2110.539.48−0.30−0.28−11.32−10.69
Hand–Thumb–Fingers5.73 × 10−62.67 × 10−188.559.03−0.38−0.40−14.90−16.58
Hip8.72 × 10−112.19 × 10−178.308.75−0.37−0.40−13.88−16.43
Shoulder2.25 × 10−87.90 × 10−168.498.78−0.37−0.42−17.46−16.97
Calf2.17 × 10−85.48 ×10−137.567.41−0.36−0.40−16.73−18.81
Wrist5.63 × 10−54.93 × 10−86.065.58−0.34−0.26−9.96−22.79
Heel2.97 × 10−41.78 × 10−75.695.32−0.32−0.25−14.11−19.22
Elbow2.64 × 10−31.07 × 10−52.974.51−0.23−0.311.73−13.62
Chest1.96 × 10−21.47 × 10−2−0.23−2.100.000.2217.2122.96
Pelvic3.33 × 10−22.17 × 10−21.052.15−0.16−0.32−1.18−23.34
Fibular3.38 × 10−22.45 × 10−21.632.31−0.28−0.39−5.57−15.27
Shin3.27 × 10−23.04 × 10−21.102.16−0.17−0.30−7.61−22.61
Upper arm–Forearm5.14 × 10−25.01 × 10−2−1.74−2.050.350.431.0940.49
Table 3. Five games before/after the injury.
Table 3. Five games before/after the injury.
Anatomical Sub-AreasAvg. p-ValueMedian p-ValueAvg. t-StatisticMedian t-StatisticAvg. Cohen’s DMedian Cohen’s DAverage of Avg. % ChangeMedian Avg. % Change
Ankle1.78 × 10−307.46 × 10−7418.0418.78−0.35−0.35−4.88−7.39
Knee6.39 × 10−336.94 × 10−5216.2015.43−0.28−0.26−1.56−4.12
Thigh4.32 × 10−229.77 × 10−3512.7312.70−0.35−0.34−9.91−12.09
Thoracolumbar3.68 × 10−192.37 × 10−2511.6310.93−0.30−0.28−3.27−2.16
Foot5.60 × 10−153.04 × 10−2410.5810.42−0.35−0.34−3.33−6.83
Abdominal6.03 × 10−92.79 × 10−209.489.59−0.42−0.43−8.54−10.39
Shoulder3.15 × 10−91.49 × 10−198.779.36−0.36−0.36−4.78−5.79
Hip2.30 × 10−104.40 × 10−178.378.66−0.35−0.35−3.70−4.24
Calf8.50 × 10−115.00 × 10−147.707.76−0.34−0.34−7.96−9.26
Hand–Thumb–Fingers1.33 × 10−78.27 × 10−138.167.30−0.35−0.29−5.57−5.41
Wrist2.60 × 10−68.93 × 10−127.017.07−0.42−0.43−10.10−9.99
Heel6.18 × 10−63.00 × 10−106.416.48−0.35−0.35−7.05−8.92
Toes5.40 × 10−56.23 × 10−96.036.01−0.35−0.33−4.87−2.27
Elbow1.68 × 10−41.51 × 10−53.874.43−0.28−0.34−3.50−8.18
Chest2.29 × 10−24.40 × 10−32.952.92−0.29−0.33−3.32−5.24
Shin2.29 × 10−26.91 × 10−31.342.44−0.18−0.274.89−1.10
Pelvic3.07 × 10−22.10 × 10−21.892.42−0.26−0.29−7.34−16.70
Upper arm–Forearm4.53 × 10−24.15 × 10−20.861.95−0.10−0.28−10.80−4.25
Fibular4.47 × 10−24.54 × 10−21.692.08−0.28−0.350.904.41
Table 4. Ten games before/after the injury.
Table 4. Ten games before/after the injury.
Anatomical Sub-AreasAvg. p-ValueMedian p-ValueAvg. t-StatisticMedian t-StatisticAvg. Cohen’s DMedian Cohen’s DAverage of Avg. % ChangeMedian Avg. % Change
Ankle2.05 × 10−412.65 × 10−5716.1916.34−0.28−0.26−0.76−3.83
Knee8.79 × 10−422.35 × 10−4515.9114.37−0.26−0.223.173.43
Thigh1.02 × 10−262.86 × 10−2911.9411.54−0.31−0.32−2.42−3.81
Thoracolumbar2.35 × 10−267.47 × 10−2912.0111.39−0.27−0.273.052.45
Foot2.34 × 10−156.41 × 10−2410.4910.36−0.31−0.311.18−1.29
Abdominal7.17 × 10−96.73 × 10−188.839.02−0.35−0.36−3.63−5.08
Shoulder5.64 × 10−145.82 × 10−147.847.69−0.29−0.270.57−1.47
Hip3.97 × 10−111.52 × 10−137.587.56−0.29−0.275.201.55
Calf3.12 × 10−101.17 × 10−117.196.95−0.29−0.250.570.54
Heel2.07 × 10−78.96 × 10−116.856.69−0.33−0.33−5.43−5.25
Hand–Thumb–Fingers1.10 × 10−83.50 × 10−107.116.67−0.28−0.261.490.68
Wrist3.40 × 10−51.43 × 10−95.676.23−0.33−0.351.49−2.68
Toes1.33 × 10−54.26 × 10−85.725.76−0.30−0.27−4.02−5.09
Elbow2.19 × 10−32.15 × 10−64.044.87−0.26−0.28−1.98−5.40
Chest1.87 × 10−21.02 × 10−33.163.49−0.32−0.31−49.15−6.78
Pelvic2.30 × 10−21.39 × 10−22.362.60−0.26−0.25−8.06−10.80
Fibular3.22 × 10−22.22 × 10−22.372.42−0.34−0.36−5.66−6.96
Shin3.13 × 10−23.25 × 10−22.222.18−0.24−0.234.532.28
Upper arm–Forearm4.17 × 10−24.12 × 10−20.871.98−0.11−0.277.946.24
Table 5. Two games before/after the injury—Basketball Performance Analytics.
Table 5. Two games before/after the injury—Basketball Performance Analytics.
Performance MetricMedian p-ValueAvg. p-ValueAvg. t-StatisticMedian t-StatisticMedian Cohen’s DAvg. Cohen’s DMedian Avg. % ChangeAverage of Avg. % Change
POSS_ADVANCED4.24 × 10−324.24 ×10−3212.5512.55−0.53−0.532.402.40
FG_PCT_PLAYER_TRACK1.51 × 10−141.51 × 10−148.058.05−0.54−0.54−15.97−15.97
OFF_RATING_ADVANCED2.32 × 10−73.00 × 10−47.136.82−0.61−0.62−15.85−15.51
DEF_RATING_ADVANCED7.02 × 10−106.34 × 10−47.017.09−0.58−0.59−14.09−15.22
E_DEF_RATING_ADVANCED1.04 × 10−21.04 × 10−22.792.79−0.66−0.66−5.49−5.49
OPP_FTA_RATE_FOUR_FACTORS3.56 × 10−23.56 × 10−2−2.24−2.240.550.5527.0027.00
PCT_PF_USAGE4.10 × 10−24.10 × 10−2−2.18−2.180.610.6161.3261.32
E_NET_RATING_ADVANCED4.15 × 10−24.15 × 10−2−2.16−2.160.600.60−183.09−183.09
OPP_TOV_PCT_FOUR_FACTORS4.95 × 10−24.95 ×10−2−2.07−2.070.650.6527.7927.79
Table 6. Five Games before/after the injury—Basketball Performance Analytics.
Table 6. Five Games before/after the injury—Basketball Performance Analytics.
Performance MetricMedian p-ValueAvg. p-ValueAvg. t-StatisticMedian t-StatisticMedian Cohen’s DAvg. Cohen’s DMedian Avg. % ChangeAverage of Avg. % Change
POSS_ADVANCED4.24 × 10−324.24 × 10−3212.5512.55−0.53−0.532.402.40
FG_PCT_PLAYER_TRACK1.51 × 10−141.51 × 10−148.058.05−0.54−0.54−15.97−15.97
OFF_RATING_ADVANCED2.32 × 10−73.00 × 10−47.136.82−0.61−0.62−15.85−15.51
DEF_RATING_ADVANCED7.02 × 10−106.34 ×10−47.017.09−0.58−0.59−14.09−15.22
E_DEF_RATING_ADVANCED1.04 × 10−21.04 × 10−22.792.79−0.66−0.66−5.49−5.49
OPP_FTA_RATE_FOUR_FACTORS3.56 × 10−23.56 × 10−2−2.24−2.240.550.5527.0027.00
PCT_PF_USAGE4.10 × 10−24.10 × 10−2−2.18−2.180.610.6161.3261.32
E_NET_RATING_ADVANCED4.15 × 10−24.15 × 10−2−2.16−2.160.600.60−183.09−183.09
OPP_TOV_PCT_FOUR_FACTORS4.95 × 10−24.95 × 10−2−2.07−2.070.650.6527.7927.79
Table 7. Ten games before/after the injury—Basketball Performance Analytics.
Table 7. Ten games before/after the injury—Basketball Performance Analytics.
Performance MetricMedian p-ValueAvg. p-ValueAvg. t-StatisticMedian t-StatisticMedian Cohen’s DAvg. Cohen’s DMedian Avg. % ChangeAverage of Avg. % Change
OPP_PTS_FB_MISC0.00050.00053.9653.965−0.650−0.650−11.830−11.830
DEF_RATING_ADVANCED0.00370.00345.6403.111−0.603−0.647−11.010−11.610
OFF_RATING_ADVANCED0.00510.00512.9902.990−0.538−0.538−10.590−10.590
E_DEF_RATING_ADVANCED0.00510.00513.0923.092−0.706−0.706−4.710−4.710
POSS_ADVANCED0.00910.00912.8532.853−0.543−0.543−11.230−11.230
OPP_EFG_PCT_FOUR_FACTORS0.02470.02472.4042.404−0.526−0.526−4.090−4.090
OPP_TOV_PCT_FOUR_FACTORS0.02640.0264−2.372−2.3720.6970.69717.44017.440
E_NET_RATING_ADVANCED0.04460.0446−2.125−2.1250.6160.61644.24044.240
Table 8. Team recovery time correlated with the sum of losses in the period 2000 to 2023.
Table 8. Team recovery time correlated with the sum of losses in the period 2000 to 2023.
TeamsAverage Recovery Time (2000–2023)Sum of Losses (2000–2023)
GSW38.33USD 980,043,613.6
DEN33.96USD 898,608,370.5
WAS42.38USD 896,393,189.0
CLE38.91USD 884,262,820.6
NYK43.09USD 882,353,294.8
NOH–NOK –NOP 147.13USD 850,911,070.3
HOU35.51USD 831,909,820.5
LAL43.89USD 829,088,874.5
MEM–VAN 237.27USD 818,408,844.2
MIL33.00USD 803,077,829.5
BKN–NJN 335.94USD 787,297,893.7
POR42.67USD 725,597,473.1
TOR37.21USD 719,025,631.9
DAL31.81USD 717,319,559.4
IND28.26USD 707,021,308.5
MIA27.46USD 696,390,943.7
MIN39.37USD 687,081,943.9
CHA–CHH 440.73USD 672,569,410.7
LAC32.64USD 669,659,380.7
ORL34.44USD 650,059,655.1
PHI29.59USD 633,630,120.4
ATL35.52USD 629,062,649.4
CHI41.94USD 620,958,846.2
SAC37.62USD 593,901,503.1
PHX37.92USD 553,901,960.3
UTA32.62USD 552,960,349.6
DET33.35USD 494,764,653.5
SAS21.36USD 484,022,964.1
BOS37.31USD 470,742,686.1
OKC and SEA29.40USD 467,801,724.6
Grand Total35.98USD 21,208,828,385.5
1 New Orleans Pelicans (previously the New Orleans Hornets, and before that, the Charlotte Hornets); 2 Memphis Grizzlies (The team started in Vancouver and moved to Memphis in 2001); 3 Brooklyn Nets (previously known as the New Jersey Nets until 2012); 4 Charlotte Hornets (The team was previously known as the Charlotte Bobcats).
Table 9. Anatomical sub-area relationships with average recovery time and economic team losses.
Table 9. Anatomical sub-area relationships with average recovery time and economic team losses.
Anatomical Sub-AreasAvg. Recovery TimeSum of Team Losses
Knee44.47USD 4,223,672,393.1
Unclassified30.32USD 3,923,783,660.9
Ankle32.67USD 2,509,238,498.5
Thigh33.75USD 1,544,221,395.7
Thoracolumbar30.02USD 1,345,058,412.9
Foot43.91USD 1,216,344,145.1
Hand–Thumb–Fingers51.29USD 1,025,316,589.5
Heel45.94USD 718,869,461.3
Shoulder47.04USD 691,206,599.1
Abdominal35.21USD 630,757,817.8
Calf36.75USD 595,174,649.9
Hip28.63USD 480,523,199.1
Wrist46.31USD 412,959,249.0
Cranial32.92USD 269,506,780.6
Toes40.45USD 225,354,084.5
Elbow48.64USD 220,358,271.9
Rest11.59USD 190,999,755.3
Other facial areas64.23USD 143,326,062.1
Neck20.94USD 138,930,281.6
Shin44.00USD 115,452,588.8
Digestive13.02USD 107,701,146.1
Mouth25.98USD 90,712,536.9
Eye36.47USD 79,136,024.3
Fibular75.04USD 64,233,781.7
Nose21.17USD 54,995,090.5
Pelvic30.78USD 53,377,027.9
Chest16.39USD 42,115,533.9
COVID-19-related76.44USD 32,549,095.0
Upper arm–Forearm43.45USD 32,083,939.4
Respiratory19.80USD 30,870,312.7
USD 21,208,828,385.1
Table 10. Anatomical sub-area injuries compared between performance analytics categories (defensive, miscellaneous, offensive, and rating) based on test significance and effect size for the two-game series before and after injury.
Table 10. Anatomical sub-area injuries compared between performance analytics categories (defensive, miscellaneous, offensive, and rating) based on test significance and effect size for the two-game series before and after injury.
Anatomical Sub-Areas (2d)RatingMiscOffensiveDefensiveGrand Total
Abdominal41128
Ankle 112
Calf 112
Chest 112
COVID-19-related42129
Cranial 112
Elbow 11
Eye 11
Fibular 314
Foot 11
Hand–Thumb–Fingers 112
Heel1 12
Hip 112
Mouth2 2
Pelvic431311
Shin 112
Shoulder 112
Thigh 112
Toes 11
Upper arm–Forearm11 13
Wrist41117
Grand Total208162468
Table 11. Anatomical sub-area injuries compared between performance analytics categories (defensive, miscellaneous, offensive, and rating) based on test significance and effect size for the five-game series before and after injury.
Table 11. Anatomical sub-area injuries compared between performance analytics categories (defensive, miscellaneous, offensive, and rating) based on test significance and effect size for the five-game series before and after injury.
Anatomical Sub-Areas (5d)RatingMiscOffensiveDefensiveGrand Total
Abdominal1 113
Chest 112
COVID-19-related11 2
Elbow 11
Hand–Thumb–Fingers 11
Heel 11
Other facial areas 11
Pelvic 112
Toes 11
Upper arm–Forearm3 25
Wrist 1113
Grand Total5241122
Table 12. Anatomical sub-area injuries compared between performance analytics categorization (defensive, miscellaneous, offensive, and rating) based on test significance and effect size for the ten-game series before and after injury.
Table 12. Anatomical sub-area injuries compared between performance analytics categorization (defensive, miscellaneous, offensive, and rating) based on test significance and effect size for the ten-game series before and after injury.
Anatomical Sub-Areas (10d)RatingOffensiveMiscDefensiveGrand Total
Upper arm–Forearm4 26
Fibular1 1 2
Abdominal 11
Pelvic 1 12
Grand Total511411
Table 13. Comparison matrix of Anatomical sub-areas for all the game series (two, five, and ten).
Table 13. Comparison matrix of Anatomical sub-areas for all the game series (two, five, and ten).
Metrics-Dimensions2 Games5 Games10 Games
Areas with Significant Impact on ConcernAbdominal, Foot, PelvicPelvic, Thigh, AbdominalPelvic, Chest, Ankle
% ChangeMost Impacted: Abdominal area (−29.06%);
Least Impacted: Upper Forearm area (40.49%)
Most Impacted: Pelvic area (−16.70%);
Least Impacted: Fibular area (4.41%)
Most Impacted: Pelvic area (−10.80%);
Least Impacted: Upper arm–Forearm area (6.24%)
Table 14. Comparison matrix of the performance metrics for all the game series (2, 5 and 10).
Table 14. Comparison matrix of the performance metrics for all the game series (2, 5 and 10).
Metric/Dimension2 Games5 Games10 Games
Significant ImpactAll metrics have p-values below 0.05All metrics have p-values below 0.05All metrics have p-values below 0.05
Cohen’s DPCT_TOV_USAGE, PLUS_MINUS_TRADITIONAL, PCT_BLK_USAGE, BLK_TRADITIONAL, BLK_MISC, PCT_PTS_FT_SCORING, OPP_TOV_PCT_FOUR_FACTORS, OREB_PCT_FOUR_FACTORSOPP_FTA_RATE_FOUR_FACTORS, PCT_PF_USAGE, E_NET_RATING_ADVANCED, OPP_TOV_PCT_FOUR_FACTORSOPP_TOV_PCT_FOUR_FACTORS, E_NET_RATING_ADVANCED
% ChangeMost impacted: +65.15% in BLK_TRADITIONAL;
Least impacted: −71.23% in DFGM_PLAYER_TRACK
Most impacted: +61.32% in PCT_PF_USAGE;
Least impacted: −183.09% in E_NET_RATING_ADVANCED
Most impacted: +44.24% in E_NET_RATING_ADVANCED;
Least impacted: −11.83% in OPP_PTS_FB_MISC
Areas of ConcernDEF_RATING_ADVANCED with the highest average % declineDEF_RATING_ADVANCED showing a continued declineMultiple metrics with both positive and negative changes; care needed in interpretation
Table 15. Comparison of injuries in the different game series.
Table 15. Comparison of injuries in the different game series.
AspectTwo-Game SeriesFive-Game SeriesTen-Game Series
Injury Prone AreasPelvic and wrist areas had notable injury incidences.Shift in high incidence of injuries to the upper arm–forearm area.Upper arm–forearm area
remained most prone to injuries.
Play Type RisksDefensive plays associated with the highest number of injuries.Defensive injuries continued to be significant, reflective of the physical demands of the sport.Defensive play injuries persisted, emphasizing the need for focused preventative strategies.
Injury RatingsSome injuries rated, but without a high severity in any specific area.Ratings not extensively reported, suggesting a need for more detailed injury impact assessments.Ratings more prevalent, especially in the upper arm–forearm area, indicating higher risk.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sarlis, V.; Papageorgiou, G.; Tjortjis, C. Sports Analytics and Text Mining NBA Data to Assess Recovery from Injuries and Their Economic Impact. Computers 2023, 12, 261. https://doi.org/10.3390/computers12120261

AMA Style

Sarlis V, Papageorgiou G, Tjortjis C. Sports Analytics and Text Mining NBA Data to Assess Recovery from Injuries and Their Economic Impact. Computers. 2023; 12(12):261. https://doi.org/10.3390/computers12120261

Chicago/Turabian Style

Sarlis, Vangelis, George Papageorgiou, and Christos Tjortjis. 2023. "Sports Analytics and Text Mining NBA Data to Assess Recovery from Injuries and Their Economic Impact" Computers 12, no. 12: 261. https://doi.org/10.3390/computers12120261

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop