Next Issue
Volume 4, June
Previous Issue
Volume 3, December
 
 

Analytics, Volume 4, Issue 1 (March 2025) – 10 articles

  • Issues are regarded as officially published after their release is announced to the table of contents alert mailing list.
  • You may sign up for e-mail alerts to receive table of contents of newly released issues.
  • PDF is the official format for papers published in both, html and pdf forms. To view the papers in pdf format, click on the "PDF Full-text" link, and use the free Adobe Reader to open them.
Order results
Result details
Select all
Export citation of selected articles as:
32 pages, 3163 KiB  
Article
Unveiling the Impact of Socioeconomic and Demographic Factors on Graduate Salaries: A Machine Learning Explanatory Analytical Approach Using Higher Education Statistical Agency Data
by Bassey Henshaw, Bhupesh Kumar Mishra, William Sayers and Zeeshan Pervez
Analytics 2025, 4(1), 10; https://doi.org/10.3390/analytics4010010 - 11 Mar 2025
Viewed by 350
Abstract
Graduate salaries are a significant concern for graduates, employers, and policymakers, as various factors influence them. This study investigates determinants of graduate salaries in the UK, utilising survey data from HESA (Higher Education Statistical Agency) and integrating advanced machine learning (ML) explanatory techniques [...] Read more.
Graduate salaries are a significant concern for graduates, employers, and policymakers, as various factors influence them. This study investigates determinants of graduate salaries in the UK, utilising survey data from HESA (Higher Education Statistical Agency) and integrating advanced machine learning (ML) explanatory techniques with statistical analytical methodologies. By employing multi-stage analyses alongside machine learning models such as decision trees, random forests and the explainability with SHAP stands for (Shapley Additive exPanations), this study investigates the influence of 21 socioeconomic and demographic variables on graduate salary outcomes. Key variables, including institutional reputation, age at graduation, socioeconomic classification, job qualification requirements, and domicile, emerged as critical determinants, with institutional reputation proving the most significant. Among ML methods, the decision tree achieved a standout with the highest accuracy through rigorous optimisation techniques, including oversampling and undersampling. SHAP highlighted the top 12 influential variables, providing actionable insights into the interplay between individual and systemic factors. Furthermore, the statistical analysis using ANOVA (Analysis of Variance) validated the significance of these variables, revealing intricate interactions that shape graduate salary dynamics. Additionally, domain experts’ opinions are also analysed to authenticate the findings. This research makes a unique contribution by combining qualitative contextual analysis with quantitative methodologies, machine learning explainability and domain experts’ views on addressing gaps in the existing identification of graduate salary predicting components. Additionally, the findings inform policy and educational interventions to reduce wage inequalities and promote equitable career opportunities. Despite limitations, such as the UK-specific dataset and the focus on socioeconomic and demographic variables, this study lays a robust foundation for future research in predictive modelling and graduate outcomes. Full article
Show Figures

Figure 1

3 pages, 133 KiB  
Editorial
Updated Aims and Scope of Analytics
by Carson K. Leung
Analytics 2025, 4(1), 9; https://doi.org/10.3390/analytics4010009 - 6 Mar 2025
Viewed by 167
Abstract
Analytics [...] Full article
17 pages, 305 KiB  
Article
The Role of Cognitive Performance in Older Europeans’ General Health: Insights from Relative Importance Analysis
by Eleni Serafetinidou and Christina Parpoula
Analytics 2025, 4(1), 8; https://doi.org/10.3390/analytics4010008 - 4 Mar 2025
Viewed by 200
Abstract
This study explores the role of cognitive performance in the general health of older Europeans aged 50 and over, focusing on gender differences, using data from 336,500 respondents in the sixth wave of the Survey of Health, Aging, and Retirement in Europe (SHARE). [...] Read more.
This study explores the role of cognitive performance in the general health of older Europeans aged 50 and over, focusing on gender differences, using data from 336,500 respondents in the sixth wave of the Survey of Health, Aging, and Retirement in Europe (SHARE). Cognitive functioning was assessed through self-rated reading and writing skills, orientation in time, numeracy, memory, verbal fluency, and word-list learning. General health status was estimated by constructing a composite index of physical and mental health-related measures, including chronic diseases, mobility limitations, depressive symptoms, self-perceived health, and the Global Activity Limitation Indicator. Participants were classified into good or poor health status, and logistic regression models assessed the predictive significance of cognitive variables on general health, supplemented by a relative importance analysis to estimate relative effect sizes. The results indicated that males had a 51.1% lower risk of reporting poor health than females, and older age was associated with a 4.0% increase in the odds of reporting worse health for both genders. Memory was the strongest predictor of health status (26% of the model R2), with a greater relative contribution than the other cognitive variables. No significant gender differences were found. While this study estimates the odds of reporting poorer health in relation to gender and various cognitive characteristics, adopting a lifespan approach could provide valuable insights into the longitudinal associations between cognitive functioning and health outcomes. Full article
30 pages, 1939 KiB  
Article
Towards Visual Analytics for Explainable AI in Industrial Applications
by Kostiantyn Kucher, Elmira Zohrevandi and Carl A. L. Westin
Analytics 2025, 4(1), 7; https://doi.org/10.3390/analytics4010007 - 12 Feb 2025
Viewed by 623
Abstract
As the levels of automation and reliance on modern artificial intelligence (AI) approaches increase across multiple industries, the importance of the human-centered perspective becomes more evident. Various actors in such industrial applications, including equipment operators and decision makers, have their needs and preferences [...] Read more.
As the levels of automation and reliance on modern artificial intelligence (AI) approaches increase across multiple industries, the importance of the human-centered perspective becomes more evident. Various actors in such industrial applications, including equipment operators and decision makers, have their needs and preferences that often do not align with the decisions produced by black-box models, potentially leading to mistrust and wasted productivity gain opportunities. In this paper, we examine these issues through the lenses of visual analytics and, more broadly, interactive visualization, and we argue that the methods and techniques from these fields can lead to advances in both academic research and industrial innovations concerning the explainability of AI models. To address the existing gap within and across the research and application fields, we propose a conceptual framework for visual analytics design and evaluation for such scenarios, followed by a preliminary roadmap and call to action for the respective communities. Full article
(This article belongs to the Special Issue Visual Analytics: Techniques and Applications)
Show Figures

Figure 1

22 pages, 2383 KiB  
Article
Monetary Policy Sentiment and Its Influence on Healthcare and Technology Markets: A Transformer Model Approach
by Dongnan Liu and Jong-Min Kim
Analytics 2025, 4(1), 6; https://doi.org/10.3390/analytics4010006 - 11 Feb 2025
Viewed by 531
Abstract
This study investigates how the Federal Open Market Committee’s (FOMC) statements impact healthcare spending, mental health trends, and stock performance in healthcare and tech sectors By analyzing FOMC’s sentiment from 2018 to 2024, we found that higher sentiment correlates with increased depressive disorders [...] Read more.
This study investigates how the Federal Open Market Committee’s (FOMC) statements impact healthcare spending, mental health trends, and stock performance in healthcare and tech sectors By analyzing FOMC’s sentiment from 2018 to 2024, we found that higher sentiment correlates with increased depressive disorders (2019–2021) and tech stock returns, especially for the “Magnificent Seven” (like Apple and Amazon). Although healthcare stocks showed weaker ties to sentiment, Granger causality tests suggest some influence, hinting at ways to adjust stock strategies based on FOMC trends. These results highlight how central bank communication can shape both mental health dynamics and investment decisions in healthcare and technology. Full article
(This article belongs to the Special Issue Business Analytics and Applications)
Show Figures

Figure 1

22 pages, 2578 KiB  
Article
A Comparative Analysis of Machine Learning and Deep Learning Techniques for Accurate Market Price Forecasting
by Olamilekan Shobayo, Sidikat Adeyemi-Longe, Olusogo Popoola and Obinna Okoyeigbo
Analytics 2025, 4(1), 5; https://doi.org/10.3390/analytics4010005 - 11 Feb 2025
Viewed by 1252
Abstract
This study compares three machine learning and deep learning models—Support Vector Regression (SVR), Recurrent Neural Networks (RNN), and Long Short-Term Memory (LSTM)—for predicting market prices using the NGX All-Share Index dataset. The models were evaluated using multiple error metrics, including Mean Absolute Error [...] Read more.
This study compares three machine learning and deep learning models—Support Vector Regression (SVR), Recurrent Neural Networks (RNN), and Long Short-Term Memory (LSTM)—for predicting market prices using the NGX All-Share Index dataset. The models were evaluated using multiple error metrics, including Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Square Error (RMSE), Mean Percentage Error (MPE), and R-squared. RNN and LSTM were tested with both 30 and 60-day windows, with performance compared to SVR. LSTM delivered better R-squared values, with a 60-day LSTM achieving the best accuracy (R-squared = 0.993) when using a combination of endogenous market data and technical indicators. SVR showed reliable results in certain scenarios but struggled in fold 2 with a sudden spike that shows a high probability of not capturing the entire underlying NGX pattern in the dataset correctly, as witnessed by the high validation loss during the period. Additionally, RNN faced the vanishing gradient problem that limits its long-term performance. Despite challenges, LSTM’s ability to handle temporal dependencies, especially with the inclusion of On-Balance Volume, led to significant improvements in prediction accuracy. The use of the Optuna optimisation framework further enhanced model training and hyperparameter tuning, contributing to the performance of the LSTM model. Full article
Show Figures

Figure 1

14 pages, 2091 KiB  
Article
Personalizing Multimedia Content Recommendations for Intelligent Vehicles Through Text–Image Embedding Approaches
by Jin-A Choi, Taekeun Hong and Kiho Lim
Analytics 2025, 4(1), 4; https://doi.org/10.3390/analytics4010004 - 5 Feb 2025
Viewed by 395
Abstract
The ability to automate and personalize the recommendation of multimedia contents to consumers has been gaining significant attention recently. The burgeoning demand for digitization and automation of formerly analog communication processes has caught the attention of researchers and professionals alike. In light of [...] Read more.
The ability to automate and personalize the recommendation of multimedia contents to consumers has been gaining significant attention recently. The burgeoning demand for digitization and automation of formerly analog communication processes has caught the attention of researchers and professionals alike. In light of the recent interest and anticipated transition to fully autonomous vehicles, this study proposes a text–image embedding method recommender system for the optimization of personalized multimedia content for in-vehicle infotainment. This study leverages existing pre-trained text embedding models and pre-trained image feature extraction methods. Previous research to date has focused mainly on textual-only or image-only analyses. By employing similarity measurements, this study demonstrates how recommendation of the most relevant multimedia content to consumers is enhanced through text–image embedding. Full article
Show Figures

Figure 1

13 pages, 291 KiB  
Article
A Fuzzy Analytical Network Process Framework for Prioritizing Competitive Intelligence in Startups
by Arman Golshan, Soheila Sardar, Seyed Faraz Mahdavi Ardestani and Paria Sadeghian
Analytics 2025, 4(1), 3; https://doi.org/10.3390/analytics4010003 - 14 Jan 2025
Viewed by 727
Abstract
Competitive intelligence (CI) is a critical tool for startups, enabling informed decision making through the systematic gathering and analysis of relevant information. This study aims to identify and prioritize the key factors influencing CI in startups, providing actionable insights for entrepreneurs, educators, and [...] Read more.
Competitive intelligence (CI) is a critical tool for startups, enabling informed decision making through the systematic gathering and analysis of relevant information. This study aims to identify and prioritize the key factors influencing CI in startups, providing actionable insights for entrepreneurs, educators, and support organizations. Through a systematic literature review, key variables and components impacting competitive intelligence were identified. Two surveys were conducted to refine these components. The first employed a five-point Likert scale to evaluate the significance of each component, while the second used a pairwise comparison approach involving ten experts in CI and startup mentorship. Utilizing the fuzzy Analytical Network Process (ANP), this study ranked Technology Intelligence as the most critical factor, followed by market and Strategic Intelligence. Competitor Intelligence and Internet intelligence were deemed moderately important, while Organizational Intelligence ranked lowest. These findings emphasize the importance of technology-driven insights and market awareness in fostering startups’ competitive advantage and informed decision making. This study provides a structured framework to guide startups in prioritizing CI efforts, offering practical strategies for navigating dynamic market conditions and achieving long-term success. Full article
(This article belongs to the Special Issue Business Analytics and Applications)
12 pages, 2818 KiB  
Article
Use of Hazard Functions for Determining Power-Law Behaviour in Data
by Joseph D. Bailey
Analytics 2025, 4(1), 2; https://doi.org/10.3390/analytics4010002 - 9 Jan 2025
Viewed by 483
Abstract
Determining the ‘best-fitting’ distribution for data is an important problem in data analysis. Specifically, observing how the distribution of data changes as values below (or above) a threshold are omitted from analyses can be of use in various applications, from animal movement to [...] Read more.
Determining the ‘best-fitting’ distribution for data is an important problem in data analysis. Specifically, observing how the distribution of data changes as values below (or above) a threshold are omitted from analyses can be of use in various applications, from animal movement to the modelling of natural phenomena. Such truncated distributions, known as hazard functions, are widely studied and well understood in survival analysis, although rarely widely used in data analysis. Here, by considering the hazard and reverse-hazard functions, we demonstrate a qualitative assessment of the ‘best-fit’ distribution of data. Specifically, we highlight the potential advantages of this method when determining whether power-law behaviour may or may not be present in data. Finally, we demonstrate this approach using some real-world datasets. Full article
Show Figures

Figure 1

26 pages, 15401 KiB  
Article
Uncovering Patterns and Trends in Big Data-Driven Research Through Text Mining of NSF Award Synopses
by Arielle King and Sayed A. Mostafa
Analytics 2025, 4(1), 1; https://doi.org/10.3390/analytics4010001 - 6 Jan 2025
Viewed by 959
Abstract
The rapid expansion of big data has transformed research practices across disciplines, yet disparities exist in its adoption among U.S. institutions of higher education. This study examines trends in NSF-funded big data-driven research across research domains, institutional classifications, and directorates. Using a quantitative [...] Read more.
The rapid expansion of big data has transformed research practices across disciplines, yet disparities exist in its adoption among U.S. institutions of higher education. This study examines trends in NSF-funded big data-driven research across research domains, institutional classifications, and directorates. Using a quantitative approach and natural language processing (NLP) techniques, we analyzed NSF awards from 2006 to 2022, focusing on seven NSF research areas: Biological Sciences, Computer and Information Science and Engineering, Engineering, Geosciences, Mathematical and Physical Sciences, Social, Behavioral and Economic Sciences, and STEM Education (formally known as Education and Human Resources). Findings indicate a significant increase in big data-related awards over time, with CISE (Computer and Information Science and Engineering) leading in funding. Machine learning and artificial intelligence are dominant themes across all institutions’ classifications. Results show that R1 and non-minority-serving institutions receive the majority of big data-driven research funding, though HBCUs have seen recent growth due to national diversity initiatives. Topic modeling reveals key subdomains such as cybersecurity and bioinformatics benefiting from big data, while areas like Biological Sciences and Social Sciences engage less with these methods. These findings suggest the need for broader support and funding to foster equitable adoption of big data methods across institutions and disciplines. Full article
Show Figures

Figure 1

Previous Issue
Next Issue
Back to TopTop