Machine Learning-Based Semantic Analysis of Scientific Publications for Knowledge Extraction in Safety-Critical Domains
Abstract
1. Introduction
2. Problem Statement, Purpose, and Objectives of the Study
- The coverage of specific subtopics (e.g., telerobotics or decision-support);
- The depth of research in critical areas (e.g., human–machine interaction in transport automation);
- Existing gaps in addressing essential issues such as system integration or behavioural modelling.
- Process custom collections of PDF articles independently of commercial platforms;
- Perform thematic normalization that accounts for terminology and rubric variations;
- Analyze category interrelations and article relevance from the perspective of different engineering and applied domains.
- Development of a modular, ML-based semantic analysis framework that combines vectorization, clustering, and fuzzy logic;
- Integration of topic modelling and 3D visualization for comprehensive full-text analysis;
- Demonstration of the framework’s applicability to safety-critical domains such as maritime safety and cybersecurity.
3. Materials and Methods
- article_metadata.csv or the ArticleMetadata sheet from Art_AI.xlsx—publication metadata (ArticleID, APA, Keywords, etc.).
- expert_matrix.csv or the ExpertMembership sheet—expert evaluations of each article’s correspondence to specific rubrics.
- pandas.read_excel(), pandas.read_csv()—for importing tables.
- ExcelFile(…).sheet_names—for previewing available sheets in the .xlsx file.
- Automatic conversion of identifiers (ArticleID) to string type (str) to avoid errors during table merging.
- pivot_expert.to_csv(r”C:\… Art_AI\expert_matrix.csv”)
- df_meta.to_csv(r”C:\…Art_AI\ article_metadata.csv”)
4. Formal Model for Analyzing the Core Collection of Scientific Publications
- Input Data
- 1.1.
- Primary sets:A = {a1, a2, …, an}—the set of scientific articles.R = {r1, r2, …, rm}—the set of rubrics (topics).
- 1.2.
- Rubric evaluation matrix:M = [mij] ∈ [0, 1]n×m—normalized correspondence matrix between articles ai and rubrics rj, formed automatically based on TF–IDF or classification models, where mij is the degree of relevance of ai to rj.
- 1.3.
- Article metadata (based on expert evaluation) (Figure 2):Ki—keywords;Ti—research objective;Si—results;Gi—identified gaps;APAi—bibliographic reference.
- Vectorization and Normalization
- 2.1.
- Article rubric vector (1):
- 2.2.
- Normalization (L2) (2):At this stage, component-wise normalization of each article vector is performed, i.e., for each row of matrix M. The norm is used to ensure scale invariance when comparing similarities:To assess external validity, the selected thematic clusters were manually cross-checked against Scopus and IEEE Xplore metadata to verify topic coherence. Future work will extend the validation to domain-specific databases.
- Coverage Analysis and Entropy
- 3.1.
- General article coverage (4):
- 3.2.
- Average rubric coverage (5):
- 3.3.
- Semantic entropy of a rubric (according to Shannon) (6):
- Dimensionality Reduction (PCA) (Figure 3)
- 4.1.
- Data cantering and feature matrix (7) and (8):Figure 3. Semantic clustering map illustrating topic relationships across the corpus. Each colour denotes a major thematic group (blue—cybersecurity; green—maritime safety; orange—energy systems; purple—AI safety). Node size indicates topic frequency, while edge thickness represents semantic similarity.Figure 3. Semantic clustering map illustrating topic relationships across the corpus. Each colour denotes a major thematic group (blue—cybersecurity; green—maritime safety; orange—energy systems; purple—AI safety). Node size indicates topic frequency, while edge thickness represents semantic similarity.
- 4.2.
- Covariance matrix (9):where Ω is a symmetric matrix subject to spectral decomposition (for PCA).
- 4.3.
- Component selection:Selection of principal components based on the eigenvalues of the matrix Ω.The first k principal components are selected to preserve more than 90% of the variance.
- Article Clustering
- 5.1.
- Euclidean distance (10):
- 5.2.
- K-Means (11):where μk is the cluster centre.
- 5.3.
- DBSCAN (12):Silhouette Score method for clustering quality assessment (13):where a(i) is the average distance to other points in the same cluster and b(i) is the minimum average distance to points in other clusters.
- Cosine Similarity Matrix
- 6.1.
- Similarity computation (14):
- 6.2.
- Use cases:For building a graph G = (V, E) or for a recommendation system (nearest neighbour search).
- Detection of Semantic Gaps
- 7.1.
- GapSet (15):whereθ ∈ [0.05, 0.15]—threshold for weak coverage,η ∈ [0.5, 1.5]—threshold for informational dispersion.
- Recommendation System
- 8.1.
- Averaged profile of nearest neighbours (16):
- 8.2.
- Recommended rubrics (17) (Figure 4):where τ is the importance threshold and δ is the absence threshold in the current profile.

5. Model Description for 3D Software Visualization of Publication Analysis Results
- Text vectorization (Count Vectorizer).We have a set of articles: D = {d1, d2, …, dn}, and a vocabulary of terms: T = {t1, t2, …, tm}.Next, we construct a document–term frequency matrix: ,where xij—the number of occurrences of term tj in document di,X—a sparse matrix, which is passed to the dimensionality reduction stage.
- Dimensionality reduction (Truncated SVD model).Transformation of into a three-dimensional projection .The Truncated SVD model approximates matrix X as (18)where —coordinates of the documents in the component space,—matrix of right singular vectors (terms in component space),WT—transposed matrix of terms in the new basis.That is, for each document (19):
- Meaning of the Axes (Research Directions—Component k).Top terms of each axis are defined via the weight influence Wkj, where k = 1, 2, 3. These are the absolute values of the term weights tj in each component k (Figure 5).Figure 5. Three-dimensional map of scientific publications generated using Truncated SVD, showing semantic grouping of the maritime safety literature (validation dataset).Figure 5. Three-dimensional map of scientific publications generated using Truncated SVD, showing semantic grouping of the maritime safety literature (validation dataset).
- Cluster assignment (DBSCAN).Input data: document coordinates .The DBSCAN algorithm classifies points based on Euclidean distance (20):Parameters: ε = 1.5—neighbourhood radius and minPts = 2—minimum number of points in a neighbourhood. A point zi belongs to cluster C if it satisfies . If not, the point is considered noise (cluster = −1). That is, a point zi in space is considered a “core” if it has more than minPts neighbours.
- Vectorization and positioning of the selected user article.The selected PDF article is converted to text duser, and vectorized using the previously trained model (21):Its coordinates in the component space are defined as (22)
- Space construction.Each point on the graph is: . Colouring is based on cluster labels Ci, and the user’s article is displayed as a separate point (in red, with a rectangular label): .
- Axis interpretation.Each axis is considered as a linear combination of terms (23):SVD 1 (x-axis): technical terminology (systems, safety, ship, information); SVD 2 (y-axis): HMI, software, sensors; SVD 3 (z-axis): simulation, maneuvering, testing.General view of the model:
6. Development of Software Modules for Analyzing Selected Groups of Publication Collections
- Clustering module (KMeans method) distributes the input array of articles into a fixed number of clusters based on the KMeans algorithm. Each article is described in a three-dimensional feature space (x, y, z), and assignment to a cluster is performed by minimizing intra-cluster dispersion.
- Clustering module (DBSCAN method) applies the DBSCAN algorithm, which allows clusters of arbitrary shape to be identified without the need to predefine their number. It is particularly effective for identifying dense thematic groups and detecting anomalies.
- Clustering quality assessment module performs quality assessment of the formed clusters using internal metrics, in particular the silhouette coefficient and the Davis–Boldin index. This allows you to quantitatively assess the degree of cohesion of clusters and their isolation.
- Thematic analysis module performs a frequency analysis of keywords for each cluster. Frequently used terms (with a frequency of more than 10 times) are interpreted as indicators of the cluster’s theme.
- Integrated analysis and report generation module summarizes the results of clustering, distance calculation, and thematic analysis. Generates a final report that includes the calculation of Euclidean distances to the user-uploaded article, the determination of cluster centres, and the visualization of results, Figure 6.
7. Discussion
- Article [37] presents a new method for controlling the pivot point position on conventional single-screw vessels without bow thrusters, refining the “centre of mass–centre of rotation–pivot point model.”
- Article [38] describes a method for automatically resetting kinetic energy in the case of an inevitable collision, based on gradient-based optimal control.
- In article [39], the concept of an intelligent control system for the redundant structure of vessel executive devices is proposed. The system ensures automatic redistribution control by simultaneously modulating multiple ship actuators.
- A simulation-based method using Markov processes to predict changes in the ship’s seaworthy condition under the influence of various factors [40];
- Evaluation of the quality of risk analysis for shipboard operations through expert review [41];
- A study on the impact of AIS manipulation on risk detection, safety, and proposed strategic countermeasures [42];
- A comprehensive interdisciplinary model of cybersecurity in maritime transport, including analysis of the regulatory environment, technical solutions, global incident registries, and training initiatives [43].
- The impact of excessive mobile device use on chronic stress [44];
- The role of coping styles and self-regulation in the formation of anxiety [45];
- Youth psychological defence strategies [46];
- Attributional style as a defence mechanism in the behaviour of student activists [47];
- Coping strategies of youth under martial law conditions [48];
- Psycho-emotional stability of individuals during periods of societal transformation [49].
8. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| AI | Artificial intelligence |
| AIS | Automatic Identification System |
| APA | American Psychological Association (style) |
| BERT | Bidirectional Encoder Representations from Transformers |
| CPU | Central Processing Unit |
| CSV | Comma-Separated Values |
| DBSCAN | Density-Based Spatial Clustering of Applications with Noise |
| ECDIS | Electronic Chart Display and Information System |
| EMSA | European Maritime Safety Agency |
| HMI | Human–Machine Interface |
| HTML | HyperText Markup Language |
| IMO | International Maritime Organization |
| KMeans | K-Means clustering |
| LDA | Latent Dirichlet Allocation |
| MAS | Multi-Agent Systems |
| ML | Machine learning |
| PCA | Principal Component Analysis |
| Portable Document Format | |
| PRISMA | Preferred Reporting Items for Systematic Reviews and Meta-Analyses |
| RAM | Random Access Memory |
| SVD | model Singular Value Decomposition |
| TF–IDF | Term Frequency–Inverse Document Frequency |
| t-SNE | t-distributed Stochastic Neighbour Embedding |
| UMAP | Uniform Manifold Approximation and Projection |
| XAI | Explainable AI |
| TruncatedSVD | Truncated Singular Value Decomposition (as implemented in scikit-learn for dimensionality reduction in sparse matrices) |
References
- Fan, S.; Shi, K.; Weng, J.; Yang, Z. Letting Losses Be Lessons: Human–Machine Cooperation in Maritime Transport. Reliab. Eng. Syst. Saf. 2025, 253, 110547. [Google Scholar] [CrossRef]
- Man, Y.; Brorsson, E.; Björndal, P. Human–Machine Interaction Challenges for Bridge Operations in Large Passenger Ships and Future Improvements from the Deck Officers’ Perspective. Hum. Factors Transp. 2023, 95, 730–739. [Google Scholar] [CrossRef]
- Saager, M.; Steinmetz, A.; Osterloh, J.-P.; Naumann, A.; Hahn, A. Ensuring Fast Interaction with HMIs for Safety-Critical Systems: An Extension of the Human–Machine Interface Design Method KONECT. Intell. Hum. Syst. Integr. 2024, 119, 193–203. [Google Scholar] [CrossRef]
- Veitch, E.; Alsos, O.A. Human–Centered Explainable Artificial Intelligence for Marine Autonomous Surface Vehicles. J. Mar. Sci. Eng. 2021, 9, 1227. [Google Scholar] [CrossRef]
- Peeters, G.; Yayla, G.; Catoor, T.; Van Baelen, S.; Afzal, M.R.; Christofakis, C.; Storms, S.; Boonen, R.; Slaets, P. An Inland Shore Control Centre for Monitoring or Controlling Unmanned Inland Cargo Vessels. J. Mar. Sci. Eng. 2020, 8, 758. [Google Scholar] [CrossRef]
- Martelli, M.; Virdis, A.; Gotta, A.; Cassarà, P.; Di Summa, M. An Outlook on the Future Marine Traffic Management System for Autonomous Ships. IEEE Access 2021, 9, 157316–157328. [Google Scholar] [CrossRef]
- Emad, G.R.; Narayanan, S.; Kataria, A. On the Road to Autonomous Maritime Transport: A Conceptual Framework to Meet Training Needs for Future Ship Operations. Adv. Transp. 2022, 60, 640–646. [Google Scholar] [CrossRef]
- Li, X.; Yuen, K.F. A Human-Centred Review on Maritime Autonomous Surface Ships: Impacts, Responses, and Future Directions. Transp. Rev. 2024, 44, 791–810. [Google Scholar] [CrossRef]
- Liu, J.; Aydın, M.; Akyüz, E.; Arslan, Ö.; Matyar, E.; Kurte, R.E.; Turan, O. Prediction of Human–Machine Interface (HMI) Operational Errors for Maritime Autonomous Surface Ships (MASS). J. Mar. Sci. Technol. 2021, 27, 293–306. [Google Scholar] [CrossRef]
- Song, R.; Papadimitriou, E.; Negenborn, R.R.; van Gelder, P. Safety and Efficiency of Human–MASS Interactions: Towards an Integrated Framework. J. Mar. Eng. Technol. 2024, 24, 159–178. [Google Scholar] [CrossRef]
- Gözalan, A.; John, O.; Lübcke, T.; Maier, A.; Reimann, M.; Richter, J.-G.; Zverev, I. Assisting Maritime Search and Rescue (SAR) Personnel with AI-Based Speech Recognition and Smart Direction Finding. J. Mar. Sci. Eng. 2020, 8, 818. [Google Scholar] [CrossRef]
- Danial, S.N.; Smith, D.; Veitch, B. A Method to Detect Anomalies in Complex Socio-Technical Operations Using Structural Similarity. J. Mar. Sci. Eng. 2021, 9, 212. [Google Scholar] [CrossRef]
- Pedersen, O.-M.; Kim, E. Arctic Vision: Using Neural Networks for Ice Object Classification, and Controlling How They Fail. J. Mar. Sci. Eng. 2020, 8, 770. [Google Scholar] [CrossRef]
- Simion, D.; Postolache, F.; Fleacă, B.; Fleacă, E. AI-Driven Predictive Maintenance in Modern Maritime Transport-Enhancing Operational Efficiency and Reliability. Appl. Sci. 2024, 14, 9439. [Google Scholar] [CrossRef]
- Cao-Feijóo, G.; Pérez-Canosa, J.M.; Pérez-Castelo, F.J.; Orosa, J.A. Deep Learning Methods to Mitigate Human-Factor-Related Accidents in Maritime Transport. J. Mar. Sci. Eng. 2024, 12, 1819. [Google Scholar] [CrossRef]
- Bhaganagar, K.; Kolar, P.; Faruqui, S.H.A.; Bhattacharjee, D.; Alaeddini, A.; Subbarao, K. A Novel Machine-Learning Framework with a Moving Platform for Maritime Drift Calculations. Front. Mar. Sci. 2022, 9, 831501. [Google Scholar] [CrossRef]
- Bandara, D.; Woodward, M.; Chin, C.; Jiang, D. Augmented Reality Lights for Compromised Visibility Navigation. J. Mar. Sci. Eng. 2020, 8, 1014. [Google Scholar] [CrossRef]
- Abdullah, A.; Blow, D.; Chen, R.; Uthai, T.; Du, E.J.; Islam, M.J. Human-Machine Interfaces for Subsea Telerobotics: From Soda-Straw to Natural Language Interactions [Preprint]. arXiv 2024. [Google Scholar] [CrossRef]
- Kari, R.; Steinert, M. Human Factor Issues in Remote Ship Operations: Lessons Learned by Studying Different Domains. J. Mar. Sci. Eng. 2021, 9, 385. [Google Scholar] [CrossRef]
- Fan, S.; Yang, Z. Analysing Seafarer Competencies in a Dynamic Human–Machine System. Ocean Coast. Manag. 2023, 240, 106662. [Google Scholar] [CrossRef]
- Misztal, L.; Hatlas-Sowinska, P. The Impact of the Human Factor on Communication during a Collision Situation in Maritime Navigation. Appl. Sci. 2025, 15, 2797. [Google Scholar] [CrossRef]
- Zhou, Y.; Liu, Z.; Wang, X.; Xie, H.; Tao, J.; Wang, J.; Yang, Z. Human Errors Analysis for Remotely Controlled Ships during Collision Avoidance. Front. Mar. Sci. 2024, 11, 1473367. [Google Scholar] [CrossRef]
- Aylward, K.; Weber, R.; Man, Y.; Lundh, M.; MacKinnon, S.N. “Are You Planning to Follow Your Route?” The Effect of Route Exchange on Decision Making, Trust, and Safety. J. Mar. Sci. Eng. 2020, 8, 280. [Google Scholar] [CrossRef]
- Smith, J.; Yazdanpanah, F.; Thistle, R.; Musharraf, M.; Veitch, B. Capturing Expert Knowledge to Inform Decision Support Technology for Marine Operations. J. Mar. Sci. Eng. 2020, 8, 689. [Google Scholar] [CrossRef]
- Wang, S.; Gang, L.; Liu, T.; Lan, Z.; Li, C. Analysis of the Characteristics of Ship Collision-Avoidance Behavior Based on Apriori and Complex Network. J. Mar. Sci. Eng. 2025, 13, 35. [Google Scholar] [CrossRef]
- Zhou, Y.; Du, W.; Liu, J.; Li, H.; Grifoll, M.; Song, W.; Zheng, P. Determination of Ship Collision Avoidance Timing Using Machine Learning Method. Sustainability 2024, 16, 4626. [Google Scholar] [CrossRef]
- Zheng, J.; Liu, B.; Li, Y.; Huang, C. Unmanned Ship Collision Avoidance Action Plan Deduction Method under Man–Machine Interactive Negotiation in Collision Avoidance Scenarios. J. Mar. Sci. Eng. 2024, 12, 1842. [Google Scholar] [CrossRef]
- Zhou, H.; Zheng, M.; Chu, X.; Yu, C.; Lei, J.; Lin, B.; Zhang, K.; Hua, W. Virtual Reality Fusion Testing–Based Autonomous Collision Avoidance of Ships in Open Water: Methods and Practices. J. Mar. Sci. Eng. 2024, 12, 2181. [Google Scholar] [CrossRef]
- Park, J.; Jeong, J.-S. An Estimation of Ship Collision Risk Based on Relevance Vector Machine. J. Mar. Sci. Eng. 2021, 9, 538. [Google Scholar] [CrossRef]
- Wu, X.; Liu, K.; Zhang, J.; Yuan, Z.; Liu, J.; Yu, Q. An Optimized Collision Avoidance Decision-Making System for Autonomous Ships under Human–Machine Cooperation Situations. J. Adv. Transp. 2021, 2021, 7537825. [Google Scholar] [CrossRef]
- Huang, Y.; Chen, L.; Negenborn, R.R.; van Gelder, P.H.A.J.M. A Ship Collision Avoidance System for Human–Machine Co-Operation during Collision Avoidance. Ocean Eng. 2020, 207, 107913. [Google Scholar] [CrossRef]
- Huang, Y. Supporting Human–Machine Interaction in Ship Collision Avoidance Systems. Doctoral Dissertation, Delft University of Technology, Delft, The Netherlands, 2019. [Google Scholar] [CrossRef]
- van den Oever, F.; Fjeld, M.; Sætrevik, B. A Systematic Literature Review of Augmented Reality for Maritime Collaboration. Int. J. Hum.-Comput. Interact. 2024, 40, 4116–4131. [Google Scholar] [CrossRef]
- Solovey, O.; Ben, A.; Dudchenko, S.; Nosov, P. Development of Control Model for Loading Operations on Heavy Lift Vessels Based on Inverse Algorithm. East. Eur. J. Enterp. Technol. 2020, 5, 48–56. [Google Scholar] [CrossRef]
- Nosov, P.; Zinchenko, S.; Ben, A.; Prokopchuk, Y.; Mamenko, P.; Popovych, I.; Moiseienko, V.; Kruglyj, D. Navigation Safety Control System Development through Navigator Action Prediction by Data Mining Means. East. Eur. J. Enterp. Technol. 2021, 2, 55–68. [Google Scholar] [CrossRef]
- Ponomaryova, V.; Nosov, P.; Ben, A.; Popovych, I.; Prokopchuk, Y.; Mamenko, P.; Dudchenko, S.; Appazov, E.; Sokol, I. Devising an Approach for the Automated Restoration of Shipmaster’s Navigational Qualification Parameters under Risk Conditions. East. Eur. J. Enterp. Technol. 2024, 1, 6–26. [Google Scholar] [CrossRef]
- Kobets, V.; Popovych, I.; Zinchenko, S.; Nosov, P.; Tovstokoryi, O.; Kyrychenko, K. Control of the Pivot Point Position of a Conventional Single–Screw Vessel. In CEUR Workshop Proceedings; CEUR-WS: Aachen, Germany, 2023; Volume 3513, pp. 130–140. Available online: https://ceur-ws.org/Vol-3513/paper11.pdf (accessed on 28 August 2025).
- Zinchenko, S.; Kyrychenko, K.; Grosheva, O.; Nosov, P.; Popovych, I.; Mamenko, P. Automatic Reset of Kinetic Energy in Case of Inevitable Collision of Ships. In Proceedings of the 13th International Conference on Advanced Computer Information Technologies (ACIT), Wrocław, Poland, 21–23 September 2023; pp. 496–500. [Google Scholar] [CrossRef]
- Zinchenko, S.; Kobets, V.; Tovstokoryi, O.; Nosov, P.; Popovych, I. Intelligent System Control of the Vessel Executive Devices Redundant Structure. In CEUR Workshop Proceedings; CEUR-WS: Aachen, Germany, 2023; Volume 3403, pp. 582–594. Available online: https://ceur-ws.org/Vol-3403/paper44.pdf (accessed on 28 August 2025).
- Melnyk, O.; Onyshchenko, S.; Onishchenko, O.; Shcherbina, O.; Vasalatii, N. Simulation-Based Method for Predicting Changes in the Ship’s Seaworthy Condition under Impact of Various Factors. In Studies in Systems, Decision and Control; Springer: Cham, Switzerland, 2023; Volume 481, pp. 653–664. [Google Scholar] [CrossRef]
- Melnyk, O.; Bychkovsky, Y.; Onishchenko, O.; Onyshchenko, S.; Volianska, Y. Development of the Method of Shipboard Operations Risk Assessment Quality Evaluation Based on Experts Review. In Studies in Systems, Decision and Control; Springer: Cham, Switzerland, 2023; Volume 481, pp. 695–710. [Google Scholar] [CrossRef]
- Melnyk, O.; Kuznichenko, S.; Onishchenko, O. Impact of AIS Manipulation on Shipping Safety and Strategic Countermeasures. Lex Portus 2024, 10, 31–39. [Google Scholar] [CrossRef]
- Melnyk, O.; Drozdov, O.; Kuznichenko, S. Cybersecurity in Maritime Transport: An International Perspective on Regulatory Frameworks and Countermeasures. Lex Portus 2025, 11, 7–19. [Google Scholar] [CrossRef]
- Limasheva, L.; Kariyev, A.; Berdibayeva, S.; Dautkaliyeva, P. Chronic Stress and Resilience of Students in Conditions of Excessive Mobile Phone Usage. Insight Psychol. Dimens. Soc. 2025, 13, 174–194. Available online: https://insight.journal.kspu.edu/index.php/insight/article/view/293 (accessed on 28 August 2025).
- Kalamazh, R.; Voloshyna-Narozhna, V.; Tymoshchuk, Y.; Balashov, E. Coping Styles and Self-Regulation Abilities as Predictors of Anxiety. Insight Psychol. Dimens. Soc. 2024, 12, 96–114. [Google Scholar] [CrossRef]
- Plokhikh, V.; Bilous, R. Psychological Defenses of Students in Confronting the Stressors of Military Actions. Insight Psychol. Dimens. Soc. 2025, 13, 18–48. [Google Scholar] [CrossRef]
- Bokhonkova, Y.; Serbin, I.; Zavatska, N.; Danko, D. Attributional Style as a Defense Mechanism of Student Activists’ Behavior. Insight Psychol. Dimens. Soc. 2025, 13, 280–304. [Google Scholar] [CrossRef]
- Halian, I.; Halian, O.; Myshchyshyn, M. Coping Strategies in the Behavioral Models of Youth in Martial Law Conditions. Insight Psychol. Dimens. Soc. 2024, 12, 44–65. [Google Scholar] [CrossRef]
- Raievska, Y.; Kulbida, S.; Lavlinskyy, R.; Senytsia, N. Psycho–Emotional Stability of the Individual in the Period of Societal Transformations. Insight Psychol. Dimens. Soc. 2025, 13, 519–543. Available online: https://ekhsuir.kspu.edu/server/api/core/bitstreams/cf11f1aa-8d4a-46b6-981d-c1eeffbe6866/content (accessed on 1 September 2025).
- Melnyk, O.; Onyshchenko, S.; Koryakin, K. Nature and Origin of Major Security Concerns and Potential Threats to the Shipping Industry. Sci. J. Silesian Univ. Technol. Ser. Transp. 2021, 113, 145–153. [Google Scholar] [CrossRef]
- Malaksiano, N.A. Exact Inclusions of Gehring Classes in Muckenhoupt Classes. Math. Notes 2001, 70, 673–681. [Google Scholar] [CrossRef]
- Lapkina, I.O.; Malaksiano, M.O. Modelling and Optimization of Perishable Cargo Delivery System through Odesa Port. Actual Probl. Econ. 2016, 177, 353–365. [Google Scholar]
- Romanuke, V.V.; Romanov, A.Y.; Malaksiano, M.O. A Genetic Algorithm Improvement by Tour Constraint Violation Penalty Discount for Maritime Cargo Delivery. Syst. Res. Inf. Technol. 2023, 2023, 104–126. [Google Scholar] [CrossRef]
- Wang, D.; Wang, Y.; Dou, M.; Qiao, M.; Fu, X.; Zhang, Y. Exploring the Nonlinear Impact of Visual Environment on Residents’ Happiness: A Computational Framework Integrating Semantic and Geometric Features. Geo-Spat. Inf. Sci. 2024, 27, 1–27. [Google Scholar] [CrossRef]
- Yang, J.; Wu, C.; Du, B.; Zhang, L. Enhanced Multiscale Feature Fusion Network for HSI Classification. IEEE Trans. Geosci. Remote Sens. 2021, 59, 10328–10347. [Google Scholar] [CrossRef]





| Criterion | PRISMA | Scopus/WoS Analytics | Proposed System |
|---|---|---|---|
| Accessibility | Partly open, requires manual work | Commercial licence, paid access | Open-access PDF inputs; runs with Jupyter/Anaconda; reproducible CSV/HTML/PNG exports; no extra licences |
| Selection method | Manual/semi-manual | Automated, but metadata only | Automated full-text parsing plus expert rubric mapping; similarity-weighted “truth” and centroid (prototype) assignment; hybrid scoring |
| Content analysis | Limited, keyword-based | Aggregation of bibliometric data | SBERT embeddings with clustering and topic grouping; interpretable themes; optional expert review loop |
| Visualization | Basic PRISMA flow | Network maps, graphs | Interactive 2D/3D plots |
| Domain adaptability | Low | Low | High—validated on maritime safety, but extendable to HMI, AI/ML, cybersecurity, energy, and related domains |
| ArticleID | Rubric | Expert Percent | Expert Summary |
|---|---|---|---|
| 1 | Human–machine interaction and HMI ergonomics | 40 | The article focuses on operator-automated subsystem interactions, analysis of HMI performance; it proposes a graph-theoretical approach for identifying bottlenecks in decision regions. |
| 1 | Human factors, risk analysis, and competency modelling | 40 | Primary attention is given to identifying and quantitatively analyzing risk influence factors (RIFs), accounting for their frequency and severity, and based on navigator errors and miscommunication. |
| 1 | Decision-support and expert systems | 30 | A graph-theoretical network is used to support decision-making, visualize key RIFs, and provide recommendations for training and design of MASS systems. |
| 1 | AI/ML for analytics and prediction | 15 | Correlation and network models are applied to uncover hidden relationships among RIFs, in line with data analytics methodologies. |
| 2 | Human–machine interaction and HMI ergonomics | 40 | The study of human–machine interaction on the bridge of large passenger vessels, including key issues of information availability, interface fragmentation, trust, and usability. |
| 2 | Human factors, risk analysis, and competency modelling | 30 | The study conducts an in-depth analysis of navigators’ cognitive workload under abnormal conditions, stress from distractions and limited signals, and the applicable 3D strategies of navigational crews. |
| 2 | Decision-support and expert systems | 15 | Gathering deck officers’ requirements for proactive decision-support and context-sensitive tools lays the foundation for developing DSS/expert systems. |
| Tool | Avg. Topic Coherence | Processing Time (100 PDFs) | Thematic Depth (Avg. Keywords/Topic) |
|---|---|---|---|
| Proposed Framework | 0.71 | 6.5 min | 12.4 |
| CiteSpace (6.4.R2) | 0.64 | 8.3 min | 8.9 |
| VOSviewer (1.6.20) | 0.60 | 7.5 min | 9.1 |
| OpenAlex Embeddings (0.19) | 0.67 | 6.8 min | 10.2 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Nosov, P.; Melnyk, O.; Malaksiano, M.; Mamenko, P.; Onyshko, D.; Fomin, O.; Píštěk, V.; Kučera, P. Machine Learning-Based Semantic Analysis of Scientific Publications for Knowledge Extraction in Safety-Critical Domains. Mach. Learn. Knowl. Extr. 2025, 7, 150. https://doi.org/10.3390/make7040150
Nosov P, Melnyk O, Malaksiano M, Mamenko P, Onyshko D, Fomin O, Píštěk V, Kučera P. Machine Learning-Based Semantic Analysis of Scientific Publications for Knowledge Extraction in Safety-Critical Domains. Machine Learning and Knowledge Extraction. 2025; 7(4):150. https://doi.org/10.3390/make7040150
Chicago/Turabian StyleNosov, Pavlo, Oleksiy Melnyk, Mykola Malaksiano, Pavlo Mamenko, Dmytro Onyshko, Oleksij Fomin, Václav Píštěk, and Pavel Kučera. 2025. "Machine Learning-Based Semantic Analysis of Scientific Publications for Knowledge Extraction in Safety-Critical Domains" Machine Learning and Knowledge Extraction 7, no. 4: 150. https://doi.org/10.3390/make7040150
APA StyleNosov, P., Melnyk, O., Malaksiano, M., Mamenko, P., Onyshko, D., Fomin, O., Píštěk, V., & Kučera, P. (2025). Machine Learning-Based Semantic Analysis of Scientific Publications for Knowledge Extraction in Safety-Critical Domains. Machine Learning and Knowledge Extraction, 7(4), 150. https://doi.org/10.3390/make7040150

