1. Introduction
Intelligent machine tools, which are essential for modern manufacturing, drive significant improvements in production efficiency and precision, thereby enhancing the manufacturing sector’s competitiveness [
1]. These tools are crucial in sectors such as aerospace, automotive manufacturing, and precision instrument production, where industrial automation and digitalization are advancing rapidly. By integrating computer control, sensor technology, and network communications, intelligent machine tools progressively increase manufacturing intelligence [
2,
3]. In this evolving landscape, emerging technologies, marked by swift development and vast commercial potential [
4], are pivotal. Understanding these technologies is crucial not only for current operations, but also for shaping the future [
5], as informed decision making fundamentally depends on reliable information regarding these trends.
The interdisciplinary nature of machine tool technology, characterized by complex interactions across various technological domains, poses significant challenges in predicting emerging technologies. This paper aims to concentrate on intelligent machine tools, particularly on the advanced areas where new-generation information technology deeply integrates with machine tool technology. By examining these cutting-edge developments, we aim to propose a framework that identifies potential emerging technologies via technology communities and deep learning methods, providing methodological support for capitalizing on future development opportunities in the field.
Current methods for predicting emerging technologies often focus on broad industry analyses. For instance, Zhou et al. [
6] utilized a topic clustering model to identify emerging technologies in the 3D printing sector, while Wang et al. [
7] applied bibliometrics to uncover emerging areas in cancer research. However, the field of intelligent machine tools is characterized by rapid changes and significant differences in the development and focus of various research topics [
8]. The methodologies employed by these studies tend to spotlight emerging technologies within well-known topics, thereby neglecting potential advancements in less visible topics. Applying these traditional approaches to individual topics without accounting for the interconnectedness and mutual influences among different topics might overlook essential information, consequently diminishing the accuracy of predictions. This oversight not only hampers a comprehensive understanding of technological evolution in the realm of intelligent machine tools, but also risks leading businesses and researchers to miss crucial technological opportunities.
Another segment of emerging technology prediction methods focuses on patents or scholars. These methods identify influential patents and scholars for industry guidance using network analysis, text mining, and deep learning. For example, Zhou et al. [
9] applied deep learning models to identify high-quality patents in emerging technology, based on outlier patent theory. Kong et al. [
10] used a support vector machine-based classifier to extract high-quality patents representing technological innovation. Khan et al. [
11] built an author collaboration network to examine the profound impact of relational structures among various participants in emerging technology development.
An important characteristic of emerging technologies is their community nature [
12]. Although these approaches provide some insight into emerging technologies, they often overlook the broader community context, limiting a full understanding of technological evolution. This oversight can lead to inaccurate predictions. Current research attempts to address this by using IPC classification and cluster analysis to study patent groups while considering their community aspects. For instance, Geum et al. [
13] predict emerging technologies by analyzing patent characteristics within each IPC. Kim et al. [
14] use forward citations and independent claims of clustered patent groups for prediction. These methods partially consider the community aspect of technology, but lack a deep exploration of the complex interactions and subtle relationships within these communities [
15]. In the field of intelligent machine tools, the interdisciplinary nature emphasizes the importance of community characteristics. Internal interactions and collaborations within these communities are crucial for technology development and innovation.
Given the capability of knowledge graphs to associate multi-source information, some researchers have started using knowledge graph technologies in predicting emerging technologies. For example, Lee et al. [
16] have utilized deep learning and knowledge graphs to identify emerging technologies from three data sources. There are also studies on the technological system of intelligent machine tools, such as those by Liu et al. [
17], who determined the technology system in this field through the utilization of knowledge graphs and expert interaction. This provides a foundation for combining data knowledge with domain expertise. The recent development of graph classification methods, such as Graph Convolutional Networks (GCN) [
18], offers opportunities to mine and represent the complex structural characteristics of communities.
In summary, we built an emerging technology prediction model based on knowledge graphs and graph deep learning, approached from the perspective of the technology community, and comprised of patents and papers. Initially, we gather patents and papers in the field of intelligent machine tools to construct a technological knowledge graph. Subsequently, we employ community detection algorithms and reference technology systems to cluster and divide communities, achieving a balanced analysis across the topics. Following this, we utilize a community classification model based on GCN to learn the associations between community structure, various metrics, and community development. Ultimately, this relationship model is used to predict the developmental capacity of communities over the next five years, identifying communities that are likely to exert significant impacts in the future. This enables us to forecast emerging technologies within the intelligent machine tools sector.
3. Methods
Figure 1 illustrates our proposed framework for predicting emerging technologies, which can be broken down into the following steps:
Collect patent and paper data in the field of intelligent machine tools.
Process this data to extract relationships and construct a technological knowledge graph.
Use community detection algorithms to identify technology communities, and divide them based on the technology system.
Extract early features of these communities from the knowledge graph and analyze their evolution to assess their development potential.
Build a graph deep learning model to correlate community attributes with their future growth, thereby predicting emerging technologies.
3.1. Data Collection
In our research, which aims to forecast emerging technologies in the intelligent machine tool sector, we gathered data from academic papers and patents, considering their significance in indicating theoretical advancement and practical application, respectively. The search formula is derived from the “Green Paper on Technological Innovation in Key Fields of China’s Manufacturing Industry, Technology Roadmap: 2019” [
36]. We meticulously selected the Web of Science as our core academic database, and Derwent Innovation for our patents.
3.2. Knowledge Graph Construction
3.2.1. Entities Text Representation
In this section, we describe our approach for the text representation of entities, which is crucial for subsequent analyses. We employed Sentence-BERT [
37], an extension of the well-known BERT model, specifically designed for sentence embeddings. This choice was driven by Sentence-BERT’s ability to capture contextual nuances within texts, a critical aspect for our analysis of papers and patents. The process begins with the preprocessing of text data, where we remove irrelevant information, such as non-textual elements, and standardize the text format. Subsequently, each preprocessed text entity is converted into a high-dimensional vector using Sentence-BERT. These vectors effectively encapsulate the semantic essence of the texts, enabling accurate and nuanced similarity comparisons and clustering in the later stages of our analysis. The process is shown in
Figure 2.
3.2.2. Links
In our comprehensive analysis of the intricate relationships between patents and academic papers in intelligent machine tools, we recognize the necessity to construct a technical knowledge graph. This graph is integral for unraveling the complex web of interactions that propel technological advancements. Central to this endeavor are three pivotal types of relationships: citations, publications, and textual similarities. The citation relationship highlights the mutual progression of knowledge and innovation, providing insights into the evolution of technology. The publication linkage elucidates individual contributions to knowledge creation, revealing key figures and trends within the field. Textual similarity analysis bridges academic research and technological innovations, uncovering thematic overlaps and divergences.
By integrating these relationships into a knowledge graph, we gain a comprehensive view of the technological landscape. This integration is not just about mapping connections, but about understanding the dynamics of the flow of ideas and innovation. Our approach enables the effective tracking and prediction of emerging technologies. More importantly, it facilitates community detection within this knowledge graph, thus allowing us to identify and analyze distinct clusters of interrelated technologies and research areas.
Citation and publication relationships can be extracted from the downloaded document. For the similarity relationship, we employ cosine similarity measures on text representation vectors, using the detailed process depicted in
Figure 2.
3.2.3. Knowledge Graph Representation
The process of representing a knowledge graph involves transforming the graph’s elements into vector representations in mathematical terms. This transformation allows the nodes to encompass a wider range of semantic properties, aiding subsequent classification tasks. The GATNE [
38] algorithm is particularly effective at extracting complex nodes and relationships within the knowledge graph, and it also accommodates the initial input of node attribute vectors. This feature ensures that while the nodes assimilate complex semantic information, they also preserve a significant amount of their textual characteristics. Therefore, we utilize the GATNE algorithm to represent our knowledge graph, converting each node into a 768-dimensional vector. These vectors are then stored and used as the attributes of the nodes. The principle of GATNE is shown in
Figure 3. In this figure, different colors are used to underscore the attention given to distinct features during the graph representation process. Green highlights a focus on structural features, red emphasizes attention to attribute features, and blue marks the emphasis on edge features.
3.3. Community Detection and Evolution Analysis
Aligned with Rotolo’s theory [
12], a key characteristic of emerging technologies is their formation and development within communities. Therefore, this research predicts future technologies by focusing on the communities formed around patents and papers, employing the Louvain [
31] algorithm for efficient community detection in large-scale graphs. The identification of densely linked communities is completed via the optimization of modularity.
In the intelligent machine tool sector, diverse topics garner varying interest levels. This study segments data for an in-depth, balanced analysis across these topics. Following the technology system outlined by Liu et al. [
17], we aim to investigate emerging technologies in intelligent machine tools across five topics. We employ the K-means [
39] algorithm, an established method in unsupervised learning, to categorize communities into five separate clusters, representing distinct topics within the intelligent machine tool sector. Each cluster’s vector is calculated through averaging the knowledge graph representation vectors of all patents and paper nodes within that community.
To acknowledge the faster growth rate of emerging technology communities as per the technology lifecycle curve, we analyze these communities in five-year intervals. By analyzing the growth rates of technology communities from 2013 to 2017, we can ascertain their developmental potential. Communities exhibiting high developmental potential are considered emerging technology communities, and will be key labels in subsequent model training. The formula for growth rate is as indicated in Formula (1).
where
is the growth rate,
is the total number of papers and patents published by the community between 2013 and 2017, and
is the total published between 2018 and 2022.
3.4. Train Model and Forecasting Emerging Technology
We aim to use the emerging and non-emerging technology communities identified between 2013 and 2017 as a training set for a community classification model. This model will be applied to data from 2018 to 2022 to predict emerging technologies, as illustrated in
Figure 4.
Our objective is to utilize a graph deep learning model to correlate the initial traits of communities with their potential for future development. The future developmental capacity of these communities is as outlined in
Section 3.3. Following Kong’s research [
40], we have identified several indicators for community nodes. ‘Prior Knowledge’ [
41] signifies the depth of research grounded in existing literature—a robust indicator of a field’s maturity and focus. ‘Technology Cycle Time’ [
42] provides insight into the relevance and currency of the community’s knowledge, which is vital in rapidly advancing tech sectors. ‘Collaboration’ [
43] underscores the importance of cooperative efforts for innovation and knowledge sharing.
To enhance our model, we have integrated new indicators like ‘Betweenness Centrality’ and ‘Degree Centrality’, which provide perspective regarding a node’s influence and connectivity within the knowledge network, thus denoting any potential for knowledge spread and impact. ‘Total Knowledge Contributions’ illustrate the community’s research output and innovative strength, indicative of its dynamism and productivity. ‘Scope and Coverage’ measure the community’s participation and impact on external knowledge areas, reflecting its reach and interdisciplinary nature.
Collectively, these indicators are instrumental in providing a holistic understanding and prediction of the future development of technological communities through the encapsulation of both the internal and external factors that drive technological evolution.
Each node’s features are derived from both its own characteristics and those of its community. In total, 14 indicators have been established to capture the early features of community nodes, which are key for assessing a community’s development potential, as shown in
Table 1.
The model inputs are communities, which can also be understood as graphs. Specifically, these include the nodes within the community, their connections, and the attributes of each node, which are defined by 14 indicators. This comprehensive input captures both the structural and individual characteristics of each community, forming the basis for the model’s analysis and predictions. The output identifies whether a community represents an emerging technology. Communities are labeled as emerging technologies if they show high developmental potential; otherwise, they are labeled as non-emerging.
The model’s architecture includes graph convolutional layers, graph pooling layers, and fully connected layers, all specially designed to extract the structural and attribute features of the graph. The graph convolutional layers, utilizing GCN principles, accumulate features from neighboring nodes. This enables nodes to learn from both their own attributes and their position within the wider network. Graph pooling layers reduce the node count while retaining key structural information, thus aiding in computational efficiency. They focus on the graph’s most critical nodes and structures for more effective feature extraction. Finally, the fully connected layer merges features from the convolutional and pooling layers, culminating in the final classification of the graph. This layer captures intricate relationships between various features, facilitating the prediction of emerging patterns and trends within the graph data.
5. Conclusions
This study introduces a systematic framework to forecast emerging technologies in the intelligent machine tool domain. This is achieved via community detection, evolutionary analysis methods, and graph deep learning models. The central tenet of the framework is a GCN-based community classification model, tailored to establish the relationship between the development potential of technological communities and the 14 chosen static indicators. Moreover, the framework conducts a balanced analysis of various prominent topics by clustering communities in accordance with the technology system. Our results underscore the framework’s efficacy, evidenced by the classification model’s 81% accuracy rate. When applied to the latest data, this study precisely identifies five emerging technologies within the extensive community of the intelligent machine tool domain. These encompass intelligent control systems, intelligent applications, intelligent data-driven manufacturing, intelligent precision machining, and structural optimization. Additionally, the study provides detailed emerging keywords within each technological direction, thus offering a granular view of the innovative aspects driving these fields forward.
This research delivers two principal contributions. Firstly, from an academic perspective, this study proposes a novel framework which utilizes machine learning and other quantitative analysis methods to forecast emerging technologies in the intelligent machine tools sector. These quantitative approaches guarantee the framework’s objectivity and credibility, with its effectiveness being empirically validated. This framework can be customized and applied by researchers to meet their specific objectives. Within this framework, the research segments the field based on the technological systems of intelligent machine tools, identifying emerging technologies across diverse topics, thus enhancing the comprehensiveness of forecasts. The core of this framework is the analysis of the evolution of technological communities. This begins with the establishing of a comprehensive set of indicators that capture both the internal and external factors that influence the development of technology communities. Subsequently, a graph classification model is developed to accurately model the complex relationships between these factors and the evolution of communities, greatly enhancing the accuracy of the predictions.
Secondly, from a practical perspective, the framework proposed in this research efficiently analyzes and forecasts emerging technologies in the field of intelligent machine tools within acceptable ranges of both time and cost. The insights into forthcoming trends from these predictions enable industry professionals to strategically align their planning and resources towards areas which are ripe for growth and innovation. This foresight is essential for sustaining a competitive advantage and advancing the industry. Moreover, this supports governments and businesses in making informed decisions and formulating strategic plans, thereby accelerating intelligent and rapid development in the intelligent machine tool sector.
While this study makes meaningful contributions, it is not without its limitations, which future research will need to address. The proposed quantitative analysis framework, effective in ensuring objectivity and credibility, could be further improved by integrating qualitative analysis to enhance the robustness of the findings. Future efforts could gain from integrating expert knowledge into labeling technology communities, either through manual expert reviews of labels, or by developing a label library from the emerging technologies that have already been recognized and documented by experts. Moreover, the study primarily relies on analyzing technical literature to forecast emerging technologies. However, the advent of emerging technologies is influenced by a broad array of factors. Incorporating multidimensional data, including funding, policy, supply chain data [
52], and market knowledge [
53], could uncover new insights, albeit at the risk of overshadowing some forecasts derived from the technical literature.