The Ontology-Based Mapping of Microservice Identification Approaches: A Systematic Study of Migration Strategies from Monolithic to Microservice Architectures

Oumoussa, Idris; Saidi, Rajaa

doi:10.3390/computers14040133

Open AccessReview

The Ontology-Based Mapping of Microservice Identification Approaches: A Systematic Study of Migration Strategies from Monolithic to Microservice Architectures

by

Idris Oumoussa

^*

and

Rajaa Saidi

SI2M Laboratory, National Institute of Statistics and Applied Economics (INSEA), B.P. 6217 Rabat-Instituts, Rabat 10112, Morocco

^*

Author to whom correspondence should be addressed.

Computers 2025, 14(4), 133; https://doi.org/10.3390/computers14040133

Submission received: 27 February 2025 / Revised: 19 March 2025 / Accepted: 2 April 2025 / Published: 5 April 2025

(This article belongs to the Special Issue Best Practices, Challenges and Opportunities in Software Engineering)

Download

Browse Figures

Versions Notes

Abstract

:

The Microservice Architecture Style (MSA) has emerged as a significant computing paradigm in software engineering, with companies increasingly restructuring their monolithic systems to enhance digital performance and competitiveness. However, the migration process, particularly the microservice identification phase, presents complex challenges that require careful consideration. This study aimed to provide developers and researchers with a practical roadmap for microservice identification during legacy system migration while highlighting crucial migration steps and research requirements. Through a systematic mapping study following Kitchenham and Petersen’s guidelines, we analyzed various microservice identification approaches and developed a middleweight ontology that can be queried for key inputs, data modeling, identification algorithms, and performance evaluation metrics. Our research makes several significant contributions: a comprehensive analysis of existing identification methodologies, a multi-dimensional framework for categorizing and evaluating approaches, an examination of current research trajectories and literature gaps, an ontological framework specifically designed for microservice identification, and an outline of pressing challenges and future research directions. The study concluded that microservice identification remains a significant barrier in system migration efforts, highlighting the need for more research focused on developing effective identification techniques that consider various aspects, including roles and dependencies within a microservice architecture. This comprehensive analysis provides valuable insights for professionals and researchers working on microservice migration projects.

Keywords:

microservices; microservice identification; monolith application decomposition; monolith-to-microservice migration; systematic mapping

1. Introduction

System complexity, sizes, and costs are all increasing as a result of the fast rise in demand and the adoption of advanced technologies in today’s society [1]. Additionally, rivalries force many businesses to undertake rapid system updates while also ensuring that their systems remain fully operational. This necessitates the use of proper concepts, designs, and process models. These demands can be successfully addressed by several paradigms in software engineering, which break down traditional monolithic applications into a set of fine-grained services that can be developed, tested, and deployed independently [2,3].

Software maintenance is a critical stage in the software life cycle. Numerous companies nowadays invest large efforts in correcting, adapting, augmenting, and restructuring their existing programs, particularly in the presence of monolithic systems. This type of system is widely regarded as a key and critical component of many businesses (especially banks and insurance companies), particularly given that these systems—often legacy systems—have demonstrated a certain level of efficiency in executing complicated and significant business logic over a long period of time [4]. On the other hand, monolithic systems have proven increasingly challenging to operate due to the tight connection between their internal components. Changing the functionality in one module frequently necessitates modifications in numerous other modules, increasing the work and attention necessary for development [5,6].

Microservice-based architectures are viewed as an excellent candidate for modernizing such systems, particularly because they enable the development of complex and inter-organizational applications through the integration of reusable, relatively autonomous, generally heterogeneous, and distributed microservices [7,8]. Recently, there has been a tremendous change in the approach used to develop and deliver applications or services. The MSA [1] has gained a foothold in the software development industry and has become one of the latest architectural trends in software engineering, with more and more cloud computing applications starting to adopt microservices [9,10].

However, migrating monolithic applications to microservices is a complicated process that requires the consideration of a number of aspects, including the migration approach used, the method used to identify microservices, and the quality of the discovered services [11]. The identification of microservices is regarded as the most significant and time-consuming stage in the process of migration, all the more so because the new architecture’s quality and resilience are contingent on the quality of the identified services. The identification of microservices entails finding modules of an existing system that may be encapsulated and represented as services [5].

In this paper, we undertake a systematic mapping analysis to discover the migration processes from monolithic applications to determine the primary approaches for supporting the migration process, specifically the microservice identification phase. We thoroughly explore recent studies addressing the identification of microservices in the context of shifting from a monolithic design to one based on microservices. We use a detailed methodology to extract, analyze, and categorize the reported techniques to identify microservices. The following are the study’s main purposes:

Identify the most relevant challenges and levels of automation in microservice identification.
Point out the strategies, approaches, metrics, and inputs used for microservice identification.
Determine the set algorithms and tools employed, along with trends and emerging research directions.
End up with a middleweight ontology for microservice identification.

The remainder of this article is organized as follows: The second section presents an overview of the methodology and approaches utilized in this study, including the identification of microservices and the systematic mapping development procedure. Section 3 provides an introduction to and discussion of related works, while Section 4 describes our study approach. Section 5 shows and examines the mapping findings; Section 6 discusses the intended ontology for microservice identification; Section 7 highlights potential research gaps; threats to validity are described in Section 8; and Section 9 wraps up the study.

2. Foundations

2.1. Microservice Architectures

Microservices [1] are a kind of architecture that evolved from service-oriented architectures (SOAs) [12]. The core concept is to organize systems through the use of tiny autonomous building components that communicate purely via message passing. These components are referred to as microservices; each microservice implements a single task from the application’s logical domain [1]. The new design is different from monolithic systems and traditional service-oriented architectures in one important way. It places more emphasis on the independence, scalability, and semantic cohesion of each component of the system [2]. As a result, microservices may be built in a variety of programming languages, utilize a variety of databases, and be tested independently of the rest of the system’s structure. This is due to their extremely effective loose coupling [13]. Microservices use lightweight communication protocols like HTTP and can also communicate in an indirect manner using message brokers (see Figure 1). Microservices may be deployed through a network on a variety of different execution platforms or lightweight containers [14]. Containers are great for microservice deployment due to being easy to use, inexpensive, and rapid to start and run.

In terms of software quality attributes, implementing microservices promotes the interoperability and reusability of complex software systems, facilitates scalability, and improves maintainability [2]. Microservices may be readily launched, duplicated, replaced, substituted, and removed independently of the rest of the system using appropriate distributed technologies and strategies. Additionally, by creating a single business feature per microservice, they can be applicable to a wide variety of implementations [15]. Adopting microservices necessitates either completely rebuilding programs, which is frequently time-consuming and error-prone, or contemplating application re-engineering (i.e., lifting—no paradigm change) or migrating from one paradigm to another (i.e., shifting—paradigm shift).

Microservices are gaining traction and acceptance in business. Several companies—Netflix, Amazon and Spotify—have succeeded in creating microservice architectures [10]. Specifically, microservices are gaining popularity in business as a result of their benefits [16]. Although the benefits of using microservice architectures in designing complex systems are numerous, migrating from a monolithic architecture to a microservice architecture is not always straightforward [5]. However, identifying microservices continues to be a significant difficulty, jeopardizing the effectiveness of this sort of migration. As a result, it is critical to detect and verify current trends in order to overcome migration obstacles, which is the purpose of this study.

2.2. From Monolithic to Microservices

Monolithic systems pose maintenance challenges due to their tightly interdependent components. Microservices offer a solution for modernization, enabling the development of complex applications using reusable, distributed microservices. However, migrating to microservices is a complex process involving migration procedures, microservice extraction techniques, and service quality [17].

Identifying microservices is a critical phase in the migration process, as the new architecture’s quality relies on the selected services. These services must be granular to facilitate change management, maintenance, and reuse [18]. Identifying them requires a systematic approach involving the examination and breakdown of the current system into functional units, defining module context boundaries, and using static and dynamic analysis. Identifying microservices is vital yet complex, requiring a methodical approach and potential assistance from automatic detection techniques to simplify the process.

3. Related Work

The research community has shown significant interest in exploring microservice architectures in recent years. A comprehensive literature analysis by Francesco et al. [7] examined emerging patterns in publications, key research directions, and the feasibility of implementing microservices in industrial settings. Building on this research foundation, Alshuqayran et al. [19] conducted a systematic review investigating the primary architectural obstacles encountered when implementing microservice-based systems.

Numerous studies have explored the motivations, advantages, and disadvantages associated with microservice migration. Christoforou et al. [20] conducted a systematic literature review to identify migration drivers, which were subsequently validated by domain experts, culminating in the development of a node-based decision support system to facilitate microservice migration. In parallel research, Kalske et al. [21] investigated the rationale behind organizations’ decisions to transition from monolithic systems to microservices, categorizing the encountered challenges based on technical and organizational dimensions. Wolfart et al. [22] analyzed the literature to determine organizational motivations for modernizing monolithic systems, documenting common activities and their associated inputs and outputs. Additionally, Soldani et al. [14] examined gray literature to identify the technical and operational benefits and drawbacks of microservices, encompassing architectural design considerations, security measures, testing protocols, storage solutions, management practices, resource utilization, and monitoring requirements within microservice architectures.

Abgaz et al. [10] introduced a comprehensive “Monolith to Microservices Decomposition Framework” that outlines the principal decomposition stages and components while evaluating current decomposition methodologies, the available tools, relevant metrics, and applicable datasets. In a similar vein, Oumoussa et al. [8] conducted a literature analysis examining the progression of decomposition techniques, documenting methodologies and tools developed to address migration challenges, and identifying both promising approaches and their inherent limitations. While our previous work [8] focused on the historical evolution of microservice identification techniques through a systematic literature review, it did not provide an ontological classification framework for practitioners to systematically select appropriate identification techniques based on their specific context and requirements. This limitation represents a significant gap that our current study addresses through a systematic mapping study approach.

Research examining migration to microservices has primarily concentrated on migration methodologies and the driving factors behind adopting microservice architectures. Razzaq et al. [9] conducted a literature analysis examining publication trends, research contexts, focus areas, migration approaches, challenges, success factors, and the industrial adoption potential regarding transitioning from monolithic to microservice systems. Regarding the migration process itself, Fritzsch et al. [5] explored the decomposition of a monolithic system into smaller services. Given its complexity and significance, this decomposition represents a critical migration milestone. Their systematic review identified ten primary techniques for microservice identification in the literature, culminating in a decision framework to guide migration efforts by selecting appropriate techniques for specific scenarios. Similarly, Ponce et al. [23] performed a rapid review that categorized migration approaches into three groups: model-driven, static analysis, and dynamic code analysis. Kazanavičius et al. [24] further evaluated various migration strategies, examining both advantages and limitations. Additionally, Saucedo et al. [25] synthesized existing research on migration case studies, identification methods, the available tools, motivating elements, obstacles, advantages, and both migration processes and techniques.

Similarly, Fritzsch et al. [26] examined industry-adopted migration processes, highlighting the absence of semi-automated migration support mechanisms. Their research revealed that organizations predominantly rely on unsystematic approaches or manual functional decomposition methods for migration implementation. Addressing this gap, Lapuz et al. [27] compiled a collection of dynamic data collection instruments from the existing literature that were utilized in or could potentially support monolithic-to-microservice migration through dynamic analysis. Taking a different approach, Mparmpoutis et al. [28] analyzed research utilizing data-driven artifacts from legacy systems to identify potential microservice candidates. Additionally, Di Francesco et al. [29] conducted an industry-focused survey to characterize practitioners’ activities and challenges during microservice migration. Their findings revealed several prevalent pre-migration activities, including domain decomposition, service identification, the implementation of domain-driven design practices, and system decomposition.

Finally, Velepucha et al. [30] concentrated their research on documenting migration-related issues and obstacles found in the academic literature. Building on this research foundation, Luz et al. [31] documented the motivations, advantages, and challenges encountered during the transition from a monolithic enterprise architecture to microservice-based systems across three Brazilian Government Institutions.

Despite the valuable contributions of these studies, several critical research gaps remain unaddressed. First, existing studies lack a formal ontological framework that can be queried to match specific project requirements with appropriate identification techniques. Second, while previous studies have cataloged identification approaches, they have not provided a multi-dimensional evaluation framework that considers input types, data modeling approaches, identification algorithms, and performance metrics in an integrated manner. Third, the majority of existing studies focus on theoretical aspects without providing practical guidance for implementing identification techniques in real-world scenarios.

This research distinguishes itself by conducting a methodical analysis and synthesis of microservice identification methodologies, including evaluation metrics, tools, and implementation approaches. The analysis encompasses key inputs, procedural steps, and automation levels utilized during system decomposition, while also addressing associated challenges, benefits, and emerging developments highlighted in recent research. Through careful differentiation between identification methodologies, technical implementations, and their integration with domain expertise, this research delivers practical insights for practitioners developing novel approaches. Unlike previous studies that have primarily offered descriptive reviews, our work provides an actionable framework through the development of a middleweight ontology for microservice identification, which establishes formal definitions of core concepts, interconnections, and assessment criteria. This ontological approach enables developers to systematically query appropriate identification techniques based on their specific contexts, available inputs, and desired outcomes, addressing a significant gap in the current literature. Our systematic mapping study complements and extends our previous work [8] by shifting the focus from historical evolution to practical implementation guidance, offering a structured methodology for evaluating and selecting identification approaches in real-world migration scenarios.

4. Methodology

We conducted this study using a systematic mapping study (SMS) methodology, following the established guidelines of Kitchenham and Petersen [1,12]. The systematic mapping process, adapted from [32,33], is illustrated in Figure 2, comprising three core phases: planning, conducting, and reporting. In accordance with SMS guidelines [33,34], the quality assessment phase was considered optional and is therefore represented as a dashed-line box in the figure to ensure methodological transparency while maintaining flexibility in our approach. The following subsections provide a detailed explanation of how we applied this systematic process in our investigation of microservice identification techniques.

4.1. Research Objectives

The adoption of microservice architectures for migrating monolithic systems raises important questions on the challenges associated with identifying microservices within distributed and highly dynamic environments. In this study, we aimed to identify the various strategies, techniques, and tools used to identify microservices and how they interact within a microservice architecture. Moreover, we aimed to collect and document all known approaches and methods for microservice identification and organize them into easily manageable, reliable, and extensible frameworks such as ontologies. This will enable (1) researchers and practitioners to have an overview of the identification methods that could be applied to their existing or future systems and (2) share experiences of how to successfully identify microservices in real-world scenarios. To sum up, the objectives of this research can be reformulated as follows:

O1. Identify and categorize the strategies (high-level approaches) and tools (software applications) used for microservice identification within microservice architectures.
O2. Recognize and document the methods (systematic procedures) and techniques (specific algorithms or processes) employed for microservice identification, along with the tools that support them.
O3. Analyze the validation techniques and tools used to assess the effectiveness of microservice identification.
O4. Develop and share a middleweight ontology to support microservice identification in distributed systems.

4.2. Research Questions

This study aimed to identify a set of microservice identification techniques and how they might be utilized to facilitate the migration of legacy systems to a microservice architecture. Thus, we developed our research questions in light of the objectives of our study and in accordance with Kuhrmann et al.’s [35] criteria. This study was conducted with five main questions in mind. These questions are described in Table 1 with their corresponding motivation and related objectives.

4.3. Literature Repository Selection

The search string employed in this investigation was intended to be simple and generic. As suggested by Petticrew and Roberts in [36], it was generated using search phrases pertaining to the population, intervention, and context. The population referred to the application domain of microservices and microservice architectures, where identification and decomposition were employed as interventions. The context represented the original systems being transformed, specifically monolithic, existing, or legacy applications. Consequently, the final search string used was as follows:

(“ monolith ” OR “ existing ” OR “ legacy ”) AND (“ microservices ” OR “ micro - services ” OR “ microservice architectures ”) AND (“ identification ” OR “ decomposition ” OR “ extraction ”)

To obtain relevant studies, we followed Kuhrmann et al.’s [35] instructions. The following technical publications provided our initial selection of papers:

ACM Digital Library;
Elsevier ScienceDirect;
IEEE Xplore Digital Library;
Springer Online Library;
Wiley Online Library.

We additionally supplemented our article selection by conducting a search in the Google Scholar database using “microservices identification” as an exact search phrase and analyzing the results. To prevent us overlooking significant research, we manually mined the references of each of the publications in our first selection, employing Wohllin’s [37,38] advised forward and backward snowballing (i.e., using Google Scholar to search for citations to and in a particular publication), to locate referenced articles that were found to report work within the realm of microservice identification based on their abstracts.

4.4. Selection Examination Procedure

The collection of publications retrieved through an algorithmic search underwent two steps of vetting. In the initial phase, abstracts and titles were analyzed for relevancy by reading them. In the following round, entire papers were reviewed to determine whether they met our criteria for inclusion. The list of all the articles was evaluated independently by each author; opinions were exchanged and disagreements were resolved. While determining whether to include or remove articles, the authors additionally conducted a second review of any publications discovered through snowballing.

4.5. Inclusion and Exclusion Criteria

Establishing a stringent set of inclusion and exclusion criteria decreased the quantity of publications retrieved from online academic libraries. Only articles from journals and conferences that had undergone peer review were included in this study. The automated search encompassed all articles published since 2016, including early releases. Given that there was no unanimity on the concept of microservice identification prior to 2016 and the work of Ahmadvand and Ibrahim [39], the term is a relatively new concept in software development; hence, the year 2017 was chosen as the starting point. Only work published in English that addressed microservice identification or legacy system migration to microservices was considered. Table 2 and Table 3 present the comprehensive set of approved inclusion and exclusion criteria, respectively. In Table 2, “earlier works” refers to foundational studies that contributed to the development of concepts and techniques relevant to microservice identification, even if they did not explicitly use the term “microservices” at the time. Similarly, “including early releases” refers to studies published in early-access formats (e.g., preprints or preliminary versions in conference proceedings) that were later finalized in journal or extended conference publications. As shown in Table 3, we excluded review articles, surveys, or supplementary research on the topic of identifying microservices or microservice architectures, as these types of studies typically summarize existing work rather than propose new methodologies or frameworks for microservice identification.

4.6. Data Extraction Process

As shown in Table 4, a data extraction form was constructed in accordance with Petersen et al.’s [33] recommendations. Each paper’s information, such as its publication year, source, and type, were given. In addition, a collection of data necessary for our investigation was extracted. Included were the principal input addressed by the study, data modeling, identification algorithm, and performance evaluation measures.

4.7. Overview of Primary Studies

In answer to research questions RQ1 and RQ2, we discovered a lack of agreement on precise taxonomies for microservice identification methodologies, which impeded the accurate mapping of the selected studies to relevant and unique categories. Furthermore, due to the variety of applications, targeted platforms, and verification and validation procedures used in the selected studies, it was critical to categorize these studies suitably in line with research questions RQ3, RQ4, and RQ5. In addition, we used grounded theory [40] as a supplement to build missing categories from the retrieved data items. To discover categories and their relationships to pre-existing categories, we used open coding and selective coding strategies. Grounded theory was employed in this study in an iterative manner, with categories and subcategories being adjusted in each iteration until they reached a condition of stability.

5. The Outcomes of the Mapping

In this section, we will elucidate and provide a comprehensive account of the findings from the mapping study, addressing the five research questions outlined in Section 3.

5.1. Overview of Selected Research

The search process was carried out in August 2024, resulting in the identification of 45 unique papers that were published from 2016 onwards. The formulated query was employed across the chosen libraries, and Table 5 illustrates the count of papers retrieved from each of these libraries.

After gathering the 1067 papers retrieved from various search engines, duplicates were eliminated, reducing the number to 1065. Upon screening the titles and abstracts of the remaining papers, 1015 were deemed irrelevant and excluded. Following a thorough review against the inclusion and exclusion criteria, only 37 papers were deemed suitable for inclusion. An additional eight papers were identified using recursive backward and forward snowballing techniques. Two rounds of snowballing were conducted to reach a state of stability. In the first round, five new papers were incorporated, while three more were added in the second round. The number of included and excluded papers for each phase is presented in Figure 3 using a PRISMA flowchart.

Figure 4 illustrates the distribution of the selected studies based on their publication year and source. Notably, despite the initial introduction of the MSA in 2016, interest in microservice identification and microservice architectures gained significant traction in 2020 and beyond. Furthermore, Figure 4 reveals that IEEE Xplorer and Springer were the primary sources for publications in this domain. Table 6 provides a comprehensive overview of the selected studies, including their publication year, publication type, and acquisition method.

To assess the significance of our primary investigations on microservice identification, we employed word clouds as recommended in [35]. Figure 5 displays the most prevalent terms in titles and abstracts, such as microservice, architecture, and service. Terms related to identification methods like clustering, classification, and similarity were also common. The word cloud underscores the importance of the breakdown process in identifying microservices and the need for effective extraction and classification procedures.

5.2. Challenges in Microservice Identification (RQ1)

Microservice identification, as a pivotal phase in migrating from monolithic to microservice architectures, brings unique challenges that threaten the quality, scalability, and maintainability of the resulting system. These challenges may stem from structural issues within the monolith, runtime complexities, or the inherent trade-offs in service design. To achieve effective decomposition, all challenges, regardless of their origin, need to be identified and addressed. In this study, we explored the focus of existing research with respect to the source of challenges (structural, runtime, or hybrid). Figure 6 depicts the distribution of the selected studies regarding the identified sources of challenges. The results indicate that 59% (27 studies) of primary studies focused on structural challenges, 22% (10 studies) addressed runtime complexities, and 19% (8 studies) considered hybrid challenges. This highlights a predominant focus on static structural issues, with less emphasis on runtime or combined approaches.

Given the diverse perspectives on microservice identification challenges and the lack of a unified taxonomy, we adopted a classification based on the targets of these challenges. Accordingly, challenges in microservice identification can be classified into the following:

Granularity Challenges: Issues related to defining service boundaries that balance granularity with maintainability and operational efficiency. Overly fine-grained services may lead to a high communication overhead, while coarse-grained services risk reintroducing monolithic dependencies.
Dependency Challenges: Addressing tight coupling and interdependencies within monolithic systems that make it difficult to isolate functionalities for effective service decomposition.
Domain Modeling Challenges: Ensuring semantic cohesion and domain alignment in the identified services, which requires comprehensive domain knowledge and precise business process understanding.
Data Distribution Challenges: Managing the decomposition of centralized monolithic databases into distributed microservices while preserving data integrity and consistency.

Table 7 shows the categorization of microservice identification challenges addressed by the primary studies. The results reveal that granularity and dependency challenges are the most treated and studied issues, while domain modeling and data distribution challenges receive comparatively less attention. Furthermore, hybrid approaches that combine structural and runtime perspectives are emerging but remain underexplored.

5.3. Categorization and Performance of Microservice Identification Approaches (RQ2)

Given the variety of proposed solutions in the field of microservice identification, we classified the microservice identification approaches addressed in the primary studies according to the nature of their methodologies and classification strategies as follows:

General identification strategies: Approaches using conventional methods to identify microservices, such as heuristic-based techniques or guidelines for selecting appropriate tools, languages, or technologies for microservice identification.
Framework-based solutions: Architectural frameworks for microservices that incorporate predefined modules or patterns for identifying and managing microservices, including service decomposition, interface identification, and microservice dependencies.
Technique-based solutions: New techniques or adaptations of existing methods from other domains applied specifically for the identification of microservices. These include machine learning and artificial intelligence approaches, such as clustering, classification, and deep learning models used to identify service boundaries and dependencies.
Tool-based solutions: Tools specifically designed for the identification of microservices in existing systems or for guiding the design of microservice architectures.
Algorithm-based solutions: New algorithms proposed for the automatic or semi-automatic identification of microservices, utilizing data analysis, clustering, or other optimization techniques to define microservice boundaries.
Protocol-based solutions: New communication protocols or methods for identifying interactions between microservices, ensuring efficient integration and service identification.
Analysis-based solutions: Studies focusing on the comparison, evaluation, or experimentation of existing microservice identification methodologies, highlighting their benefits, limitations, and performance in different contexts.

Our investigation (see Figure 7) revealed that 34% of the studies focused on new techniques for identifying microservices, while 31% proposed framework-based solutions and 13% emphasized general identification strategies. A smaller proportion of studies developed new tools (10%), algorithms (5%), or protocols (4%). Notably, 3% of the studies focused on the analysis of existing identification approaches. The authors of P20 analyzed existing identification approaches and proposed a refined framework based on insights derived from their analysis. The proposed solutions for identifying microservices can be classified into the following key categories based on their focus:

Service Decomposition: Techniques for breaking down monolithic applications into distinct, manageable microservices based on business logic or domain-driven design.
Interface Identification: Approaches aimed at identifying the interactions and communication patterns between services, which are crucial for defining microservice boundaries.
Dependency Mapping: Methods that focus on identifying service dependencies and interactions, often visualized through dependency graphs or matrices to clearly delineate service boundaries.
Performance Evaluation: Techniques that assess the effectiveness and performance of microservice identification approaches, ensuring that the identified services align with the performance goals of the architecture.
Adaptation and Refinement: Approaches that allow for the ongoing refinement of microservice identification as the system evolves or as new requirements emerge.

Table 8 presents a classification of the proposed microservice identification solutions, showing the proportion of studies in each category. The results indicate that a significant emphasis is placed on service decomposition (40%), interface identification (24.9%), and dependency mapping (15.5%), with less attention given to performance evaluation (13.4%) and adaptation and refinement (11.1%).

5.4. Key Inputs, Data Modeling, and Evaluation Metrics in Microservice Identification (RQ3)

Microservices identification relies on diverse inputs, sophisticated data modeling techniques, and precise evaluation metrics to ensure the effective and accurate decomposition of monolithic systems. The selection of inputs and the chosen modeling approach significantly influence the quality of the identified microservices, while evaluation metrics provide a means to measure the success of these efforts. In this study, we examined the focus of existing approaches with respect to the types of inputs, data modeling methods, and evaluation metrics employed.

The inputs used in microservice identification are varied and include both business and technical artifacts. Business-oriented inputs, such as process models, user stories, and functional or non-functional requirements, provide high-level guidance for defining microservices that align with organizational goals. Technical inputs, including the source code, execution logs, database schemas, and OpenAPI specifications, offer detailed insights into system operations and interactions. These inputs are critical for capturing the structural and behavioral aspects of the system under consideration.

Data modeling in microservice identification is predominantly based on graph or relational data representations. Graph-based approaches represent systems as nodes and edges, where nodes correspond to entities like classes, modules, or functions and edges depict their interactions or dependencies. Partitioning algorithms are then applied to identify cohesive clusters that can serve as microservices. Examples include semantic clustering, syntactic clustering, and knowledge graphs. Relational models, on the other hand, such as data flow matrices, focus on quantifying relationships between components, employing matrices to quantify the relationships between system components, with matrix factorization techniques employed to deduce independent service boundaries. Both approaches aim to balance granularity, cohesion, and independence in the identified services.

Evaluation metrics play an essential role in validating the effectiveness of microservice identification techniques. Metrics such as cohesion and coupling assess the internal consistency of services and their independence from one another. The precision and recall measure the alignment of the identified services with a predefined ground truth, while modularity indices evaluate the overall quality of the decomposition. Performance metrics, including the execution time and resource usage, are particularly relevant for approaches applied to large-scale systems. These metrics collectively provide a robust framework for assessing the feasibility and reliability of proposed identification methods.

Table 9 shows the distribution of approaches proposed by the primary studies across the different input types, data modeling techniques, and evaluation metrics. This study revealed that much emphasis is placed on graph-based and relational modeling techniques, which dominate data representation approaches, and on technical inputs such as the source code, execution traces, and API specifications. In contrast, less attention is given to business-oriented inputs, including BPMN diagrams, user stories, and transactional contexts, or to hybrid evaluation metrics that combine functional and non-functional assessments.

5.5. Automation in Microservice Identification Approaches (RQ4)

The reviewed studies indicate significant advancements in automation for microservice identification, with approaches varying in their degree of automation. These methods can be broadly categorized into manual, semi-automated, and fully automated techniques. As shown in Table 10, semi-automated techniques dominate the landscape, accounting for 62.22% of the proposed methods. These approaches leverage a combination of automated tools and expert validation to achieve accurate service decomposition. Examples include tools that generate dependency graphs or runtime logs, requiring human intervention to refine the resulting service candidates.

Fully automated techniques, constituting 33.33% of the studies, rely on advanced algorithms such as clustering, graph-based partitioning, machine learning models, or Natural Language Processing (NLP) to identify microservices with minimal human intervention. These methods are particularly effective for large-scale systems where manual processing is infeasible. However, challenges remain in ensuring the interpretability and trustworthiness of the results.

Manual approaches, which represented 4.44% of the studies, are rarely employed in niche contexts or when the data availability is limited. These techniques rely heavily on expert judgment and domain knowledge to identify microservices through processes such as domain-driven design or functional analysis.

5.6. Trends and Emerging Research Directions in Microservice Identification (RQ5)

The reviewed studies reveal key trends and emerging research directions in the field of microservice identification. Automation continues to dominate as a critical focus area, with significant efforts directed toward fully automating the identification process. Researchers are also exploring diverse methodologies to address the complexities of modern software systems and to adapt microservice architectures to meet evolving demands. We determined the trends and research directions:

Automation: Studies like P5 and P38 emphasized the application of advanced algorithms, including the semantic analysis of application specifications and machine learning techniques, to streamline microservice identification and ensure loose coupling and high cohesion.
Service Mesh Technologies: Tools such as Istio and Linkerd are increasingly being adopted to manage communication between microservices. Studies such as P13 and P39 highlighted their role in providing advanced routing, monitoring, and security features essential for scaling complex microservice architectures.
Edge Computing: The integration of microservices with edge computing, as explored in P16 and P27, enhances responsiveness by processing data closer to their source. This trend is particularly significant for IoT systems and real-time applications where latency reduction is critical.
AI-Driven Solutions: Artificial intelligence plays an essential role in microservice management. Studies such as P8 and P14 highlighted the use of AI algorithms for tasks such as anomaly detection, resource allocation, predictive monitoring, and intelligent fault recovery.
Low-Code/No-Code Platforms: P18 demonstrated how these platforms are enabling rapid application development within microservice frameworks, empowering non-technical users and reducing the time to market for new solutions.
Enhanced Observability: Distributed systems require robust observability tools. Studies like 11 and P13 emphasized the development of AI-powered observability solutions that provide real-time insights into the system performance and offer end-to-end tracing for addressing bottlenecks.
Decomposition Techniques: Research on defining optimal service boundaries is ongoing. Studies such as P31 and P45 explored approaches like graph-based partitioning and domain-driven design to ensure microservices were cohesive and independent.
Migration Strategies: Frameworks for transitioning from monolithic systems to microservices, as seen in P3 and P10, address challenges in software re-engineering and offer systematic methods for decomposition.
Performance Analysis: Techniques for evaluating the latency, throughput, and resource utilization are becoming increasingly important. Studies like P20 and P26 focused on understanding the performance trade-offs associated with different microservice configurations.
Standardized Metrics: Despite the variety of proposed metrics, a unified framework for evaluating microservice identification techniques is still lacking. Studies such as P20 and P24 proposed cohesion, coupling, and modularity metrics, but further standardization is needed.
Industry Benchmarks: Establishing benchmarks using real-world datasets and open-source systems is critical for validating microservice identification approaches. P12 and P35 advocated for these efforts to ensure practical applicability and foster collaboration across the community.

Based on the analysis of research directions in microservice identification, Table 11 shows that automation emerged as the predominant focus of the reviewed studies, closely followed by decomposition techniques. The significant attention to AI-driven solutions demonstrates the field’s increasing emphasis on artificial intelligence applications. Migration strategies and performance analysis are also widely explored, which indicates a focus on optimizing and improving the migration and performance aspects of microservices. Conversely, service mesh technologies and standardized metrics are among the least-addressed areas, demonstrating a moderate but consistent research interest. The data reveal balanced attention across several areas, with edge computing, enhanced observability, low-code/no-code platforms, and industry benchmarks each comprising 9% of the research landscape. This distribution suggests a comprehensive approach to microservice identification rather than significant research gaps.

A notable pattern emerged in the interconnected nature of these research directions, with many studies incorporating multiple approaches. For instance, several papers appeared across different categories, such as combining AI-driven solutions with decomposition techniques. This integration suggests a holistic approach to addressing the complexities of microservice identification. While studies like P2 and P20 focused on automation, they also contributed to other areas such as low-code platforms and standardized metrics, indicating a broader scope than initially suggested. It is important to note that the studies listed in the table may span multiple research directions, leading to some duplication. As a result, the total number of unique studies (45) is not fully represented in the distribution percentages of each category. This reflects the multidisciplinary nature of the field and the tendency for research to address overlapping challenges in microservice identification.

6. An Ontology for Microservice Identification

Making the results of our study practical and extendable, and considering the static nature of taxonomies, we propose an ontology-based representation of our findings. To the best of our knowledge, this work is the first to propose an ontology for microservice identification. This approach formalizes the relationships among key concepts and entities within the microservice identification domain, enabling the creation of a reusable, extensible framework for researchers and practitioners. The ontology models the relationships between system artifacts, decomposition techniques, evaluation metrics, microservice candidates, domain knowledge, and validation methods. Figure 8 illustrates an excerpt of the ontology structure derived from this study. While the complete ontology is too extensive to display in its entirety, the figure shows its core structural organization while still capturing the key structural elements.

The ontology was developed using Protégé (https://protege.stanford.edu/, accessed on 5 February 2025), and its consistency and coherence were validated using the Protégé Debugger option. The ontology organizes domain knowledge into key concepts that collectively represent the microservice identification process. The main concepts of the ontology are as follows:

Studies: Encompass the 45 selected research studies (P1 to P45). Each study in this class contains essential information, including its title, DOI, automation degree, challenges, and other relevant attributes.
SystemArtifact: Represents the primary inputs required for microservice identification, including runtime logs, the source code, architectural models, and business process descriptions.
DecompositionTechnique: Defines the techniques used for service decomposition, such as static analysis, dynamic analysis, and clustering algorithms.
EvaluationMetric: Captures the metrics used to evaluate the quality of the identified microservices, including the cohesion, coupling, modularity, and granularity.
MicroserviceCandidate: Describes the potential microservices identified during decomposition, characterized by their boundaries, dependencies, and functional roles.
DomainKnowledge: Represents business processes, workflows, and other domain-specific data that inform the decomposition process and ensure alignment with organizational objectives.
ValidationMethod: Details the techniques used to validate the identified microservices, including case studies, simulation environments, and proof-of-concept implementations.

Relationships between these classes are represented using Protégé object properties, as outlined in Table 12. For instance, the relationship isEvaluatedBy links MicroserviceCandidate to EvaluationMetric, while isDecomposedBy connects SystemArtifact to DecompositionTechnique.

For usability, the ontology supports OWL-DL queries to explore its structure and retrieve actionable insights. Table 13 describes several useful queries. For example, Q1 retrieves a list of decomposition techniques applied to business processes, while Q2 identifies evaluation metrics most commonly associated with clustering algorithms. Q3 lists all system artifacts used in studies that employed graph-based partitioning, and Q4 retrieves the evaluation metrics used to evaluate microservice candidates.

The proposed ontology is available in OWL format at https://github.com/Ioumoussa/MicroservicesIdentification (accessed on 20 February 2025), allowing researchers and practitioners to extend and adapt it for various contexts. By formalizing the microservice identification process, this ontology provides a foundational framework for advancing research and developing standardized tools in this domain.

7. Discussion

This study sought to comprehensively characterize the migration of monolithic systems to microservices by analyzing microservice identification techniques, presenting key findings, identifying research gaps, and outlining future research directions. The insights derived from the systematic mapping study establish a robust foundation for understanding the challenges, methodologies, and emerging trends within this domain.

7.1. Principal Findings and Lessons Acquired

This investigation extended beyond the taxonomic framework proposed by Fritzsch et al. [5], introducing a more comprehensive classification methodology for microservice identification techniques. While previous research has primarily focused on monolithic system attributes that facilitate identification, our analysis encompassed multiple dimensions: the automation sophistication, auxiliary methodologies, granularity considerations, and evaluation frameworks. To systematically codify this knowledge domain, we developed and implemented a middleweight ontological framework for identification methodologies. This ontological structure enables the systematic querying of essential components, including input parameters, data modeling paradigms, identification algorithms, and performance evaluation metrics.

The implementation of this ontological framework addresses a significant lacuna in the field by establishing a standardized methodology for the comparative analysis of identification approaches. This structured knowledge representation enables practitioners and researchers to conduct systematic analyses of methodological interrelationships and make empirically informed decisions regarding approach selection based on specific contextual requirements.

Our empirical analysis revealed significant heterogeneity in the proposed methodologies, each presenting distinct trade-offs between technological constraints and requisite manual intervention. A noteworthy finding indicates that semi-automated methodologies maintain a substantial dependence on domain expertise, potentially creating operational bottlenecks in large-scale identification initiatives. This dependency is particularly pronounced in organizations attempting to identify microservices within complex, enterprise-scale systems.

The establishment of uniform quality assessment metrics for microservice identification necessitates the development of industry-standard and academically validated evaluation criteria. The current absence of comprehensive boundary evaluation metrics and standardized validation frameworks significantly impedes the comparative analysis of different approaches. This limitation is particularly evident in industrial applications, where the lack of standardized benchmarks complicates the evaluation of identification strategy efficacy. In this context, Aderaldo et al. [83] have proposed essential criteria for architectural benchmark selection to enhance research reproducibility. Similarly, Santos et al. [84] have introduced metrics for quantifying the identification complexity and implementation costs.

The analysis of the motivating factors for microservice identification revealed predominant themes of scalability enhancement, maintainability improvement, technological independence, and implementation efficiency. Empirical evidence suggests that the anticipated benefits of microservice adoption align closely with these primary drivers. However, it is noteworthy that a minority of studies proposing identification techniques have conducted post-implementation evaluations to verify the realization of these expected benefits.

Our research findings align with those of Saucedo et al. [25] regarding the significant complexities inherent in service boundary identification during microservice adoption. While our investigation did not specifically address database integration, incorporating database components into the identification methodology presents a promising direction for enhancing service boundary delineation. Current methodologies generally demonstrate an insufficient consideration of database elements during the identification phase. The incorporation of database schemas, data flow relationships, and quantitative decomposition metrics could substantially improve the efficacy of microservice identification methodologies.

Furthermore, an emerging consideration in this domain pertains to retroactive optimization capabilities, specifically the potential requirement to reassess or modify previously identified services due to technical or organizational constraints. With the exception of the methodology proposed by Freire et al. [56], the reviewed approaches do not explicitly incorporate provisions for such retroactive modifications. Although a comprehensive evaluation of microservice architectures’ suitability should be conducted during the initial identification phase, unanticipated challenges may manifest subsequent to the completion of service identification. The implementation of semi-automated refinement methodologies or service boundary adjustment mechanisms could potentially reduce the manual effort associated with such modifications, thereby enhancing the adaptability and sustainability of microservice identification processes.

7.2. Practical Implications for Practitioners

This investigation yields several significant implications for practitioners engaged in microservice identification initiatives. The following empirically derived insights merit particular consideration.

Systematic Service Identification for Enhanced Scalability. The methodological identification of microservices demonstrates substantial potential for improving system scalability characteristics. Through the precise identification of autonomous services, organizations can implement granular scaling mechanisms, enabling independent resource allocation based on service-specific demand patterns. This granular approach to scalability optimization facilitates enhanced performance metrics while simultaneously reducing operational resource consumption.

The Optimization of Development and Deployment Methodologies. The identification of well-defined service boundaries significantly enhances the efficiency of deployment and development processes. The implementation of continuous integration and continuous deployment (CI/CD) methodologies becomes more systematic when founded upon properly identified service boundaries. This enhancement enables accelerated release cycles and reduced time-to-market metrics. Furthermore, organizations can leverage precisely identified microservices to implement sophisticated automated testing frameworks, streamlined deployment processes, and efficient rollback mechanisms.

The Implementation of Domain-Driven Design Paradigms. The identification of microservices necessitates the adoption of a methodologically rigorous approach founded upon domain-driven design principles. This approach encompasses the comprehensive analysis of business domains, systematic identification of bounded contexts, and precise alignment of service boundaries with fundamental business capabilities. The integration of hybrid evaluation metrics, incorporating both quantitative performance indicators and qualitative domain expertise, substantially enhances the precision of service identification outcomes.

Architectural Decision Support Framework. Our findings provide a structured approach for evaluating competing microservice identification methodologies. The ontological framework enables architects to systematically compare the available techniques based on system-specific characteristics, including the codebase complexity, business domain nature, and existing architectural constraints. This methodological comparison facilitates evidence-based decision-making when selecting identification approaches aligned with architectural quality attributes prioritized by the organization.

Implementation Guidance for Development Teams. This research offers practical guidance on implementing the identified microservice boundaries. The analysis of automation sophistication levels helps development teams anticipate the required technical expertise and manual intervention during the identification process. Our findings regarding communication patterns provide developers with insights for implementing loose coupling between services, reducing system instability risks, and facilitating independent deployment capabilities.

Business–Technical Alignment Methodology. Our research bridges technical and business perspectives by emphasizing domain-driven design principles in service identification. This study highlights the importance of aligning service boundaries with business capabilities, enabling analysts to effectively communicate technical decomposition decisions using business domain terminology. This alignment enhances collaboration between technical and business teams, ensuring that the identified microservices support both technical objectives and business agility.

7.3. Emerging Research Trajectories

The analysis of primary research studies revealed several critical trajectories requiring further investigation within the domain of microservice identification. The following areas present significant opportunities for advancement in both theoretical frameworks and practical methodologies:

Data-Intensive Systems and Technological Evolution. Contemporary data-intensive systems present unique methodological challenges for microservice identification, primarily due to complex data processing requirements. The emergence of advanced technologies, including 5G infrastructure and edge computing paradigms, introduces additional complexity dimensions, particularly regarding latency-sensitive service identification and dynamic reconfiguration requirements. These evolutionary paradigms necessitate innovative identification methodologies capable of accommodating both performance requirements and architectural flexibility considerations.

Business Logic Integration and Data Dependency Analysis. Legacy system architectures frequently incorporate business rules directly within database structures, introducing significant complexity in service boundary identification. This historical architectural pattern necessitates sophisticated approaches to service identification that consider both application logic and data dependency patterns. Further research is required to develop methodological frameworks for identifying service boundaries while maintaining data integrity and business rule coherence.

Security Integration in Service Identification Methodology. The integration of security considerations must be fundamental to the service identification process rather than treated as a supplementary concern. Contemporary approaches, including Moving Target Defense (MTD) and artificial intelligence-based anomaly detection systems, present promising frameworks for securing the identified microservices. These advanced security paradigms require integration within core identification methodologies, particularly concerning inter-service communication protocols and dependency structures.

Communication Pattern Analysis in Service Identification. The identification of microservices must incorporate the sophisticated analysis of communication patterns and service dependencies. Contemporary best practices emphasize the minimization of synchronous communication patterns, as tightly coupled services can propagate system instability. Future research should focus on identification methodologies that promote loose coupling architectures and support asynchronous communication patterns.

Organizational Alignment and Conway’s Law’s Implications. The influence of Conway’s Law on microservice identification presents significant implications for organizational structure consideration. The efficacy of microservice identification demonstrates a strong correlation with organizational structural alignment. This necessitates the development of identification strategies that incorporate both technical and organizational dimensions. An alignment between organizational team structures and service boundaries facilitates enhanced development efficiency and long-term maintainability characteristics.

The synthesis of these emerging perspectives, supported by our ontological framework, establishes a comprehensive foundation for future research initiatives in microservice identification. Subsequent investigations should focus on the development of sophisticated identification methodologies that address these emerging challenges while maintaining practical applicability in industrial contexts. The integration of our findings with established theoretical frameworks suggests several trajectories for future research, particularly in the development of standardized evaluation metrics and the enhancement of automated identification tools. These research directions, coupled with our ontological framework, provide a robust foundation for advancing the field of microservice identification.

8. Threats to Validity

In this section, we describe the most common threats to the validity of an SMS in software engineering, including the construct, internal, external, and conclusion validity, as reported in [33,85], and explain how we mitigated their effects on the obtained results.

8.1. Construct Validity

To ensure the robustness of our study’s construct validity, we implemented several mitigation strategies. We carefully designed our search methodology following Kuhrmann et al.’s [35] guidelines for search engine selection and Petticrew and Roberts’ [36] principles for keyword construction. The search terms were deliberately kept general to maximize coverage while being precise enough to maintain relevance. We validated our search string through preliminary testing before executing the full research protocol. To minimize search bias and ensure comprehensive coverage, we employed multiple complementary approaches: (1) automated searches across diverse academic databases, which helped reduce subjective errors and enhance replicability, (2) a snowballing technique applied iteratively [38], which successfully identified additional relevant papers not indexed in the selected search engines, and (3) strict inclusion/exclusion criteria focusing on peer-reviewed journal and conference papers to maintain high quality standards and ensure the completeness of the results.

8.2. Internal Validity

To address internal validity concerns, we implemented a rigorous data extraction and study selection process. Following Peterson et al.’s [33] guidelines, we developed a comprehensive data extraction form through collaborative refinement by the authors. Both authors independently performed study selection and data extraction tasks, maintaining detailed documentation of accept/reject decisions. Regular consensus meetings were held to discuss and resolve any differences in assessment. In cases where an initial disagreement occurred, the authors engaged in detailed discussion until a consensus was reached. To quantify the reliability of our process, we calculated the kappa coefficient following Kitchenham et al.’s [34] guidelines, achieving a score of 0.94, which indicated “very good” or “almost perfect” inter-rater agreement. The final data extraction form was adopted only after resolving all disagreements between the authors, ensuring consistency throughout the remainder of the study.

8.3. External Validity

Regarding the external validity and the generalizability of our results, we implemented several strategies. Our search methodology combined automated database searches with backward snowballing, following Mourão et al.’s [86] recommended approach of using a Scopus database search complemented by backward snowballing. While we acknowledge that forward snowballing could have potentially identified additional recent studies, our extensive initial search pool provided a representative overview of advancements in monolithic-to-microservice migration. To ensure structural flexibility, we developed an ontology incorporating all classification schemes based on the retrieved papers. This ontology has been made freely available to researchers and practitioners, allowing for future updates as new studies emerge that may not fit the current classification schemes. Additionally, we strengthened the external validity by including only peer-reviewed studies without date restrictions, ensuring comprehensive coverage of the field’s evolution while maintaining quality standards.

8.4. Conclusion Validity

A conclusion validity threat to our study concerned both the replicability of the results and the adoption of taxonomies for microservice identification approaches. To address replicability, we defined a detailed protocol that was cross-validated by both researchers, with each researcher working independently before discussing and cross-validating individual findings. Regarding taxonomies, while several existing classification schemes for microservice identification were investigated, none of these taxonomies enabled the proper classification of all the identified studies. Therefore, we employed open and selective coding from grounded theory [40] to develop a comprehensive classification based on a deeper analysis of the focus and proposed solutions in the identified papers. Our final classification scheme incorporates both existing taxonomic elements from the literature (either as is or adapted) and new categories that emerged from our systematic analysis of microservice identification approaches, ensuring a complete and accurate representation of the field.

Additionally, our study faces several methodological limitations worth acknowledging. First, our focus on peer-reviewed academic literature may have excluded valuable insights from industry-based approaches that haven’t been formally published. Second, the rapidly evolving nature of microservice technologies means that some cutting-edge identification approaches may have emerged after our literature collection phase. Third, the evaluation of identification techniques in the analyzed studies often lacked standardized metrics, making objective comparisons challenging. Finally, while our ontological framework provides a comprehensive classification, its practical application may require adaptation to specific organizational contexts and technological environments.

9. Conclusions

This systematic mapping study presents a comprehensive analysis of the current research landscape regarding microservice identification methodologies in the context of monolithic-to-microservice migration. The investigation encompassed identification techniques, supporting tools, decision criteria, and the determination of optimal service boundaries. Furthermore, this research provides a methodical examination of the identification process, with particular emphasis on its inherent challenges, quantitative metrics, and degrees of automation.

The findings indicate that no singular microservice identification methodology demonstrates universal efficacy across all scenarios. The heterogeneity in automation capabilities, granularity specifications, input prerequisites, and technological support mechanisms presents substantial obstacles in formulating a generalized identification framework applicable across diverse systems and domains. Moreover, the analyzed literature reveals significant disparities regarding essential aspects such as input requirements, granularity determinations, supporting methodologies, and evaluation metrics for microservice identification. These inconsistencies significantly impede both the comparative analysis of different approaches and the development of standardized methodologies.

This systematic mapping study further contributes by offering an analytical perspective on the specific challenges and advantages associated with microservice identification, thereby establishing a foundation for the development of standardized tools and evaluation frameworks. The proposed middleweight ontology serves as a taxonomic framework for organizing and categorizing microservice identification techniques, facilitating the establishment of a standardized vocabulary and enhancing collaboration among stakeholders involved in the identification process.

Given the current absence of standardized and extensively validated identification frameworks, future research directions will prioritize the integration of industrial insights to enhance the ontological framework and improve the practical applicability of identification techniques. Additionally, the expansion of this research to include gray literature—encompassing industry reports, professional blogs, and practitioner experiences—will facilitate a multivocal analysis, thereby strengthening the bridge between academic research and industrial implementation practices.

Author Contributions

I.O. conducted the primary literature review following the PRISMA 2020 guidelines, including the comprehensive search for and selection of relevant studies, performed data extraction and synthesis, and proposed the framework for microservice identification. I.O. also contributed to analyzing and mapping the data, identifying research gaps, and preparing the initial draft of the manuscript. The study selection and data extraction processes were conducted in accordance with the PRISMA 2020 standards to ensure transparency and reproducibility. R.S. provided continuous guidance throughout the study, supervising the entire research process and ensuring methodological rigor in line with PRISMA recommendations. R.S. also reviewed, validated, and refined the proposed framework and critically assessed the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Relevant data are provided within the paper, with the complete dataset and scripts used in this systematic mapping study available at https://github.com/Ioumoussa/MicroservicesIdentification (accessed on 25 February 2025).

Conflicts of Interest

The authors declare no conflicts of interest.

References

Dragoni, N.; Giallorenzo, S.; Lluch-Lafuente, A.; Mazzara, M.; Montesi, F.; Mustafin, R.; Safina, L. Microservices: Yesterday, Today, and Tomorrow; Springer International Publishing: Cham, Switzerland, 2017; Chapter 12; pp. 195–216. [Google Scholar]
Newman, S. Building Microservices: Designing Fine-Grained Systems; O’Reilly Media Inc.: Sebastopol, Australia, 2015. [Google Scholar]
Pahl, C.; Jamshidi, P. Microservices: A Systematic Mapping Study. In Proceedings of the 6th International Conference on Cloud Computing and Services Science, Rome, Italy, 23–25 April 2016; pp. 137–146. [Google Scholar]
Krause, A.; Zirkelbach, C.; Hasselbring, W.; Lenga, S.; Kröger, D. Microservice decomposition via static and dynamic analysis of the monolith. In Proceedings of the 2020 IEEE International Conference on Software Architecture Companion, Salvador, Brazil, 16–20 March 2020; pp. 9–16. [Google Scholar]
Fritzsch, J.; Bogner, J.; Zimmermann, A.; Wagner, S. From monolith to microservices: A classification of refactoring approaches. In Software Engineering Aspects of Continuous Development and New Paradigms of Software Production and Deployment; Springer: Cham, Switzerland, 2018; pp. 128–141. [Google Scholar]
Balalaie, A.; Heydarnoori, A.; Jamshidi, P. Microservices Architecture Enables DevOps: Migration to a Cloud-Native Architecture. IEEE Softw. 2016, 33, 42–52. [Google Scholar] [CrossRef]
Francesco, P.D.; Malavolta, I.; Lago, P. Research on architecting microservices: Trends, focus, and potential for industrial adoption. In Proceedings of the 2017 IEEE International Conference on Software Architecture (ICSA), Gothenburg, Sweden, 5–7 April 2017; pp. 21–30. [Google Scholar]
Oumoussa, I.; Saidi, R. Evolution of microservices identification in monolith decomposition: A systematic review. IEEE Access 2024, 12, 23389–23405. [Google Scholar]
Razzaq, A.; Ghayyur, S. A systematic mapping study: The new age of software architecture from monolithic to microservice architecture—Awareness and challenges. Comput. Appl. Eng. Educ. 2022, 31, 421–451. [Google Scholar]
Abgaz, Y.; McCarren, A.; Elger, P.; Solan, D.; Lapuz, N.; Bivol, M.; Jackson, G.; Yilmaz, M.; Buckley, J.; Clarke, P. Decomposition of monolith applications into microservices architectures: A systematic review. IEEE Trans. Softw. Eng. 2023, 49, 4213–4242. [Google Scholar]
Taibi, D.; Systä, K. A decomposition and metric-based evaluation framework for microservices. In Proceedings of the Cloud Computing and Services Science 9th International Conference, CLOSER 2019, Heraklion, Crete, Greece, 2–4 May 2020; pp. 133–149. [Google Scholar]
MacKenzie, C.M.; Laskey, K.; McCabe, F.; Brown, P.F.; Metz, R. Reference Model for Service Oriented Architecture 1.0; Technical Report 12; OASIS Standard: Woburn, MA, USA, 2006. [Google Scholar]
Oumoussa, I.; Faieq, S.; Saidi, R. Microservices: Investigating Underpinning. In Proceedings of the Third International Conference on Digital Age and Technological Advances for Sustainable Development, Al Hoceima, Morocco, 27–28 May 2022; Springer: Cham, Switzerland, 2022; pp. 343–351. [Google Scholar]
Soldani, J.; Tamburri, D.A.; Heuvel, W.J.V.D. The pains and gains of microservices: A systematic grey literature review. J. Syst. Softw. 2018, 146, 215–232. [Google Scholar]
Oumoussa, I.; Faieq, S.; Saidi, R. When Microservices Architecture and Blockchain Technology Meet: Challenges and Design Concepts. In Proceedings of the International Conference on Advanced Technologies for Humanity, Rabat, Morocco, 26–27 November 2021; Springer: Cham, Switzerland, 2022; pp. 161–172. [Google Scholar]
Bogner, J.; Fritzsch, J.; Wagner, S.; Zimmermann, A. Microservices in Industry: Insights into Technologies, Characteristics, and Software Quality. In Proceedings of the 2019 IEEE International Conference on Software Architecture Companion (ICSA-C), Hamburg, Germany, 25–26 March 2019; pp. 187–195. [Google Scholar]
Taibi, D.; Lenarduzzi, V.; Pahl, C. Processes, Motivations, and Issues for Migrating to Microservices Architectures: An Empirical Investigation. IEEE Cloud Comput. 2017, 4, 22–32. [Google Scholar]
Oumoussa, I.; Saidi, R.; Daoud, M.; Moha, N.; Faieq, S. A Business-Centric Approach to Automated Microservices Identification. In Proceedings of the International Conference on Digital Technologies and Applications, Benguerir, Morocco, 7–8 November 2024; Springer: Cham, Switzerland, 2024; pp. 240–249. [Google Scholar]
Alshuqayran, N.; Ali, N.; Evans, R. A systematic mapping study in microservice architecture. In Proceedings of the 2016 IEEE 9th International Conference on Service-Oriented Computing and Applications (SOCA), Macau, China, 4–6 November 2016; pp. 44–51. [Google Scholar]
Christoforou, A.; Garriga, M.; Andreou, A.S.; Baresi, L. Supporting the decision of migrating to microservices through multi-layer fuzzy cognitive maps. In Service-Oriented Computing; Maximilien, M., Vallecillo, A., Wang, J., Oriol, M., Eds.; Lecture Notes in Computer Science; Springer International Publishing: Cham, Switzerland, 2017; pp. 471–480. [Google Scholar]
Kalske, M.; Mäkitalo, N.; Mikkonen, T. Challenges when moving from monolith to microservice architecture. In Current Trends in Web Engineering; Garrigós, I., Wimmer, M., Eds.; Lecture Notes in Computer Science; Springer International Publishing: Cham, Switzerland, 2018; pp. 32–47. [Google Scholar]
Wolfart, D.; Assunção, W.K.G.; da Silva, I.F.; Domingos, D.C.P.; Schmeing, E.; Villaca, G.L.D.; Paza, D.d.N. Modernizing legacy systems with microservices: A roadmap. In Proceedings of the Evaluation and Assessment in Software Engineering, EASE 2021, New York, NY, USA, 21–23 June 2021; pp. 149–159.
Ponce, F.; Márquez, G.; Astudillo, H. Migrating from monolithic architecture to microservices: A rapid review. In Proceedings of the 2019 38th International Conference of the Chilean Computer Science Society (SCCC), Concepcion, Chile, 4–9 November 2019; pp. 1–7. [Google Scholar]
Kazanavičius, J.; Mažeika, D. Migrating legacy software to microservices architecture. In Proceedings of the 2019 Open Conference of Electrical, Electronic and Information Sciences (EStream), Vilnius, Lithuania, 25 April 2019; pp. 1–5. [Google Scholar]
Saucedo, A.M.; Rodríguez, G.; Rocha, F.G.; dos Santos, R.P. Migration of monolithic systems to microservices: A systematic mapping study. Inf. Softw. Technol. 2025, 177, 107590. [Google Scholar]
Fritzsch, J.; Bogner, J.; Wagner, S.; Zimmermann, A. Microservices migration in industry: Intentions, strategies, and challenges. In Proceedings of the 2019 IEEE International Conference on Software Maintenance and Evolution (ICSME), Cleveland, OH, USA, 30 September–4 October 2019; IEEE: Piscataway, NJ, USA, 2019. [Google Scholar]
Lapuz, N.; Clarke, P.; Abgaz, Y. Digital transformation and the role of dynamic tooling in extracting microservices from existing software systems. In Systems, Software and Services Process Improvement; Yilmaz, M., Clarke, P., Messnarz, R., Reiner, M., Eds.; Springer International Publishing: Cham, Switzerland, 2021; pp. 301–315. [Google Scholar]
Mparmpoutis, A.; Kakarontzas, G. Using database schemas of legacy applications for microservices identification: A mapping study. In Proceedings of the 6th International Conference on Algorithms, Computing and Systems (ICACS ’22), New York, NY, USA, 16–18 September 2022. [Google Scholar]
Francesco, P.D.; Lago, P.; Malavolta, I. Migrating towards microservice architectures: An industrial survey. In Proceedings of the 2018 IEEE International Conference on Software Architecture, ICSA, Seattle, WA, USA, 30 April–4 May 2018; pp. 29–2909. [Google Scholar]
Velepucha, V.; Flores, P. Monoliths to microservices—Migration problems and challenges: A SMS. In Proceedings of the 2021 Second International Conference on Information Systems and Software Technologies, ICI2ST, Quito, Ecuador, 23–25 March 2021; pp. 135–142. [Google Scholar]
Luz, W.; Agilar, E.; de Oliveira, M.C.; de Melo, C.E.R.; Pinto, G.; Bonifácio, R. An experience report on the adoption of microservices in three Brazilian government institutions. In Proceedings of the XXXII Brazilian Symposium on Software Engineering, SBES ’18, New York, NY, USA, 17–21 September 2018; pp. 32–41. [Google Scholar]
Petersen, K.; Feldt, R.; Mujtaba, S.; Mattsson, M. Systematic Mapping Studies in Software Engineering. In Proceedings of the 12th International Conference on Evaluation and Assessment in Software Engineering, Swindon, UK, 26–27 June 2008; EASE’08. pp. 68–77. [Google Scholar]
Petersen, K.; Vakkalanka, S.; Kuzniarz, L. Guidelines for Conducting Systematic Mapping Studies in Software Engineering: An Update. Inf. Softw. Technol. 2015, 64, 1–18. [Google Scholar] [CrossRef]
Kitchenham, B.A.; Budgen, D.; Brereton, P. Evidence-Based Software Engineering and Systematic Reviews; Chapman and Hall/CRC: Boca Raton, FL, USA, 2015. [Google Scholar]
Kuhrmann, M.; Fernández, D.M.; Daneva, M. On the Pragmatic Design of Literature Studies in Software Engineering: An Experience-Based Guideline. Empir. Softw. Eng. 2017, 22, 2852–2891. [Google Scholar] [CrossRef]
Petticrew, M.; Roberts, H. Systematic Reviews in the Social Sciences: A Practical Guide; John Wiley and Sons, Ltd.: Hoboken, NJ, USA, 2006. [Google Scholar]
Wohlin, C. Guidelines for Snowballing in Systematic Literature Studies and a Replication in Software Engineering. In Proceedings of the 18th International Conference on Evaluation and Assessment in Software Engineering, London, UK, 13–14 May 2014; EASE’14. pp. 1–10. [Google Scholar]
Wohlin, C. Second-Generation Systematic Literature Studies Using Snowballing. In Proceedings of the 20th International Conference on Evaluation and Assessment in Software Engineering, Limerick, Ireland, 1–3 June 2016; EASE’16. pp. 1–6. [Google Scholar]
Ahmadvand, M.; Ibrahim, A. Requirements Reconciliation for Scalable and Secure Microservice (De)composition. In Proceedings of the IEEE 24th International Requirements Engineering Conference Workshops (REW), Beijing, China, 12–16 September 2016; pp. 68–73. [Google Scholar]
Strauss, A.L.; Corbin, J.M. Basics of Qualitative Research: Techniques and Procedures for Developing Grounded Theory; Sage Publications: Thousand Oaks, CA, USA, 1998. [Google Scholar]
Sun, X.; Boranbaev, S.; Han, S.; Wang, H.; Yu, D. Expert System for Automatic Microservices Identification Using API Similarity Graph. Expert Syst. 2024, 41, e13158. [Google Scholar]
Quattrocchi, G.; Cocco, D.; Staffa, S.; Margara, A.; Cugola, G. Cromlech: Semi-Automated Monolith Decomposition into Microservices. IEEE Trans. Serv. Comput. 2024, 17, 466–481. [Google Scholar] [CrossRef]
Trabelsi, I.; Abdellatif, M.; Abubaker, A. From legacy to microservices: A type-based approach for microservices identification using machine learning and semantic analysis. J. Softw. Evol. Proc. 2023, 35, e2503. [Google Scholar] [CrossRef]
Qian, L.; Li, J.; He, X.; Gu, R.; Shao, J.; Lu, Y. Microservice extraction using graph deep clustering based on dual view fusion. Inf. Softw. Technol. 2023, 158, 107171. [Google Scholar] [CrossRef]
Filippone, G.; Mehmood, N.Q.; Autili, M.; Rossi, F.; Tivoli, M. From monolithic to microservice architecture: An automated approach based on graph clustering and combinatorial optimization. In Proceedings of the 2023 IEEE 20th International Conference on Software Architecture (ICSA), L’Aquila, Italy, 13–17 March 2023; pp. 47–57. [Google Scholar]
Li, Z.; Shang, C.; Wu, J.; Li, Y. Microservice extraction based on knowledge graph from monolithic applications. Inf. Softw. Technol. 2022, 150, 106992. [Google Scholar] [CrossRef]
Sooksatra, K.; Maharjan, R.; Cerny, T. Monolith to microservices: VAE-based GNN approach with duplication consideration. In Proceedings of the 2022 IEEE International Conference on Service-Oriented System Engineering (SOSE), Newark, CA, USA, 15–18 August 2022; pp. 1–10. [Google Scholar]
Romani, Y.; Tibermacine, O.; Tibermacine, C. Towards migrating legacy software systems to microservice-based architectures: A data-centric process for microservice identification. In Proceedings of the 2022 IEEE 19th International Conference on Software Architecture Companion (ICSA-C), Honolulu, HI, USA, 12–15 March 2022; pp. 15–19. [Google Scholar]
Liu, B.; Xiong, J.; Ren, Q.; Tyszberowicz, S.; Yang, Z. Log2MS: A framework for automated refactoring monolith into microservices using execution logs. In Proceedings of the 2022 IEEE International Conference on Web Services (ICWS), Barcelona, Spain, 10–16 July 2022; pp. 391–396. [Google Scholar]
Al-Debagy, O.; Martinek, P. A microservice decomposition method through using distributed representation of source code. Scalable Comput. Pract. Exp. 2021, 22, 39–52. [Google Scholar] [CrossRef]
Assunção, W.K.; Colanzi, T.E.; Carvalho, L.; Pereira, J.A.; Garcia, A.; de Lima, M.J.; Lucena, C. A multi-criteria strategy for redesigning legacy features as microservices: An industrial case study. In Proceedings of the IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER), Honolulu, HI, USA, 9–12 March 2021; pp. 377–387. [Google Scholar]
Daoud, M.; El Mezouari, A.; Faci, N.; Benslimane, D.; Maamar, Z.; El Fazziki, A. A multi-model based microservices identification approach. J. Syst. Archit. 2021, 118, 102200. [Google Scholar] [CrossRef]
Desai, U.; Bandyopadhyay, S.; Tamilselvam, S. Graph neural network to dilute outliers for refactoring monolith application. In Proceedings of the 35th AAAI Conference on Artificial Intelligence, Virtually, 2–9 February 2021; pp. 72–80. [Google Scholar]
Brito, M.; Cunha, J.; Saraiva, J. Identification of microservices from monolithic applications through topic modelling. In Proceedings of the 36th Annual ACM Symposium on Applied Computing, New York, NY, USA, 22–26 March 2021; pp. 1409–1418. [Google Scholar] [CrossRef]
De Alwis, A.A.C.; Barros, A.; Fidge, C.; Polyvyanyy, A. Microservice remodularisation of monolithic enterprise systems for embedding in industrial IoT networks. In Proceedings of the Advanced Information Systems Engineering, Melbourne, VIC, Australia, 28 June–2 July 2021; pp. 432–448. [Google Scholar]
Freire, A.F.A.A.; Sampaio, A.F.; Carvalho, L.H.L.; Medeiros, O.; Mendonça, N.C. Migrating production monolithic systems to microservices using aspect oriented programming. Softw. Pract. Exp. 2021, 51, 1280–1307. [Google Scholar] [CrossRef]
Kalia, A.K.; Xiao, J.; Krishna, R.; Sinha, S.; M, V.; Banerjee, D. Mono2Micro: A practical and effective tool for decomposing monolithic Java applications to microservices. In Proceedings of the 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, New York, NY, USA, 23–28 August 2021; pp. 1214–1224. [Google Scholar]
Agarwal, S.; Sinha, R.; Sridhara, G.; Das, P.; Desai, U.; Tamilselvam, S.; Singhee, A.; Nakamuro, H. Monolith to microservice candidates using business functionality inference. In Proceedings of the IEEE International Conference on Web Services (ICWS), Chicago, IL, USA, 5–10 September 2021; pp. 758–763. [Google Scholar]
Bucchiarone, A.; Soysal, K.; Guidi, C. A model-driven approach towards automatic migration to microservices. In Software Engineering Aspects of Continuous Development and New Paradigms of Software Production and Deployment; Bruel, J.M., Mazzara, M., Meyer, B., Eds.; Springer: Cham, Switzerland, 2020; pp. 15–36. [Google Scholar]
Zhang, Y.; Liu, B.; Dai, L.; Chen, K.; Cao, X. Automated microservice identification in legacy systems with functional and non-functional metrics. In Proceedings of the IEEE International Conference on Software Architecture (ICSA), Salvador, Brazil, 16–20 March 2020; pp. 135–145. [Google Scholar]
Matias, T.; Correia, F.F.; Fritzsch, J.; Bogner, J.; Ferreira, H.S.; Restivo, A. Determining microservice boundaries: A case study using static and dynamic software analysis. In Proceedings of the Software Architecture 14th European Conference, ECSA 2020, L’Aquila, Italy, 14–18 September 2020; Jansen, A., Malavolta, I., Muccini, H., Ozkaya, I., Zimmermann, O., Eds.; Springer: Cham, Switzerland, 2020; pp. 315–332. [Google Scholar]
Al-Debagy, O.; Martinek, P. Extracting microservices’ candidates from monolithic applications: Interface analysis and evaluation metrics approach. In Proceedings of the IEEE 15th International Conference of System of Systems Engineering (SoSE), Budapest, Hungary, 2–4 June 2020; pp. 289–294. [Google Scholar]
Selmadji, A.; Seriai, A.; Bouziane, H.L.; Mahamane, R.O.; Zaragoza, P.; Dony, C. From monolithic architecture style to microservice one based on a semi-automatic approach. In Proceedings of the IEEE International Conference on Software Architecture (ICSA), Salvador, Brazil, 16–20 March 2020; pp. 157–168. [Google Scholar]
Bajaj, D.; Bharti, U.; Goel, A.; Gupta, S.C. Partial migration for re-architecting a cloud native monolithic application into microservices and FaaS. In Proceedings of the Information, Communication and Computing Technology 5th International Conference, ICICCT 2020, New Delhi, India, 9 May 2020; Badica, C., Liatsis, P., Kharb, L., Chahal, D., Eds.; Springer: Singapore, 2020; pp. 111–124. [Google Scholar]
Alwis, A.A.C.D.; Barros, A.; Fidge, C.; Polyvyanyy, A. Remodularization analysis for microservice discovery using syntactic and semantic clustering. In Proceedings of the Advanced Information Systems Engineering 32nd International Conference, CAiSE 2020, Grenoble, France, 8–12 June 2020; Dustdar, S., Yu, E., Salinesi, C., Rieu, D., Pant, V., Eds.; Springer: Cham, Switzerland, 2020; pp. 3–19. [Google Scholar]
Eyitemi, F.D.; Reiff-Marganiec, S. System decomposition to optimize functionality distribution in microservices with rule based approach. In Proceedings of the IEEE International Conference on Service Oriented Systems Engineering (SOSE), Oxford, UK, 3–6 August 2020; pp. 65–71. [Google Scholar]
Bandara, C.; Perera, I. Transforming monolithic systems to microservices—An analysis toolkit for legacy code evaluation. In Proceedings of the 20th International Conference on Advances in ICT for Emerging Regions (ICTer), Colombo, Sri Lanka, 4–7 November 2020; pp. 95–100. [Google Scholar]
Li, S.; Zhang, H.; Jia, Z.; Li, Z.; Zhang, C.; Li, J.; Gao, Q.; Ge, J.; Shan, Z. A dataflow-driven approach to identifying microservices from monolithic applications. J. Syst. Softw. 2019, 157, 110380. [Google Scholar] [CrossRef]
Nunes, L.; Santos, N.; Silva, A.R. From a monolith to a microservices architecture: An approach based on transactional contexts. In Proceedings of the Software Architecture 13th European Conference, ECSA 2019, Paris, France, 9–13 September 2019; Bures, T., Duchien, L., Inverardi, P., Eds.; Springer: Cham, Switzerland, 2019; pp. 37–52. [Google Scholar]
Christoforou, A.; Odysseos, L.; Andreou, A. Migration of software components to microservices: Matching and synthesis. In Proceedings of the 14th International Conference on Evaluation of Novel Approaches to Software Engineering, Heraklion, Crete, Greece, 4–5 May 2019; pp. 134–146. [Google Scholar]
Jin, W.; Liu, T.; Cai, Y.; Kazman, R.; Mo, R.; Zheng, Q. Service candidate identification from monolithic systems based on execution traces. IEEE Trans. Softw. Eng. 2021, 47, 987–1007. [Google Scholar] [CrossRef]
Pigazzini, I.; Fontana, F.A.; Maggioni, A. Tool support for the migration to microservice architecture: An industrial case study. In Proceedings of the Software Architecture 13th European Conference, ECSA 2019, Paris, France, 9–13 September 2019; Bures, T., Duchien, L., Inverardi, P., Eds.; Springer: Cham, Switzerland, 2019; pp. 247–263. [Google Scholar]
Saidani, I.; Ouni, A.; Mkaouer, M.W.; Saied, A. Towards automated microservices extraction using multi-objective evolutionary search. In Proceedings of the Service-Oriented Computing 17th International Conference, ICSOC 2019, Toulouse, France, 28–31 October 2019; Yangui, S., Rodriguez, I.B., Drira, K., Tari, Z., Eds.; Springer: Cham, Switzerland, 2019; pp. 58–63. [Google Scholar]
Abdullah, M.; Iqbal, W.; Erradi, A. Unsupervised learning approach for web application auto-decomposition into microservices. J. Syst. Softw. 2019, 151, 243–257. [Google Scholar] [CrossRef]
Eski, S.; Buzluca, F. An automatic extraction approach: Transition to microservices architecture from monolithic application. In Proceedings of the 19th International Conference on Agile Software Development, Porto, Portugal, 21–25 May 2018. [Google Scholar]
Kamimura, M.; Yano, K.; Hatano, T.; Matsuo, A. Extracting candidates of microservices from monolithic application code. In Proceedings of the 2018 25th Asia-Pacific Software Engineering Conference (APSEC), Nara, Japan, 4–7 December 2018; pp. 571–580. [Google Scholar]
De Alwis, A.A.C.; Barros, A.; Polyvyanyy, A.; Fidge, C. Function-splitting heuristics for discovery of microservices in enterprise systems. In Service-Oriented Computing; Springer: Cham, Switzerland, 2018; pp. 37–53. [Google Scholar]
Ren, Z.; Wang, W.; Wu, G.; Gao, C.; Chen, W.; Wei, J.; Huang, T. Migrating web applications from monolithic structure to microservices architecture. In Proceedings of the Tenth Asia-Pacific Symposium on Internetware, Beijing, China, 16 September 2018. [Google Scholar]
Mazlami, G.; Cito, J.; Leitner, P. Extraction of microservices from monolithic software architectures. In Proceedings of the IEEE International Conference on Web Services (ICWS), Honolulu, HI, USA, 25–30 June 2017; pp. 524–531. [Google Scholar]
Chen, R.; Li, S.; Li, Z. From monolith to microservices: A dataflow-driven approach. In Proceedings of the 24th Asia-Pacific Software Engineering Conference (APSEC), Nanjing, China, 4–8 December 2017; pp. 466–475. [Google Scholar]
Baresi, L.; Garriga, M.; Renzis, A.D. Microservices identification through interface analysis. In Proceedings of the Service-Oriented and Cloud Computing 6th IFIP WG 2.14 European Conference, ESOCC 2017, Oslo, Norway, 27–29 September 2017; Springer: Cham, Switzerland, 2017; pp. 19–33. [Google Scholar]
Gysel, M.; Kölbener, L.; Giersche, W.; Zimmermann, O. Service cutter: A systematic approach to service decomposition. In Proceedings of the Service-Oriented and Cloud Computing 5th IFIP WG 2.14 European Conference, ESOCC 2016, Vienna, Austria, 5–7 September 2016; Springer: Cham, Switzerland, 2016; pp. 185–200. [Google Scholar]
Aderaldo, C.M.; Mendonça, N.C.; Pahl, C.; Jamshidi, P. Benchmark requirements for microservices architecture research. In Proceedings of the 2017 IEEE/ACM 1st International Workshop on Establishing the Community-Wide Infrastructure for Architecture-Based Software Engineering, ECASE, Buenos Aires, Argentina, 22 May 2017; pp. 8–13. [Google Scholar]
Santos, N.; Silva, A.R. A Complexity Metric for Microservices Architecture Migration. In Proceedings of the 2020 IEEE International Conference on Software Architecture (ICSA), Salvador, Brazil, 16–20 March 2020; pp. 169–178. [Google Scholar]
Zhou, X.; Jin, Y.; Zhang, H.; Li, S.; Huang, X. A map of threats to validity of systematic literature reviews in software engineering. In Proceedings of the 23rd Asia-Pacific Software Engineering Conference, APSEC, Hamilton, New Zealand, 6–9 December 2016; pp. 153–160. [Google Scholar]
Mourão, E.; Pimentel, J.F.; Murta, L.; Kalinowski, M.; Mendes, E.; Wohlin, C. On the performance of hybrid search strategies for systematic literature reviews in software engineering. Inf. Softw. Technol. 2020, 123, 106294. [Google Scholar] [CrossRef]

Figure 1. Microservice architecture.

Figure 2. The comprehensive procedure for undertaking systematic mapping.

Figure 3. Systematic review flowchart.

Figure 4. Distribution of selected studies by year and digital library.

Figure 5. Keyword cloud for primary research.

Figure 6. Sources of challenges in microservice identification.

Figure 7. Microservice identification solutions.

Figure 8. Ontology for microservice identification.

Table 1. Research questions, main motivations, and objectives.

Research Question	Main Motivation	Objective
RQ1. How can challenges encountered during the microservice identification phase of monolith-to-microservice migration be classified and addressed systematically?	This research question aimed to classify and address challenges in microservice identification during migration.	O1, O2
RQ2. How can existing microservice identification strategies be categorized, and what are their performance and effectiveness metrics?	This research question aimed to categorize strategies and explore their performance metrics.	O2
RQ3. What are the typical inputs, data models, algorithms, and performance metrics used in microservice identification strategies, and how are they grouped?	This research question examined inputs, models, algorithms, and metrics, grouping them for a better understanding.	O3
RQ4. What levels of automation exist in microservice identification approaches, and how are they distributed across different methodologies?	This research question investigated automation levels and their distribution across methodologies.	O3
RQ5. What trends and emerging research directions are evident in microservice identification, and how can they be visualized in a systematic mapping framework?	This research question identified trends and visualized them using systematic mapping.	O4

Table 2. Criteria for inclusion.

#	Criteria
In1	Articles released since 2017, including earlier work
In2	English-language publications
In3	Articles undergoing peer review
In4	Studies undertaken with identification of microservices or migration to microservice architectures as their core subject matter
In5	Research articles providing frameworks, approaches, techniques, or mechanisms to identify microservices or switch to microservice architectures
In6	Articles offering a qualitative or quantitative assessment of microservice identification initiatives for moving to microservice architectures

Table 3. Criteria for exclusion.

#	Criteria
Ex1	Articles discussing migration to distributed technologies and platforms like clouds, without specifically addressing the microservice identification phase
Ex2	Articles outlining broad aspects of the transition to microservice architectures but giving little consideration to the identification issue
Ex3	Editorials and pedagogical papers
Ex4	Review articles, surveys, or supplementary research on the topic of identifying microservices or microservice architectures
Ex5	Writing and book chapters that seldom receive peer review and convey general concepts that have already been published in journals or presented at conferences
Ex6	Articles lacking complete text

Table 4. Form for extracting data.

#	Data Element	Description	RQ
D1	Study ID	First author’s name and year
D2	Year	The publication year
D3	Source	Publication’s original source
D4	Type	Conference or journal article
D5	Category	Analysis, approach proposal
D6	Challenges	Challenges in microservice identification during migration process	RQ1
D7	Main input	The approach’s primary input	RQ2, RQ3
D8	Data modeling	The standardization of input data	RQ2, RQ3
D9	Identification algorithm	Techniques utilized for identification	RQ2, RQ3
D10	Evaluation	Indicates whether the experiments were based on an industrial application or a case study	RQ2, RQ3
D11	Metric details	The investigated quality metrics for the identified microservices	RQ2, RQ3
D12	Automation	The extent to which a microservice identification approach requires human specialists	RQ4
D13	Trends	Refers to current trends in microservice identification	RQ5

Table 5. Number of studies returned by each repository.

Repository	Search Results
IEEE Xplorer	275
ACM Digital Library	71
SpringerLink	519
ScienceDirect	95
Wiley Online Library	107
Total	1067

Table 6. List of selected studies: C: conference paper; J: journal paper; W: workshop; WO: Wiley Online; SD: ScienceDirect.

ID	Title	Year	Source	Type
P1 [41]	A Expert system for automatic microservices identification using API similarity graph	2024	WO	J
P2 [18]	A Business-Centric Approach to Automated Microservices Identification	2024	Springer	C
P3 [42]	Cromlech: Semi-Automated Monolith Decomposition Into Microservices	2024	IEEE	J
P4 [43]	From Legacy To Microservices: A Type-based Approach For Microservices Identification Using Machine Learning And Semantic Analysis	2023	WO	J
P5 [44]	Microservice extraction using graph deep clustering based on dual view fusion	2023	SD	J
P6 [45]	From Monolithic To Microservice Architecture: An Automated Approach Based On Graph Clustering And Combinatorial Optimization	2023	IEEE	C
P7 [46]	Microservice extraction based on knowledge graph from monolithic applications	2022	SD	J
P8 [47]	Vae-based Gnn Approach With Duplication Consideration	2022	IEEE	C
P9 [48]	Towards Migrating Legacy Software Systems To Microservice-based Architectures: A Data-centric Process For Microservice Identification	2022	IEEE	C
P10 [49]	Log2ms: A Framework For Automated Refactoring Monolith Into Microservices Using Execution Logs	2022	IEEE	C
P11 [50]	A Microservice Decomposition Method Through Using Distributed Representation of Source Code	2021	WO	J
P12 [51]	A Multi-criteria Strategy for Redesigning Legacy Features as Microservices: An Industrial Case Study	2021	IEEE	C
P13 [52]	A Multi-model Based Microservices Identification Approach	2021	SD	J
P14 [53]	Graph Neural Network to Dilute Outliers for Refactoring Monolith Application	2021	ACM	C
P15 [54]	Identification of Microservices from Monolithic Applications Through Topic Modelling	2021	ACM	C
P16 [55]	Microservice Remodularisation of Monolithic Enterprise Systems for Embedding in Industrial Iot Networks	2021	Springer	C
P17 [56]	Migrating Production Monolithic Systems to Microservices Using Aspect Oriented Programming	2021	WO	J
P18 [57]	Mono2micro: A Practical and Effective Tool for Decomposing Monolithic Java Applications to Microservices	2021	ACM	C
P19 [58]	Monolith to Microservice Candidates Using Business Functionality Inference	2021	IEEE	C
P20 [11]	A Decomposition and Metric-based Evaluation Framework for Microservices	2020	Springer	C
P21 [59]	A Model-driven Approach Towards Automatic Migration to Microservices	2020	Springer	W
P22 [60]	Automated Microservice Identification in Legacy Systems with Functional and Non-functional Metrics	2020	IEEE	C
P23 [61]	Determining Microservice Boundaries: A Case Study Using Static and Dynamic Software Analysis	2020	Springer	C
P24 [62]	Extracting Microservices’ Candidates from Monolithic Applications: Interface Analysis and Evaluation Metrics Approach	2020	IEEE	C
P25 [63]	From Monolithic Architecture Style to Microservice One Based on A Semi-automatic Approach	2020	IEEE	C
P26 [4]	Microservice Decomposition Via Static And Dynamic Analysis of The Monolith	2020	IEEE	C
P27 [64]	Partial Migration for Re-architecting a Cloud Native Monolithic Application into Microservices and Faas	2020	Springer	C
P28 [65]	Remodularization Analysis for Microservice Discovery Using Syntactic and Semantic Clustering	2020	Springer	C
P29 [66]	System Decomposition to Optimize Functionality Distribution in Microservices with Rule Based Approach	2020	IEEE	C
P30 [67]	Transforming Monolithic Systems to Microservices - An Analysis Toolkit for Legacy Code Evaluation	2020	IEEE	C
P31 [68]	A Dataflow-driven Approach to Identifying Microservices From Monolithic Applications	2019	Springer	C
P32 [69]	From a Monolith to a Microservices Architecture: an Approach Based on Transactional Contexts	2019	Springer	C
P33 [70]	Migration of Software Components to Microservices: Matching and Synthesis	2019	ACM	C
P34 [71]	Service Candidate Identification From Monolithic Systems Based On Execution Traces	2019	IEEE	J
P35 [72]	Tool Support for the Migration to Microservice Architecture: an Industrial Case Study	2019	Springer	C
P36 [73]	Towards Automated Microservices Extraction Using Multi-objective Evolutionary Search	2019	Springer	C
P37 [74]	Unsupervised Learning Approach for Web Application Auto-decomposition Into Microservices	2019	SD	J
P38 [75]	An Automatic Extraction Approach: Transition to Microservices Architecture From Monolithic Application	2018	ACM	C
P39 [76]	Extracting Candidates of Microservices From Monolithic Application Code	2018	IEEE	C
P40 [77]	Functionsplitting Heuristics for Discovery of Microservices in Enterprise Systems	2018	Springer	C
P41 [78]	Migrating Web Applications From Monolithic Structure to Microservices Architecture	2018	ACM	C
P42 [79]	Extraction of Microservices From Monolithic Software Architectures	2017	IEEE	C
P43 [80]	From Monolith to Microservices: a Dataflowdriven Approach	2017	IEEE	C
P44 [81]	Microservices Identification Through Interface Analysis	2017	Springer	C
P45 [82]	Service Cutter: a Systematic Approach to Service Decomposition	2016	Springer	C

Table 7. Classification of challenges in microservice identification.

Challenges	Percentage	Studies
Granularity Challenges	35.56%
Service Size Optimization	15.56%	P1, P3, P4, P12, P15, P25, P32
Resource Allocation Balance	6.67%	P18, P23, P29
Communication Overhead Management	13.33%	P31, P34, P36, P38, P41, P42
Dependency Challenges	31.11%
Tight Coupling Resolution	13.33%	P5, P13 P14, P17, P27, P39
Interface Design Optimization	8.89%	P7, P16, P24, P35,
Service Communication Patterns	8.89%	P9, P11, P21, P40
Domain Modeling Challenges	20%
Business Logic Alignment	13.33%	P2, P6, P19, P26, P28, P43
Semantic Cohesion Maintenance	6.67%	P10, P30, P44
Data Distribution Challenges	13.33%
Data Consistency Management	4.44%	P8, P37
Data Integrity Maintenance	8.89%	P20, P22, P33, P45

Table 8. Classification of identification technique solutions.

Primary Category	Focus Area	Studies	%	Key Characteristics	Main Outcomes
Technique-Based Solutions	Service decomposition	P1, P2, P4, P5, P8, P9, P14, P27, P37	20%	ML/AI-driven analysis	Automated boundary detection
	Interface identification	P5, P7, P11, P15, P18, P35	13.8%	Pattern recognition	Service interface specifications
Framework-Based Solutions	Dependency mapping	P16, P28, P29, P34, P38	11.1%	Architectural patterns	Dependency graphs
	Adaptation and refinement	P10, P12, P20, P30, P42	11.1%	Iterative improvement	Refined service boundaries
General Identification	Service decomposition	P3, P13, P31, P40, P43	11.1%	Heuristic guidelines	Decomposition strategies
	Performance evaluation	P25, P26, P39	6.7%	Best practices	Performance metrics
Tool-Based Solutions	Interface identification	P6, P17, P32	6.7%	Automated tools	Interface specifications
	Dependency mapping	P21, P35	4.4%	Visualization tools	Dependency analysis
Algorithm-Based	Service decomposition	P23, P36, P41, P45	8.9%	Optimization algorithms	Optimal service boundaries
Protocol-Based	Interface identification	P19, P33	4.4%	Communication protocols	Service interaction patterns
Analysis-Based	Performance evaluation	P22, P24, P44	6.7%	Comparative analysis	Evaluation frameworks

Table 9. Comprehensive analysis of microservice identification approaches.

Category	Type	Components	Studies	Percentage	Key Considerations
Input Types	Business-oriented	- Process models - User stories - Functional requirements - BPMN diagrams - Domain expertise	P2, P12, P19, P32, P36	11%	- Strategic alignment - Business value focus - Domain integrity
	Technical	- Source code - Execution logs - API specifications - Database schemas - Runtime traces	P1, P3, P4, P6, P7, P8, P9, P10, P11, P14, P15, P17, P18, P23, P24, P26, P29, P30, P31, P33, P34, P38, P39, P41, P42, P43, P44	60%	- Implementation accuracy - Behavioral patterns - Technical dependencies
	Hybrid	- Combined technical–business - Stakeholder inputs - System context	P13, P16, P20, P21, P25, P27, P28, P35, P37, P40, P45	29%	- Balanced perspective - Comprehensive coverage - Reality alignment
Data Modeling	Graph-based	- Semantic clustering - Knowledge graphs - Neural networks	P1, P4, P5, P6, P7, P8, P14, P28, P30, P34, P38, P39, P40, P42	32%	- Relationship mapping - Pattern detection - Scalability
	Relational	- Data flow matrices - Component matrices - Dependency analysis	P2, P3, P9, P10, P13, P16, P17, P20, P21, P22, P24, P25, P26, P27, P29, P31, P32, P33, P35, P41, P43, P44, P45	51%	- Quantitative analysis - Mathematical precision - Structural clarity
	AI-based	- Deep learning - NLP models - Evolutionary algorithms	P11, P12, P15, P18, P19, P23, P36, P37	17%	- Automated discovery - Pattern learning - Adaptive modeling
Evaluation Metrics	Functional	- Cohesion - Coupling - Modularity	P4, P6, P9, P15, P16, P20, P22, P24, P25, P29, P30, P31, P35, P37, P38, P39, P41, P42, P44	42%	- Service quality - Design principles - Maintainability
	Performance	- Response time - Resource usage - Scalability	P1, P3, P7, P8, P10, P11, P18, P21, P23, P26, P27, P33, P34, P40, P43	33%	- System efficiency - Resource optimization - Operational fitness
	Hybrid	- Combined metrics - Integrated assessment - Multi-dimensional	P5, P12, P13, P14, P17, P22, P36	16%	- Comprehensive view - Balanced evaluation - Holistic assessment
	Business value	- ROI metrics - Time to market - Customer satisfaction	P2, P19, P28, P32, P45	9%	- Business impact - Value delivery - Stakeholder satisfaction

Table 10. Automation degree classification of identification techniques.

Automation Degree	Percentage	Studies
Manual	4.44%	P17, P26
Semi-automatic	62.22%	P3, P4, P6, P7, P10, P11, P12, P13, P15, P18, P19, P22, P24, P25, P27, P28, P29, P30, P31, P32, P33, P34, P35, P39, P40, P43, P44, P45
Automatic	33.33%	P1, P2, P5, P8, P9, P14, P16, P20, P21, P23, P36, P37, P38, P41, P42

Table 11. Microservice identification research directions.

Research Direction	Studies	Percentage
Automation	P1, P3, P10, P18, P25, P30, P36, P37, P38, P39	22%
Service Mesh Technologies	P13, P16, P27, P35, P40	11%
Edge Computing	P16, P27, P40, P41	9%
AI-Driven Solutions	P4, P5, P8, P11, P14, P15, P19, P37	18%
Low-Code/No-Code Platforms	P2, P12, P25, P32	9%
Enhanced Observability	P10, P23, P26, P34	9%
Decomposition Techniques	P5, P6, P7, P9, P28, P29, P31, P42, P45	20%
Migration Strategies	P3, P17, P21, P33, P35, P41, P43	16%
Performance Analysis	P20, P22, P24, P26, P44	11%
Standardized Metrics	P13, P20, P22, P24, P30	11%
Industry Benchmarks	P12, P18, P35, P36	9%

Table 12. Relationships between ontology classes.

Property	Domain	Range
isDecomposedBy	SystemArtifact	DecompositionTechnique
isEvaluatedBy	MicroserviceCandidate	EvaluationMetric
usesDomainKnowledge	DecompositionTechnique	DomainKnowledge
isValidatedBy	MicroserviceCandidate	ValidationMethod
dependsOn	MicroserviceCandidate	SystemArtifact
discoversTechnique	Studies	DecompositionTechnique

Table 13. OWL-DL query samples.

Query ID	Query Description
Q1	DecompositionTechnique and (appliedTo value Business_Processes)
Q2	EvaluationMetric and (associatedWith value Clustering_Algorithms)
Q3	SystemArtifact and (usedIn value Graph_Based_Partitioning)
Q4	EvaluationMetric and (assesses value MicroserviceCandidate)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Oumoussa, I.; Saidi, R. The Ontology-Based Mapping of Microservice Identification Approaches: A Systematic Study of Migration Strategies from Monolithic to Microservice Architectures. Computers 2025, 14, 133. https://doi.org/10.3390/computers14040133

AMA Style

Oumoussa I, Saidi R. The Ontology-Based Mapping of Microservice Identification Approaches: A Systematic Study of Migration Strategies from Monolithic to Microservice Architectures. Computers. 2025; 14(4):133. https://doi.org/10.3390/computers14040133

Chicago/Turabian Style

Oumoussa, Idris, and Rajaa Saidi. 2025. "The Ontology-Based Mapping of Microservice Identification Approaches: A Systematic Study of Migration Strategies from Monolithic to Microservice Architectures" Computers 14, no. 4: 133. https://doi.org/10.3390/computers14040133

APA Style

Oumoussa, I., & Saidi, R. (2025). The Ontology-Based Mapping of Microservice Identification Approaches: A Systematic Study of Migration Strategies from Monolithic to Microservice Architectures. Computers, 14(4), 133. https://doi.org/10.3390/computers14040133

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The Ontology-Based Mapping of Microservice Identification Approaches: A Systematic Study of Migration Strategies from Monolithic to Microservice Architectures

Abstract

1. Introduction

2. Foundations

2.1. Microservice Architectures

2.2. From Monolithic to Microservices

3. Related Work

4. Methodology

4.1. Research Objectives

4.2. Research Questions

4.3. Literature Repository Selection

4.4. Selection Examination Procedure

4.5. Inclusion and Exclusion Criteria

4.6. Data Extraction Process

4.7. Overview of Primary Studies

5. The Outcomes of the Mapping

5.1. Overview of Selected Research

5.2. Challenges in Microservice Identification (RQ1)

5.3. Categorization and Performance of Microservice Identification Approaches (RQ2)

5.4. Key Inputs, Data Modeling, and Evaluation Metrics in Microservice Identification (RQ3)

5.5. Automation in Microservice Identification Approaches (RQ4)

5.6. Trends and Emerging Research Directions in Microservice Identification (RQ5)

6. An Ontology for Microservice Identification

7. Discussion

7.1. Principal Findings and Lessons Acquired

7.2. Practical Implications for Practitioners

7.3. Emerging Research Trajectories

8. Threats to Validity

8.1. Construct Validity

8.2. Internal Validity

8.3. External Validity

8.4. Conclusion Validity

9. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI