1. Introduction
The rapid growth of heterogeneous data environments has increased the analytical challenges faced by contemporary research. In ecology, environmental sciences, genomics, health studies, and decision sciences, researchers increasingly work with multiple data blocks that describe the same observational units from different but complementary perspectives. As a result, single-table exploratory approaches are often insufficient because they cannot fully capture the relationships between datasets that differ in scale, content, or informational characteristics. The search for methods capable of integrating several tables while preserving the geometric properties of multivariate spaces has a long lineage in statistical science, including the treatment of vector variables, the development of the RV coefficient as a measure of similarity between configurations, and the broader family of methods for the canonical analysis of several sets of variables [
1,
2,
3]. Within this tradition, co-inertia analysis (CoIA) emerged as one of the most elegant and operational frameworks for studying the shared structure between two tables observed on the same sampling units [
4,
5].
Co-inertia analysis was originally proposed as a multivariate method for maximizing the covariance between the projections of two linked data tables [
4]. Unlike procedures that focus primarily on prediction, CoIA is designed to reveal the common latent structure underlying paired datasets, making it especially valuable when the aim is exploratory integration rather than one-directional modeling. One of the main strengths of CoIA is that it preserves the internal structure of each dataset while identifying a shared space that facilitates interpretation of their common patterns [
5]. In this sense, co-inertia analysis belongs to the duality diagram family and forms part of a broader geometric approach to multivariate integration [
6,
7,
8].
The ecological sciences provided the earliest and perhaps most influential applications of CoIA. In these contexts, researchers frequently needed to relate species assemblages to environmental descriptors, biological traits to habitat gradients, or temporal changes in communities to changing ecological conditions [
4,
5]. Over time, its interpretive potential was strengthened by its relationship with adjacent approaches such as RLQ analysis and fourth-corner methods, which made it possible to connect species traits, environmental variation, and species composition in a more integrated inferential framework [
9,
10]. Multiple co-inertia analysis later expanded this rationale to more than two tables, allowing the study of synchrony and shared variability across temporal or multiblock ecological datasets [
11].
A major reason for the diffusion of CoIA has been the availability of accessible computational implementations. The ADE-4 platform and, later, the ade4 ecosystem in R contributed decisively to translating theoretical developments into reproducible workflows for applied researchers [
6,
7]. The consolidation of these tools did more than simplify computation; it widened the methodological reach of co-inertia analysis by allowing users from diverse disciplines to perform advanced geometric data analysis without having to derive the mathematics from first principles. As a result, CoIA evolved from a specialized ecological technique into a more general strategy for multiblock data integration [
8].
The interdisciplinary expansion of CoIA became especially visible with the rise of genomics and, later, multi-omics research. High-dimensional biological data often arise in parallel blocks, such as transcriptomics, proteomics, metabolomics, epigenomics, and genotype-derived marker sets. These blocks are typically measured on the same samples but express different layers of biological organization, making them ideal candidates for integrative multivariate analysis. Co-inertia was quickly adopted because it allowed cross-platform comparison and joint exploration of different omics spaces [
12,
13]. Subsequent work on multivariate integration of multi-omics datasets confirmed that CoIA and related latent-variable approaches occupy an important place in the broader toolbox of integrative biology [
14]. In parallel, packages such as mixOmics and DIABLO helped establish data integration as a mature methodological field [
15,
16].
Despite the growing use of co-inertia analysis in multiblock and high-dimensional data integration, its application in environments characterized by indeterminate or partially reliable information remains limited. At the same time, neutrosophic theory has emerged as a relevant framework for representing uncertainty, inconsistency, and indeterminacy in complex analytical settings. However, the potential methodological relationship between CoIA Cd neutrosophic approaches has not yet been systematically explored in the scientific literature. This study addresses that gap by examining the intellectual and thematic development of research at the intersection of co-inertia analysis and neutrosophic spaces through a bibliometric perspective.
This expansion toward multi-omics signals a deeper epistemological transformation. Biological and environmental systems are increasingly studied as layered systems in which information is partial, heterogeneous, and sometimes internally inconsistent. Reviews on multi-omics integration have emphasized that the field is moving from isolated high-throughput measurements toward true systems-level synthesis [
17,
18,
19,
20].
In this context, integrative multivariate methods have become increasingly important because the scientific problem itself is multiblock in nature. Co-inertia analysis is particularly valuable because it supports exploratory analysis of how different sources of information converge or diverge while preserving the internal structure of each dataset [
12,
13,
14,
15,
16]. Unlike methods focused exclusively on prediction, CoIA facilitates the interpretation of shared patterns across heterogeneous data sources.
This characteristic becomes especially important in contexts where uncertainty is inherent to the data rather than being treated only as statistical noise. In many real-world scenarios, researchers must interpret relationships between multiple datasets affected by incomplete information, ambiguity, or inconsistent observations. Such conditions highlight the need for analytical approaches capable of handling both multivariate complexity and epistemic uncertainty simultaneously.
The empirical literature further shows that CoIA has demonstrated considerable adaptability across application domains. Beyond classical ordination problems, it has been used to assess anthropogenic impacts on marine organisms, to integrate environmental datasets for contamination assessment, and to study fish–habitat relationships under disturbance [
21,
22,
23]. These studies confirm the usefulness of co-inertia analysis in contexts characterized by ecological complexity, correlated gradients, and heterogeneous sources of evidence. Nevertheless, even in these advanced applications, uncertainty is generally handled within conventional probabilistic or geometric-statistical assumptions. Ambiguity, incompleteness, and indeterminacy may be acknowledged substantively, but they are not usually modeled as explicit analytical dimensions. This limitation becomes increasingly relevant as researchers confront datasets that are not only multiblock and high-dimensional, but also epistemically unstable: partially observed, semantically vague, or characterized by conflicting information from different sources.
In this context, neutrosophic theory becomes particularly relevant. Introduced as a framework that extends beyond binary and fuzzy logics by explicitly incorporating indeterminacy alongside truth and falsity, neutrosophy offers a conceptual architecture for representing information that is incomplete, inconsistent, vague, or partially unknown [
24]. Its later formalization through single-valued, interval, and multi-valued neutrosophic sets made the framework operational for applied mathematical and decision-oriented contexts [
25,
26,
27]. In contrast to classical crisp representations, and even in contrast to many fuzzy formulations, neutrosophic approaches seek to preserve the independent role of indeterminacy rather than absorbing it into a single uncertainty parameter. This property is particularly significant for real-world datasets in which ambiguity does not merely reflect measurement error, but structural indeterminacy arising from conflicting evidence, missingness, or unstable classification rules.
Over the last decade, neutrosophic methods have expanded rapidly into multicriteria decision-making, similarity analysis, pattern recognition, medical diagnosis and complex decision systems [
27,
28,
29,
30,
31,
32]. Researchers have developed neutrosophic operators, similarity coefficients, aggregation procedures, and cross-entropy measures aimed at preserving the informational richness of indeterminate environments [
28,
29].
More recent studies have extended neutrosophic approaches to multivariate monitoring and quality control settings, including neutrosophic multivariate EWMA control charts and bootstrap-based multivariate schemes under neutrosophic conditions [
30,
31]. In parallel, conceptual discussions have emphasized that indeterminacy cannot always be adequately represented through probabilistic or fuzzy approaches alone [
32]. Together, these developments show that neutrosophic theory has evolved from a mainly theoretical concept into a practical framework for analyzing complex and uncertain data environments.
The present study addresses a gap that has not yet been examined through systematic bibliometric means. Although both co-inertia analysis and neutrosophic theory have each accumulated substantial studies, no prior study has mapped their joint scientific landscape, identified where their traditions overlap, or quantified the degree to which their conceptual dialog has developed. This absence is not merely bibliographic: without a structured overview of how these two traditions relate, it is difficult to assess whether their integration is conceptually feasible, methodologically opportune, or scientifically premature. A bibliometric investigation is therefore not only descriptive but diagnostic—it reveals the structural conditions under which a new methodological synthesis may or may not be emerging.
Despite the maturity of both research traditions, the methodological relationship between CoIA and neutrosophic theory remains limited. This gap is noteworthy because the two approaches appear conceptually compatible in several respects. Co-inertia analysis focuses on identifying concordance between linked datasets while preserving multivariate geometry, whereas neutrosophic methods focus on representing information characterized by truth, falsity, and indeterminacy. In principle, integrating these perspectives could provide a more flexible framework for analyzing datasets that contain not only numerical variability but also partially reliable or indeterminate information. Such an approach could be particularly relevant in environmental risk assessment, socio-ecological monitoring, multi-omics diagnostics, and decision-support systems where data sources differ in precision, completeness, and semantic certainty. However, the current literature remains fragmented, with CoIA mainly advancing in ecological and omics integration, while neutrosophic approaches have developed primarily within decision analysis and uncertainty modeling [
12,
13,
14,
15,
16,
24,
25,
26,
27,
28,
29,
30,
31,
32].
This fragmentation creates a clear need for structured bibliometric mapping. Bibliometric analysis is particularly suitable for identifying how a field evolves over time, which authors and journals shape its intellectual core, how thematic clusters are organized, and where emerging research fronts begin to appear [
33,
34,
35,
36,
37]. In rapidly diversifying domains, bibliometric methods do more than summarize productivity; they make visible the structure of scientific knowledge production, including co-citation structures, keyword co-occurrence, source concentration, and collaboration patterns [
33,
34,
35,
36,
37].
For a topic such as the intersection between co-inertia analysis and neutrosophic approaches, bibliometric mapping can help determine whether this area is evolving into a coherent research domain or whether it remains dispersed across related studies. Therefore, bibliometric analysis serves not only as a descriptive tool but also as a way to evaluate the maturity and conceptual organization of an emerging field.
The study therefore examines whether this intersection is evolving into a coherent research area or whether it remains dispersed across related disciplines such as multivariate statistics, ecology, omics integration, and neutrosophic uncertainty modeling. On that basis, the article addresses three guiding questions: What has been the evolution of scientific production on CoIA applied to neutrosophic spaces? What are the main thematic lines and predominant analytical approaches? Which authors, journals, institutions, and countries exert the greatest scientific influence in this emerging domain?
4. Discussion
The present bibliometric analysis provides a structured interpretation of how co-inertia analysis has evolved from a specialized multivariate tool into a broader framework for integrating heterogeneous data structures across ecology, bioinformatics, and complex-system analytics. The results indicate that the field has passed through a recognizable sequence of emergence, consolidation, and diversification [
4,
5,
10,
11,
12,
13,
14,
15,
16,
17,
18,
19,
20,
21,
22,
23,
24].
In response to the first research question, the temporal pattern confirms that scientific production has grown progressively, albeit unevenly, across the study period. The early years were characterized by low annual output but high long-term citation accumulation, a pattern typical of fields structured around seminal methodological contributions. This first stage was dominated by foundational work on co-inertia analysis and related ordination logic, as well as by the early software infrastructures that enabled its dissemination [
1,
2,
3,
4,
5]. The subsequent expansion phase reflects the widening relevance of multivariate coupling methods in environmental and biological research [
44,
45,
46,
47,
48,
49,
50,
51,
52]. The growth of the field parallels the broader movement in multivariate science toward multiblock, multidomain, and systems-oriented analytical strategies [
18,
19,
20,
21,
22,
23,
24,
25,
49,
50,
51,
52,
53].
The productivity and citation patterns reveal a pronounced intellectual concentration. The leading authors—especially Dray, Thioulouse, Dolédec, and Chessel—form a cohesive methodological nucleus that has shaped both the technical language and the empirical legitimacy of co-inertia analysis. Its influence is particularly evident in the continuity between foundational ordination studies, software implementation, ecological applications, and subsequent multi-board extensions [
1,
2,
3,
4,
5,
6,
7,
8,
9,
15,
16,
17]. The bibliometric data support the interpretation that CoIA is a field with a stable canonical core.
The journal structure reinforces this interpretation. The contrast between high-output and high-impact journals is telling. Broad multidisciplinary venues have widened the field’s visibility, but the heaviest citation concentrations remain in journals central to ecology, bioinformatics, and statistical computing. This suggests that co-inertia analysis has spread through disciplinary layering—not by displacing existing fields, but by being reinterpreted within them. Ecology remains the field’s historical anchor, while bioinformatics and multi-omics research have become key spaces for methodological renewal [
10,
11,
12,
13,
14,
15,
16,
17,
18,
19,
20,
21,
22,
23,
37,
38,
39,
40,
41,
42,
43,
44,
45,
46,
47,
48,
49,
50,
51,
52]. This pattern aligns with the conceptual centrality of “multivariate analysis” in the co-occurrence network and with the growing strategic weight of data integration as a thematic line.
The geographic and institutional findings point in the same direction. France leads in publication volume and acts as a hub in a broader transnational network. The collaboration map shows that the Franco-European core was the main channel through which CoIA spread to other scientific communities—particularly in the United Kingdom, Ireland, and the United States. The dual prominence of CNRS and University College Dublin reflects two distinct but complementary roles: one anchored in foundational methodology, the other in applied high-dimensional data analysis. That division matters analytically, because it mirrors the field’s own shift from ecological multivariate geometry toward large-scale integrative analytics [
10,
11,
12,
13,
14,
15,
16,
17,
18,
19,
20,
21,
22,
23,
24,
38,
39,
40,
41,
42,
43,
44,
45,
46,
47,
48,
49,
50,
51,
52].
The disconnect between co-inertia analysis and neutrosophic theory—despite their apparent complementarity—comes down to structural differences between two largely separate disciplinary communities. CoIA grew out of the French ecological statistics tradition and spread mainly through software ecosystems (ade4, ADE-4) adopted by ecologists and bioinformaticians. Its practitioners have worked almost entirely within an ecology–multivariate statistics–bioinformatics triangle. Neutrosophic theory, by contrast, has roots in philosophical logic and found its audience among researchers in decision science, operations research, and engineering. These communities have had little methodological overlap: different journals, different citation habits, different empirical priorities. The bibliometric evidence—notably the absence of a consolidated neutrosophic cluster in the co-citation network and the peripheral position of neutrosophic journals in the three-field plot—reflects those structural separations, not any inherent incompatibility between the methods themselves.
From the perspective of the second research question, the conceptual structure of the field is highly coherent. The keyword co-occurrence map identifies “multivariate analysis” as the dominant organizing term, while co-inertia analysis occupies a central but intermediate position. The MCA factorial space supports this view by revealing a polarized but articulated domain: one pole is more formal, associated with multivariate and phylogenetic statistics; the other is more application-driven, structured around ecological traits, disturbance, biodiversity, and data integration [
4,
5,
6,
7,
8,
9,
33,
34,
35,
36,
37,
44,
45,
46,
47,
48,
49,
50,
51,
52].
The thematic map provides additional insight into the developmental status of the field. The location of motor themes in the ecological-functional zone confirms that the strongest and most mature applications of CoIA remain linked to biodiversity, macroinvertebrate research, and trait-based ecological interpretation. At the same time, the position of multiple co-inertia analysis and related consensus approaches as niche themes suggests technical sophistication combined with limited diffusion, consistent with their specialized role in multitable coupling and chemometric or omics-oriented integration [
15,
16,
17,
23,
24,
25,
53,
54]. Recent applications in mathematical biology have further extended the relevance of multivariate integration approaches in contexts of biological complexity [
55,
56].
The most consequential result of this study lies in the asymmetry between the mature co-inertia literature and the still peripheral presence of neutrosophic. Although journals and keywords associated with neutrosophy appear in the corpus, they do not yet form a dense or central cluster. The early neutrosophic literature established a wide conceptual foundation by formalizing neutrosophic logic, single-valued neutrosophic sets, neutrosophic measures, similarity concepts, and related epistemic constructs [
52,
53,
54,
57,
58,
59,
60,
61,
62,
63,
64].
This point is especially important because current developments in co-inertia analysis are increasingly placing it in contexts where uncertainty is a significant factor. In ecological monitoring, multi-omics integration, environmental contamination studies, and other high-dimensional settings, heterogeneity is rarely only numerical. It is often epistemic. Data blocks may differ in reliability, represent conflicting biological or environmental signals, or contain uncertainty that cannot be treated as ordinary noise. In such settings, the neutrosophic framework becomes relevant not simply as an abstract philosophical extension, but as a possible way to preserve indeterminacy explicitly within multivariate integration. Recent neutrosophic applications in market segmentation, multivariate control charts, bootstrap monitoring, and the distinction between uncertainty and indeterminacy show that the neutrosophic tradition is already moving toward more operational and quantitative contexts [
65,
66,
67,
68,
69,
70]. This makes the absence of a strong co-inertia–neutrosophy cluster all the more striking.
A substantial body of work has developed similarity measures, cross-entropy formulations, trapezoidal and interval neutrosophic representations, and weighted aggregation operators [
71,
72,
73,
74,
75,
76,
77]. A second set of contributions has extended these developments toward multivalued neutrosophic sets, linguistic uncertainty, DEMATEL–TOPSIS combinations, and multi-scale three-way decision systems [
78,
79,
80,
81,
82,
83,
84,
85,
86,
87]. Taken together, these studies demonstrate that neutrosophic methodology already possesses a sophisticated apparatus for handling structured indeterminacy. Yet this apparatus has remained largely external to the mainstream literature on co-inertia analysis. The bibliometric evidence thus indicates not the absence of tools, but the absence of integration.
A concrete pathway toward methodological integration could proceed through a sequential methodological workflow involving at least three stages. First, the original linked data tables could be transformed into neutrosophic matrices in which each observation is represented through truth (T), indeterminacy (I), and falsity (F) memberships. Within this framework, neutrosophic weighting schemes could be incorporated into the duality diagram structure underlying CoIA, allowing observations, variables, or entire data tables to carry different levels of epistemic certainty rather than a single deterministic reliability weight. This would enable the co-inertia compromise axes to reflect not only the shared variance between tables but also the degree of uncertainty associated with each observational unit.
Second, the RV coefficient used to assess concordance between tables in CoIA could be reformulated within a neutrosophic framework by replacing deterministic vector norms with single-valued neutrosophic similarity or distance measures, as developed in [
81,
82,
83]. Such adaptation would allow the concordance structure between linked datasets to preserve ambiguity and partial reliability during multivariate integration.
Third, the resulting co-inertia axes could be projected into neutrosophic compromise coordinates, producing ordination configurations that explicitly preserve indeterminacy instead of collapsing uncertain observations into single point estimates. Each of these stages builds on existing developments in neutrosophic similarity measures, aggregation operators, and multivalued uncertainty modeling [
81,
82,
83,
85,
86,
87,
88,
89,
90,
91,
92,
93,
94], and could be validated in ecological monitoring, multi-omics integration, or environmental contamination datasets characterized by partial observability and conflicting signals.
In summary, the discussion supports four main conclusions. Firstly, co-inertia analysis has consolidated itself as a methodologically stable and bibliometrically coherent field anchored in multivariate ecology and subsequently extending into bioinformatics and integration in high-dimensional spaces. Secondly, the field remains strongly structured by a canonical authorial and journal core, which explains both its stability and the citation concentration. Thirdly, the conceptual structure confirms that CoIA operates as a multivariate linking framework, rather than as a specialized technique of limited scope. Fourthly, the weak but detectable presence of neutrosophic-oriented literature suggests that the most promising future direction is methodological synthesis between multitable integration and explicit representations of indeterminacy [
58,
59,
60,
61,
62,
63,
64,
65,
66,
67,
68,
69,
70,
71,
72,
73,
74,
75,
76,
77,
78,
79,
80,
81,
82,
83,
84,
85,
86,
87]—a synthesis whose structural conditions are now in place.
6. Study Limitations and Future Directions
Several limitations should be considered when interpreting the findings of this study. The first is related to database coverage. The corpus was built exclusively from Scopus, which offers standardized metadata and broad journal coverage, making it highly suitable for bibliometric analysis. However, single-database studies inevitably risk underrepresenting literature indexed in other sources, including Web of Science, Dimensions, Crossref-linked repositories, or specialized disciplinary databases. This issue is recognized in methodological discussions on bibliometric design, which note that differences in indexing policies, source coverage, and metadata completeness can influence both productivity indicators and network structures [
88,
89,
90,
91,
92,
94,
95,
96]. Consequently, the present study should be interpreted as a rigorous map of the Scopus-indexed landscape rather than as an exhaustive census of all possible literature on CoIA and neutrosophic approaches.
A second limitation concerns the search strategy and keyword dependence of the corpus. Bibliometric analyses are highly sensitive to lexical formulation, especially in emerging or interdisciplinary fields where equivalent concepts may circulate under partially different terminologies. Although the search strategy was progressively refined and coupled with expert review, it is still possible that some relevant contributions were omitted because they used adjacent descriptors rather than the specific terms prioritized here. This limitation is inherent to bibliometric retrieval and has been repeatedly discussed in the bibliometric methods literature, particularly in relation to co-word mapping and emerging topic detection [
33,
34,
37,
88,
89,
92,
95,
96]. In fields where conceptual bridges are still weak, terminology may lag behind substantive similarity, making query design a decisive source of inclusion or exclusion.
A third limitation is associated with the restriction to English-language journal articles. This decision improved metadata consistency and comparability, but it also reduced the corpus to the most internationally visible segment of the literature. As a consequence, conceptually relevant contributions published in other languages, or in formats such as proceedings, book chapters, reports, or monographs, may have been excluded. This issue may be especially relevant for methodological traditions with strong regional development, where part of the foundational debate may not be fully visible in mainstream journal indexing systems.
A fourth limitation concerns the interpretive nature of citation-based evidence. Citations indicate visibility, uptake, and relational centrality, but they do not directly measure theoretical superiority or methodological adequacy. A highly cited document may be dominant because it offers software accessibility, broad disciplinary relevance, or timing advantages, rather than because it exhausts the theoretical possibilities of a field. Likewise, peripheral themes in a thematic map are not necessarily weak in epistemic terms; they may simply be recent, fragmented, or insufficiently stabilized. This is particularly important when interpreting the marginality of neutrosophic literature in the current corpus. Its weak bibliometric density does not imply low conceptual value. It may instead reflect recency, disciplinary separation, or differences in indexing intensity.
A fifth limitation derives from the use of science-mapping indicators themselves. Co-citation, bibliographic coupling, and co-word analyses are powerful for reconstructing relational structures, but each emphasizes different aspects of scientific organization. Co-citation privileges shared reception over time, bibliographic coupling emphasizes common reference behavior at the publication moment, and co-word analysis captures lexical proximity rather than necessarily deep conceptual convergence [
90,
91,
92]. Therefore, the present maps should be understood as complementary views of the field rather than as definitive representations of a single latent structure. In the same way, Bradford-type source concentration and Lotka-type author productivity laws provide useful heuristics for interpreting concentration patterns, but they should not be reified as deterministic laws governing all scientific production [
93,
94].
A sixth limitation concerns recency effects. Newer publications have had less time to accumulate citations, and emerging topics often appear bibliometrically weak during their early diffusion phase. This is especially relevant for the neutrosophic literature included in the corpus, much of which belongs to a more recent wave of applied uncertainty modeling. Therefore, the relatively peripheral position of neutrosophic terms and sources may partly reflect citation latency rather than purely structural marginality.
Despite these limitations, the study also opens several clear future directions. The first is methodological: future bibliometric work should adopt multi-database designs and compare how corpus structure changes when Scopus is combined with other indexing systems. Such triangulation would improve recall, reduce database-specific bias, and allow stronger robustness checks [
88,
95,
96]. The second direction is conceptual: future research should move beyond bibliometric mapping and develop formal analytical models that explicitly integrate co-inertia logic with neutrosophic representations of indeterminacy. This could include neutrosophic weighting schemes for linked data tables, extensions of covariance-based ordination under indeterminate information, or hybrid frameworks for multiblock analysis in which truth, falsity, and uncertainty are preserved simultaneously rather than collapsed into a single error term [
53,
54,
57,
58,
59,
60,
61,
62,
63,
64,
65,
66,
67,
68,
69,
70,
71,
72,
73,
74,
75,
76,
77,
78,
79,
80,
81,
82,
83,
84].
A third future direction is empirical. The thematic structure identified in this study suggests that environmental monitoring, ecological disturbance analysis, biodiversity assessments, and multi-omics integration are especially promising domains for testing a future co-inertia–neutrosophic synthesis. These are precisely the settings in which multiple data blocks, partial observability, and ambiguous signals tend to coexist. In such cases, the conventional strength of co-inertia analysis—its ability to reveal concordance between linked datasets—could be substantially enhanced by a framework that treats indeterminacy as analytically meaningful rather than merely residual.
Ultimately, the main limitation of the current literature is also its greatest opportunity. The explicit integration between CoIA and neutrosophic theory remains underdeveloped. Yet the bibliometric evidence presented here shows that both traditions have already matured enough, on their own terms, to make such a convergence plausible. The next step for the field is therefore not only to continue describing its structure, but to build the methodological bridge that its current conceptual configuration now makes visible.