1. Introduction
Horizon Europe is the European Union’s flagship research and innovation programme. It implements the key policies of the European Commission (EC), such as the climate ambitions of the European Green Deal, by funding projects through a complex architecture of funding calls. While this diversity encourages innovation through both top-down and bottom-up approaches, it also creates challenges related to thematic coherence, fragmentation and overlap. These funding calls are also helpful to identify research trends and understand policy targets. In addition to guiding and allocating funding, these calls encode priorities, frame problems, and refine the approaches used by applicants to formulate their innovations and ideas. Therefore, understanding Horizon Europe’s internal structure and reasoning is needed to assess whether it is delivering on its mission-oriented policy ambitions.
Despite this, the literature has largely focused on analysing funded projects [
1], consortia structures, and national participation patterns [
2]. Very few studies examine the design phase of funding programmes, and even fewer approach the call texts themselves as cognitive artefacts that can shed light on the European Commission’s logic and strategic framing. Unlike prior work, which concentrates on projects after funding, our study shifts attention to the design stage of Horizon Europe by analysing the call texts directly. This provides a novel upstream perspective on how missions are operationalised, capturing the policy logic embedded in funding calls before any project selection occurs. Addressing this is important to comprehend the previous step of how collaboration structures are formed and how research funding shapes the development of emerging technologies. This gap in the literature raises a fundamental challenge: the need for a better understanding of how the Commission structures its thinking and what might potentially constrain the strategic ambitions of mission-oriented policy.
Network science may become the best approach to address this challenge, as it is an accessible tool for identifying, visualising and interpreting connections in varied and complex data. It has long been applied in innovation studies, specifically in the assessment of how network structures relate to European policy objectives [
3] and country-level collaborative research [
4]. In the context of research funding programmes, network analysis has proven effective in showing patterns of cooperation among institutions [
5], creation of innovation systems promoted by funding consortia [
6]. Previous studies on EU frameworks have also used it to map collaboration networks and estimate their influence on research outcomes [
7], or to track the journey of cities towards decarbonisation [
8]. More recently, cognitive network analyses demonstrated that these methods can capture explicit priorities while also showing implicit framing structures in textual and organisational data [
9].
While existing research has examined EU research collaboration patterns and analysed organisation-level networks in EU-funded projects [
10], these studies focus on institutional collaboration structures derived from project-level data instead of on the thematic architecture of the funding programme itself. To address this limitation, our study introduces a methodologically distinct approach: a dual-layered thematic network analysis applied directly to a set of Horizon Europe call texts disaggregated into their Scopes and Expected Outcomes. This enabled us to map conceptual priorities at the stage of designing the funding calls, and to capture thematic intent before any proposal selection occurs, studying whether early-stage policy framing shapes thematic coherence across the energy Work Programme. This provides a novel perspective to existing project-level approaches and extends the understanding of how EU research priorities are conceptually organised and potentially influence downstream the funded projects.
The analysis of call texts becomes even more relevant when considering the case of energy-related funding, as the energy transition is characterised by fast-moving innovation and deep systemic shifts, which is the type of transformation described in the emerging-technologies literature. According to Rotolo, Hicks and Martin, the key features of emergence (radical novelty, rapid growth, thematic coherence, and impact under uncertainty and ambiguity) [
11] can be traced in policy documents. These attributes are aligned closely with the energy-transition domain addressed by Cluster 5 of Horizon Europe, i.e., breakthrough innovative technologies are the radical novelty; growth and coherence are present in the expanding range of renewable energy topics; and impact-under-uncertainty is in the EU’s long-term climate and sustainability goals. Therefore, quantifying thematic coherence and highlighting low-centrality niches provide indirect signals about whether Horizon Europe is already supporting technological emergence and where policy focus remains isolated. In this way, linking emergence metrics with call-text networks reinforces our broader thought that semantic structures can serve as tools of policy direction.
Furthermore, this gap is especially noticeable for energy-related calls, where the coordination of technical, environmental, and societal dimensions is of utmost importance. These domains are intrinsically connected in energy systems, where, for instance, technological innovation must align with climate goals, regulatory frameworks, and citizen acceptance. The clean energy transition pushed by the European Commission involves high levels of complexity, cross-sectoral dependencies, and long-term investment strategies, which requires total internal coherence in policy framing. Consequently, any misalignment or fragmentation in the early design of funding calls can lead to inefficiencies, missed synergies, and potentially conflicting priorities downstream in the research and implementation funding process.
This work does not aim to provide an evaluation of the success of the funding programme nor to assess its value as a policy instrument; our objective is to identify thematic clustering, intersecting areas of research and potential gaps in thematic balance and importance. In doing so, this article makes three contributions: conceptually, it reframes Horizon Europe call texts as deliberate policy instruments rather than neutral administrative documents; methodologically, it introduces a dual-layered Textual Network Analysis (TNA) that disaggregates Scope and Outcomes and applies centrality, modularity, and assortativity metrics; practically, it derives lessons for policymakers (to identify gaps and improve Work Programme design) and applicants (to align proposals with the thematic architecture of EU missions).
Building on these foundations, this research applies TNA to all energy-related calls published under Horizon Europe’s 2021–2022 Work Programme. Instead of focusing on funded projects and consortia, integrating TNA at the call-design stage supports the theoretical justification for studying the calls themselves, where the main priorities and internal consistency of European policy are first set. The analysis is focused on the mapping of the interconnections between the different calls, breaking it down in their Scope and Outcomes sections, considering these as dual layers of the funding programme’s policy logic: the technical context and the impacts expected.
To facilitate clarity and transparency,
Figure 1 presents the structure of the research process. It illustrates the sequential stages followed: (i) manual retrieval of Horizon Europe call texts from the Funding & Tenders Portal; (ii) construction of two corpora corresponding to Scope and Expected Outcomes sections; (iii) keyword extraction using KeyBERT; (iv) network construction and computation of centrality and community metrics in R; and (v) interpretation of results in light of Mission-Oriented Innovation Policy (MOIP) theory. The framework is transferable to other Work Programmes, clusters, or even national funding schemes, making it broadly applicable beyond the immediate dataset.
3. Materials and Methods
3.1. Data
Our study focuses on the “Climate, Energy and Mobility” Work Programme under Cluster 5 of Horizon Europe for 2021–2022. We manually retrieved the full set of calls, sourced directly from the European Commission’s Funding & Tenders Portal [
32], which provides official and publicly accessible versions of all Horizon Europe calls.
A total of 188 calls were identified under HORIZON-CL5-2021 and HORIZON-CL5-2022, covering the actions defined by the European Commission on Climate, Energy, and Mobility in those years. For each of these calls, we extracted two key textual sections:
Scope: describing the expected technical and scientific focus of the call.
Expected Outcomes: intended results, benefits, or impacts as defined by the European Commission.
These sections were processed and stored as two parallel corpora, which were used as the foundational material for the subsequent semantic and network analyses. This dual structure enabled us to separately explore the alignment between strategic objectives and implementation mechanisms in the design of the calls.
3.2. Text Mining
For keyword extraction, we used KeyBERT [
33], a tool based on Bidirectional Encoder Representations from Transformers (BERT) embeddings, to identify semantically representative keywords from unstructured text. KeyBERT generates document embeddings and extracts words or phrases that align closely semantically with the overall content.
The text-mining pipeline followed four steps: (i) manual compilation of Scope and Expected Outcomes texts into an Excel file to create two structured corpora; (ii) application of KeyBERT to each entry in the dataset to extract representative keywords (top ten per section, using unigrams and bigrams); (iii) post-processing of results, which included removing common stop words and manually validating random samples against the original texts; and (iv) export of the cleaned keyword lists into structured files for subsequent network construction.
The extraction process was carried out manually: all Scope and Expected Outcomes texts were copied from the original Work Programme documents into a Microsoft Excel file, which served as the structured dataset. Each text entry was then processed with KeyBERT to extract representative keywords. The model computed BERT embeddings for the entire section and evaluated the semantic similarity of candidate keywords using cosine similarity. The script was configured to automatically remove common English words (e.g., “and”, “the”, “with”) that do not contribute meaningful semantic content. This step ensured that the resulting keywords were conceptually informative.
We also defined the number of keywords to retrieve to control the output size and ensure comparability across calls. For that, we chose an n-gram range of 1 to 2 to capture both unigrams and bigrams. Then, the script extracted the top ten keywords in each call for both their Scope and their Outcomes. The decision to extract the top ten keywords per section was guided by the need for comparability across calls of variable length. While the number is necessarily a simplification, it ensured a manageable, consistent representation of each call while avoiding dominance of longer calls. To test robustness, we re-ran the extraction at thresholds of 5 and 15 keywords; the relative rankings of centrality measures, as well as the qualitative conclusions, remained stable. This indicates that our results are not sensitive to the exact threshold chosen.
For easier readability and to facilitate manual inspection, results were saved in a structured format using separation with semicolons. To ensure the integrity and accuracy of the extracted terms, we validated the method by manually cross-checking random keyword samples against the original texts, confirming that extracted terms aligned with the intended meaning of the calls.
We selected KeyBERT for keyword extraction because it leverages transformer-based embeddings (BERT), which have been demonstrated to outperform traditional frequency-based or Term Frequency-Inverse Document Frequency (TF-IDF) methods in capturing semantic similarity [
34]. Compared to alternatives such as YAKE, RAKE, or topic modelling with Latent Dirichlet Allocation (LDA), KeyBERT provides a balance of interpretability and contextual sensitivity, making it particularly suitable for short, policy-oriented texts, such as Horizon Europe calls.
3.3. Network Analysis
To construct the cognitive networks used in this study, we developed a custom R script [
35] (R version 4.3.2, executed on Windows 10). The extracted keywords were treated as nodes in our co-occurrence networks, with edges formed based on their co-occurrence. The script made use of several packages, notably
igraph [
36] (for graph operations),
dplyr [
37] and
stringr [
38] (for data wrangling), and
readxl [
39] and
readr [
40] (for file input and output).
The first step involved parsing each topic entry to extract all keyword pairs appearing within a single outcome. These pairs were interpreted as undirected edges, and their frequency of co-occurrence across calls was recorded as edge weights, thereby reflecting the strength of thematic association. The aggregated edge list was then used to construct a weighted undirected graph using the graph_from_data_frame() function from the igraph package.
Once the network was built, we computed standard centrality metrics to quantify the structural role of every keyword. Each of these metrics captures a different facet of conceptual relevance or influence [
41], which help identify predominant themes and highlight structurally marginal or disconnected areas in the programme’s thematic architecture:
Degree: direct thematic breadth, which measures the number of direct connections a keyword has with others, indicating how broadly a concept is associated across the corpus.
Betweenness: bridging potential, identifying nodes that frequently occur on the shortest paths between other nodes, thus highlighting keywords that act as connectors between otherwise distant keywords.
Closeness: semantic accessibility, calculated as the average length of the shortest paths from a node to all others. This shows how quickly information can spread from a given concept.
Eigenvector centrality: influence through influential neighbours. This assigns relative scores to all nodes based on the principle that connections to highly connected nodes contribute more to the score of the node itself. In addition to the most frequently co-occurring ones, it identifies those keywords embedded in strategically important clusters.
To assess the structural properties of the network and evaluate the quality of clustering, we also computed assortativity coefficients, which shows the tendency of nodes to connect with others that have similar levels of connectivity, and modularity scores, which quantify the strength of division of a network into distinct communities (higher values indicating clearer thematic clustering). To identify thematic clusters, we applied the Louvain community detection algorithm (cluster_louvain() function in igraph), assigning each keyword to its corresponding community.
Finally, a basic network visualisation was generated, using colour coding to highlight low-degree nodes, with edge thickness proportional to co-occurrence weight. All output files, including centrality scores and summary statistics, were exported as CSV files for further inspection and integration into the broader analysis.
Formal definitions of the centrality measures used are provided in
Appendix A (Equations (A1)–(A4)). The modularity function used for community detection is given in Equation (A5).
3.4. Hypothesis Verification
We interpreted the defined hypotheses H1–H4 using established network measures computed for both layers.
For H1, we compared degree assortativity and modularity and examined degree distributions; directional support is reported when Outcomes exhibits lower or negative assortativity and comparable or slightly lower modularity than in the Scope.
For H2, we inspected the rank distribution of betweenness and identified keywords with betweenness values several orders of magnitude above the median as bridges.
For H3, we compared top-k hubs across layers using overlap in degree/eigenvector rankings and noted layer-specific policy anchors.
For H4, we flagged keywords with degree and closeness values below the 10th percentile as peripheral and qualitatively assessed their policy relevance.
To ensure robustness of results, we compared outputs across alternative parsing configurations and extraction thresholds. Specifically, we varied the number of keywords per section (n = 5, 10, 15) and the n-gram range (unigrams only vs. unigrams and bigrams), and in KeyBERT. Across all settings, the identity of top-ranked hubs, the relative ranking of centrality measures, and the qualitative interpretation of bridging and peripheral terms remained stable. This suggests that our results are not artefacts of a specific extraction configuration but instead reflect structural features of the Horizon Europe call texts. While alternative NLP methods (e.g., YAKE, RAKE, LDA topic models) might yield slight variations in keyword selection, the persistence of high-level patterns across robustness checks supports the reliability of our conclusions.
4. Results
This section presents the results of our network analysis across both Scope and Expected Outcomes sections of Horizon Europe energy-related calls. Our analysis focuses on better understanding the functions of specific terms, as well as revealing bridges, clusters, and structurally marginal themes in the architecture of the Work Programme. Instead of treating each layer (Scope and Outcomes) separately, we facilitate comparison of their structures and focus on areas of convergence and divergence, which allows our subsequent discussion on overall thematic orientation and policy coherence. See
Appendix A for formal definitions of the centrality measures.
4.1. Structural Features and Network Architecture
The overall topology of the networks offers insights into Horizon Europe’s thematic structuring logic. To assess the internal organisation of each layer and facilitate visualisation, we first limited the analysis to the top 30 most connected keywords and applied the Louvain community detection algorithm.
Shown in
Figure 2, the Scope network reported a modularity score of 0.54, which indicates substantial semantic clustering with cohesive communities of technical terms. Additionally, the positive assortativity coefficient (0.206) implied preferential attachment between highly connected keywords. This means that central keywords tend to reinforce each other and form stable thematic clusters. This architecture mirrors bottom-up structuring of technical areas, where domain-specific text blends naturally around a set of identified key challenges.
By contrast, the Outcomes network reported a modularity score of 0.50, which was slightly lower but still indicated moderate clustering. In this case, the assortativity coefficient was negative (−0.035) and indicates a weakly disassortative configuration; in other words, highly connected keywords are not more likely to link with each other, and they only connect broadly across the network. This architecture, shown in
Figure 3, resembles a hierarchical, policy-oriented structure with a limited number of central and overarching concepts (e.g., “renewable energy”, “climate change”), which are deliberately repeated in the call texts aiming to unify fragmented calls.
The Scope network favours thematic depth and technical cohesion, while the Outcomes network reflects deliberate convergence around impact-oriented goals. This divergence between technical formulation and expected impacts stresses the importance of bridging elements for effective alignment in research proposals.
In the full, unrestricted co-occurrence network derived from the Scope sections, the Louvain algorithm identified 61 communities. Its structure was fragmented with a majority of small clusters consisting of fewer than 10 nodes, alongside a small number of large communities (e.g., 118, 96, and 95 keywords). In contrast, the full network built from the Outcomes produced a modular structure with 43 communities. Most clusters were relatively small, with over 30 of them containing 10 nodes or fewer, but several large communities were also identified, including one with 140 keywords and others with 110, 97 and 86 nodes, respectively.
4.2. Thematic Anchors and Conceptual Hubs
Degree and eigenvector centrality were used to assess the importance and influence of keywords in the networks. Both Scope and Outcomes layers showed marked asymmetry in the distribution of keyword connectivity and displayed a pronounced long-tail distribution. A small set of hub terms had high connectivity and influence, while the majority of keywords presented minimal links.
In the Scope network, the mean degree across all nodes was 9.0, and the 10th percentile (threshold for low connectivity) was even lower. Similarly, the Outcomes network displayed a median degree of 9, and although its top nodes exhibited approximately double the connections of the Scope network, the long-tail distribution persisted. This suggests that although both layers share structural asymmetry, the Outcomes layer is more concentrated at the upper end of the distribution, reinforcing the assumption that the Work Programme architecture focuses on a compact set of programmatic terms. At the same time, it highlights the asymmetry of the network, where a few terms act as hubs and the majority remain in the periphery.
For instance, in the Scope network (
Table 1), “renewable energy” (77 connections), “climate change” (65), “energy storage” (63), and “energy generation” (60) were identified as these thematic anchors. They appeared frequently and also connected across multiple conceptual domains, reflecting the Commission’s emphasis on systemic challenges like the energy transition and climate resilience. The same pattern was mirrored in the Outcomes layer (
Table 2), where “renewable energy” (with 142 connections), “project results” (119), and “emissions” (98) dominated. The most notable difference was the presence of “project results”, absent from the Scope layer, which seems logical considering its role as an impact-oriented focal point. Complementing the technical specifications detailed in the Scope section, this clearly indicates a turn in the emphasis of the text from research orientation to the delivery of tangible impacts.
Eigenvector centrality confirmed the structural importance of these terms, evaluating their influence by weighting connections according to the centrality of neighbouring nodes, thereby capturing how influential these connections were in our study. “Renewable energy” returned the highest eigenvector score in both networks, which further validated its role as a conceptual nucleus and gravitational centre of the keyword networks. Other influential keywords such as “climate change”, “energy generation”, and “energy storage” also ranked highly and were likely benefiting from their proximity to other well-connected nodes.
4.3. Bridging Concepts and Thematic Integration
Then, we analysed betweenness and closeness centrality, which help identify concepts that link disconnected topics, as well as the overall accessibility of themes in the networks. First, we focused on betweenness (
Table 3), since keywords with high values can be considered bridges between otherwise disconnected thematic clusters.
In the Scope network, “energy storage”, “energy generation” and “emissions” show betweenness values several orders of magnitude higher than the average. This high betweenness placed them in structurally strategic positions that connect distinct domains such as renewable technologies, infrastructure, and mobility-related areas. Notably, mobility-related keywords, mostly those concerning transport, infrastructure, and logistics, also showed high centrality values, stressing their importance in the network. “Energy storage”, for example, was identified as a nexus linking technical generation systems (e.g., solar, wind), grid infrastructure, and application domains like buildings and transport.
In the Outcomes layer, the bridging function was even more concentrated. Most terms displayed zero betweenness, but a handful showed values that were orders of magnitude higher, notably “project results”, “climate change”, and “renewable energy”. These keywords allow cross-sectoral integration and connect areas such as agriculture, urban mobility, and digitalisation. This difference in distribution suggests a tighter concentration of holistic solutions in the Expected Outcomes section; at the same time, this may demonstrate top-down coherence around MOIP objectives rather than sector-specific interventions as the result of the funding calls.
Additionally, closeness centrality was a complementary tool to assess the overall thematic reach and systemic accessibility. In both layers, keywords like “energy generation”, “energy storage”, and “climate change” held central positions with minimal semantic distance from the rest of the network. This clearly highlights their role as the main entry points for navigating the thematic architecture of Horizon Europe. Conversely, peripheral terms with low closeness values, such as “methane”, “materials renovation”, or “aviation research”, were found in both Scope and Outcomes, which indicated areas of limited integration. These may represent both niche opportunities for researchers and neglected policy domains of the European Commission.
Through the analysis of bridging and proximity metrics, we found a dual structure: a consolidated core of cross-cutting themes that reinforce Horizon Europe’s general ambitions, and a periphery of isolated topics that may require deliberate linking strategies to achieve complete policy coherence.
5. Discussion
Building on the structural characteristics identified in our network analysis, we explore the broader implications for programme design, policy coherence, and MOIP. We acknowledge that certain general-purpose terms (e.g., “project results”) may partly reflect drafting conventions of the European Commission rather than thematic strategy. However, their structural prominence in the network is itself meaningful, as it highlights the institutional emphasis consistently encoded in the calls. Such terms may shape applicants’ framing and evaluators’ expectations, thereby carrying policy relevance regardless of their generic nature. This finding supports our central argument that Horizon Europe call texts should be treated not as neutral administrative templates but as cognitive artefacts of policy design, where textual choices can actively structure the dynamics of both application behaviour and programme coherence.
Assessing the main research hypotheses defined, the results are consistent with H1: compared with Scope, the Outcomes network exhibits a slightly lower modularity (0.50 vs. 0.54) and negative degree assortativity (−0.035 vs. +0.206), indicating top-down convergence around impact-level anchors (
Section 4.1). H2 is corroborated by the highly skewed betweenness distributions, with a small set of integrators such as “energy storage”, “emissions” and “climate change” in Scope, and “renewable energy”, “emissions” and “climate change” in Outcomes, which show values orders of magnitude above the median. H3 is supported by the continuity of hubs across layers (“renewable energy” ranks first by degree and eigenvector in both networks), with a policy-proximate anchor (“project results”) emerging only in Outcomes. H4 is evidenced by repeated low-degree/low-closeness terms that remain weakly integrated, signalling policy-relevant niches. These results jointly validate our initial expectations while also delineating actionable levers for MOIP-consistent call drafting and proposal design.
For easier readability, we organise our discussion in four pillars: first, the strategic role of bridging concepts for integration; second, the innovation niches identified in the peripheral terms; third, the key features and differences found between Scope and Outcomes as indicators of Horizon Europe’s internal policy logic to be considered in future work programme designs; and fourth, guidance for applicants to increase competitiveness and thematic alignment in their proposal writing.
5.1. Bridging Keywords as Policy Instruments
According to the betweenness centrality analysis, a handful of terms in both the Scope and Outcomes networks act as conceptual bridges and connect loose thematic clusters. Keywords such as “energy storage,” “emissions,” “climate change,” and “construction renovation” occupied structurally strategic positions. These terms are thematically rich and are used to integrate domains that are usually siloed; for instance, coupling renewable generation with urban planning, transport, and digitalisation.
From a policy design perspective, the prominence of these high-betweenness nodes suggests that Horizon Europe deliberately inserts certain cross-domain integrators to enhance its thematic coherence. For example, “energy storage”, a technical enabler, links renewable generation, grid flexibility, and sector coupling with mobility or buildings. Similarly, “digital twin” connects smart infrastructure development with data-driven monitoring and climate adaptation scenarios. Such bridging assumes that they contribute to translating high-level MOIP into implementable research calls. This points to an underlying strategy whereby certain keywords, selected for their thematic relevance and their bridging capacity, are included in call texts to promote integration across different energy and sustainability sectors.
With this approach, the Commission implicitly reinforces the idea that mission-oriented research cannot rely exclusively on disciplinary silos. Rather, it requires the orchestration of conceptual interfaces where distinct themes such as energy, digitalisation, mobility, and climate resilience intersect. This is consistent with the epistemic turn in MOIP described by Kattel and Mazzucato [
15] and stresses the European Commission’s intention to develop clear knowledge structures and systems capable of supporting coordination and learning across fragmented innovation ecosystems.
Additionally, a number of key concepts operate as semantic platforms across the Work Programme, in consonance with MOIP’s requirements of multi-sector integration. By facilitating systemic connections, certain bridging terms operationalise the principle of directionality that is central to MOIP. Notably, the case of “energy storage” illustrates this situation: it connects technological, infrastructural and application layers, while also reinforcing the priorities set out in the EU’s energy storage roadmap. In this sense, specific bridging keywords in funding calls are more than semantic artefacts; they function as instruments of policy coherence. Our results therefore show that bridging terms are not incidental but operate as designed connectors, aligning with MOIP’s requirement for cross-domain integration. This provides empirical evidence that textual choices in call drafting directly influence the systemic orientation of EU research policy.
5.2. Thematic Silos and Drafting of Funding Calls
In contrast to the strategic centrality of bridging terms, our analysis also identified a long tail of structurally marginal concepts and keywords. These low-degree, low-closeness, and low-eigenvector keywords (e.g., “rural transport”, “forest disturbance”, “drought events”, “ecological compensation”) were isolated across both networks.
At first glance, their marginality might suggest low thematic relevance and low priority to the EU Research Agenda. However, many of these concepts are tightly linked to recent policy efforts as well as to the five EU Missions, such as the Adaptation to Climate Change and the 100 Climate-Neutral and Smart Cities by 2030. Consequently, their structural isolation may point to a disconnection between the objectives of Horizon Europe and their operational implementation using funding calls. This is of utmost importance in mission-oriented research approaches, which heavily depend on a continuous coordination between high-level goals and programme design, and therefore it raises concerns about the extent of MOIP integration in Horizon Europe’s call architecture.
From the perspective of researchers, the transformation of these peripheral terms into meaningful components of their proposals can augment the depth and uniqueness of their proposed research. The inclusion of these keywords and concepts would demonstrate a sensitivity to underrepresented EU policy areas and introduce less explored problem-solution pairings that differentiate their proposal from the rest. As an example, researchers framing “drought resilience” in the “water-energy nexus”, or explicitly embedding “rural innovation” in calls addressing smart energy autonomy systems can reposition these concepts from the thematic peripheries to use them as hinges of their projects, which will help researchers stand out by proactively addressing thematic gaps in the programme.
In practical terms and in line with recent discussions advancing FP10, the drafting of funding calls should consider repositioning marginal keywords to strengthen the European research agenda and its MOIP approach. These isolated terms may become opportunities to better maximise EU funding; their inclusion through deliberate language choices in the funding calls would create a more conceptually inclusive innovation ecosystem. For instance, considering peripheral terms related to ecology and land use, this could be achieved by rephrasing funding calls to require applicants to consider territorial cohesion explicitly (e.g., project outcomes must include benefits for remote, rural or ecologically sensitive areas), or call for proposals that integrate these peripheral concepts through nexus framings (e.g., water–energy–biodiversity).
Consequently, while network metrics are mostly used for retrospective diagnostics, our study encourages policymakers to consider cognitive science as a tool for preparation of funding calls and Missions. In the design and preparation of upcoming Work Programmes, keyword metrics can help by early identifying thematic gaps, redundancies, and under-integrated terms. Incorporating these considerations into the drafting process of each individual call would allow for proactive alignment with mission objectives and would ensure that dominant and policy-relevant peripheral concepts are appropriately positioned to support the implementation of MOIP approaches.
Table 4 provides illustrative examples of extracts of call texts being adjusted to reposition peripheral keywords and encourage their integration with central terms. These examples were derived through a comparative analysis of the most typical Scope and Outcome phrasings used in Cluster 5 calls and modified using the conceptual logic derived from the centrality metrics in our study. The goal is to illustrate that simple yet deliberate changes in phrasing can break thematic silos, increase the centrality of marginalised terms, and align them with key central concepts to reduce policy fragmentation. In methodological terms, this illustrates how Textual Network Analysis can move beyond retrospective diagnostics and serve as a forward-looking design tool. The framework is transferable to other Work Programmes, clusters, or even national funding schemes, making it broadly applicable beyond the immediate dataset.
5.3. Implications for Future Work Programmes Design
Our analysis of the Scope and Outcomes networks and their comparison showed an evident asymmetry in their structures. The Scope network displayed higher assortativity and modularity, which indicated stronger thematic clustering and greater attachment among highly connected technical keywords. This reflects a bottom-up logic with the technical prospects coming together around shared research themes. In contrast, the Outcomes network displayed negative assortativity and concentrated density at the top of the distribution. A small group of keywords dominated the network structure, reflecting deliberate top-down coherence around impact-oriented objectives defined by the European Commission.
This dual architecture captures the hybrid nature of Horizon Europe and its attempt to reconcile the diversity and specialisation of bottom-up research with the coordinated efforts of mission-driven approaches. In MOIP terms, this reflects the balance between exploration and exploitation, between the technological specificity to be developed and the policy relevance to be achieved. The Scope is the exploratory layer, more diverse and decentralised, while the Outcome is focused on exploitation and their coordinated efforts with European policy agendas.
For programme designers, centrality metrics offer a useful tool for governance diagnostics. Additionally, cognitive science approaches can be used to monitor thematic coherence and to detect neglected areas in funding structures, which is of particular interest for their future iterations of the programmes. Policy-relevant terms that are currently under-integrated can be deliberately activated using minor adjustments with the aim of connecting marginal topics with central ones. Even subtle changes, such as pairing peripheral terms with commonly used flagship terms in phrasing, can increase their centrality. For example, our analysis shows that keywords such as “energy storage” act as high-betweenness bridges across renewable generation, grid infrastructure, and mobility. Policymakers could use this insight to deliberately frame future Work Programme calls around “energy storage” as a cross-sectoral enabler, thereby fostering proposals that integrate technical, infrastructural, and societal dimensions rather than treating them in silos.
These results make a case for the institutionalisation of network-informed diagnostics as part of the development of Funding and Work Programmes. The stratified nature of keyword centrality gives a better understanding of the conceptual structure of any funding scheme, from their overall objectives to their individual elements. Using network science to provide evidence of the coordination of technical innovations (Scope) with measurable contributions to wider policy objectives (Outcomes) would strengthen the upcoming internal coherence in FP10, as well as enhance its capacity to fulfil the ambitions of MOIP approaches. Based on our analysis,
Table 5 provides policy recommendations for future Work Programmes design. By institutionalising such network-informed diagnostics, EU policy actors could establish a systematic mechanism to test the coherence of funding calls before publication.
Finally, while our analysis is limited to the semantic architecture of Horizon Europe calls at the pre-implementation stage, the structural asymmetries we identified between Scope and Outcomes also raise questions regarding potential coordination challenges during policy implementation. The coexistence of a technically clustered Scope layer and a more centralised Outcomes layer may imply risks in lack of coordination when proposals are executed, particularly if the thematic diversity of technical research directions is not fully reconciled with the narrower set of impact anchors. This echoes broader concerns in the MOIP literature, in which implementation difficulties often arise from the need to coordinate heterogeneous actors and objectives under rigid programme structures [
13,
17].
Although assessing actual implementation outcomes requires project-level data and evaluative evidence that fall beyond the scope of this article, our findings suggest that such asymmetries could create frictions in translating research activities into policy impacts. Future research could therefore examine whether the semantic structures identified here correlate with coordination difficulties during project execution, which would offer a complementary line of analysis that bridges call design with policy implementation dynamics.
5.4. Guidance for Applicants and Proposal Writers
For researchers and applicants to Horizon Europe funding, our analysis highlights the value of explicitly integrating high-betweenness connectors into their proposal writing. If their research concepts build on these bridging terms, they can demonstrate cross-cutting relevance and improve the competitiveness of their proposals in the eyes of the evaluators of the European Commission. In the same way, this insight is also relevant for policymakers, who can consider the deliberate insertion of certain terms, as well as their omission, as a tool to shape EU research agendas and their consistency with MOIP principles.
Understanding the different roles of the Scope and Outcomes layers allows for better strategies in proposal writing: use hubs for credibility, bridges for integration, and peripheries for novelty. For grant writers and their consortia, this differentiation offers a data-driven roadmap for structuring proposal narratives and task distribution. Hubs (i.e., high-degree, high-eigenvector keywords) should guide the project idea in well-established and programmatically validated concepts, which increase the perceived legitimacy of the proposal. Bridging terms, typically with high betweenness centrality, are ideal for demonstrating interdisciplinary ambition and wider policy integration, potentially acting as pillars for the proposed work packages and complementary consortium roles that span different sectors and applications. Meanwhile, deliberate engagement with peripheral terms offers a controlled risk–reward opportunity: when smartly framed in the proposal idea and connected to central themes, these marginal concepts can demonstrate innovation, originality, and alignment with neglected or emerging EU priorities.
In addition to thematic convergence, proposal writers should also be aware that the conceptual architecture revealed by our analysis may influence evaluators’ mental models. The repeated presence of certain terms in funding calls contributes to shaping what is subconsciously perceived as credible, policy-relevant, or innovative in scope. Consequently, using central or bridging keywords may not only improve semantic coherence with the programme, but also echo the implicit expectations evaluators bring to the review process. This strengthens the value of such terms as structural connectors in the network and as rhetorical anchors during evaluation.
While central and bridging keywords can guide proposal structure and improve evaluation, they should not be used as a substitute for conceptual originality. Without a clearly innovative framing, proposals that rely too heavily on highly central keywords risk blending into a crowded field of superficially similar submissions. Therefore, applicants should aim to match semantic proximity with narrative uniqueness, employing central terms as the structural framework but ensuring their proposal offers a distinct and genuinely innovative contribution to the field. Achieving this balance, however, is rarely straightforward. Our analysis provides a roadmap for applicants: use hubs for legitimacy, bridges for integration, and peripheries for novelty. This actionable guidance exemplifies how methodological advances in TNA can be translated into practical benefits for research communities engaging with mission-oriented policy frameworks.
6. Conclusions
Our study has presented a comprehensive textual network analysis of the “Climate, Energy and Mobility” calls in Horizon Europe’s 2021–2022 Work Programme, focusing specifically on their Scope and Expected Outcomes descriptions. Extracting and analysing keyword co-occurrences, we mapped the conceptual architecture inherent to the programme and assessed the internal coherence of their thematic structure. The results provide a novel, data-driven approach to better understand how research priorities are framed at call-design stage, before any funding is allocated. Our analysis of the wording of calls as cognitive artefacts, rather than of projects or consortia, offers an upstream perspective that extends existing evaluations of EU funding.
At the same time, our research provides evidence that Textual Network Analysis serves as a powerful diagnostic tool for assessing the internal logic of mission-oriented funding frameworks. Centrality metrics, traditionally used in scientometrics, can effectively identify the roles that different concepts play in the programme’s policy narrative. High-degree and high-eigenvector keywords represent consolidated, programmatically endorsed themes, and high-betweenness terms reveal bridging concepts that support cross-domain integration. On the other hand, keywords with low centrality scores point out thematic blind spots and areas of potential innovation that remain under-integrated or underdeveloped. This demonstrates the methodological value of applying TNA to policy design, providing replicable and scalable tools for both scholars and practitioners.
The network structure we uncovered reflects Horizon Europe’s hybrid design. The Scope layer reveals a modular, technically oriented structure mirroring bottom-up research development, while the Outcomes layer displays a more centralised, impact-driven logic consistent with top-down mission framing. This asymmetry highlights the dual challenge faced by researchers and grant applicants, who need to balance the exploratory diversity of emerging and innovative technologies with the priorities of EU policy objectives.
Taken together, the results support our initial hypothesis H1 and confirm that Scope sections exhibit modular and assortative clustering while Outcomes sections are more centralised and policy-driven. The analysis also validates H2 and H3, as keywords such as “energy storage” and “emissions” play a bridging role across otherwise disconnected communities. Finally, H4 is corroborated through the identification of structurally marginal terms that correspond to policy-relevant but under-integrated themes. These outcomes collectively reinforce our main hypothesis that Horizon Europe’s Work Programme embodies a hybrid architecture, which aims to balance exploratory research diversity with mission-driven directionality.
Using centrality metrics, we provide policymakers and applicants with a framework to refine their future programme design and proposal writing strategies. For policymakers, network-based analysis may help identify neglected areas and strengthen semantic consistency in funding call design. Concretely, our results suggest that policymakers drafting future Horizon Europe or FP10 calls could increase coherence by pairing under-integrated concepts (e.g., drought resilience, rural transport) with central hubs such as “renewable energy” or “emissions”. This would prevent thematic blind spots while encouraging projects that align technical innovation with wider territorial and social priorities of the EU. For applicants, purposefully incorporating bridging concepts and peripheral elements can increase thematic coherence and improve the competitiveness of their proposals. Together, these practical implications show how our approach generates direct value for both programme designers and the research community.
Future work could build on this framework by treating entire calls as nodes within the network to trace how thematic clusters evolve across Work Programmes. Extending the analysis to other clusters and subsequent Work Programmes (e.g., 2023–2025) would also provide a clearer view of thematic trajectories in European research funding over time. Additionally, incorporating post-funding data, such as project abstracts available in CORDIS or impact evaluations, would offer a longitudinal perspective on how semantic design decisions influence actual funding outcomes. Furthermore, cross-referencing call texts with national energy strategies could yield insights into how EU-level framing interacts with member state contexts. These potential extensions, although beyond the scope of the present article, underscore the value of combining textual network analysis with complementary perspectives in future research.
Table 6 summarises the conceptual, methodological, and practical contributions of this research, highlighting how our work extends existing studies while generating direct value for both policy and practice. Ultimately, our study contributes to a growing body of work that views funding calls not merely as administrative instruments, but as deliberate textual elements that shape the future research and innovation landscape. Conceptually, we reframe calls as artefacts of mission-oriented policy design; methodologically, we demonstrate the utility of textual network analysis for diagnosing policy coherence; and practically, we derive actionable lessons for both policymakers and applicants. Through this deeper examination of Horizon Europe’s conceptual infrastructure, we provide a methodological and data-driven foundation for improving funding programme governance and guiding researchers within MOIP frameworks.