Quantifying Environmental Assumptions Volatility and Its Role in Requirements Technical Debt Accumulation

Alenazi, Mounifah

doi:10.3390/electronics14244930

Open AccessArticle

Quantifying Environmental Assumptions Volatility and Its Role in Requirements Technical Debt Accumulation

by

Mounifah Alenazi

Department of Computer Science and Engineering, College of Computer Science and Engineering, University of Hafr Al Batin, Hafar Al Batin 39524, Saudi Arabia

Electronics 2025, 14(24), 4930; https://doi.org/10.3390/electronics14244930

Submission received: 13 November 2025 / Revised: 11 December 2025 / Accepted: 12 December 2025 / Published: 16 December 2025

Download

Browse Figures

Versions Notes

Abstract

Assumptions about environmental and operational conditions play a key role in the design of sensor-driven and cyber–physical systems. When these assumptions later change or prove incorrect, they can cause rework, inconsistency, and other forms of requirements technical debt (RTD). Although prior studies have highlighted this problem conceptually, there has been limited quantitative evidence showing how assumptions volatility contributes to RTD during early system modeling. Objective: This work introduces the concept of assumptions volatility, the degree to which environmental assumptions evolve or become invalid, and provides the first empirical assessment of how these measures relate to RTD indicators in model-based development. Methods: We analyzed 89 environmental assumptions curated from a prior controlled modeling study. For assumptions volatility, we identified three metrics, i.e., assumption change (ACR), invalidation ratio (IR), and dependence density (DD). These measures were compared against three RTD indicators, i.e., rework ratio, inconsistency density, and correction count. Correlation and regression analyses with robustness checks were used to evaluate the strength and consistency of the observed relationships. Results: Our results showed that assumptions with higher volatility were consistently linked to a greater level of RTD, with dependency density showing the most stable associations among the three volatility measures. Conclusions: The findings provide initial quantitative evidence that environmental assumption volatility is associated with RTD during conceptual design and motivate future multi-domain validation in broader Model-based Systems Engineering settings.

Keywords:

environmental assumptions; assumptions volatility; requirements technical debt; requirements engineering; model-based systems engineering

1. Introduction

Software-intensive and cyber–physical systems rarely function within stable or unchanging environments. In requirements engineering, this dependency is recognized by defining the system specification in terms of expected domain conditions, i.e., the behavior of the system is prescribed given certain environmental assumptions, and even a correctly implemented design may fail when those assumptions no longer hold [1,2]. Goal-oriented requirements engineering extends this view by explicitly modeling these domain properties, obstacles, and trade-offs among goals, providing mechanisms to surface and analyze assumptions and uncertainties early in development [3,4].

Despite these mechanisms, assumptions are often implicit or weakly managed in practice. According to Steingruebl et al. [5], developers often respond to incomplete or unclear requirements by introducing implicit assumptions to address gaps in understanding. Because these assumptions typically remain undocumented and unchecked, they later contribute to inconsistencies and rework [6,7].

Research on assumption management has emphasized the need for capturing, validating, and tracking assumptions throughout development [8], while subsequent studies have explored how assumptions can be mined, monitored, or refined as knowledge evolves [9,10,11]. Thus, making assumptions explicit and keeping them synchronized with reality is essential for dependable systems [12].

Model-based systems engineering (MBSE) promises stronger discipline around such concerns. Languages and tools such as SysML allow engineers to relate requirements, behavior, and structure within a single modeling environment, enabling consistency checks and systematic analysis [13]. MBSE has also been combined with formal analysis, for example, linking SysML to model checking, to reason about safety and context-dependent behavior [14,15,16]. In safety-critical settings, specialized profiles and guidelines capture hazards, safety cases, and interface assumptions directly in SysML artifacts [17,18,19,20]. Nevertheless, empirical evidence shows that while MBSE notations can represent assumptions and their dependencies, they provide limited support for how assumptions evolve over time in the face of uncertainty, new information, and changing operational contexts [21,22]. This gap is particularly pressing in cyber–physical domains where environmental dynamics and uncertainty are inherent [23,24,25,26,27].

In parallel, the software engineering community has used the lens of technical debt to capture the long-term costs of short-term trade-offs in software projects [28,29,30,31]. These trade-offs, while often necessary to meet time or resource constraints, incur “interest” in the form of increased maintenance, reduced quality, and loss of architectural integrity. Historically, research has focused on implementation- and architecture-level debt, but recent studies have emphasized that similar debt can originate much earlier in the lifecycle—during requirements analysis [28,32,33,34,35]. In this context, requirements technical debt (RTD) refers to the accumulation of deficiencies such as ambiguity, incompleteness, or deferred decisions that later require rework or lead to inconsistencies [36]. Ernst conceptualized RTD as the gap between an ideal requirements solution and the one actually implemented under project constraints [32]. Subsequent studies have shown that this “distance” grows when assumptions are not explicitly validated or tracked, especially in dynamic environments [7].

Although the role of assumptions is acknowledged, we still lack quantitative evidence that captures how assumption behavior, especially its volatility, relates to the accumulation of requirements-level debt during early modeling.

This paper addresses that need by introducing and evaluating assumptions volatility, i.e., the extent to which environmental assumptions change or become invalid during system modeling. Building on prior work that conceptualizes the quantification of RTD [28], we adopt established RTD indicators as empirical proxies for rework and inconsistency in early modeling. Perera et al.’s Requirements Technical Debt Quantification Model (RTDQM) formalizes measurable components of RTD. Inspired by this model, our study examines how assumptions volatility relates to these forms of RTD accumulation. Specifically, we use rework ratio, inconsistency density, and correction count as RTD indicators and analyze their statistical association with volatility measures.

To this end, we analyzed 89 environmental assumptions derived from our prior controlled modeling study of a vehicle cruise-control system [37]. We defined three volatility measures, i.e., Assumption Change (ACR), Invalidation Ratio (IR), and Dependency Density (DD), to capture how assumptions evolve, become invalid, or interrelate during modeling. Their relationships with RTD indicators were evaluated using correlation and regression analyses on a final set of 89 environmental assumptions.

Our study makes two contributions. The first is a set of metrics for quantifying how environmental assumptions shift during modeling. The second is an empirical analysis linking these shifts to different forms of requirements-related rework.

Although prior work has examined environmental assumptions and explored their relationship to RTD, these studies have primarily taken a qualitative perspective. In contrast, the present work introduces explicit volatility metrics, ACR, IR, DD, and empirically evaluates their association with quantified RTD indicators. To the best of our knowledge, no prior study has provided a statistical examination of how environmental assumption volatility corresponds to rework or correction effort during early system modeling.

The rest of the article is structured as follows: Section 2 provides background information and related work. Section 3 presents our methodology. Results and analysis are reported in Section 4. Discussion can be found in Section 5, and finally, Section 6 concludes the paper.

2. Background and Related Work

In this section, we review key background concepts on environmental assumptions and prior studies that addressed assumptions within model-based approaches. It also summarizes related research on RTD and its quantifications.

2.1. Environmental Assumptions

The concept of environmental assumptions originates from Jackson’s seminal work on the distinction between the machine and the environment domains [1]. In this framework, the environment represents the problem world in which requirements are situated, while the machine refers to the system being constructed to satisfy those requirements. The two domains interact through shared phenomena, events and states observable in both the machine and its environment. Environmental assumptions define properties or conditions of the environment which are expected to hold regardless of whether the machine exists or not. On the other hand, the behavior of the machine is defined by its specification, which restricts the common phenomena at the interface between the machine and the environment.

Research on environmental assumptions spans several related areas, including formal requirements modeling [38,39], model checking and validation [40,41], and testing [42,43]. A recurring challenge across these fields is how to effectively manage and validate assumptions in systems that operate within dynamic, distributed, and uncertain environments [44]. Recent studies have shown that many software and system-level failures are due to undocumented environmental assumptions [37,42,45]. As systems grow in complexity, assumptions that once appeared stable may become invalid or incomplete, which compromises both requirements satisfaction and the overall system dependability. This growing awareness highlights the importance of methods that make environmental assumptions explicit and testable throughout model-based development. In other words, even a technically correct design can fail if the environment behaves differently than anticipated. In practice, many assumptions remain implicit because stakeholders often believe they share a common understanding [6,7]. Such assumptions are difficult to verify and tend to drift away from reality as a project evolves [46,47]. Empirical research on modern cyber–physical and autonomous systems further shows that incorrect assumptions about human operators and physical context can propagate through models and lead to cascading failures or safety risks [48,49,50].

2.2. Modeling Approaches and Assumption Evolution

A number of modeling traditions seek to capture and reason about assumptions. Goal-oriented methods such as KAOS and i* make assumptions explicit through domain properties and actor dependencies [4,51]. Obstacle analysis investigates how goal failures may arise when assumptions do not hold [3,10]. Safety-critical engineering methods take a similar position, treating assumptions as preconditions whose violation may lead to hazards and unexpected system behavior [47].

MBSE extends this reasoning by linking assumptions to requirements and design elements [13,21]. However, in most modeling environments, environmental assumptions remain static text within the model, and updating or validating them depends largely on manual effort [17]. When the operational context shifts, as it frequently does in complex systems, these static assumptions can quickly become outdated and, as a result, this leaves the model misaligned with reality [52].

Recent work has emphasized that modeling tools need stronger support for assumption evolution [7,53]. Even so, empirical reviews suggest that most current MBSE practices still struggle to manage assumptions once systems move beyond static design models and begin interacting with uncertain, real-world environments [54]. These gaps highlight the need for systematic and measurable approaches to understanding how environmental assumptions evolve and how their volatility contributes to development challenges.

Recent MBSE literature has emphasized the expanding role of system models across the engineering lifecycle. For example, Wilking et al. [55] highlight the many ways system models are used in practice and identify research directions for improving their lifecycle integration. Likewise, emerging perspectives such as MBSE 2.0 [56] call for more comprehensive and intelligent MBSE environments with stronger support for context awareness and model evolution. These developments underline the need for quantitative ways of characterizing the behavior of model elements, such as the proposed ACR, IR, and DD metrics, to better understand how environmental assumption volatility may affect model quality and requirements-related effort.

2.3. Requirements Technical Debt

Cunningham [57] introduced the term technical debt to describe the future cost of quick development decisions that trade quality for speed. While the concept was initially rooted in code-level compromises, recent research has recognized that debt can also originate at an early stage of development, particularly during requirements engineering [31]. In this context, RTD arises when requirements are incomplete, ambiguous, deferred, or specified with insufficient domain understanding, which leads to future corrective effort and architectural misalignment within the project [34,36]. RTD differs from implementation-level debt in timing and impact. Because requirements decisions shape the foundation of a system, early deficiencies can spread across multiple components, which leads to significant rework later in the project [28]. For example, RTD may arise when teams accelerate requirements capture without fully validating stakeholder needs, or when critical decisions are deferred due to uncertainty or limited information [34]. In contrast, implementation-level debt tends to manifest later in the lifecycle, when shortcuts taken during coding lead to maintainability or reliability issues [29,30,31].

Lenarduzzi and Fucci [58] classify RTD into three types, ranging from unmet user needs to inconsistencies between requirements and implementation, providing a structured basis for its detection and management. Melo et al. [34] further identify root causes of RTD and highlight the need for empirical measurement strategies. More recently, Perera et al. [28,36] introduce models for estimating the cost and system impact of RTD, underscoring its strategic implications for project success.

Despite increased attention to RTD, its relationship with early-phase uncertainties remains underexplored. In particular, environmental assumptions, implicit or explicit beliefs about operating context, such as stakeholders, may introduce RTD when they later prove invalid or incomplete. Our prior work [59,60] suggested that such assumptions play a significant role in the accumulation of RTD. This study extends that line by quantitatively analyzing how assumption volatility influences RTD indicators.

These metrics are aligned with established empirical software engineering measurement guidelines [61] and MBSE quality principles [13]. By associating ACR, IR, and DD with RTD indicators such as rework effort, inconsistency density, and correction count, this work contributes a quantitative basis for understanding how assumption dynamics influence RTD accumulation during early modeling.

In summary, existing research recognizes the importance of environmental assumptions and acknowledges their link to RTD, yet a rigorous, data-driven understanding of this relationship is missing. Modeling frameworks capture assumptions qualitatively, while RTD models measure downstream impact quantitatively, but the bridge between the two remains underdeveloped.

This study addresses this gap by defining and empirically evaluating a set of volatility metrics and examining their relationship to established RTD measures. The next section outlines the methodological design used to carry out this investigation.

3. Methodology

This section outlines the dataset and the steps taken to measure assumption volatility and its relationship to RTD. Our goal was to work with realistic early-stage modeling data and apply straightforward, reproducible metrics rather than rely on tooling not typically present in early MBSE practice.

3.1. Research Design

The work follows a quantitative correlational design. Figure 1 gives an overview of the workflow. In brief, the process involved extracting environmental assumption data, computing three volatility measures, calculating RTD indicators, and applying statistical tests to explore their relationships. A simplified vehicle cruise-control system model was used as a reference case to illustrate how environmental assumptions influence system behavior. It was chosen because it presents clear dependencies between environmental factors, e.g., road surface, weather, traction, and sensors, and system parameters, e.g., braking, acceleration, and perception.

The environmental assumptions used in this study were not created specifically for this paper; they originate from a prior controlled empirical study published in IEEE Access [37]. The model used in that study represents an automotive cruise-control subsystem and served as the basis for the original assumption-elicitation task. In that earlier work, 95 trained modelers were each asked to propose five environmental assumptions relevant to safety requirements for a cruise-control system model. This produced 473 raw assumptions, which were then anonymized and processed by removing duplicates, yielding 190 unique statements, and reclassified into true environmental assumptions versus requirements, i.e., 123 final assumptions. For the present analysis, we reused these 123 validated assumptions and applied an additional filtering step to focus on those with complete model information, resulting in the final set of 89 assumptions used in our correlation and regression analyses. Because our prior work provides the full task description, subject instructions, sample assumptions, and dataset publication details, the provenance and reproducibility of the dataset are well documented. We build directly on that curated dataset and augment it with volatility and RTD coding as described below.

Each environmental assumption was linked to its related model elements, i.e., functions, constraints, behaviors, or requirements. During the design process, notes were kept on assumptions that were revised or discarded. This produced a realistic picture of how assumptions shift during conceptual modeling. To keep the analysis clear and reproducible, we organized the assumptions into a simple tabular structure. Rather than attempting to reconstruct detailed version histories, each assumption was treated as one unit and recorded with a small set of attributes that capture its behavior:

a unique identifier,
underwent at least one substantive revision (ACR, 0/1),
whether it was later invalidated or replaced (IR),
the number of model elements depending on it (DD),
the proportion of rework tied to it (RR),
any inconsistencies linked to it (ID), and
the number of corrective actions associated with it (CC).

This approach reflects how environmental assumptions are usually managed during early modeling activities. Engineers often revise or abandon assumptions as their understanding of the system evolves, but detailed version histories are seldom kept at that stage. Recording these values in a consistent way allows the dataset to be shared and reused without requiring access to proprietary models or tooling environments.

3.2. Volatility Measures

We considered three indicators of assumption movement: Assumption Change (ACR), Invalidation Ratio (IR), and Dependency Density (DD).

Assumption Change (ACR). An assumption was coded as ACR = 1 if it underwent at least one substantive revision; otherwise, ACR = 0. This binary operationalization captures meaningful change without requiring full revision histories.

Invalidation Ratio (IR). IR was operationalized as a binary indicator (IR = 1 if the assumption was later judged incorrect or obsolete, and IR = 0 otherwise).

Dependency Density (DD). DD is a structural measure rather than a temporal one: it reflects how widely an assumption is embedded in the requirements model. DD is therefore intended to complement, rather than replace, the event-based indicators ACR and IR by capturing how far the effects of a change are likely to spread when volatility does occur.

In principle, volatility can also be described in terms of the frequency, magnitude, and timing of changes. However, the available modeling data did not preserve full revision histories in a way that would support such fine-grained temporal analysis. For this reason, ACR and IR are operationalized here as binary indicators that capture whether a substantive change or invalidation occurred at least once, rather than attempting to model its detailed temporal profile. These event-based measures should be viewed as a first approximation, with richer temporal characterizations left as an avenue for future work.

To minimize bias in defining volatility constructs, the coding of ACR and IR was conducted independently by two reviewers who were not involved in the study design or authorship. Agreement between the reviewers reached Cohen’s

κ = 0.84

, indicating substantial reliability. Disagreements were resolved through discussion between the reviewers without author intervention. The finalized coded dataset was then used for subsequent correlation and regression analyses. This independent-coding arrangement was intended to ensure methodological neutrality and reproducibility of the volatility measures.

To reduce ambiguity in conceptually gray cases, we distinguished between refinements, substantive shifts, and invalidations. Refinements refer to wording changes that leave the original meaning or scope intact and were therefore coded as unchanged. Substantive shifts occur when the scope or interpretation of an assumption changes, for example, an assumption related to road conditions that was initially stated as “the road surface is paved and well-maintained” was later revised when the modeling team broadened the operational context to include uneven or partially degraded surfaces, and was coded as ACR = 1. Invalidations arise when an assumption can no longer hold under the updated model context and were coded as IR = 1.

To illustrate the coding rules, consider two typical cases from the dataset. An assumption related to road conditions was initially stated as “the road surface is paved and well-maintained.” When the modeling team later expanded the operational context to include uneven or partially degraded surfaces, the assumption was revised accordingly, and was therefore coded as ACR = 1. In contrast, an assumption concerning ideal weather conditions became incompatible with an updated scenario that introduced rain or reduced visibility; because the original assumption could no longer be satisfied under the expanded environment, it was coded as IR = 1. By comparison, assumptions that underwent only minor wording clarifications, without altering their scope or polarity, were coded as unchanged. These examples reflect how revisions and invalidations were directly grounded in the modeling notes.

3.3. RTD Indicators

RTD was operationalized using three measures adapted from the Requirements Technical Debt Quantification Model (RTDQM) [36]:

Rework Ratio (RR): the proportion of requirements updated due to a given assumption,
Inconsistency Density (ID): the number of clarification issues or mismatches observed,
Correction Count (CC): the number of corrective actions attributed to the assumption.

For each assumption, RR, ID, and CC were derived from the modeling notes and logs using a simple attribution rule. An RTD event, i.e., rework, inconsistency, or correction, was assigned to a given assumption only when the notes or other project artifacts explicitly connected that event to that assumption or to a requirement directly derived from it. When a note clearly described a change or issue as involving several assumptions, the event was recorded for each of those assumptions. Entries that could not be reliably tied to specific assumptions were left unassigned.

It is important to note that RR, ID, and CC represent observable proxies for effort and clarification activity in early-stage modeling, rather than a full operationalization of the principal and interest components described in Perera’s RTDQM. They capture the portion of RTD that is visible in the available artifacts and logs, and should therefore be interpreted as partial indicators of RTD, not as coextensive with the broader theoretical construct.

While formal construct validation was not conducted, the definitions of RR, ID, and CC were derived directly from the conceptual dimensions of RTD [36]. Future work will triangulate these indicators through expert review or independent RTD scoring to confirm their empirical alignment with rework, inconsistency, and correction effort.

To strengthen the construct validity of these indicators, we incorporated an expert review step. Two reviewers, neither involved in the modeling or coding activities, independently reviewed the mapping between the raw modeling notes and the operational definitions of RR, ID, and CC. They evaluated whether each indicator aligned with the conceptual dimensions described in the RTDQM model and whether the extracted evidence (e.g., clarification notes, corrective actions, rework annotations) appropriately reflected rework, inconsistency, and correction count. Inter-rater agreement was high (Cohen’s

κ = 0.81

), and minor discrepancies were resolved through discussion. This review step helped ensure that the indicators used in this study are meaningfully tied to established constructs of requirements technical debt and not merely artifacts of the coding process.

3.4. Analysis Procedure

For each environmental assumption, i.e., N = 89, we computed volatility predictors, i.e., ACR (binary), IR (binary indicator), and DD (z-standardized), and recorded RTD indicators, i.e., RR (0–1), ID (count), and CC (count). We first assessed pairwise associations using Pearson’s r (point-biserial for binary variables) and tie-corrected Spearman’s

ρ

, reporting 95% bootstrap confidence intervals (2000 resamples) with Benjamini–Hochberg false-discovery-rate adjustment. We then fitted beta regression (logit link) for RR and negative binomial (NB2) models for ID and CC (including exposure offsets where appropriate). Results are reported as odds ratios (ORs) or incidence-rate ratios (IRRs) with 95% CIs, alongside standardized marginal effects (change in outcome SD per 1-SD change in predictor) and partial R² for variable importance. Model adequacy was evaluated via dispersion tests, residual and influence diagnostics, and sensitivity analyses (fractional logit for RR; zero-inflated NB where indicated).

Table 1 presents a small illustrative excerpt of the dataset to clarify structure and coding format.

As an example, consider Assumption E09, i.e., “The vehicle is traveling on a paved, well-maintained road.” As the project scope expanded to include unpaved surfaces, this assumption was revised and ultimately invalidated (

ACR = 1

,

IR = 1

). It had

DD = 5

associated requirements (artifacts). Following the change,

RR = 0.28

(i.e., 28% of its 18 linked requirements required rework), with

ID = 3

inconsistencies and

CC = 3

correction activities. In short, E09 shows how a shifting environmental condition can propagate across multiple requirements and generate requirements-level technical debt.

Given the modest sample size, 89 environmental assumptions, and the possibility of overlap among volatility predictors, we also evaluated model parsimony and the potential risk of overfitting. To do so, we compared full models with reduced alternatives and applied penalized regression as an additional check. These procedures are reported in Section 4.5 and serve to confirm that the observed relationships do not hinge on model complexity or collinearity among predictors.

4. Results

This section presents the descriptive statistics and analytical results for the environmental assumptions included in our study. We first describe general volatility patterns observed in the dataset, then report the associations between volatility indicators and RTD measures, followed by regression results.

4.1. Descriptive Summary

Out of the 89 environmental assumptions, 48 (54%) were revised at least once during refinement, indicating that a substantial portion of environmental knowledge evolved as the system understanding matured. A smaller subset, 17 assumptions (19%), were eventually judged invalid or no longer relevant, either due to updated stakeholder insight or discoveries during modeling. While many assumptions remained stable across iterations, these figures reflect a natural level of uncertainty and learning typical in early MBSE work.

Dependency density varied among assumptions. Most connected to only a few model elements (median = 2), though one extended to seven. This pattern suggests that while many assumptions touch isolated areas, a small number serve as key contextual links across the model.

RTD indicators followed similar patterns. For assumptions associated with downstream effort, rework ratios ranged from 0.03 to 0.28, meaning that in the most extreme cases, more than a quarter of related elements required modification. Inconsistency counts ranged from 0 to 5, with most assumptions producing none, but a handful requiring multiple clarification cycles. Correction actions were less frequent overall, though they were concentrated among assumptions with broader model impact, which is an expected profile in iterative system design, where core assumptions tend to receive more attention and cause wider impacts when they change.

4.2. Correlation Analysis

We examined associations between volatility measures (ACR, IR, DD) and RTD indicators (RR, ID, CC) using both Pearson’s r (point-biserial for binary variables) and tie-corrected Spearman’s

ρ

on the

N = 89

analysis set, reporting 95% bootstrap CIs and Benjamini–Hochberg FDR-adjusted q-values. Table 2 reports Pearson’s r;

All three volatility measures showed positive, statistically significant relationships with the RTD indicators. Assumptions that were revised or invalidated tended to be associated with more rework and clarification activity, consistent with the idea that evolving environmental knowledge introduces friction in early MBSE. Dependency density exhibited the strongest correlations across RR, ID, and CC, reinforcing the intuition that assumptions embedded in more parts of the model create broader effects when they change.

Table 3. Pearson’s r with 95% bootstrap CIs for volatility–RTD pairs (

N = 89

). ACR/IR: point-biserial r.

Table 3. Pearson’s r with 95% bootstrap CIs for volatility–RTD pairs (

N = 89

). ACR/IR: point-biserial r.

Pair	r	95% CI	p	q
ACR–RR	0.42	[0.20, 0.60]	0.001	0.004
ACR–ID	0.37	[0.14, 0.56]	0.004	0.010
ACR–CC	0.35	[0.11, 0.54]	0.007	0.014
IR–RR	0.38	[0.16, 0.57]	0.003	0.009
IR–ID	0.34	[0.10, 0.53]	0.008	0.015
IR–CC	0.31	[0.07, 0.51]	0.015	0.022
DD–RR	0.50	[0.32, 0.64]	<0.001	0.002
DD–ID	0.46	[0.27, 0.61]	<0.001	0.003
DD–CC	0.43	[0.23, 0.59]	0.001	0.004

Table 4. Spearman’s

ρ

(tie-corrected) with 95% CIs for volatility–RTD pairs (

N = 89

).

Table 4. Spearman’s

ρ

(tie-corrected) with 95% CIs for volatility–RTD pairs (

N = 89

).

Pair	$ρ$	95% CI	p	q
ACR–RR	0.40	[0.18, 0.58]	0.001	0.004
ACR–ID	0.35	[0.12, 0.54]	0.006	0.012
ACR–CC	0.33	[0.09, 0.53]	0.011	0.018
IR–RR	0.37	[0.15, 0.56]	0.004	0.010
IR–ID	0.33	[0.09, 0.53]	0.012	0.018
IR–CC	0.30	[0.05, 0.50]	0.023	0.030
DD–RR	0.50	[0.31, 0.65]	<0.001	0.002
DD–ID	0.47	[0.28, 0.62]	<0.001	0.003
DD–CC	0.44	[0.24, 0.60]	0.001	0.004

4.3. Regression Models

To assess predictive value with outcome-appropriate estimands, we fitted beta regression (logit link) for RR and negative binomial (NB2) models for ID and CC (

N = 89

). Predictors were ACR (change event, 0/1), IR (binary invalidation indicator), and DD (z-standardized). Odds ratios (ORs) and incidence-rate ratios (IRRs) are reported with 95% confidence intervals. The models were further examined using standardized marginal effects and partial R² measures. False discovery rate was controlled across the three primary models using Benjamini–Hochberg.

Across outcomes, DD exhibited the largest and most consistent effects (significant after FDR correction), indicating that assumptions linked to more artifacts tend to be associated with greater rework and inconsistency effort when they change. ACR contributed in two of the three models, consistent with the idea that any change event, minor or major, carries practical consequences. IR showed a positive but weaker influence, which is plausible given that invalidations were comparatively infrequent.

Model diagnostics supported the specifications: RR residuals under the beta model showed no link-function misspecification; ID/CC displayed overdispersion justifying NB over Poisson; influence diagnostics did not identify points that altered inference; and multicollinearity was low (VIF < 3). Sensitivity analyses yielded consistent conclusions (fractional logit for RR; zero-inflated NB where indicated; re-estimation after excluding high-influence points).

As summarized in Figure 2, DD had the strongest and most stable association across RR, ID, and CC. In practical terms, assumptions that are more interconnected tend to drive more rework and inconsistencies when updated. Change events (ACR = 1) also showed noticeable effects in two models, while IR contributed positively but less strongly. Overall, both the frequency of changes (ACR events) and the structural importance of assumptions (DD) help explain how requirements technical debt accumulates over time.

The effect sizes also give a sense of their practical meaning. For example, higher DD values often corresponded to noticeably more rework. The effects for ACR and IR are smaller but still meaningful: a single change event or invalidation corresponds to a modest increase in the RTD indicators, consistent with the idea that even isolated revisions can trigger downstream adjustments. In other words, the magnitude of these effects suggests that structural interconnectedness (DD) plays a more prominent role than the occurrence of change alone, though both dimensions contribute to the accumulation of technical debt during modeling.

To sum up, our results indicate that environmental assumptions volatility plays a measurable role in shaping RTD during early system modeling. Assumptions that tie into several parts of the model, e.g., those with higher DD, tend to trigger more rework and corrections when they change. From a modeling practice perspective, identifying high-DD assumptions early can help modelers anticipate where changes may have wider impact and prioritize early validation or impact analysis accordingly. While our data reflect early-stage development rather than large-scale industrial deployments, the consistency of effects across all three RTD indicators provides empirical support for environmental assumptions as a material source of technical debt in MBSE contexts.

4.4. Sensitivity Analysis

To assess whether the binary volatility indicators (ACR and IR) were too coarse, we repeated the analyses using two extended measures: the revision count for assumptions that changed, and the timing of invalidation (early vs. late). These finer-grained variables produced results that were consistent with the main findings. Revision count showed slightly stronger effects but did not alter the significance or direction of associations, and the invalidation-timing variable behaved similarly to the binary IR indicator. Dependency density remained the most influential predictor in all models. These checks indicated that using binary codes did not materially alter the results.

4.5. Model Parsimony and Overfitting Checks

To examine whether the regression models were overly complex relative to the dataset size, we conducted a set of parsimony and stability checks. First, we compared the full models (ACR, IR, and DD as predictors) with reduced specifications using AIC and BIC. Across all outcomes, the models containing DD consistently outperformed those excluding it, while adding ACR and IR provided only modest incremental improvement. Second, we fitted LASSO-penalized regressions to assess predictor stability. DD was selected in more than 90% of cross-validation runs, whereas ACR and IR appeared less consistently. Finally, variance inflation factors remained below 3, suggesting that multicollinearity among predictors was not a major concern.

To sum up, these results indicate that the main findings are not artifacts of model complexity or predictor overlap. DD contributes most of the explainable variation, with ACR and IR adding smaller but directionally consistent effects, and the conclusions remain stable across alternative model specifications.

5. Discussion

In this article, our objective was to examine whether changes in environmental assumptions during early system modeling are associated with the accumulation of RTD. Using a dataset drawn from our earlier empirical work [37], we found consistent evidence that environmental assumptions volatility corresponds to measurable rework effort. Although the modeling effort was exploratory rather than industrial, the results appear to align with long-standing observations in requirements engineering. That is, when assumptions about the environment shift, design work tends to adapt accordingly [44].

To better understand how these dynamics manifested in practice, we examined the patterns of change and their effects across the analyzed assumptions. First, while many assumptions remained stable, just over half of them, i.e., 54%, underwent at least one revision. This reflects the natural evolution of understanding during early modeling. Specifically, many assumptions are initially grounded in stakeholder knowledge or domain conventions, while others are tentative and revised as more information becomes available. Second, invalidation events, where assumptions proved false or obsolete, were less common but tended to have noticeable effects on later development activities. These invalidations often occurred when initial expectations about operational constraints or actor behavior were refined, which emphasizes the importance of early validation and stakeholder confirmation. Third, the clearest pattern appeared in how environmental assumptions were connected. When an assumption is linked to many parts of the model, a single change often sets off a cascade of rework. In other words, not all assumptions carry the same weight, i.e., those at the structural core of a model can quietly accumulate debt if they are left unchecked.

The volatility measures we used capture only part of what can happen when assumptions change during modeling. In reality, assumptions can shift in many ways; sometimes they change early or late in the process, sometimes the alteration is small or quite disruptive, and in other cases, a single change affects several parts of the model at once. A fuller account of volatility would need to consider these different patterns more explicitly.

Prior studies have highlighted the importance of documenting and validating assumptions in early design stages [53,59,60]. The data we analyzed tend to support these points, and they come from modeling activities that look a lot like what engineers actually do in practice, not from an idealized or overly polished scenario. Rather than relying on full version histories or automated traceability, we used simple and manually feasible indicators, the kind an analyst or research team could realistically collect in agile MBSE contexts.

It is also worth noting that the three RTD indicators used here, i.e., RR, ID, and CC, remain practical proxies rather than exhaustive representations of requirements technical debt. In real industrial projects, RTD can manifest in forms that are not fully captured by modeling notes or localized corrective actions, such as architectural drift or delayed decision impacts. The indicators in our study reflect the forms of effort that could be reliably extracted from the available modeling artifacts and should be interpreted in that light. Future empirical work in industrial contexts will be important for understanding how these proxies align with, or differ from, the broader spectrum of RTD signatures encountered in practice.

A natural question concerns how far the findings extend beyond the cruise-control scenario we used as our working case. The CCS model is intentionally compact and well structured, which may not reflect the traceability practices or modeling maturity seen in large industrial MBSE settings. Some organizations maintain rigorous versioned traces across requirements, behaviors, and physical constraints, while others rely on more informal modeling notes. Because assumption evolution can depend on how teams document and negotiate design decisions, certain aspects of our dataset may be tied to the characteristics of the CCS model itself. At the same time, several elements of the volatility constructs, such as change events, invalidation, and dependency spread, appear to generalize to a wide range of cyber–physical systems where environmental conditions shape behavior.

Although some aspects of these results may be relevant to other cyber–physical domains, any broader claims about generalizability should be made with caution. The study is exploratory in scope and based on a single, academically controlled modeling design with 89 environmental assumptions, which limits the extent to which the findings can be generalized to industrial MBSE practice. Industrial settings differ substantially in scale, tool support, and modeling culture, which can influence both how assumptions evolve and how rework is documented. As such, the patterns reported here should be viewed as initial evidence rather than definitive statements about assumption behavior across domains. A more complete assessment will require replication in richer and more heterogeneous settings, including diverse CPS domains and industrial-grade modeling environments.

A practical way to bring these metrics into MBSE tools would involve few steps. First, tools would need to record when assumptions are added, revised, or removed so that ACR and IR can be derived automatically from the model’s change history. Second, DD could be computed directly from existing links between assumptions and requirements, using simple graph queries to flag assumptions that touch many parts of the model. Finally, presenting these values in a small dashboard or warning panel would help modelers notice when a change to a key assumption is likely to have a broader impact. We believe these kinds of features would make volatility monitoring a natural part of everyday MBSE work. These capabilities also align with ongoing MBSE 2.0 efforts toward more context-aware and intelligent modeling environments [56]. The proposed metrics (ACR, IR, and DD) make it easier for tools to reveal how assumptions change and how widely those changes may propagate, giving modelers a lightweight way to anticipate and manage the effects of evolving environmental conditions.

Threats to Validity

As with any empirical work, several factors may shape how our results should be interpreted. Some issues are unavoidable when working with early-stage modeling data, while others reflect choices we made to keep the study lightweight and realistic.

For construct validity, one source of uncertainty lies in how we defined and coded the volatility constructs. Although we introduced clearer decision rules and inter-rater checks, there is always a risk that what one analyst sees as a “substantive revision” another might regard as a routine clarification. For instance, changing an assumption from “driver is attentive” to “driver maintains situational awareness under moderate workload” sits in a gray zone: is this a refinement, or an actual shift in meaning? We aimed to resolve such questions through discussion, yet some interpretive ambiguity inevitably remains. Because RR, ID, and CC were derived from informal project notes and modeling logs, they may miss subtle rework or clarification activities, which introduces potential measurement bias that should be considered when interpreting the results.

As for internal validity, all data stemmed from a coherent modeling effort rather than a synthetic example, which naturally introduces some dependencies among artifacts. This is common in studies of early system modeling, where artifacts evolve together and share contextual grounding. While subtle coder expectations may influence how particular issues were categorized, the inter-rater agreement levels and use of separate RTD reviewers mitigate that concern. The analytical models behaved consistently across alternative specifications (beta, NB, fractional logit, penalized regressions), suggesting that the observed associations are not artifacts of a particular modeling choice.

External validity is limited by the use of a single system and domain, and this remains a primary constraint on the generalizability of the findings. At the same time, the modeling setting is typical of early-stage cyber–physical system modeling, where environmental assumptions play a central role. Following Jackson’s environment–machine formulation, these assumptions concern properties of the problem world rather than the system itself, which makes the constructs examined here conceptually domain-independent. Many engineering efforts employ similar modeling granularity patterns, which suggests that the finding is likely relevant beyond this case. Even so, examining these metrics in multiple domains and real-world industrial projects will be important to understand the extent of their applicability.

Conclusion validity may be affected by the dataset size. The dataset size in our study aligns with common empirical work in requirements and MBSE research, especially where hand-coded artifacts and early-stage design notes are involved. The combination of bootstrap confidence intervals, false-discovery control, and penalized regression provides additional assurance that the conclusions do not hinge on a small number of influential data points. Although expansions to larger industrial datasets would certainly be valuable, the present results appear stable and internally coherent, supporting the study’s goal of establishing a first empirical foundation for assumptions volatility metrics.

6. Conclusions

In this paper, we examined how environmental assumptions volatility may relate to RTD in model-based system development. Building on prior work that discussed assumption-driven forms of RTD, this study proposed three quantitative metrics, i.e., ACR, IR, and DD, and explored their relationship with established RTD indicators, including rework ratio, inconsistency density, and correction count. The analysis suggested that assumptions that changed frequently or were highly interconnected tended to be associated with higher levels of RTD. These observations provide preliminary quantitative support for the long-held view in requirements engineering that unstable environmental assumptions can influence the effort and quality of early modeling activities. We believe that by translating the abstract notion of assumptions volatility into measurable indicators, this work lays the foundation for early detection and prediction of assumption-driven technical debt.

Several avenues remain open for further investigation. One useful step is to examine larger and more diverse datasets to test the generality of the metrics. Integrating assumptions volatility tracking directly into MBSE tools could also enable real-time monitoring and feedback. Another promising direction is predictive modeling, e.g., using machine learning to estimate likely RTD growth based on early volatility patterns. Finally, qualitative follow-up studies could explore how practitioners perceive and manage environmental assumption change, helping to refine the theory as well as practical guidance on assumption management.

Overall, the findings offer early support for treating assumptions not just as background context but as dynamic elements that may shape technical debt long before a line of code is written.

Funding

This research received no external funding.

Data Availability Statement

The environmental assumptions dataset used as the basis for this study is publicly available [62]. All sources are publicly available through the referenced publications.

Acknowledgments

The author would like to thank the anonymous reviewers for their detailed and constructive feedback, which helped improve the clarity and quality of this manuscript.

Conflicts of Interest

The author declares no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

RTD	Requirements Technical Debt
MBSE	Model-Based Systems Engineering
ACR	Assumption Change
IR	Invalidation Ratio
DD	Dependency Density
SysML	Systems Modeling Language
RTDQM	Requirements Technical Debt Quantification Model
DR	Derived Requirement
RR	Rework Ratio
ID	Inconsistency Density
CC	Correction Count

References

Jackson, M. Problems and requirements (software development). In Proceedings of the Second IEEE International Symposium on Requirements Engineering, York, UK, 27–29 March 1995; pp. 2–9. [Google Scholar]
Jackson, M. The Meaning of Requirements. Ann. Softw. Eng. 1997, 3, 5–21. [Google Scholar] [CrossRef]
van Lamsweerde, A.; Letier, E. Handling Obstacles in Goal-Oriented Requirements Engineering. IEEE Trans. Softw. Eng. 2000, 26, 978–1005. [Google Scholar] [CrossRef]
Dardenne, A.; van Lamsweerde, A.; Fickas, S. Goal-Directed Requirements Acquisition. Sci. Comput. Program. 1993, 20, 3–50. [Google Scholar] [CrossRef]
Steingruebl, A.; Peterson, G. Software Assumptions Lead to Preventable Errors. IEEE Secur. Priv. 2009, 7, 84–87. [Google Scholar] [CrossRef]
Albayrak, Ö.; Kurtoglu, H.; Biçakçi, M. Incomplete Software Requirements and Assumptions Made by Software Engineers. In Proceedings of the 16th Asia-Pacific Software Engineering Conference, APSEC 2009, Batu Ferringhi, Penang, Malaysia, 1–3 December 2009; pp. 333–339. [Google Scholar]
Yang, C.; Liang, P.; Avgeriou, P. Assumptions and their management in software development: A systematic mapping study. Inf. Softw. Technol. 2018, 94, 82–110. [Google Scholar] [CrossRef]
Lewis, G.A.; Mahatham, T.; Wrage, L. Assumptions Management in Software Development; Technical Report CMU/SEI-2004-TR-010, ADA443152; Software Engineering Institute, Carnegie Mellon University: Pittsburgh, PA, USA, 2004. [Google Scholar]
Li, W.; Dworkin, L.; Seshia, S.A. Mining assumptions for synthesis. In Proceedings of the 9th IEEE/ACM International Conference on Formal Methods and Models for Codesign, MEMOCODE 2011, Cambridge, UK, 11–13 July 2011; pp. 43–50. [Google Scholar]
Alrajeh, D.; Kramer, J.; van Lamsweerde, A.; Russo, A.; Uchitel, S. Generating obstacle conditions for requirements completeness. In Proceedings of the 34th International Conference on Software Engineering, ICSE 2012, Zurich, Switzerland, 2–9 June 2012; pp. 705–715. [Google Scholar]
Welsh, K.; Sawyer, P.; Bencomo, N. Towards requirements aware systems: Run-time resolution of design-time assumptions. In Proceedings of the 26th IEEE/ACM International Conference on Automated Software Engineering (ASE 2011), Lawrence, KS, USA, 6–10 November 2011; pp. 560–563. [Google Scholar]
Corbató, F.J. On Building Systems That Will Fail. Commun. ACM 1991, 34, 72–81. [Google Scholar] [CrossRef]
Friedenthal, S.; Moore, A.; Steiner, R. A Practical Guide to SysML: The Systems Modeling Language; Morgan Kaufmann: Burlington, MA, USA, 2014. [Google Scholar]
Wang, H.; Zhong, D.; Zhao, T.; Ren, F. Integrating Model Checking with SysML in Complex System Safety Analysis. IEEE Access 2019, 7, 16561–16571. [Google Scholar] [CrossRef]
Ammann, P.; Black, P.E.; Majurski, W. Using Model Checking to Generate Tests from Specifications. In Proceedings of the Second IEEE International Conference on Formal Engineering Methods, ICFEM 1998, Brisbane, QLD, Australia, 9–11 December 1998; pp. 46–55. [Google Scholar]
Cimatti, A.; Clarke, E.M.; Giunchiglia, F.; Roveri, M. NUSMV: A New Symbolic Model Verifier. In Proceedings of the 11th International Conference on Computer Aided Verification, CAV ’99, Trento, Italy, 6–10 July 1999; Proceedings; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 1999; Volume 1633, pp. 495–499. [Google Scholar]
Sabetzadeh, M.; Nejati, S.; Briand, L.C.; Mills, A.H.E. Using SysML for Modeling of Safety-Critical Software-Hardware Interfaces: Guidelines and Industry Experience. In Proceedings of the International Symposium on High-Assurance Systems Engineering (HASE), Boca Raton, FL, USA, 10–12 November 2011; pp. 193–201. [Google Scholar]
Biggs, G.; Sakamoto, T.; Kotoku, T. A profile and tool for modelling safety information with design information in SysML. Softw. Syst. Model. 2016, 15, 147–178. [Google Scholar] [CrossRef]
Mhenni, F.; Choley, J.Y.; Nguyen, N. SysML extensions for safety-critical mechatronic systems design. In Proceedings of the 2015 IEEE International Symposium on Systems Engineering (ISSE), Rome, Italy, 28–30 September 2015; pp. 242–247. [Google Scholar]
Müller, M.; Roth, M.; Lindemann, U. The hazard analysis profile: Linking safety analysis and SysML. In Proceedings of the 2016 Annual IEEE Systems Conference (SysCon), Orlando, FL, USA, 18–21 April 2016; pp. 1–7. [Google Scholar]
Nejati, S.; Sabetzadeh, M.; Falessi, D.; Briand, L.C.; Coq, T. A SysML-based approach to traceability management and design slicing in support of safety certification: Framework, tool support, and case studies. Inf. Softw. Technol. 2012, 54, 569–590. [Google Scholar] [CrossRef]
Nejati, S.; Sabetzadeh, M.; Arora, C.; Briand, L.C.; Mandoux, F. Automated change impact analysis between SysML models of requirements and design. In Proceedings of the 2016 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering; ACM: New York, NY, USA, 2016; pp. 242–253. [Google Scholar]
Lee, E.A.; Seshia, S.A. Introduction to Embedded Systems: A Cyber-Physical Systems Approach; MIT Press: Cambridge, MA, USA, 2016. [Google Scholar]
Rajkumar, R.R.; Lee, I.; Sha, L.; Stankovic, J. Cyber-physical systems: The next computing revolution. In Proceedings of the 47th Design Automation Conference; ACM: New York, NY, USA, 2010; pp. 731–736. [Google Scholar]
Ali, S.; Lu, H.; Wang, S.; Yue, T.; Zhang, M. Uncertainty-Wise Testing of Cyber-Physical Systems. In Advances in Computers; Elsevier: Amsterdam, The Netherlands, 2017; Volume 107, pp. 23–94. [Google Scholar]
Cailliau, A.; van Lamsweerde, A. Assessing requirements-related risks through probabilistic goals and obstacles. Requir. Eng. 2013, 18, 129–146. [Google Scholar] [CrossRef]
Cailliau, A.; van Lamsweerde, A. Runtime Monitoring and Resolution of Probabilistic Obstacles to System Goals. In Proceedings of the IEEE/ACM 12th International Symposium on Software Engineering for Adaptive and Self-Managing Systems (SEAMS), Buenos Aires, Argentina, 22–23 May 2017; pp. 1–11. [Google Scholar]
Perera, J.; Tempero, E.; Tu, Y.C.; Blincoe, K. Modelling the quantification of requirements technical debt. Requir. Eng. 2024, 29, 421–458. [Google Scholar] [CrossRef]
McConnell, S. Managing Technical Debt; Technical Report; Construx Software Builders: Bellevue, WA, USA, 2008. [Google Scholar]
Brown, N.; Cai, Y.; Guo, Y.; Kazman, R.; Kim, M.; Kruchten, P.; Lim, E.; MacCormack, A.; Nord, R.; Ozkaya, I.; et al. Managing technical debt in software-reliant systems. In FoSER ’10: Proceedings of the FSE/SDP Workshop on Future of Software Engineering Research; Association for Computing Machinery: New York, NY, USA, 2010; pp. 47–52. [Google Scholar] [CrossRef]
Kruchten, P.; Nord, R.L.; Ozkaya, I. Technical Debt: From Metaphor to Theory and Practice. IEEE Softw. 2012, 29, 18–21. [Google Scholar] [CrossRef]
Ernst, N.A. On the role of requirements in understanding and managing technical debt. In Proceedings of the 2012 Third International Workshop on Managing Technical Debt (MTD), Zurich, Switzerland, 5 June 2012; pp. 61–64. [Google Scholar]
Abad, Z.S.H.; Ruhe, G. Using real options to manage technical debt in requirements engineering. In Proceedings of the 2015 IEEE 23rd International Requirements Engineering Conference (RE), Ottawa, ON, Canada, 24–28 August 2015; pp. 230–235. [Google Scholar]
Melo, A.; Fagundes, R.; Lenarduzzi, V.; Santos, W.B. Identification and measurement of Requirements Technical Debt in software development: A systematic literature review. J. Syst. Softw. 2022, 194, 111483. [Google Scholar] [CrossRef]
Robiolo, G.; Scott, E.; Matalonga, S.; Felderer, M. Technical debt and waste in non-functional requirements documentation: An exploratory study. In Proceedings of the 20th International Conference on Product-Focused Software Process Improvement, PROFES 2019, Barcelona, Spain, 27–29 November 2019; Proceedings 20; Springer: Cham, Switzerland, 2019; pp. 220–235. [Google Scholar]
Perera, J.; Tempero, E.; Tu, Y.C.; Blincoe, K. Quantifying requirements technical debt: A systematic mapping study and a conceptual model. In Proceedings of the 2023 IEEE 31st International Requirements Engineering Conference (RE), Hannover, Germany, 4–8 September 2023; pp. 123–133. [Google Scholar]
Alenazi, M.; Niu, N.; Alshammary, L.T. Environmental Assumptions in System Design: A Phenomenon-Based and Operational Perspective. IEEE Access 2025, 13, 174883–174898. [Google Scholar] [CrossRef]
van Lamsweerde, A.; Letier, E. Integrating Obstacles in Goal-Driven Requirements Engineering. In Proceedings of the 1998 International Conference on Software Engineering, ICSE 98, Kyoto, Japan, 19–25 April 1998; pp. 53–62. [Google Scholar]
Alrajeh, D.; Cailliau, A.; van Lamsweerde, A. Adapting Requirements Models to Varying Environments. In ICSE ’20: Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering; Association for Computing Machinery: New York, NY, USA, 2020; pp. 50–61. [Google Scholar] [CrossRef]
Bloem, R.; Chockler, H.; Ebrahimi, M.; Strichman, O. Specifiable Robustness in Reactive Synthesis. Form. Methods Syst. Des. 2022, 60, 259–276. [Google Scholar] [CrossRef]
Yang, X.; Chen, X.; Wang, J. A Model Checking Based Software Requirements Specification Approach for Embedded Systems. In Proceedings of the 2023 IEEE 31st International Requirements Engineering Conference Workshops (REW), Hannover, Germany, 4–5 September 2023; pp. 184–191. [Google Scholar]
Sturmer, S.; Niu, N.; Bhowmik, T.; Savolainen, J. Eliciting environmental opposites for requirements-based testing. In Proceedings of the 2022 IEEE 30th International Requirements Engineering Conference Workshops (REW), Melbourne, Australia, 15–19 August 2022; pp. 10–13. [Google Scholar]
Amin, M.R.; Bhowmik, T.; Niu, N.; Savolainen, J. Environmental Variations of Software Features: A Logical Test Cases’ Perspective. In Proceedings of the 2023 IEEE 31st International Requirements Engineering Conference Workshops (REW), Hannover, Germany, 4–5 September 2023; pp. 192–198. [Google Scholar]
Granadeno, P.A.A.; Bernal, A.M.R.; Al Islam, M.N.; Cleland-Huang, J. An Environmentally Complex Requirement for Safe Separation Distance Between UAVs. In Proceedings of the 2024 IEEE 32nd International Requirements Engineering Conference Workshops (REW), Reykjavik, Iceland, 24–25 June 2024; pp. 166–175. [Google Scholar]
Peng, Z.; Rathod, P.; Niu, N.; Bhowmik, T.; Liu, H.; Shi, L.; Jin, Z. Testing software’s changing features with environment-driven abstraction identification. Requir. Eng. 2022, 27, 405–427. [Google Scholar] [CrossRef]
Ali, R.; Dalpiaz, F.; Giorgini, P.; Souza, V.E.S. Requirements Evolution: From Assumptions to Reality. In Enterprise, Business-Process and Information Systems Modeling, Proceedings of the 12th International Conference, BPMDS 2011, and 16th International Conference, EMMSAD 2011, Held at CAiSE 2011, London, UK, 20–21 June 2011; Proceedings; Lecture Notes in Business Information Processing; Springer: Berlin/Heidelberg, Germany, 2011; Volume 81, pp. 372–382. [Google Scholar]
Tun, T.T.; Lutz, R.R.; Nakayama, B.; Yu, Y.; Mathur, D.; Nuseibeh, B. The Role of Environmental Assumptions in Failures of DNA Nanosystems. In Proceedings of the 1st IEEE/ACM International Workshop on Complex Faults and Failures in Large Software Systems, COUFLESS 2015, Florence, Italy, 23 May 2015; pp. 27–33. [Google Scholar]
Daun, M.; Tenbergen, B. Context modeling for cyber-physical systems. J. Softw. Evol. Process 2023, 35, e2451. [Google Scholar]
Hyun, S.; Jee, E.; Bae, D.H. Collaboration failure analysis in cyber-physical system-of-systems using context fuzzy clustering. Empir. Softw. Eng. 2025, 30, 44. [Google Scholar] [CrossRef]
Clemmensen, T.; Moghaddam, M.T.; Nørbjerg, J. Cyber-physical systems with Human-in-the-Loop: A systematic review of socio-technical perspectives. J. Syst. Softw. 2025, 226, 112348. [Google Scholar]
Yu, E.S. Towards modelling and reasoning support for early-phase requirements engineering. In Proceedings of the ISRE’97: 3rd IEEE International Symposium on Requirements Engineering, Annapolis, MD, USA, 6–10 January 1997; pp. 226–235. [Google Scholar]
Brecher, C.; Nittinger, J.A.; Karlberger, A. Model-based control of a handling system with SysML. Procedia Comput. Sci. 2013, 16, 197–205. [Google Scholar] [CrossRef]
Alenazi, M. Uncovering the Implicit: A Comparative Evaluation of Modeling Approaches for Environmental Assumptions. Appl. Sci. 2025, 15, 10345. [Google Scholar] [CrossRef]
Samin, H.; Walton, D.; Bencomo, N. Surprise! Surprise! Learn and Adapt. In AAMAS ’25: Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems; International Foundation for Autonomous Agents and Multiagent Systems: Richland, SC, USA, 2025; pp. 1821–1829. [Google Scholar]
Wilking, F.; Horber, D.; Goetz, S.; Wartzack, S. Utilization of System Models in Model-Based Systems Engineering: Definition, classes and research directions based on a systematic literature review. Des. Sci. 2024, 10, e6. [Google Scholar] [CrossRef]
Zhang, L.; Chen, Z.; Laili, Y.; Ren, L.; Deen, M.J.; Cai, W.; Zhang, Y.; Zeng, Y.; Gu, P. MBSE 2.0: Toward More Integrated, Comprehensive, and Intelligent MBSE. Systems 2025, 13, 584. [Google Scholar] [CrossRef]
Cunningham, W. The WyCash portfolio management system. ACM Sigplan Oops Messenger 1992, 4, 29–30. [Google Scholar] [CrossRef]
Lenarduzzi, V.; Fucci, D. Towards a holistic definition of requirements debt. In Proceedings of the 2019 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM), Porto de Galinhas, Brazil, 19–20 September 2019; pp. 1–5. [Google Scholar]
Alenazi, M. Requirements Technical Debt Through the Lens of Environment Assumptions. In Proceedings of the 2025 IEEE/ACM International Conference on Technical Debt (TechDebt), Ottawa, ON, Canada, 27–28 April 2025; pp. 40–46. [Google Scholar]
Alenazi, M. The Role of Environmental Assumptions in Shaping Requirements Technical Debt. Appl. Sci. 2025, 15, 8028. [Google Scholar] [CrossRef]
Wohlin, C.; Runeson, P.; Höst, M.; Ohlsson, M.C.; Regnell, B. Experimentation in Software Engineering; Springer: Berlin/Heidelberg, Germany, 2012. [Google Scholar]
Alenazi, M.; Niu, N. Environment Assumptions in the Context of SysML Modeling. Zenodo 2025. [Google Scholar] [CrossRef]

Figure 1. Overview of the study workflow.

Figure 2. Standardized marginal effects (

Δ

SD) with 95% confidence intervals for ACR, IR, and DD across the beta (RR) and negative binomial models (ID and CC). ACR and IR are coded as 0→1 changes; DD is z-standardized.

Figure 2. Standardized marginal effects (

Δ

SD) with 95% confidence intervals for ACR, IR, and DD across the beta (RR) and negative binomial models (ID and CC). ACR and IR are coded as 0→1 changes; DD is z-standardized.

Table 1. Illustrative excerpt of the environmental assumptions dataset (10 of 89). Each row represents one assumption and its associated volatility and RTD indicators.

	ACR	IR	DD	RR	ID	CC
E01	0	0	1	0.03	0	0
E02	1	0	3	0.12	1	1
E03	0	0	2	0.05	0	0
E04	1	1	4	0.19	2	2
E05	0	0	1	0.04	0	0
E06	1	0	3	0.11	1	1
E07	1	0	2	0.09	1	1
E08	0	0	1	0.03	0	0
E09	1	1	5	0.28	3	3
E10	1	0	2	0.08	1	1

ACR = Assumption Change (0 = unchanged, 1 = changed); IR = Invalidated (0 = no, 1 = yes); DD = Dependency Density; RR = Rework Ratio; ID = Inconsistency Density; CC = Correction Count.

Table 2. Pearson correlations (r) between volatility measures and RTD indicators (

N = 89

). ACR and IR are binary (point-biserial r). BH–FDR-adjusted significance shown.

Table 2. Pearson correlations (r) between volatility measures and RTD indicators (

N = 89

). ACR and IR are binary (point-biserial r). BH–FDR-adjusted significance shown.

	RR	ID	CC
ACR	0.42 *	0.37 *	0.35 *
IR	0.38 *	0.34 *	0.31 *
DD	0.50 **	0.46 *	0.43 *

*

q < 0.05

, **

q < 0.01

(BH–FDR across the correlation family). 95% bootstrap CIs provided in Table 3; Spearman’s

ρ

in Table 4.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Alenazi, M. Quantifying Environmental Assumptions Volatility and Its Role in Requirements Technical Debt Accumulation. Electronics 2025, 14, 4930. https://doi.org/10.3390/electronics14244930

AMA Style

Alenazi M. Quantifying Environmental Assumptions Volatility and Its Role in Requirements Technical Debt Accumulation. Electronics. 2025; 14(24):4930. https://doi.org/10.3390/electronics14244930

Chicago/Turabian Style

Alenazi, Mounifah. 2025. "Quantifying Environmental Assumptions Volatility and Its Role in Requirements Technical Debt Accumulation" Electronics 14, no. 24: 4930. https://doi.org/10.3390/electronics14244930

APA Style

Alenazi, M. (2025). Quantifying Environmental Assumptions Volatility and Its Role in Requirements Technical Debt Accumulation. Electronics, 14(24), 4930. https://doi.org/10.3390/electronics14244930

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Quantifying Environmental Assumptions Volatility and Its Role in Requirements Technical Debt Accumulation

Abstract

1. Introduction

2. Background and Related Work

2.1. Environmental Assumptions

2.2. Modeling Approaches and Assumption Evolution

2.3. Requirements Technical Debt

3. Methodology

3.1. Research Design

3.2. Volatility Measures

3.3. RTD Indicators

3.4. Analysis Procedure

4. Results

4.1. Descriptive Summary

4.2. Correlation Analysis

4.3. Regression Models

4.4. Sensitivity Analysis

4.5. Model Parsimony and Overfitting Checks

5. Discussion

Threats to Validity

6. Conclusions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI