Next Article in Journal
Study on the Pore Pressure Coefficient of Saturated Sandy Silt Under Frozen Conditions
Previous Article in Journal
Dynamic Risk Evolution and Adaptive Synchronization Control for Human–Machine–Environment Coupled Nuclear Emergency System: Based on Comprehensive On-Site Emergency Drills of Nuclear Power Plants
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Cost Modeling and Configuration Optimization for Large-Scale VANET Co-Simulation

1
Department of Computer, North China Electric Power University, Baoding 071003, China
2
Engineering Research Center of Intelligent Computing for Complex Energy Systems, Ministry of Education, Baoding 071003, China
3
Hebei Key Laboratory of Knowledge Computing for Energy & Power, Baoding 071003, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2026, 16(7), 3264; https://doi.org/10.3390/app16073264
Submission received: 4 March 2026 / Revised: 21 March 2026 / Accepted: 23 March 2026 / Published: 27 March 2026

Abstract

Vehicular Ad Hoc Network (VANET) traffic–network co-simulation is a foundational methodology for the engineering evaluation of vehicle-to-everything (V2X) protocols and cooperative Intelligent Transportation System (ITS) applications before field deployment. However, with research objectives and experimental conditions varying widely, existing studies still lack a systematic paradigm for parameter configuration and experimental workflows. As a result, researchers often rely on experience-based settings, which can bring high time and computational overhead, long experimental cycles, and limited reproducibility. To address these issues, this paper proposes a simulation cost modeling and configuration optimization methodology for traffic–network co-simulation. By profiling and structurally modeling key overheads, such as initialization and traffic- and network-side execution, we characterize how traffic, network, and control parameters jointly affect total simulation overhead. We formulate a minimum-cost configuration optimization model under constraints of statistical validity and experimental comparability. We further develop a configuration solving mechanism and a structured workflow for simulation experiment configuration to complement empirical tuning with a more systematic approach, thereby improving the reproducibility of simulation studies. The study is grounded in a representative urban road-network co-simulation scenario based on Simulation of Urban MObility (SUMO), Veins, and Objective Modular Network Testbed in C++ (OMNeT++). Simulation results show that the proposed method reduces simulation overhead while keeping conclusions on key performance metrics consistent, thereby providing a more efficient and statistically credible evaluation basis for application-oriented urban VANET studies related to traffic safety, transportation efficiency, and wireless-system performance.

1. Introduction

In vehicle-to-infrastructure environments, Vehicular Ad Hoc Networks (VANET) [1,2,3] are core enabling technologies, providing essential communication and coordination capabilities for Intelligent Transportation Systems (ITSs) [4,5,6] and autonomous driving applications. As a decentralized vehicle-to-everything (V2X) network, VANET support real-time exchange of status and event information, including position, speed, intent, and hazard warnings, through low-latency broadcasting and multi-hop forwarding. This capability enables proactive safety risk mitigation, such as collision avoidance, and supports cooperative driving. Because VANET nodes are physical vehicles naturally distributed across the road network, the system can be deployed with low marginal cost, mainly by using onboard communication units with limited dependence on roadside infrastructure, and the network scale can expand naturally with traffic flow. At the same time, VANET form self-organizing networks by treating vehicles as network nodes. This architecture enables flexible deployment and reduces infrastructure investment. The node population grows naturally with traffic flow. As a result, VANET are well suited to road environments where roadside unit (RSU) deployment is sparse or coverage is intermittent. Direct communication and information exchange among vehicles enable vehicle applications that support key functions such as collision warning and cooperative lane changes. These applications support cooperative perception and route guidance. These capabilities improve traffic safety, transport efficiency, and operational manageability. In the present study, the main focus is placed on vehicular nodes, inter-vehicle communication, and the associated simulation overhead, while the road–network environment mainly provides the traffic and mobility background. More detailed roadside facilities and broader deployment-side supporting infrastructure are not the primary focus of this work.
In theory, VANET can substantially improve traffic operations. However, validating them in complex urban environments still faces major technical and practical barriers. In this study, the traffic environment of interest is an urban road network within a city. Real-world testing in such urban environments is often prohibitively expensive and limited in coverage. In addition, boundary conditions are difficult to control in a strict and repeatable manner because urban traffic is shaped by factors such as intersections, local traffic interactions, and heterogeneous operating states. Such efforts are also constrained by regulatory requirements, equipment availability, and traffic management conditions. As a result, controlled experimentation at representative urban scale remains a major challenge. In addition, many cooperative mechanisms, such as roadside-assisted decision making, inter-vehicle information sharing, and dynamic reconfiguration, often struggle to reproduce consistent experimental conditions on real urban roads. For this reason, traffic–network co-simulation has become an important engineering tool for evaluating V2X protocols and cooperative ITS applications before field deployment, especially when researchers need to test performance under controllable and repeatable urban traffic conditions.
With this background, simulation experiments are a critical pillar for the design, verification, and comparative assessment of cooperative mechanisms. Simulations can reproduce complex traffic flows and wireless channel environments under controlled conditions. In addition, they support large-scale parameter sweeps and ablation analysis. As a result, these capabilities turn theoretical feasibility into evidence that can be verified through engineering practice. In terms of implementation, the evaluation of vehicular network systems has evolved from sequential asynchronous simulation to synchronous co-simulation. Early approaches typically followed a pipeline with two stages that ran traffic simulation first and network simulation second. Traffic simulators such as Simulation of Urban MObility (SUMO) and Verkehr In Städten-SIMulationsmodell (VISSIM) [7] generated microscopic vehicle motion and interaction behaviors and exported them as trajectory files. These trajectories were then imported into network simulators such as Network Simulator 3 (NS-3) and Objective Modular Network Testbed in C++ (OMNeT++) to emulate communication processes, including link setup, neighbor maintenance, channel contention, and packet forwarding. This decoupled workflow is easy to implement and allows trajectory data to be reused. However, it breaks the feedback loop between traffic and communication. As a result, it is hard to capture how communication latency, congestion, and packet loss affect cooperative decision making. In addition, this approach can introduce bias in time-step alignment, event scheduling, and state updates. To improve cross-domain consistency and closed loop interpretability, researchers have developed co-simulation frameworks. Traffic and network modules interact online and run with synchronous scheduling through a unified simulation clock and interface. As a result, vehicle mobility, communication events, and upper layer mechanisms can evolve jointly on the same timeline. Compared with sequential workflows, co-simulation more faithfully captures how traffic communication coupling affects overall system performance. This makes co-simulation not only a technical simulation framework, but also a practical evaluation basis for broader urban ITS studies in which traffic safety, transportation efficiency, and communication performance need to be jointly assessed.
Although this workflow is closer to real-world system logic, it greatly increases engineering implementation complexity. The strong coupling between traffic and communication causes frequent state updates and mutual feedback. This interdependence creates substantial overhead in cross-platform synchronization and event scheduling. As the simulation scale grows, message load concentration, synchronization blocking, and performance degradation can occur. As a result, it becomes increasingly difficult to run large-scale, long-duration, and reproducible controlled experiments. Beyond longer cross-layer chains, the difficulty of co-simulation also comes from the wide range of tunable settings. These settings cover several aspects, including traffic demand and driving rules, communication links and protocol stack options, and data collection and logging choices. Because these aspects are tightly connected, changing one parameter can trigger ripple effects and simultaneously increase traffic interaction intensity and communication load. For instance, adjusting vehicle density influences both traffic interaction intensity and the size of each communication neighborhood, which then reshapes channel contention and packet forwarding load. Likewise, a finer sampling granularity can reveal more detailed dynamics, yet it also raises the overall system burden and lengthens the experimental cycle. Finally, the simulation horizon and the number of repetitions govern statistical reliability, but they can drive the total cost up in a roughly linear manner, and in some cases even superlinearly. Because of cross-layer coupling and non-linear amplification, researchers often find it hard to balance fidelity, statistical validity, and total cost in experimental practice. Too few samples can produce unstable findings and overly wide confidence intervals. In contrast, overly cautious settings may trigger many redundant runs. As a result, large-scale comparisons across parameter schemes often become impractical due to both time and computing costs.
Currently, co-simulation practices largely rely on empirical parameter tuning. Specifically, researchers frequently adjust key configurations such as simulation duration, sampling settings, and communication loads to mitigate performance bottlenecks or result fluctuations. This strategy driven by experience carries two common risks. First, statistical insufficiency arises when statistical targets are not explicitly specified and enforced, leaving conclusions sensitive to randomness and scenario disturbances and weakening comparability across configurations and studies. Second, unnecessary increases in simulation overhead can occur when robustness is pursued without restraint, so researchers may extend the simulation horizon, raise the sampling rate, or increase the number of repetitions, which can sharply inflate the overall time and computational burden of the experiments. These issues obstruct the development of a stable, reproducible, and comparable experimental paradigm for large-scale co-simulation. As a result, they restrict systematic evaluation and engineering validation of V2X collaborative mechanisms.
Against this backdrop, this paper revisits the parameter configuration problem in VANET traffic–network co-simulation. The objective is not to replace experience-based configuration, but to complement it with a more explicit, structured, and reproducible methodological basis when experiments become larger in scale, more repetitive, and more resource-intensive. In addition, the proposed framework helps abstract effective practical experience into reusable methodological guidance, so that subsequent experiments can be designed more efficiently, repeated trial-and-error effort can be reduced, and sustained research output can be accelerated to some extent. To this end, we first perform a structured decomposition of end-to-end resource consumption in co-simulation to derive cost components that are measurable and interpretable. Furthermore, we explicitly model experimental design requirements, such as statistical validity, as hard constraints in parameter configuration. In this way, configuration selection can move beyond a mainly experience-based trial-and-error process toward a more systematic basis under cost and validity constraints.
The contributions of this paper can be summarized as follows:
  • Structured cost modeling for traffic–network co-simulation. This paper develops a structured decomposition of end-to-end resource consumption in traffic–network co-simulation, so that the overhead of large-scale experiments is represented by measurable and interpretable components. Unified observation metrics and statistical aggregation rules are further introduced to support consistent cost comparison across different scenarios and parameter settings.
  • Statistically constrained configuration optimization. This paper incorporates statistical validity requirements, including effective sample size and statistical power, directly into the simulation-configuration process. In this way, key parameters such as simulation horizon, repetition count, and sampling settings are no longer determined mainly by empirical experience, but by explicit and verifiable criteria, enabling a more controlled tradeoff among result credibility, simulation fidelity, and overall overhead.
  • A structured workflow for simulation-experiment configuration. This paper proposes a structured workflow that organizes simulation-experiment configuration into a clear sequence of executable steps. By formalizing configuration formulation, calibration, and parameter optimization within one unified process, the workflow enables large-scale VANET co-simulation studies to proceed along a clearer configuration path across different scenarios.
The remainder of this paper is organized as follows.
Section 2 reviews related work on VANET co-simulation, experimental overhead analysis, and simulation configuration optimization. Section 3 presents the materials and methods, including the problem definition, overhead modeling, validity analysis, the minimum-cost configuration model, and the simulation setup and evaluation protocol. Section 4 provides the simulation-based evaluation and results. The final section, Section 5, concludes the paper and discusses future work.

2. Related Work

In recent years, simulation has become a key tool for quantitatively evaluating routing protocols and control algorithms in VANET. This section reviews representative progress in related research and groups it into three areas: (i) simulation theory and techniques, (ii) synchronized traffic–network co-simulation, and (iii) lightweight deployment with runtime optimization.
In research on simulation theories and techniques, Ormándi and Varga [8] conducted an in-depth comparison of intersection control algorithms by introducing higher fidelity communication simulation. They showed that if the authenticity of the communication layer, such as latency, packet loss, and link fluctuations, is ignored, the judgment of algorithm superiority or inferiority may exhibit systematic bias, thereby emphasizing the strong coupling between detailed simulation and the credibility of control conclusions. Veins, OMNeT++, NS-3, SUMO, and VISSIM are among the most widely used simulation tools in VANET research, providing essential support for modeling wireless communication protocols, vehicle mobility, and microscopic traffic dynamics [9,10,11]. Raviglione et al. [12] developed the MS-VAN3T co-simulation platform, based on NS-3 and SUMO, and implemented a complete ETSI ITS G5 protocol stack, thereby supporting end-to-end virtual validation of V2X communications and applications. Ormándi et al. [13] proposed a mesoscopic V2X co-simulation method based on SUMO and OMNeT++, which aggregates far-field vehicle communications to accelerate simulation while preserving key accuracy. Weber et al. [14] provided an updated survey of VANET simulators, systematically examining their support for emerging technologies such as 5G and edge computing as well as security and privacy features, and pointed out that building more realistic and comprehensive VANET simulation tools still faces several open challenges.
In synchronized traffic–network co-simulation, Araujo et al. [15] built an integrated configurable simulation environment for vehicular named data networking (NDN), and further proposed the NDN4IVC co-simulation framework that integrates NS-3 and SUMO, achieving realistic modeling of vehicular NDN through bidirectional coupling between the network and traffic simulators. Wang et al. [16] developed a bidirectionally coupled co-simulation platform linking SUMO and NS-3, enabling more realistic evaluation of connected vehicle traffic management and edge computing-based Intelligent Transportation System solutions in smart city settings. Shi et al. [17] emphasized that simulation-based testing is essential for the development and optimization of connected and automated vehicles (CAVs), and proposed a co-simulation framework that combines traffic simulation with vehicle dynamics to support more realistic CAV evaluation. Alupoaei and Caruntu [18] conducted V2X simulation studies on the Eclipse MOSAIC platform, illustrating a research pathway that uses integrated traffic and communication platforms to evaluate cooperative optimization strategies for efficiency and safety. Tomás et al. (2023) [19] proposed an agent-based multi layer co-simulation platform for scenario-oriented integrated simulation, which integrates communication and behavior decision mechanisms so that vehicles can autonomously choose routes based on received traffic information. OpenCDA (Xu et al., 2023) [20] wraps SUMO, CARLA, and NS-3 and provides a scenario generator and a library of cooperative driving algorithms, greatly reducing orchestration cost and enabling researchers to quickly perform benchmark evaluations for tasks such as platooning, cooperative perception, and edge offloading. Meanwhile, Jia et al. (2021) [21] interconnected SUMO, OMNeT++, and Webots within a network and physical closed loop with network congestion feedback, enabling a more realistic evaluation of an eco driving controller. Wang et al. (2025) [22] proposed a hardware-in-the-loop (HIL) V2X test platform, where scaled vehicles equipped with onboard sensors exchange packets with an online network simulator, offloading part of the computation to real hardware and providing visual feedback in real-time. Liang et al. (2025) [23] proposed the DTTF Sim digital twin framework, which replays real sensor trajectories into a virtual model, enabling continuous regression testing that covers safety critical boundary cases without rerunning full Monte Carlo batches. Schrab et al. (2023) [24] proposed the Eclipse MOSAIC multi-domain simulation framework, which can couple advanced simulators across traffic, application, and communication domains and supports integration with external models and local codebases, thereby improving extensibility and flexibility. From a broader cyber-physical systems (CPSs) simulation perspective, Tampouratzis et al. [25] presented the APOLLON framework, a fully distributed integrated simulator that holistically models processors, peripherals, networks, and physical processes within a synchronized CPS environment. This work is oriented toward general CPS simulation and further highlights the importance of accurate cross-domain synchronization and holistic subsystem integration in complex co-simulation studies.
In lightweight deployment and runtime optimization strategies, Stepanyants et al. [26] proposed a parallel multi-level simulation framework that performs cooperative computation across models at different levels to improve the feasibility of fine-grained large-scale ITS modeling. Tangirala et al. [27] studied data flow simulation for ultra large-scale ITS and investigated how to efficiently simulate large-scale data movement and processing within a discrete event framework, enabling experimental evaluation at larger scales and over longer time horizons. Häfner et al. (2022) [28] added merging control and platoon control modules on top of the open VENTOS core and supported hardware-in-the-loop (HIL), showing that under high density traffic, stress testing of adaptive gap creation logic can be completed in minutes rather than hours. Regragui et al. [29] proposed a simplified microscopic mobility modeling method based on cellular automata (CA) for large-scale urban traffic simulation, where the grid-based framework includes three motion models and is used to analyze how different mobility designs affect VANET connectivity dynamics. Keramidi et al. (2022) [30] proposed VANET performance evaluation methods based on M/M/c and M/N/M/c queueing models. Hu et al. (2022) [31] developed a semi Markov process (SMP) model for DSRC safety message broadcasting, enabling analytical computation of collision probability and network throughput. Yeferny and Ben Yahia (2021) [32] proposed a mathematical model of vehicle mobility on road networks and a fully distributed geographic broadcast protocol that can efficiently compute the zone of relevance (ZOR) for events. Badole and Thakare [33] proposed a VANET routing framework combined with network digital twins, using a hybrid Honey Badger optimization model and data synchronization in real-time to significantly improve multi-hop routing performance.
Overall, existing studies have developed a substantial set of tools and methods for co-simulation framework construction, platform reuse, and runtime acceleration. Yet a major gap still remains in configuration methodology. Existing studies mainly aim to improve realism or scalability, but they lack an interpretable decomposition of resource consumption and unified measurement metrics. This makes it hard to perform quantifiable cost comparisons across different configurations. Lightweight and acceleration strategies usually target lower overhead in a single run, but they often do not treat statistical validity as a hard constraint in experimental design to systematically set the simulation horizon and repetition count. As a result, simulation studies may still face either too little statistical support or unnecessary execution. At the same time, existing platforms often stress ease of use and functional coverage, but they offer limited support for turning parameter configuration from trial and error guided by experience into a verifiable and reproducible systematic workflow. More importantly, they rarely provide actionable, cost-reducing parameter recommendations and deployable configuration schemes tailored to concrete experimental goals and resource budgets. To close these gaps, this paper uses calibrated modeling of resource consumption as the starting point and builds a minimum-cost configuration optimization framework guided by statistical reliability.

3. Materials and Methods

3.1. Problem Definition and Basic Assumptions

Based on an urban VANET traffic–network co-simulation environment, this paper conducts a comprehensive and systematic analysis of the overall resource overhead of simulation experiments and summarizes the roles and impacts of the constituent components. Building on this analysis, we establish an overall overhead modeling framework for configuration design and further formulate a corresponding optimization model.
To achieve the research objectives, this paper adopts the following assumptions. First, traffic flow is approximated by a Poisson distribution at the aggregate level. Second, the evaluation focuses on an urban road–network VANET scenario within a city, rather than on highway tolling conditions, and does not consider complex hybrid environments that involve cellular networks, UAVs, and RSUs. Third, this paper focuses on a parameter configuration methodology rather than drawing definitive conclusions about the performance of specific protocol mechanisms. Researchers can use the simulation optimization mechanisms proposed here for further secondary optimization. The key parameters involved in this paper are listed in Table 1.
As illustrated in Figure 1, vehicles exchange safety and status information via broadcast, unicast, and multi-hop forwarding in an urban VANET deployment, which motivates our subsequent overhead decomposition and configuration-oriented modeling.

3.2. Overhead Analysis and Modeling for VANET Simulation Experiments

3.2.1. Simulation Experiments Overhead Decomposition

The overall experimental overhead can be decomposed into two broad categories, comprising six components. For clarity, we summarize their taxonomy using a two-dimensional layout where the horizontal axis indicates the phase (Init/Run), the vertical axis indicates the simulation domain (Traffic/Network), and the hatch/border style indicates the scope (General Platform vs. Target Experiment), as shown in Figure 2.
(1) Road Network Initialization Overhead C road : This component parses raw road network data, such as OSM and NET files, into computable topology and indexing structures. It covers road and lane graph construction, adjacency generation between nodes and edges, spatial index building, such as grid- or tree-based indices, and loading of traffic light logic. It is a one-time preparation cost that mainly grows with road network scale, including the number of road segments and nodes.
(2) Vehicle Initialization Overhead C veh : This component generates and injects vehicle entities and completes initial route and queue configuration and state initialization. It includes origin destination pair allocation, route loading, inserting vehicles into lane queues, and establishing the initial traffic state. This cost usually grows with vehicle count and is also affected by lane organization and initialization strategy.
(3) Network Initialization Overhead C net , init : This component creates network-side objects and initial structures for vehicles and infrastructure. It includes protocol stack and buffer queue initialization, setup of neighbor and link maintenance structures, registration of communication event schedulers, and binding of initialization related to the unified simulation clock. This cost grows with network node scale, and the initial neighborhood size is affected by parameters such as communication radius.
(4) Vehicle Runtime Simulation Overhead C traf-run : This component represents the cumulative cost of step-by-step traffic-side updates during steady-state operation. It covers microscopic behavior computation such as car following, lane changing, conflict checking, traffic signal response, and necessary rerouting. This cost increases with both simulation horizon and vehicle scale, and it is the primary source of runtime overhead at the traffic layer.
(5) Network Routine Runtime Overhead C net-run : This component represents the cumulative cost of protocol processing and cross-layer synchronization progress on the network side during steady-state operation. It covers message transmission, reception, and forwarding, channel contention and queuing, neighbor table maintenance, and runtime synchronization overhead caused by traffic and communication state exchange and event scheduling in co-simulation. This cost is highly sensitive to packet injection intensity, node count, and neighborhood size, and it is the primary source of runtime overhead on the communication side.
(6) Performance Sampling Overhead C PS : This component captures the overhead incurred by the sampling and measurement procedures required to produce experimental outcomes for performance evaluation. It includes observing system states and packet-level events at a chosen sampling rate, computing key performance indicators (e.g., delivery ratio, delay, throughput, and coverage) over time windows, and aggregating statistics across vehicles, flows, or grids. This cost typically grows approximately linearly with the sampling rate and simulation horizon, and it further increases with the monitored scale and the granularity and dimensionality of the collected metrics.
The six components above describe the one-time preparation overhead and the runtime overhead in co-simulation. In this paper, cost refers to the time and computational resource overhead required by simulation experiments, rather than monetary expenditure in any specific currency. For convenience in the following discussion, the wall clock time of a single simulation run is denoted by C run . The repetition count R is further introduced as an outer-level accumulation factor, which leads to the total simulation overhead.

3.2.2. Parameter–Cost Impact Relationship

To characterize the mechanism by which configuration factors act on the six cost components, this paper analyzes, based on the above six-part decomposition, the coupling relationships between commonly used factors and C road C PS . The next section summarizes how each factor affects the corresponding overhead components, including the direction of influence and the dominant scaling terms. We categorize these factors into the following categories.
(1) Static Road Network Parameters
  • Total Road Network Length L. The longer the road network and the larger the topology scale, the greater the workload for map parsing, graph construction, and index building. Therefore, the road network initialization overhead satisfies
    C road L 0 .
    Under a fixed communication radius and protocol settings, the neighborhood size can be approximately characterized by the linear density ρ L and the average neighbor count d ¯ ( r ) under the linear density approximation (defined in Equation (34)). The neighborhood event-driven part of the network runtime overhead can be written as
    C net-run C ¯ 5 + κ 5 d V d ¯ ( r ) f tx T ,
    where C ¯ 5 is a baseline constant term and κ 5 d is a calibrated coefficient capturing the per-neighbor event cost. Therefore, when this part dominates and V , r , f tx , T are fixed, we have C net-run L 0 .
  • Number of Parallel Lanes n lane . Vehicle injection and lane changing require gap checking and queue maintenance on candidate lanes, and higher lane parallelism increases vehicle initialization and traffic runtime overhead. This implies
    C veh n lane 0 ,
    C traf-run n lane 0 .
  • Maximum Speed Limit v max . The parameter v max mainly affects runtime overhead by changing the steady-state average speed v ¯ and the frequency of event triggers, such as rerouting. This influence can be summarized as
    v max v ¯ K ( v ¯ , L , T ) C traf-run , C net-run ,
    where K is the expected number of route selection or rerouting triggers for a single vehicle over the horizon T (see below).
(2) Dynamic Traffic Flow Parameters
  • Vehicle Count V. Each additional vehicle typically adds a traffic control loop, a wireless node, and a sampling record stream. Therefore, except for road network initialization, the other five components depend monotonically on V, which implies
    C k V 0 , k { 2 , 3 , 4 , 5 , 6 } .
    In our experiments (with r , B , S pkt , f tx fixed), the dominant scaling terms during runtime can be approximated as
    C traf-run V T ,
    C net-run V f tx T ,
    C PS V f s T .
  • Average Speed v ¯ . Higher vehicle speed leads to more road segment or intersection traversals per unit time and more frequent rerouting triggers. Here, v ¯ denotes the steady-state average speed of vehicles in the traffic simulation. Let K be the expected number of rerouting triggers per vehicle over the horizon T. Then
    K v ¯ 0 ,
    K κ K v ¯ T .
    where κ K is a calibrated proportionality coefficient that maps the average speed to the expected rerouting trigger rate.
  • Rerouting Trigger Count K. Higher rerouting intensity increases the event processing burden on the traffic side and the route update burden on the network side. The coupling terms can be written explicitly as
    C traf-run = C ¯ 4 + κ 4 K V K ,
    C net-run = C ¯ 5 + κ 5 K V K .
    where C ¯ 4 and C ¯ 5 are baseline terms, and  κ 4 K and κ 5 K are calibrated coefficients for rerouting-related processing. Therefore,
    C traf-run K 0 ,
    C net-run K 0 .
(3) Network Simulation Parameters
  • Communication Radius r. The radius r determines neighborhood size. Increasing r increases potential links and contention intensity, thereby raising network initialization and runtime overhead. This implies
    C net , init r 0 ,
    C net-run r 0 .
  • Bandwidth B, Packet Size S pkt , and Packet Generation Rate f tx . These three parameters jointly determine communication load intensity. Define the load factor
    ϕ = V f tx S pkt B .
    In the high load regime, increasing S pkt or f tx amplifies network processing and queuing overhead, whereas increasing B alleviates congestion. The directional relationships can be written as
    C net-run S pkt 0 ,
    C net-run f tx 0 ,
    C net-run B 0 .
    where the effect is more pronounced for high- ϕ regimes. The following communication feasibility constraint should also be satisfied:
    f tx S pkt B .
    Furthermore, let the expected total number of packets be
    N pkt V f tx T .
    Then the network runtime overhead is often written as
    C net-run = C ¯ 5 + κ 5 p V f tx T C net-run f tx 0 .
    where κ 5 p is a calibrated coefficient representing the per-packet protocol processing cost. When logging is per-packet (or strongly correlated with per-packet events), larger packet size and higher packet injection intensity also increase I/O, which implies
    C PS S pkt 0 ,
    C PS f tx 0 .
(4) Simulation Control Parameters
  • Sampling Frequency f s . The number of sampled records scales linearly with f s , and the I/O overhead can be written as
    C PS = C ¯ 6 + κ 6 V f s T C PS f s 0 .
    where C ¯ 6 is a baseline term and κ 6 is a calibrated coefficient for per-sample measurement and aggregation. If sampling triggers uploading or broadcasting (that is, sampling also produces network events), then we further have
    C net-run f s 0 .
  • Steady-State Simulation Horizon T. The three runtime components scale with T, which implies
    C traf-run T 0 ,
    C net-run T 0 ,
    C PS T 0 .
  • Repetition Count R. We do not treat repeated experiments as an independent cost component. Instead, the total overhead is written as an outer level summation:
    C tot ( x , R ) = j = 1 R C run ( j ) ( x ) R · E C run ( x ) .

3.2.3. Parameterized Cost Modeling

This subsection consolidates the parameterized representation of the six cost components and prepares the feature quantities and optional constraints used in the subsequent optimization model.
(1) Parameterized Summary of Component Costs. Based on the directional analysis above, and to support the subsequent calibration and model solving, this subsection further writes the six cost components in an explicit form of feature terms derived from parameter combinations plus coefficients to be calibrated. Let the configuration vector be
x = L , n lane , r , B , S pkt , f tx ; V , T , f s , R ,
where L , S , r , B , S pkt , f tx are scenario or protocol settings (fixed in our experiments), V is the workload intensity, and  f s , T, and R are controllable simulation parameters used to balance statistical validity and total cost.
(2) Derived Feature Quantities. The auxiliary quantities are defined as
ρ L V L , d ¯ ( r ) 2 r ρ L = 2 r V L , K K ( v ¯ , L , T ) ,
where ρ L is linear density, d ¯ ( r ) is the linear density approximation of the average neighborhood size, and K is the expected number of rerouting or route selection triggers for a single vehicle over the horizon T (which can be obtained from logs and treated as an observable feature in calibration).
(3) Parametric Expressions of the Six Cost Components. Summarizing the above, the six-part decomposition can be written as follows, where the coefficients α · , β · , γ · , δ · , η · , ζ · 0 are to be calibrated:
C road ( L ) = α 1 + β 1 L ,
C veh ( V , n lane ) = α 2 + β 2 V + γ 2 V n lane ,
C net , init ( V , r , L ) = α 3 + β 3 V + γ 3 V d ¯ ( r ) = α 3 + β 3 V + 2 γ 3 r V 2 L ,
C traf-run ( V , T , n lane , K ) = α 4 + β 4 V T + γ 4 V T n lane + δ 4 V K ,
C net-run ( V , T , r , L , f tx , S pkt , K ) = α 5 + β 5 V f tx T + γ 5 V d ¯ ( r ) f tx T + δ 5 V f tx T S pkt + η 5 V K ,
C PS ( V , T , f s , f tx , S pkt ) = α 6 + β 6 V f s T + γ 6 V f tx T + δ 6 f s T + ζ 6 V f tx T S pkt .
If road network parsing can be reused, the total overhead of R independent repetitions under different random seeds can be written as
C tot ( x ) = C road ( L ) + j = 1 R C veh + C net , init + C traf-run + C net-run + C PS .
(4) Optional Resource Capacity Constraints. When the platform is sensitive to memory or storage, monotonic surrogates can be constructed for peak memory and log volume and incorporated into the optimization as hard constraints or as a weighted objective, for example:
m peak μ 0 + μ 1 V + μ 2 V 2 S ,
s log λ 0 + λ 1 V f s T .
For instance, a weighted objective can be constructed as follows:
C ˜ tot = w t C tot + w m m peak + w s s log ,
or budget constraints can be imposed as
m peak m max ,         s log s max .

3.3. Threats to Validity and Mitigation Strategies

The experimental guidance methodology proposed in this paper is built around a cost proxy model and statistical power constraints. It seeks to reduce overall experimental cost without changing conclusions about network performance (e.g., packet delivery ratio (PDR), throughput, and latency). To strengthen the credibility of these conclusions, this section identifies potential threats in three dimensions, internal validity, construct validity, and external validity, and presents actionable mitigation strategies.

3.3.1. Internal Validity

Synchronization Errors and Time Advancement Conventions. Inconsistencies in time advancement granularity and callback ordering between the traffic and communication layers can introduce event ordering bias, which can in turn degrade the accuracy of end-to-end latency and packet loss statistics. Mitigation: Use a unified simulation clock and deterministic coupling callback order. Log timestamps for critical events, and align time advancement granularity across configurations through regression testing.
Insufficient Warm Up and Non-Steady-State Bias. Transient effects before route convergence and traffic flow stabilization during the start-up phase can distort steady-state statistics. Mitigation: Set a warm-up duration T warm and compute statistics over the steady-state interval [ T warm , T ) . In addition, validate the adequacy of T warm using convergence curves of key metrics.

3.3.2. Measurement Validity

Sampling Granularity Changes Metric Definitions. If reducing the sampling frequency f s causes missed detections of critical events, the statistical definitions of PDR, throughput, and latency may change, thereby affecting the assessment of conclusion consistency. Mitigation: The value of f s is not chosen arbitrarily. Instead, it is constrained by a lower bound derived from typical relative motion conditions (see), which ensures that the sampling resolution is sufficient to capture the shortest contact process. In addition, we recheck the consistency of the main conclusions in the results section using a reference configuration.
Correlation Can Overestimate the Effective Sample Size. Time window-based statistics exhibit temporal correlation. If they are naively treated as independent samples, the effective sample size may be overestimated, leading to overly optimistic power planning and an underestimated recommended R .
Mitigation: Introduce an effective sample coefficient η ( L , V ) ( 0 , 1 ] and apply the following correction
N eff R η ( L , V ) T T warm Δ T win ,
where η ( L , V ) is estimated from pilot experiments. When necessary, we further verify robustness using block-bootstrap.

3.3.3. External Validity

The surrogate model coefficients may vary with road topology, wireless propagation, protocol stack implementations, traffic pattern characteristics, and application-level behaviors. In particular, if the network becomes denser, the traffic pattern becomes more heterogeneous, or signalized intersections induce stronger vehicle platooning, the calibrated values of some traffic- and network-related cost components may change. Differences in protocol stack design may likewise change the communication pattern and the relative contribution of some network-side cost terms. Nevertheless, such differences do not alter the general applicability of the proposed workflow, whose methodological logic is formulated at the framework level rather than tied to one specific routing protocol. Mitigation: We perform sensitivity analyses across multiple traffic densities and key communication parameters, and retain the workflow structure of pilot runs, profiling, and power planning so that it can be transferred to other scenarios. Therefore, when traffic patterns exhibit stronger heterogeneity, denser interactions, or more complex protocol and application behaviors, the overall workflow remains applicable, while the corresponding surrogate model coefficients should be recalibrated through pilot runs for the target scenario.
These threats and the corresponding mitigation strategies are summarized in Table 2.

3.4. Minimum-Cost Model and Structured Workflow

This section presents a structured workflow for solving the minimum-cost configuration model, aiming to minimize experimental overhead while ensuring statistical reliability and fair comparisons. We treat the workload level V and the simulation-control variables y = { f s , T , R } as adjustable, while scenario- and protocol-dependent parameters are fixed across configurations. Following the cost decomposition in Section 3, we focus on four classes of factors: (i) sampling frequency f s ; (ii) simulation horizon T and warm-up duration T warm ; (iii) repetition count R; and (iv) vehicle count V.
Throughout, y Y with R Z + , and V is treated as an experimental independent variable whose values are specified by the research objective or traffic demand. The remaining scenario and protocol parameters (e.g., L , S , r , B , S pkt , f tx ) are fixed in controlled experiments to ensure comparability; if any of them need to be tuned, they can be included as additional independent variables.

3.4.1. Minimum-Cost Configuration Model

Given the co-simulation platform and scenario settings ( L , S , r , B , S pkt , f tx ) , we select simulation control parameters under workload level V to minimize the total cost. As discussed in Section 3, the total cost follows a six-part decomposition and admits a parametric surrogate form in Equations (35)–(41). We therefore optimize y = { f s , T , R } for each given V.
Given ( L , S , V ) and fixed protocol parameters, we optimize the configurable parameter vector y = { f s , T , R } . The corresponding minimum-cost configuration is formulated as
min y C tot ( L , S , V ; f s , T , R )
s . t . T T warm + N min Δ T win R η ( L , V ) ,
f s , min f s f s , max , R Z + .
The key constraints and their practical interpretations are summarized as follows.
(1) Sampling Frequency f s . The sampling frequency determines how many states and metrics are recorded per unit time. It therefore directly affects I/O and serialization overhead, and it may also amplify the network-side load indirectly through sampling-triggered reporting or broadcasting. An excessively high f s introduces redundancy and increases C PS , whereas an overly low f s may miss short contacts or transient congestion and thus harm observability and the credibility of conclusions.
Let v rel denote the magnitude of the relative speed between two nodes. The approximate contact duration is
τ 2 r v rel .
In the most adverse case (shortest contact), to guarantee at least m samples, we recommend
f s m v rel , max 2 r ,
Here v rel , max can be approximated by an upper bound on the relative speed of two vehicles traveling in opposite directions, and m (e.g., 5–10) controls the desired time resolution. This rule provides a lower bound for f s . Its upper bound is typically constrained by the platform I/O throughput and disk budget (or an upper bound on s log ).
(2) Statistical Power and Minimum Sample Size. Setting T and R empirically can lead to either insufficient samples or overly conservative designs, which weakens reproducibility and increases cost. We adopt an approximate sample-size plan based on a two-sample mean-difference test. Let σ be the standard deviation of the target metric (estimated from calibration runs), δ be the minimum detectable effect, α be the significance level, and  1 β be the statistical power. Then the minimum effective sample size is approximated by
N min = 2 z 1 α / 2 + z 1 β δ / σ 2 .
When multiple metrics (e.g., PDR, throughput, and latency) must be satisfied simultaneously, we take
N min = max M M N min ( M ) .
(3) Effective Sample Size and Horizon Constraint. Because time-series sampling typically exhibits correlation, counting raw sample points may overestimate the effective sample size. We compute windowed statistics over the steady-state interval [ T warm , T ) with window length Δ T win , and introduce an effective-sample coefficient η ( L , V ) ( 0 , 1 ] to correct for correlation, yielding
N eff R η ( L , V ) T T warm Δ T win .
From N eff N min , we obtain the minimum simulation-horizon constraint
T T warm + N min Δ T win R η ( L , V ) .
Equation (55) explicitly characterizes the substitutability between T and R. With other conditions fixed, increasing R linearly reduces the required steady-state duration T T warm . Conversely, if R cannot be increased due to limited parallel resources, T must be extended to compensate for the effective sample size.
(4) Optional Feasibility Constraints. The communication feasibility constraint f tx S pkt B is ensured by fixing protocol parameters and checking the condition beforehand; if f tx is also optimized, it can be incorporated directly as a constraint.
When platform resources are limited, the memory and storage budgets can further be incorporated using the surrogates in Equations (42) and (43), either as hard constraints ( m peak m max and s log s max ) or as a weighted objective in Equation (44).

3.4.2. Complexity and Solving Analysis

The core tension of the minimum-cost model in Equations (48) and (49) is that meeting the power requirement increases N eff by enlarging the steady-state duration T T warm or the repetition count R. However, Section 3 shows that increasing T, R, or  f s monotonically increases C traf-run , C net-run , and  C PS , and thus raises the total cost C tot . With additional budgets ( m peak m max and s log s max ), the feasible region further shrinks, making the cost–reliability tradeoff more explicit.
Decision version: Given a threshold C 0 , determine whether there exists y = { f s , T , R } such that constraints (48) and (49) hold and C tot ( L , S , V ; f s , T , R ) C 0 .
Proposition 1 (Optimal repetition count attains the feasible lower bound).
For given ( L , S , V ) and fixed ( f s , T ) , if the total cost is nondecreasing in R (the form in Equation (41) used in this paper satisfies this property), then any optimal solution must take the minimum feasible repetition count
R ( T ; V ) = N min Δ T win η ( L , V ) ( T T warm ) .
Proof. 
For fixed ( f s , T ) , constraint (48) is equivalent to R N min Δ T win η ( L , V ) ( T T warm ) , so the smallest feasible integer is given by Equation (56). Because  C tot is nondecreasing in R, choosing a larger R cannot reduce the objective value. Therefore the optimum must attain this lower bound.    □
Proposition 2 (Polynomial-time solvability (thus not NP-hard)).
Under the experimental configuration in this paper, f s and T are chosen from explicit finite candidate sets F and T , respectively (e.g., T can be restricted to integer multiples of Δ T win ). If each cost evaluation C tot ( L , S , V ; f s , T , R ) can be computed in polynomial time (and is O ( 1 ) for the surrogate model), then the model in Equations (48) and (49) can be solved exactly in O ( | F | | T | ) time, so its decision version belongs to P . Hence, under our setting, the model is not NP-hard (under the standard assumption P NP ).
Proof. 
By Proposition 1, for any candidate pair ( f s , T ) , the optimal and feasible repetition count is uniquely determined as R ( T ; V ) . Hence, it suffices to search over all ( f s , T ) F × T , compute R ( T ; V ) , evaluate C tot , and optionally check m peak and s log , and then select the minimum. This finite search involves | F | | T | evaluations, each in polynomial time, so the overall procedure runs in polynomial time.    □

3.4.3. Model Solving

The model solving procedure and workflow are illustrated in Figure 3.
(1) Profiling and Calibration Steps.
We recommend calibrating the surrogate model with a small number of calibration runs. Choose representative configuration points that cover the feasible range (constructed from combinations of L , V , f tx , f s , T ). For each run, record ( t wall , m peak , s log ) and, when needed, the finer breakdown of traffic-side, network-side, and I/O overhead. Then fit the six overhead surrogates in Equations (35)–(40) and validate the fitting error, and estimate σ ^ and η ( L , V ) from pilot runs for power planning.
(2) Solving Method and Complexity Analysis.
The overall configuration solving procedure is summarized in Algorithm 1.
Algorithm 1 Minimum-cost configuration solving procedure (variable elimination + finite search)
Require: 
Workload set V ; candidate sets F and T ; calibration design D ; statistical targets ( α , β , δ ) ; (optional) resource limits ( m max , s max )
Ensure: 
For each V V , the recommended configuration y ( V ) = { f s , T , R }
  1:
Run the calibration set D , collect ( t wall , m peak , s log ) , and fit the overhead surrogates (Section 3)
  2:
Estimate the metric variance σ and the effective-sample coefficient η ( L , V ) , and compute N min via
  3:
for all V V do
  4:
    C min +
  5:
   for all  f s F  do
  6:
       for all  T T and T > T warm  do
  7:
           Compute R R ( T ; V ) ()
  8:
           Evaluate the cost C C tot ( L , S , V ; f s , T , R )
  9:
           if (optional budgets) m peak m max and s log s max and C < C min  then
10:
                   C min C , y ( V ) { f s , T , R }
11:
return { y ( V ) } V V
Let | V | = M , | F | = F , and | T | = K . If each cost evaluation is constant-time (algebraic operations plus surrogate calls), then the time complexity of the solving stage is O ( M F K ) . The space complexity is O ( 1 ) (or O ( M ) if storing all y ( V ) ). This complexity is decoupled from the cost of running simulations, enabling fast configuration screening before experiments.
Equation (56) indicates that the statistical constraint is translated into a lower bound on the product of the repetition count R and the steady-state duration ( T T warm ) , and that the smallest feasible repetition count is determined accordingly. We therefore adopt a deterministic strategy of variable elimination plus finite search. For each workload point V, we search ( f s , T ) F × T , compute R ( T ; V ) , evaluate C tot using the Section 3 surrogate model, and, when needed, check the budgets on m peak and s log . In this evaluation, the traffic runtime and the network runtime are dynamically recomputed under the current candidate setting together with the other cost terms, and the resulting total overhead is then used for candidate comparison and final selection. The candidate with the smallest cost is selected as the recommended configuration.

3.5. Simulation Setup and Evaluation Protocol

Table 3 summarizes the hardware and software stack used in the experiments.
We adopt the SUMO-Veins-OMNeT++ co-simulation stack to enable cross-layer consistent simulation. The traffic layer uses SUMO to generate microscopic vehicle mobility under road–network constraints. The network layer relies on OMNeT++ as a discrete-event simulation kernel to schedule wireless communication and protocol events. Veins couples the two simulators under a unified clock so that traffic states (e.g., position and speed) can drive the communication process in real-time, and network decisions can be fed back to traffic dynamics when needed. In the adopted co-simulation setting, the interaction between the traffic and communication layers is performed with fixed synchronization granularity. At each synchronization point, SUMO first advances by one configured simulation step and updates vehicle mobility states, after which the updated states are exchanged through the Veins/Traffic Control Interface (TraCI) and the corresponding network-side events are processed in OMNeT++ based on the synchronized traffic state.
For the network protocol, we use the classic ad hoc on-demand distance vector (AODV) routing protocol as a representative communication setting in the present experiments. Message delivery among vehicles and RSUs is realized via AODV multi-hop forwarding and route maintenance. Control overhead for route discovery and route maintenance (e.g., route request (RREQ), route reply (RREP), and route error (RERR)) is counted together with application messages to form the observations required by the subsequent cost modeling and minimum-cost configuration solving. Although AODV is adopted in the current evaluation, the proposed framework is not restricted to AODV itself. When other protocol stacks such as the European Telecommunications Standards Institute (ETSI) ITS G5 are considered, the overall workflow can still be retained, while the corresponding network-side cost terms and fitted coefficients should be recalibrated under the target protocol setting.
Table 4 lists the scenario settings, fixed parameters, and runtime configurations.
Based on Table 4, we conduct comparative experiments over the traffic load V, simulation horizon T, and repetition count R. The custom scripts used for cost-model fitting, configuration solving, and result post-processing are not publicly archived at this stage, but can be made available from the corresponding author upon reasonable request.

4. Simulation-Based Evaluation and Results

This section evaluates the proposed guidance method for simulation experiments (i.e., the minimum-cost configuration solving and workflow) rather than the performance superiority of a specific protocol/algorithm. Our goal is to determine a configuration for VANET traffic–network co-simulation that minimizes the overall resource overhead while meeting fidelity requirements and statistical validity. Accordingly, we validate two key questions: (i) under statistical constraints such as power, whether the configuration returned by the optimization model in Section 3 attains the minimum cost within the feasible region; and (ii) after adopting this configuration, whether the conclusions on network performance metrics (PDR, throughput, and end-to-end delay) remain consistent, demonstrating that cost reduction does not come at the expense of conclusion credibility.

4.1. Validating Minimum Cost Under Power Constraints and Consistent Conclusions

4.1.1. Minimum-Cost Verification Within the Feasible Region

This subsection validates that, under statistical constraints such as power, the configuration returned by the optimization model in Section 3 falls into the low-cost region of the feasible set, and clarifies the main sources of cost reduction. We consider a representative workload ( V = 400 ) and perform the following observations: (i) decompose the total overhead of a single co-simulation run to identify dominant cost components; and (ii) characterize the substitution between T and the minimum repetition count R min ( T ) under the power constraint. We further report platform-side energy results for candidate configurations to corroborate the resource-saving effect of the recommended configuration.
Figure 4 shows that the total overhead is dominated by runtime components: the traffic runtime accounts for 50% and the network runtime accounts for 37%, summing to 87%, while the remaining components contribute only a small fraction. This structure indicates that, once the power constraint is satisfied, reducing redundant simulation time and repetitions directly compresses the largest cost components and therefore yields the most visible reduction in the overall overhead. The dominance of runtime cost is expected because co-simulation continuously performs mobility updates, wireless event scheduling, and traffic–network synchronization, whereas initialization and post-processing are one-off operations amortized by long runs.
Figure 5 shows that R min ( T ) decreases rapidly with increasing T and then enters a plateau. In this example, the repetition demand drops to the order of tens by T = 1200 s, and reaches R = 20 at T = 1740 s, after which further changes are mild. This turning point suggests that the recommended configuration should be selected near the front edge of the plateau, avoiding both the repetition stacking of “short T + large R” and the accumulated single-run cost of overly long T. The plateau arises from diminishing returns in effective-sample growth: extending T adds steady-state windows and reduces the required R, but once variance and correlation corrections stabilize, further increasing T provides limited reduction in R, while the runtime cost continues to grow linearly with T.
Figure 6 shows a typical “short start-up peak + steady-state fluctuation” pattern: a start-up peak of 160–170 W, followed by a long steady period mostly within 70–95 W, with a slight step-up later. This indicates that the platform-side load is overall stable; hence, the benefit of the recommended configuration mainly comes from reducing total runtime and repetitions rather than being driven by sporadic hardware jitter, making the energy comparison more interpretable. The start-up peak is commonly associated with initialization and route establishment when events are dense, whereas step changes are often related to variations in event-queue density, batch logging, or accumulated control messages—all typical phase-dependent load patterns in co-simulation.
Figure 7 shows that the candidate configurations span 2.44–2.71 kWh in total energy consumption. The minimum occurs at C 8 ( T = 1740 s, R = 20 ) with 2.44 kWh, whereas a more conservative combination (e.g., C 1 : T = 1380 s, R = 27 ) consumes 2.71 kWh. The recommended configuration coincides with the minimum-energy point, indicating that the solved ( T , R ) attains the minimum among the evaluated candidates in both wall-clock overhead and platform cost, which is beneficial for batch experiments and reproducibility. When steady-state power levels are similar, total energy is mainly determined by T × R : large R linearly amplifies the total runtime, and large T increases the per-run integral; candidates near the turning point avoid both effects and therefore more easily achieve the minimum.

4.1.2. Consistency of Network-Performance Conclusions

After obtaining the low-cost recommended configuration, we further examine whether it changes the conclusions on network performance. Specifically, under a load sweep V { 200 , 300 , 400 , 500 , 600 } , we run co-simulation using a baseline configuration and the recommended configuration, and compute steady-state statistics for PDR, throughput, and end-to-end delay. For comparison, the baseline uses longer runs and more repetitions (as shown in legends, T = 5000 s, R = 50 ) and serves as a stable reference, while the recommended configuration uses a lower-cost combination that satisfies the power constraint (e.g., T = 1740 s, R = 20 ).
Figure 8 shows that PDR decreases as the traffic load V increases, and the two curves overlap at all load points. This indicates that, after the overhead is reduced substantially, the recommended configuration still preserves both the level and the trend of delivery performance, supporting low-cost reproduction and fair comparisons.
Figure 9 shows that throughput increases with load at first and then enters saturation; the key turning point is at V = 400 , after which throughput stabilizes under higher load. The two configurations agree in both the turning-point location and the magnitude. This confirms that the recommended configuration does not alter the essential throughput judgment (including the turning point), supporting the goal of “cost reduction without sacrificing conclusions.” This shape is primarily determined by the protocol and load: more active flows improve throughput under moderate load, while collisions, backoff, and retransmissions drive saturation under high load.
Figure 10 shows that end-to-end delay increases monotonically with load, indicating that heavier congestion lengthens transmission waiting and queuing time. At the highest load V = 600 , the baseline yields 125 ms and the recommended configuration yields 121 ms, remaining in the same magnitude. This shows that the recommended configuration preserves the conclusion and trend that “higher load leads to larger delay” while reducing overhead. The delay increase is mainly caused by Medium Access Control (MAC) backoff, queue buildup, and retransmissions under high load; with power ensuring sufficient sample size, small differences at individual points are attributed to stochastic variability rather than systematic bias.
Under the power constraint, the recommended configuration concentrates cost reduction on the dominant runtime components and preserves the consistency of network-performance conclusions across different loads, validating “cost reduction without sacrificing credibility”.

4.1.3. Sensitivity Analysis of Calibrated Cost Coefficients

This subsection evaluates the robustness of the calibrated overhead model with respect to coefficient uncertainty. We first define the (dimensionless) normalized coefficient sensitivity S k = ( θ k / C run ) · ( C run / θ k ) , which quantifies the relative change in C run induced by a relative perturbation in coefficient θ k . For importance ranking we report | S k | to focus on the magnitude of impact (regardless of direction). We then perform two complementary analyses: (i) a coefficient-level sensitivity ranking based on | S k | , which identifies the coefficients that most strongly affect the predicted single-run overhead; and (ii) an uncertainty-propagation experiment that perturbs calibrated coefficients and examines the distribution of the normalized cost ratio C run / C ^ run under N = 1000 trials.
As shown in Figure 11, the tornado plot provides an intuitive view of how strongly each calibrated coefficient influences the predicted single-run overhead: the x-axis is the absolute normalized sensitivity | S k | , the y-axis lists coefficient names, and a longer bar means that, under the same relative perturbation magnitude, the coefficient induces a larger relative change in the total overhead; numerically, the three most influential coefficients are β 4 , γ 5 , and γ 4 , followed by δ 5 , α 4 , and β 6 , while all remaining coefficients have | S k |   < 0.052 ; because the total overhead aggregates multiple cost components, under the representative workload a small number of features associated with dominant runtime paths occupy a larger share in the sum and therefore yield higher sensitivities, whereas coefficients linked to amortized one-time costs or low-frequency trigger events contribute less and appear as short bars. For readability and to emphasize the factors that are most critical for configuration comparison, we only visualize coefficients with relatively large | S k | (i.e., terms that contribute more significantly to the relative change of C run ), and we group the remaining ones into a “low-sensitivity” set without enumerating them; notably, these coefficients are not omitted in the model computation, but their | S k | values under the representative workload and nominal configuration are all below the threshold (0.052), implying a limited perturbation response in the predicted total overhead and thus not altering the main robustness conclusions; moreover, some of these coefficients correspond to amortized one-time costs, low-frequency trigger events, or behave almost as a constant offset across candidate configurations under the given scenario and vehicle scale, so their uncertainty tends to affect the absolute overhead level more than the relative comparison and ranking among configurations.
As shown in Figure 12, the histogram shows how coefficient perturbations propagate to the predicted overhead: the x-axis is the normalized ratio C run / C ^ run , representing the perturbed prediction relative to the nominal prediction, and the y-axis is the frequency of occurrence across trials. Under N = 1000 perturbation trials, the standard deviation is approximately 0.0346, indicating that the ratio fluctuates mildly around 1; meanwhile, the 5–95% percentile interval is approximately [0.945, 1.057], meaning that about 90% of trials fall within this range. This result validates that the proposed cost model remains practically robust under moderate coefficient uncertainty: even if calibrated coefficients drift to some extent, configuration comparison and recommendation based on the cost model can still produce stable predicted outputs.

5. Conclusions

This paper investigated the problem of cost-aware parameter configuration in large-scale VANET traffic–network co-simulation. To address the limitations of conventional parameter setting methods that rely heavily on empirical trial-and-error and often struggle to balance overhead control, statistical validity, and configuration reproducibility, a structured configuration methodology was proposed. The method decomposes the overall experimental overhead, establishes a parameterized overhead model, and constructs a minimum-overhead configuration model under statistical reliability constraints, so that key parameters such as simulation horizon, repetition count, and sampling frequency can be selected according to explicit criteria. The proposed framework was validated in a representative urban road–network co-simulation scenario based on a SUMO-Veins-OMNeT++ workflow. Comparative experiments conducted under the baseline reference configuration and the recommended configuration showed that, while satisfying the statistical validity requirement, the recommended configuration reduced platform-side energy consumption from 2.71 kWh to 2.44 kWh. At the same time, the main performance conclusions remained consistent under both configurations: the PDR curves were largely overlapped across all tested traffic loads, and at the highest tested load of V = 600 , the end-to-end delay was 125 ms and 121 ms, respectively. These results indicate that the proposed method can effectively reduce the overall time and computational resource overhead of simulation experiments while preserving result credibility and fairness of comparison.
This work provides a more efficient, reproducible, and statistically credible experimental basis for the engineering evaluation of V2X protocols and cooperative ITS applications before field deployment. It can also help relevant researchers and practitioners shorten the cycle of simulation experiments and performance evaluation, thereby improving the efficiency of research output. Future work will further validate the proposed method under more diverse road–network topologies, traffic-demand patterns, and communication settings, and extend it to support adaptive configuration across different protocol stacks and application-oriented workloads, so as to enhance its generality and applicability.

Author Contributions

Conceptualization, Y.X. and Z.C.; methodology, Y.X.; software, H.H.; validation, Y.X., Z.C. and H.H.; formal analysis, Y.X.; investigation, Y.X.; resources, X.S.; data curation, Y.X.; writing—original draft preparation, Y.X. and H.H.; writing—review and editing, Y.X., Z.C. and X.S.; visualization, Z.C.; supervision, Z.C.; project administration, Z.C.; funding acquisition, X.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (Grant No. 42577504) and the S&T Program of Hebei (Grant No. 25351804K).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data and custom scripts supporting this study are available from the corresponding author upon reasonable request. They are not publicly archived at this stage.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Dutta, A.; Samaniego Campoverde, L.M.; Tropea, M.; De Rango, F. A Comprehensive Review of Recent Developments in VANET for Traffic, Safety & Remote Monitoring Applications. J. Netw. Syst. Manag. 2024, 32, 73. [Google Scholar] [CrossRef]
  2. Borah, A.; Paranjothi, A.; Thomas, J.P. A survey on distributed approaches for security enhancement in vehicular ad-hoc networks. Comput. Netw. 2025, 261, 111140. [Google Scholar] [CrossRef]
  3. Sohail, M.; Latif, Z.; Javed, S.; Biswas, S.; Ajmal, S.; Iqbal, U.; Raza, M.; Khan, A.U. Routing protocols in Vehicular Adhoc Networks (VANETs): A comprehensive survey. Internet Things 2023, 23, 100837. [Google Scholar] [CrossRef]
  4. Sutradhar, K.; Pillai, B.G.; Amin, R.; Narayan, D.L. A survey on privacy-preserving authentication protocols for secure vehicular communication. Comput. Commun. 2024, 219, 1–18. [Google Scholar] [CrossRef]
  5. Gan, L.; Chu, W.; Li, G.; Tang, X.; Li, K. Large models for intelligent transportation systems and autonomous vehicles: A survey. Adv. Eng. Inform. 2024, 62, 102786. [Google Scholar] [CrossRef]
  6. Bochenina, K.; Agriesti, S.; Roncoli, C.; Ruotsalainen, L. From Urban Data to City-Scale Models: A Review of Traffic Simulation Case Studies. IET Intell. Transp. Syst. 2025, 19, e70021. [Google Scholar] [CrossRef]
  7. Al-Msari, H.; Koting, S.; Ahmed, A.N.; El-Shafie, A. Review of driving-behaviour simulation: VISSIM and artificial intelligence approach. Heliyon 2024, 10, e25936. [Google Scholar] [CrossRef]
  8. Ormándi, T.; Varga, B. The importance of V2X simulation: An in-depth comparison of intersection control algorithms using a high-fidelity communication simulation. Veh. Commun. 2023, 44, 100676. [Google Scholar] [CrossRef]
  9. Stanica, R.; Chaput, E.; Beylot, A.L. Simulation of vehicular ad-hoc networks: Challenges, review of tools and recommendations. Comput. Netw. 2011, 55, 3179–3188. [Google Scholar] [CrossRef]
  10. Babu, S.; Kumar, A.R. A comprehensive survey on simulators, emulators, and testbeds for VANETs. Int. J. Commun. Syst. 2022, 35, e5123. [Google Scholar] [CrossRef]
  11. Songchitruksa, P.; Sunkari, S.; Ugalde, I.; Rosca, J.; Ojea, J.A. Interlinking VISSIM and ns-3 for Connected Vehicle Simulation: Case Study of Intelligent Dilemma Zone Avoidance. Transp. Res. Rec. 2017, 2619, 36–43. [Google Scholar] [CrossRef]
  12. Raviglione, F.; Risma Carletti, C.M.; Malinverno, M.; Casetti, C.; Chiasserini, C.F. ms-van3t: An integrated multi-stack framework for virtual validation of V2X communication and services. Comput. Commun. 2024, 217, 70–86. [Google Scholar] [CrossRef]
  13. Ormándi, T.; Varga, B. Mesoscopic V2X simulation framework to enhance simulation performance. Simul. Model. Pract. Theory 2024, 132, 103003. [Google Scholar] [CrossRef]
  14. Weber, J.S.; Neves, M.; Ferreto, T. VANET simulators: An updated review. J. Braz. Comput. Soc. 2021, 27, 8. [Google Scholar] [CrossRef]
  15. Araujo, G.; Peixoto, M.; Sampaio, L. A comprehensive and configurable simulation environment for supporting vehicular named-data networking applications. Comput. Netw. 2023, 235, 109949. [Google Scholar] [CrossRef]
  16. Wang, T.; Hussain, A.; Bhutta, M.N.M.; Cao, Y. Enabling bidirectional traffic mobility for ITS simulation in smart city environments. Future Gener. Comput. Syst. 2019, 92, 342–356. [Google Scholar] [CrossRef]
  17. Shi, Y.; Liu, Z.; Wang, Z.; Ye, J.; Tong, W.; Liu, Z. An integrated traffic and vehicle co-simulation testing framework for connected and autonomous vehicles. IEEE Intell. Transp. Syst. Mag. 2022, 14, 26–40. [Google Scholar] [CrossRef]
  18. Alupoaei, S.I.; Caruntu, C.F. Optimizing urban traffic efficiency and safety via V2X: A simulation study using the MOSAIC platform. Sensors 2025, 25, 5418. [Google Scholar] [CrossRef]
  19. Tomás, V.R.; García, L.A.; Alonso, L.A. An Agent-Based Platform to Evaluate V2X Routing Road Traffic Scenarios. Simul. Model. Pract. Theory 2023, 125, 102750. [Google Scholar] [CrossRef]
  20. Xu, R.; Xiang, H.; Han, X.; Xia, X.; Meng, Z.; Chen, C.J.; Correa-Jullian, C.; Ma, J. The OpenCDA Open-Source Ecosystem for Cooperative Driving Automation Research. IEEE Trans. Intell. Veh. 2023, 8, 2698–2711. [Google Scholar] [CrossRef]
  21. Jia, D.; Sun, J.; Sharma, A.; Zheng, Z.; Liu, B. Integrated simulation platform for conventional, connected and automated driving: A design from cyber–physical systems perspective. Transp. Res. Part C 2021, 124, 102984. [Google Scholar] [CrossRef]
  22. Wang, L.; Zhang, H.; Huang, Y.; Liu, J.; Ji, K.; Shi, B. Application Research of a V2X Semi-Physical Simulation Platform in Vehicle–Road Collaboration Experiments. World Electr. Veh. J. 2025, 16, 304. [Google Scholar] [CrossRef]
  23. Liang, Z.; Wang, J.; Zhang, T.; Yong, X. DTTF-Sim: A Digital Twin-Based Simulation System for Continuous Autonomous Driving Testing. Sensors 2025, 25, 3447. [Google Scholar] [CrossRef] [PubMed]
  24. Schrab, K.; Neubauer, M.; Protzmann, R.; Radusch, I.; Manganiaris, S.; Lytrivis, P.; Amditis, A.J. Modeling an ITS Management Solution for Mixed Highway Traffic With Eclipse MOSAIC. IEEE Trans. Intell. Transp. Syst. 2023, 24, 6575–6585. [Google Scholar] [CrossRef]
  25. Tampouratzis, N.; Mousouliotis, P.; Papaefstathiou, I. A Novel Integrated Simulation Framework for Cyber-Physical Systems Modelling. IEEE Trans. Parallel Distrib. Syst. 2023, 34, 2684–2698. [Google Scholar] [CrossRef]
  26. Stepanyants, V.; Karpov, A.; Margaryan, A.; Amerikanov, A.; Telpukhov, D.; Solovyev, R.; Romanov, A. Parallel multi-level simulation for large-scale detailed intelligent transportation system modeling. Future Transp. 2025, 5, 141. [Google Scholar] [CrossRef]
  27. Tangirala, N.T.; Sommer, C.; Knoll, A. Simulating Data Flows of Very Large Scale Intelligent Transportation Systems. In Proceedings of the SIGSIM-PADS ’24: Proceedings of the 38th ACM SIGSIM Conference on Principles of Advanced Discrete Simulation, Atlanta, GA, USA, 24–26 June 2024; pp. 98–107. [Google Scholar] [CrossRef]
  28. Häfner, B.; Jiru, J.; Schepker, H.; Schmitt, G.A.; Ott, J. Preventing failures of cooperative maneuvers among connected and automated vehicles. Comput. Commun. 2022, 193, 234–245. [Google Scholar] [CrossRef]
  29. Regragui, Y.; Moussa, N. A cellular automata model for urban traffic with multiple roundabouts. Chin. J. Phys. 2018, 56, 1273–1285. [Google Scholar] [CrossRef]
  30. Keramidi, I.P.; Uzunidis, D.; Moscholios, I.D.; Sarigiannidis, P.G. On Queueing Models for the Performance Analysis of a Vehicular Ad Hoc Network. In Proceedings of the 2022 International Conference on Software, Telecommunications and Computer Networks (SoftCOM), Split, Croatia, 22–24 September 2022; pp. 1–6. [Google Scholar]
  31. Hu, L.; Liu, X.; Zhou, K.; Yao, Z. A Semi-Markov Process Model for Performance Evaluation of DSRC Vehicular Safety Communication. Math. Probl. Eng. 2022, 2022, 7548608. [Google Scholar] [CrossRef]
  32. Yeferny, T.; Ben Yahia, S.B. A Markov chain-based data dissemination protocol for vehicular ad hoc networks. Comput. Commun. 2021, 180, 303–314. [Google Scholar] [CrossRef]
  33. Badole, M.H.; Thakare, A.D. An optimized framework for VANET routing: A multi-objective hybrid model for data synchronization with digital twin. Int. J. Intell. Netw. 2023, 4, 272–282. [Google Scholar] [CrossRef]
Figure 1. Urban V2X communication scenario in traffic–network co-simulation (V2V and V2I links).
Figure 1. Urban V2X communication scenario in traffic–network co-simulation (V2V and V2I links).
Applsci 16 03264 g001
Figure 2. Taxonomy of the six overhead components in VANET traffic–network co-simulation. Phase: Initialization (Init) vs. Running (Run). Domain: Traffic vs. Network. Scope: Hatch pattern denotes the General Platform components, while a thick border denotes the Target Experiment component.
Figure 2. Taxonomy of the six overhead components in VANET traffic–network co-simulation. Phase: Initialization (Init) vs. Running (Run). Domain: Traffic vs. Network. Scope: Hatch pattern denotes the General Platform components, while a thick border denotes the Target Experiment component.
Applsci 16 03264 g002
Figure 3. Workflow for solving the minimum-cost configuration model: pilot runs and calibration construct candidate sets, and a finite search under power constraints yields the recommended configuration.
Figure 3. Workflow for solving the minimum-cost configuration model: pilot runs and calibration construct candidate sets, and a finite search under power constraints yields the recommended configuration.
Applsci 16 03264 g003
Figure 4. Cost composition of the total overhead ( V = 400 ).
Figure 4. Cost composition of the total overhead ( V = 400 ).
Applsci 16 03264 g004
Figure 5. Substitution between R min ( T ) and T under the power constraint (example at V = 400 ).
Figure 5. Substitution between R min ( T ) and T under the power constraint (example at V = 400 ).
Applsci 16 03264 g005
Figure 6. CPU package power trace in a representative single run.
Figure 6. CPU package power trace in a representative single run.
Applsci 16 03264 g006
Figure 7. Total energy consumption (kWh) of candidate configurations: differences across ( T , R ) combinations on the same platform.
Figure 7. Total energy consumption (kWh) of candidate configurations: differences across ( T , R ) combinations on the same platform.
Applsci 16 03264 g007
Figure 8. Network performance: PDR versus traffic load V (baseline vs. recommended configuration).
Figure 8. Network performance: PDR versus traffic load V (baseline vs. recommended configuration).
Applsci 16 03264 g008
Figure 9. Network performance: throughput x thr versus traffic load V (baseline vs. recommended configuration).
Figure 9. Network performance: throughput x thr versus traffic load V (baseline vs. recommended configuration).
Applsci 16 03264 g009
Figure 10. Network performance: end-to-end delay d e 2 e versus traffic load V (baseline vs. recommended configuration).
Figure 10. Network performance: end-to-end delay d e 2 e versus traffic load V (baseline vs. recommended configuration).
Applsci 16 03264 g010
Figure 11. Coefficient sensitivity of the calibrated overhead model (tornado plot; representative workload V = 400 ). Bar length indicates the absolute normalized sensitivity | S k | .
Figure 11. Coefficient sensitivity of the calibrated overhead model (tornado plot; representative workload V = 400 ). Bar length indicates the absolute normalized sensitivity | S k | .
Applsci 16 03264 g011
Figure 12. Uncertainty propagation from coefficient perturbations to the predicted single-run overhead. The histogram reports the distribution of the normalized ratio C run / C ^ run under N = 1000 independent coefficient-perturbation trials; the dashed line marks the nominal prediction (ratio equals 1).
Figure 12. Uncertainty propagation from coefficient perturbations to the predicted single-run overhead. The histogram reports the distribution of the normalized ratio C run / C ^ run under N = 1000 independent coefficient-perturbation trials; the dashed line marks the nominal prediction (ratio equals 1).
Applsci 16 03264 g012
Table 1. List of symbols.
Table 1. List of symbols.
SymbolDescription
SSimulation area.
LTotal road length, sum of lane centerlines.
VNumber of vehicles, traffic load.
v ¯ Average vehicle speed, steady-state expectation.
rCommunication radius.
TSimulation horizon per run, including warm-up.
Δ T win Steady-state window length.
τ Approximate contact duration.
v rel , v rel , max Relative-speed magnitude and its upper bound.
mRequired number of samples within the shortest contact.
f s Sampling and logging frequency.
RRepetition count, independent random seeds.
xConfiguration vector, e.g.,  x = ( f s , T , R , ) .
C run ( x ) Total overhead of a single simulation run.
C init ( x ) Initialization overhead.
C op ( x ) Steady-state operation overhead.
C road Road–network initialization overhead.
C veh Vehicle initialization overhead.
C net , init Network initialization overhead.
C traf-run Traffic-side runtime overhead.
C net-run Network-side runtime overhead.
C PS Sampling, logging, and I/O overhead.
ρ L Linear density ρ L = V L .
d ¯ ( r ) Approximation of average neighbor count or neighborhood size.
KNumber of rerouting or reconfiguration triggers.
F Candidate set of sampling frequencies.
T Candidate set of simulation horizons.
R ( T ; V ) Minimum repetitions satisfying the power constraint for given T and V.
y ( V ) Recommended minimum-cost configuration y = { f s , T , R } .
C ˜ tot Weighted total-cost objective.
Table 2. Summary of threats to validity and mitigation strategies.
Table 2. Summary of threats to validity and mitigation strategies.
ThreatMitigation Strategy (Key Points)
Synchronization errorsUnified clock; deterministic event order; timestamp checks.
Non-steady-state biasWarm-up removal; windowed statistics; report steady-state interval.
Sampling granularity effectsBound f s from motion cases; verify with a reference configuration.
Correlation and sample-size overestimationEffective-sample correction via η ( L , V ) ; block bootstrap if needed.
Scenario dependenceCross-load validation; report fit error; recalibrate for transfer.
Table 3. Hardware and software environment used in experiments.
Table 3. Hardware and software environment used in experiments.
ItemConfiguration
CPUi9-12900 (2.40 GHz)
Memory64.0 GB (4800 MHz)
Operating systemWindows 10
Co-simulation stackSUMO 1.8.0, OMNeT++ 5.6.2, Veins 5.2
Scripting languagePython 3.10
Table 4. Simulation scenario and runtime parameters.
Table 4. Simulation scenario and runtime parameters.
EntryValue
Scenario area S8.67 km2
Total road length L32 km
Simulation horizon T1200 s; warm-up 450 s; steady-state statistics in 60 s windows
Traffic load V200–500 (step 50) for load sensitivity analysis
Communication range r250 m
Sampling rate f s 1 Hz
Repetitions Rrepeated with multiple random seeds per configuration for stable statistics
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xu, Y.; Cai, Z.; Han, H.; Shao, X. Cost Modeling and Configuration Optimization for Large-Scale VANET Co-Simulation. Appl. Sci. 2026, 16, 3264. https://doi.org/10.3390/app16073264

AMA Style

Xu Y, Cai Z, Han H, Shao X. Cost Modeling and Configuration Optimization for Large-Scale VANET Co-Simulation. Applied Sciences. 2026; 16(7):3264. https://doi.org/10.3390/app16073264

Chicago/Turabian Style

Xu, Yang, Zhen Cai, Haozheng Han, and Xuqiang Shao. 2026. "Cost Modeling and Configuration Optimization for Large-Scale VANET Co-Simulation" Applied Sciences 16, no. 7: 3264. https://doi.org/10.3390/app16073264

APA Style

Xu, Y., Cai, Z., Han, H., & Shao, X. (2026). Cost Modeling and Configuration Optimization for Large-Scale VANET Co-Simulation. Applied Sciences, 16(7), 3264. https://doi.org/10.3390/app16073264

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop