Next Article in Journal
Impact of Leadership Relations and Mature Technology on Digital Technology Complementary Innovation
Previous Article in Journal
Integrating Carbon Tax and Subsidies: An Evolutionary Game Theory-Based Shore Power Promotional Strategy Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multi-Stage Simulation of Residents’ Disaster Risk Perception and Decision-Making Behavior: An Exploratory Study on Large Language Model-Driven Social–Cognitive Agent Framework

by
Xinjie Zhao
1,†,
Hao Wang
2,*,†,
Chengxiao Dai
3,†,
Jiacheng Tang
4,†,
Kaixin Deng
5,
Zhihua Zhong
6,
Fanying Kong
7,
Shiyun Wang
8 and
So Morikawa
1,*
1
Department of Civil Engineering, The University of Tokyo, Tokyo 113-8656, Japan
2
College of Humanities and Development Studies, China Agricultural University, Beijing 100083, China
3
Faculty of Information and Communication Technology, Universiti Tunku Abdul Rahman, Kampar 31900, Malaysia
4
School of Information Science and Engineering, Shandong Normal University, Jinan 250358, China
5
College of Computer Science and Cyber Security, Chengdu University of Technology, Chengdu 610059, China
6
School of Computing, Institute of Science Tokyo, Tokyo 152-8550, Japan
7
School of Aulin, Northeast Forestry University, Harbin 150040, China
8
Faculty of Science, The University of Copenhagen, 1870 Copenhagen, Denmark
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Systems 2025, 13(4), 240; https://doi.org/10.3390/systems13040240
Submission received: 10 November 2024 / Revised: 16 March 2025 / Accepted: 20 March 2025 / Published: 31 March 2025
(This article belongs to the Section Artificial Intelligence and Digital Systems Engineering)

Abstract

:
The escalating frequency and complexity of natural disasters highlight the urgent need for deeper insights into how individuals and communities perceive and respond to risk information. Yet, conventional research methods—such as surveys, laboratory experiments, and field observations—often struggle with limited sample sizes, external validity concerns, and difficulties in controlling for confounding variables. These constraints hinder our ability to develop comprehensive models that capture the dynamic, context-sensitive nature of disaster decision-making. To address these challenges, we present a novel multi-stage simulation framework that integrates Large Language Model (LLM)-driven social–cognitive agents with well-established theoretical perspectives from psychology, sociology, and decision science. This framework enables the simulation of three critical phases—information perception, cognitive processing, and decision-making—providing a granular analysis of how demographic attributes, situational factors, and social influences interact to shape behavior under uncertain and evolving disaster conditions. A case study focusing on pre-disaster preventive measures demonstrates its effectiveness. By aligning agent demographics with real-world survey data across 5864 simulated scenarios, we reveal nuanced behavioral patterns closely mirroring human responses, underscoring the potential to overcome longstanding methodological limitations and offer improved ecological validity and flexibility to explore diverse disaster environments and policy interventions. While acknowledging the current constraints, such as the need for enhanced emotional modeling and multimodal inputs, our framework lays a foundation for more nuanced, empirically grounded analyses of risk perception and response patterns. By seamlessly blending theory, advanced LLM capabilities, and empirical alignment strategies, this research not only advances the state of computational social simulation but also provides valuable guidance for developing more context-sensitive and targeted disaster management strategies.

1. Introduction

Natural hazard events—ranging from floods and hurricanes to earthquakes and pandemics—have intensified in frequency and severity, posing unprecedented challenges for societies worldwide. This escalating threat underscores a critical need to understand and predict human behavior in response to risk information [1]. How individuals and communities perceive, process, and act upon evolving disaster cues before, during, and after crisis events not only determines personal safety but also influences the collective resilience of entire populations [2]. Despite extensive research, understanding these dynamic processes remains limited due to constraints in capturing the cognitive complexity, cultural diversity, and temporal fluidity inherent in uncertain, high-stakes situations [3].
Traditional social science methodologies—such as surveys, interviews, field observations, and laboratory experiments—have provided valuable insights into disaster-related behaviors [4,5,6]. Yet, they face inherent limitations. Sample sizes are often small, ecological validity may be compromised, and controlling for confounding variables is challenging in real-time, unfolding crises [7]. As a result, it remains difficult to build comprehensive, scalable models that fully capture how individuals continuously update their perceptions and decisions as new information emerges and conditions change. Recent advances in computational social science have opened pathways to transcend these limitations by modeling complex social phenomena at unprecedented scales and fidelity [8,9,10]. Among these computational innovations, the rise of Large Language Models (LLMs) stands out as a transformative development. LLMs have exhibited remarkable capabilities in interpreting, generating, and reasoning over human language, grounded in the immense corpora of text-based data [11]. As a result, they implicitly learn cultural norms, social heuristics, and context-sensitive reasoning patterns [12,13]. These attributes are critical for simulating human-like decision-making, especially under the uncertainty and time pressures characteristic of disaster contexts.
Capitalizing on these breakthroughs, we propose an innovative multi-stage simulation framework driven by LLM-powered social–cognitive agents. Unlike traditional agent-based models that rely solely on predefined rules or static decision trees, LLM-driven agents dynamically interpret evolving information, apply theoretical concepts, and demonstrate human-like cognitive flexibility, which is anchored in a combination of established social–cognitive theories—such as risk perception and communication [14], protection motivation [15], social influence [16], bounded rationality [17], heuristics and biases [18], and planned behavior [19]—ensuring that the agents’ internal logic and external behaviors align with empirically grounded principles of human decision-making [20]. By examining how agents perceive risk, process conflicting cues, and execute protective measures at each stage, the framework provides granular insights into the interplay of individual characteristics, situational factors, and social influences. Advanced prompt engineering and cognitive architectures ensure that agents can incorporate evolving information streams and adapt to shifting conditions. Crucially, our framework addresses a significant gap in existing research: the integration of theoretical rigor with empirical grounding. We introduce a novel alignment strategy which helps ensure that the agent population’s demographic and cognitive attributes mirror those found in real-world survey data. By calibrating agent profiles to align with demographic distributions and questionnaire findings derived from a large-scale, demographically representative real-world survey (N = 1500) [21], we bridge the gap between simulated and real human populations. This empirical alignment enhances external validity, allowing the simulated agents to not only exhibit theoretically plausible behaviors but also produce survey response patterns statistically similar to those observed in actual human respondents. Thus, the model moves beyond abstract conceptions of cognition and behavior, offering a closer approximation to real-world decision-making under duress.
To demonstrate the practical utility and methodological strength of this approach, we present a case study on pre-disaster preventive measures for an impending flood. By analyzing 5864 simulated scenarios, we reveal nuanced behavioral patterns based on demographic characteristics and information environments, which highlights how integrating LLM technology, theory-driven cognitive modeling, and empirical alignment strategies can yield simulations that closely mimic human behavior patterns and illuminate the role of social amplification and information credibility in shaping decisions.
The contributions of this research are threefold:
1.
A Theoretically Grounded Simulation Framework: We develop a comprehensive model that weaves established psychological, sociological, and decision science theories into agent-based disaster simulations, surpassing the limitations of simplistic, rule-based models.
2.
Empirical Alignment with Real-World Data: By aligning the simulated agent population to actual demographic distributions and survey metrics, we enhance external validity, bridging the gap between theoretical models and observed human responses.
3.
Rich Behavioral Insights for Disaster Management: Our multi-stage, information-centric approach reveals how varied demographics, social networks, and cognitive heuristics interact with evolving information environments to shape preventive decisions, which can guide risk communication strategies, inform policy design, and ultimately improve community resilience.
By acknowledging the current limitations—such as the need for integrating emotional factors, cultural nuances, and non-verbal communication modalities—we chart a path for future research expansions. The framework also suggests avenues for incorporating multimodal data streams, strengthening ethical safeguards, and refining parameter tuning through real-time observational data. This research lays the groundwork for a new generation of computational social simulations that blend robust theoretical foundations, cutting-edge LLM capabilities, and empirical validations. By doing so, it strives to produce more credible, adaptable, and context-sensitive insights into the intricate decision-making processes that define human responses to disasters.

2. Related Works

Understanding and predicting human behavior in disaster contexts calls for a multidisciplinary integration of insights from risk perception theories and their evolution, computational modeling approaches for scenario simulation, and the emerging capacity of LLMs to model complex social–cognitive processes. This section examines these three areas in detail, elucidating the conceptual underpinnings and methodological innovations that collectively shape our proposed framework.

2.1. Evolution of Disaster Risk Information Perception Research

Over the past few decades, research on disaster risk information perception has evolved from portraying individuals as isolated, rational decision-makers to highlighting the social, cultural, and psychological roots of risk perception and response [22,23,24,25,26]. Early cognitive-processing theories posited that people evaluate risk primarily based on rational appraisals of severity and probability [27]. However, empirical studies across diverse disaster contexts have demonstrated that purely rationalist perspectives fail to fully explain observed behaviors, such as the under- or overestimation of threats, delayed responses, and strong emotional influences [28]. A critical turning point in understanding disaster risk emerged with the introduction of the social amplification of risk framework [29]. This framework illustrated that risk perception extends beyond individual cognition, encompassing collective processes shaped by social networks, media, and cultural systems [30,31]. As risk information travels through both formal and informal channels, it can be amplified or attenuated through interpretation, discussion, and communal transformation. This cyclical feedback loop—linking information flow, cultural interpretation, and group dynamics—profoundly influences how individuals perceive and respond to disasters. Meanwhile, the digital transformation of communication has given rise to multi-layered information ecosystems [32], where official alerts intermix with user-generated content on social media and other online platforms. These environments heighten the complexity of risk perception, as people navigate abundant—often conflicting—signals [33,34]. Longitudinal research further underscores the dynamic nature of risk perception, showing that individuals’ risk assessments evolve over time to incorporate prior experiences, cultural memory, and unfolding environmental changes [35,36,37]. This holistic perspective, integrating cognitive, social, and temporal dimensions, underscores the necessity for models that can accommodate multiple theoretical constructs and adapt to shifting conditions. Drawing on these insights, our proposed framework embeds social–cognitive theories into agent architectures, thereby capturing the evolving, socially influenced nature of disaster risk perception with greater fidelity.

2.2. Computational Approaches for Disaster Scenario Simulation

Computational modeling has long been integral to understanding human behavior and decision-making in disaster contexts. Early agent-based models (ABMs) largely concentrated on simulating evacuation patterns and emergency responses using simplified behavioral rules [38,39,40,41,42,43]. While these pioneering efforts laid important groundwork, they frequently failed to capture the psychological and social complexities behind real-world decision-making [44]. In more recent years, researchers have integrated human mobility data and social network analysis, drawing on diverse streams such as mobile phone records and social media trajectories to explore the interaction among infrastructure, communication networks, and population movements [45,46,47]. As computational capabilities and data availability have surged, hybrid models have emerged that combine ABMs with machine learning or optimization methods to reflect both individual-level decision processes and emergent group behaviors [48,49]. By incorporating real-time data feeds and adaptive learning mechanisms, these models can dynamically update behavioral parameters in response to changing conditions [50,51,52]. The resulting simulations are not only more nuanced but also better equipped to predict population behavior, thereby enhancing disaster preparedness, emergency responses, and policy interventions [53].

2.3. LLM-Driven Social Simulation

The emergence of LLMs has ushered in a significant shift in social simulation research [54,55,56,57,58]. Unlike conventional agent-based models (ABMs) that rely on fixed, rule-based behavioral definitions, LLM-based agents can process and generate human-like language, embody culturally rooted norms, and adapt to varied contexts [59,60]. Through mechanisms such as chain-of-thought reasoning and multi-step cognitive processing, these LLM Agents transcend simple stimulus–response paradigms, thereby enabling more nuanced simulations of mental states and decision-making pathways [61,62,63]. Recent experimental applications, notably the Stanford Smallville project, illustrate the potential of LLM Agents to exhibit rich social dynamics, develop relationships, and pursue goal-oriented behaviors within persistent environments [64]. These advancements stem from the synergy between structured cognitive architectures and sophisticated prompt engineering [65,66], which combine to enable agents to maintain coherent personalities, active memory states, and adaptive social interactions [67,68].
By modeling complex phenomena such as opinion formation, the emergence of social norms, and collective decision-making [9,69,70], LLM-driven simulations offer unprecedented insights into how individual cognition intersects with network structures to influence group behavior. This includes key applications in disaster contexts, where emotional contagion, rumor propagation, and trust dynamics can critically affect the effectiveness of risk communication and emergency interventions [71]. However, harnessing LLMs for disaster-focused social simulation is not without challenges. Ongoing debates concern the models’ fidelity in replicating human emotional and stress responses under extreme conditions [72], their vulnerability to biases, and their capacity to generalize across diverse cultural settings [73]. In response, recent work concentrates on anchoring these models in empirical data and established theoretical constructs [74,75], while also exploring hybrid approaches that integrate LLMs with other computational methods [76]. Building upon these emerging best practices, our framework links LLM-driven agents to robust theoretical underpinnings and carefully designed experimental protocols, with the aim of more accurately capturing the complex, context-dependent dynamics of disaster responses.

3. LLM-Driven Social–Cognitive Simulation Framework for Disaster Risk Perception and Decision-Making

As illustrated in Figure 1, our proposed framework integrates a comprehensive disaster-related information ecosystem with a theoretically grounded, multi-modular approach. Subsequently, Figure 2 highlights the construction of LLM-driven social–cognitive agents designed to simulate residents’ perception, cognitive processing, and decision-making under evolving disaster conditions. This framework provides a flexible, empirically informed platform adaptable to diverse disaster scenarios, laying the groundwork for effective risk communication and management strategies in the sections that follow.

3.1. Theoretical Foundations

Decision-making in disaster contexts emerges from a complex interplay of uncertainty, limited information, urgent time constraints, and social influence [77]. To capture these intricacies, our framework builds upon a suite of well-established theories from psychology, sociology, and decision science [78]. These theoretical underpinnings guide the conceptual design of each module in the simulation pipeline, ensuring that cognitive and behavioral processes are modeled in a scientifically consistent manner [79,80].
  • Risk Perception and Communication [14]: Risk perception is shaped by subjective judgments and can be amplified or attenuated by social factors. Incorporating this perspective allows us to simulate how agents process heterogeneous and sometimes conflicting disaster-related information, laying the groundwork for initial cognitive reactions.
  • Protection Motivation [15]: Centered on threat and coping appraisals, this theory helps model how agents assess the severity and probability of disasters in relation to their perceived coping capabilities—ultimately guiding the formation of protective intentions.
  • Social Influence [16]: Peer norms, cultural values, and information cascades play a significant role in shaping individual and collective behaviors. Integrating social influence mechanisms enables us to model how agent networks propagate and filter risk information, resulting in convergence or divergence in group decision-making.
  • Bounded Rationality [17]: Under time constraints and limited cognitive capacity, individuals rely on heuristics. Modeling bounded rationality allows the simulation of more realistic, and often suboptimal, decision outcomes under crisis conditions.
  • Heuristics and Biases [18]: Agents frequently resort to cognitive shortcuts that introduce biases into judgment and decision processes. Accounting for these tendencies captures the non-linear and sometimes counterintuitive responses individuals exhibit under uncertainty.
  • Planned Behavior [19]: Attitudes, subjective norms, and perceived behavioral control critically affect long-term preparedness and recovery strategies. Incorporating these constructs ensures our framework can simulate enduring behavioral adjustments beyond the immediate disaster event.
Together, these theories form the conceptual backbone of our LLM-driven framework, ensuring that agent behaviors reflect both the cognitive and social realities of disaster decision-making.

3.2. LLM-Driven Social–Cognitive Resident Agent

At the core of our framework lies the social–cognitive resident agent—a computational entity endowed with demographic attributes, personality traits, risk attitudes, and social network features (see Table 1). Each agent functions as an LLM-driven decision-maker, continuously interpreting incoming information and updating its cognitive and behavioral states over time [65]. By equipping agents with diverse backgrounds and cognitive styles, the simulation can capture heterogeneous behavioral patterns across various disaster scenarios. This bottom–up approach ensures that higher-level, system-wide behaviors naturally emerge from individual interactions, thereby enhancing the ecological validity of our model [81,82,83].

3.3. Multi-Module Approach for Modeling Social Cognition

Our framework adopts a multi-module design that partitions the decision-making process into three interconnected modules: information perception, cognitive processing, and decision-making. By delineating these stages, the model facilitates a more precise examination of how information evolves into action and how theoretical constructs inform each phase of the decision process. Table 2 details the components and processes of each module. The theories outlined earlier guide the flow of data and the cognitive transformations across all three stages. Table 3 then summarizes how each theoretical perspective aligns with these core modules, ensuring a cohesive, theory-driven modeling environment.

3.4. Multi-Stage Experiment Design and Implementation

We employ a multi-phase simulation that follows agents through the pre-disaster, during-disaster, and post-disaster stages, each with distinct tasks and informational inputs.

3.4.1. Disaster Contextual Modeling

The disaster environment is modeled by integrating three principal data dimensions (Figure 1):
1.
Regional Characteristics: Geographical, socio-economic, and infrastructural variables are included in agent initialization and scenario parameterization.
2.
Disaster-Specific Parameters: Event attributes (e.g., type, intensity, duration, and onset speed) dynamically update throughout the simulation.
3.
Information Ecosystem: Heterogeneous data streams (official alerts, media reports, and social media posts) are iteratively provided to agents, reflecting the real-time evolution of disaster conditions.
By encoding this complexity, the model captures agent responses under realistic constraints, enabling a robust evaluation of information-processing strategies [87].

3.4.2. Experimental Procedure and Data Flow

To enhance clarity and reproducibility, Table 4 presents a structured experimental workflow, highlighting key stages, input/output formats, and analytical methods. This pipeline demonstrates how theoretical constructs shape the input prompts, how LLMs interpret them, and how agent decisions develop over time. Throughout these phases, the following elements are emphasized:
  • Data Quality Control: Systematic preprocessing and filtering to ensure agents receive plausible, context-sensitive information.
  • Parameter Configuration and Tuning: Detailed configuration of LLM parameters (e.g., temperature and maximum tokens) and prompt engineering strategies (e.g., chain-of-thought reasoning), supplemented by fine-tuning with domain-specific texts to improve realism.
  • Validation Techniques: Cross-verification using synthetic test cases and, where possible, partial empirical datasets from historical disasters to verify model plausibility and outcome patterns.
  • Comparison and Control Experiments: Introduction of baseline conditions (e.g., agents without social influence or without heuristic decision-making) for benchmarking and isolating the effects of specific theoretical components.

3.4.3. Multi-Stage Task Design

Each phase of the disaster scenario is associated with distinct experimental tasks, aligned with relevant theoretical constructs:
1.
Pre-Disaster Phase: Agents implement preventive measures, interpret early warnings, and filter low-credibility sources. This phase examines how risk perception and protection motivation theories predict baseline levels of preparedness [88,89].
2.
During-Disaster Phase: Under time pressure and volatile conditions, agents must make rapid decisions (e.g., evacuation vs. shelter-in-place). Social influence, bounded rationality, and heuristic biases are particularly salient. Data analysis focuses on the diversity of chosen actions and their correlation with agent attributes [90].
3.
Post-Disaster Phase: Agents modify their long-term attitudes based on outcomes and feedback. This stage evaluates how the theory of planned behavior influences sustained changes in risk mitigation behaviors, offering insights into resilience and adaptation [91].

4. Framework Implementation

4.1. Disaster Contextual Modeling

In real-world disasters, information streams are diverse, rapidly changing, and often uncertain [87]. To emulate this complexity, we employ a dynamic contextual modeling system that continuously updates the disaster environment and information ecosystem. As shown in Figure 3, this system integrates three core domains:
1.
Physical Environment: Periodically updates hazard characteristics (e.g., flood extent and infrastructure damage) and resource availability. These updates reflect changes in vulnerability over time and influence agents’ risk assessments and coping strategies.
2.
Information Ecosystem: Collects and processes multi-source inputs, including official alerts, sensor data, and social media posts. It assigns credibility scores based on source reliability and consistency, ensuring that agents are exposed to a realistic mix of reliable and unreliable information.
3.
Social System: Models the evolution of social networks, trust relationships, and collective behaviors as the disaster unfolds. This domain captures how rumors spread, how community leaders emerge, and how norms evolve, all of which shape agent decision-making.

4.2. Theory-Driven Multimodal Agent Architecture

Our agent architecture, informed by the theoretical underpinnings outlined in Section 3, is designed to translate complex psychological constructs into computational steps that guide agent decisions. This architecture leverages prompt engineering techniques, ensuring that each decision-making layer aligns with the established theories of risk perception, protection motivation, social influence, bounded rationality, heuristics and biases, and planned behavior.
The core of this architecture is a multi-layered processing pipeline:
  • Base Layer: Initializes agent attributes (personality traits, demographics, and risk attitudes) and sets their initial cognitive states. This layer ensures that agents differ in their baseline responses to incoming information.
  • Intermediate Layer: Applies theoretical constructs to incoming stimuli. For example, risk perception informs threat judgments; social influence modulates how peer behaviors shape agent norms; and bounded rationality introduces the heuristic-based filtering of options.
  • Executive Layer: Synthesizes evaluated information into concrete actions. Here, planned behavior theory aids in translating intentions into strategies, while heuristics and biases may cause agents to deviate from purely rational choices.

4.3. Multimodal Integration

Figure 4 illustrates how multiple modules interact. Each module—information perception, cognitive processing, and decision formation—functions as a distinct unit with well-defined inputs and outputs, yet they operate in tandem. The interplay of these modules ensures that complex human-like decision patterns emerge organically. We introduce a control mechanism for scenario parameters through a structured configuration file. This file details scenario-specific variables such as disaster intensity thresholds, resource distribution limits, and social network density parameters. Table 5 provides an example configuration snippet, illustrating how scenario initialization and dynamic updates are standardized. Such an approach ensures that experiments are easily replicable and that variations in input conditions are transparently documented.

5. Case Study: Decision-Making in Pre-Disaster Preventive Measures

To evaluate the capacity of our multistage social–cognitive LLM-agents framework to capture human-like decision-making in complex and uncertain disaster contexts, we conduct a case study focused on pre-disaster preventive measures for an impending flood. Building on a historically significant flood event, this scenario integrates empirical survey data through an alignment strategy to enhance the external validity of the simulation. By combining the original experimental design with the newly implemented alignment method, we provide a comprehensive demonstration of the framework’s capabilities.

5.1. Real-World Context and Empirical Data Alignment

The case study is grounded in the catastrophic flooding that struck Henan Province, China, in July 2021. In the provincial capital, Zhengzhou, extreme rainfall exceeding 600 mm over a short period led to severe urban inundation, reservoir failures, and widespread infrastructure damage [92]. More than 14.78 million people were affected, with direct economic losses exceeding RMB 120.06 billion. Approximately 95.5% of the casualties occurred in Zhengzhou alone, underscoring the localized severity of the disaster. During this crisis, residents relied on multiple sources of information—official weather alerts, government broadcasts, mainstream media, and social media—often characterized by delays, inconsistencies, or uncertain credibility. Building upon this, we devise a virtual flood scenario mirroring the complex information ecosystem. Such complexity, as is shown in Figure 5, marked by uncertain information flows, credibility dilemmas, and high-stakes decisions, provides an ideal testbed for examining whether our LLM-driven agents can emulate the cognitive and social nuances observed in human responses to emergencies.
To strengthen the ecological validity of our simulation, we employ an alignment strategy that incorporates empirical survey data from a doctoral study conducted in Zhengzhou [21]. This original survey, involving approximately 1500 respondents from the real world, offers detailed demographic distributions (e.g., gender, age, income, and education) and validates the questionnaire measures related to disaster risk perception and preparedness behaviors. Rather than arbitrarily assigning the LLM-driven agent demographics, we create a subset of 300 agents whose demographic proportions closely match those in the real-world sample. Table 6 summarizes how the simulated agent population (N = 300) aligns with the real-world sample (N = 1500) across key demographic attributes.
To further validate our simulation outcomes, we administered the same questionnaire from [21] to an aligned subset of 300 agents. This instrument measures disaster familiarity, perceived severity, manageability, and basic knowledge on a 1-to-5 Likert scale. By reproducing the contextual conditions faced by human respondents, these agents provided responses directly comparable to real-world data. Matching the demographic distributions and survey items ensured that the agent cohort closely paralleled the actual sample, thereby strengthening the validity of subsequent statistical comparisons.
For this 300-agent subset, the alignment strategy minimized bias by preserving essential demographic features. This approach yielded a representative simulated population, capable of mirroring the community surveyed in the real study. Table 7 shows that discrepancies between mean scores for both groups remained within ±0.13, indicating that aggregated agent responses approximate human patterns across constructs such as disaster familiarity, perceived severity, manageability, and knowledge. These close alignments highlight the effectiveness of anchoring agent attributes to empirically observed distributions, thereby enhancing external validity.

5.2. Experimental Results and Analysis

Building on these findings, we expand the aligned agent population to 6400 to capture a broader range of cognitive and social dynamics. Each agent is exposed to multiple information streams and tasked with selecting from four disaster preparedness options (see Table 8). They also provide a 0 to 10 confidence score to gauge subjective certainty in their chosen action, enabling the exploration of how the perceived and actual decision quality interrelate.
By integrating the alignment strategy that reflects observed demographic distributions and replicates validated survey measures, this framework not only offers large-scale insights into collective decision-making but also ensures consistency with the real-world data. The 300-agent subset validates the approach through close statistical alignment, and the 6400-agent population extends these observations to a wider spectrum of behaviors, further solidifying the platform’s applicability to disaster decision-making research.
In the large-scale experiment of 6400 agents, we observe nuanced decision-making patterns shaped by demographic factors, information credibility, and cognitive biases. Figure 6 summarizes the distribution of selected options and corresponding confidence scores. When confronted with conflicting or uncertain data, agents frequently display risk aversion (e.g., reducing or halting preparations) and at times exhibit overconfidence in potentially suboptimal choices.
As illustrated in Figure 7, decision tendencies vary across demographic lines: older or more analytically inclined agents tended to favor caution (Options A or D), whereas younger or more risk-tolerant agents more readily alter plans (Options B or C). Figure 8 further indicates that positive official updates encourage proactive decision-making (often Option D), whereas negative official signals reinforce conservative behaviors (Options A or B). Conflicting signals between official and personal sources introduce cognitive dissonance, leading to lower consensus and reduced confidence.

6. Discussion

This study introduced a novel simulation framework that leverages LLM-driven social–cognitive agents to model dynamic, multi-stage decision-making in disaster contexts. Building on well-established theories of risk perception, social influence, and planned behavior, the framework offers a more nuanced perspective on how individuals and communities respond to evolving threats, uncertain information, and diverse social pressures. Our experimental scenarios demonstrated the framework’s ability to replicate recognizable cognitive patterns, such as increased risk aversion and heightened sensitivity to information credibility. Incorporating heuristics, bounded rationality, and social amplification mechanisms helped align simulated agent behaviors with observed human tendencies. Nonetheless, bridging the gap between simulated outcomes and real-world decisions remains challenging. Factors like emotional contagion, cultural norms, and non-verbal communication—dimensions not fully captured in the current model—are key to a more comprehensive understanding of disaster responses.
Another notable strength is the framework’s adaptability. By integrating diverse theoretical constructs within a flexible agent architecture, it can be tailored to various disaster contexts and population characteristics. However, this adaptability comes with challenges. Generalizing across different socio-economic and geographical settings requires extensive data collection, careful parameter calibration, and iterative testing. The current reliance on synthetic or simulated scenarios underscores the importance of empirical validation through real-world datasets and case studies. Such efforts will strengthen model credibility and increase its practical utility for disaster management.
Moreover, while LLM-driven agents enable context-aware and adaptive simulations, they also present concerns regarding data security, privacy, and ethics. The simulation must incorporate robust data governance protocols, encryption methods, and adversarial detection strategies to prevent manipulation or misuse. Handling sensitive information—such as geographic locations or psychological profiles—requires stringent safeguards to maintain both user privacy and model integrity.
Ultimately, this framework serves as an initial step toward capturing the complexity and fluidity of disaster decision-making. Recognizing limitations—such as the absence of nuanced emotional factors, the reliance on textual inputs, and constraints in modeling long-term cultural shifts—provides a roadmap for targeted enhancements. Rather than a final solution, the proposed framework should be viewed as a foundation for ongoing refinement, empirical testing, and theoretical expansion. Through ongoing interdisciplinary collaboration, it can evolve into a more comprehensive tool for elucidating and supporting human responses in crisis scenarios.

7. Future Work

Building on the current framework, several research directions can further strengthen its empirical grounding, theoretical depth, and practical applicability:
1.
Empirical Validation and Cultural Integration: Future efforts should incorporate real-world datasets from historical disasters or controlled field studies to calibrate and validate model parameters. Extending agent profiles to reflect cultural, linguistic, and socio-economic variations will improve generalizability across diverse regions and populations.
2.
Ethical and Security Measures: As simulations increasingly involve sensitive information, developing rigorous privacy protections, anonymization approaches, and secure data storage becomes paramount. Adversarial testing protocols are needed to ensure model outputs remain robust and resistant to manipulative inputs.
3.
Adaptive, Multimodal Information Processing: Enriching agent perceptions with multimedia data—such as satellite imagery, sensor feeds, and live video—can more closely approximate real-time environmental cues. Additionally, adaptive feedback mechanisms that refine decision rules based on newly acquired data may enhance system resilience under rapidly changing conditions.
4.
Collective Behavior and Social Network Dynamics: Future studies can incorporate more granular social network structures, community-driven feedback loops, and emergent social norms to capture complex collective behaviors under stress and uncertainty. Modeling phenomena like rumor diffusion and group-based resource sharing will offer deeper insights into crowd dynamics.
By pursuing these directions, the framework can evolve from an exploratory platform into a robust decision-support system, aiding stakeholders in devising targeted communication strategies, evaluating policy interventions, and bolstering community resilience.

8. Conclusions

This study introduced an integrative simulation framework that merges state-of-the-art Large Language Model (LLM)-based agents with social–cognitive theories and a multi-phase disaster scenario. By coupling theoretical rigor with computational innovation, the proposed approach lays the groundwork for more comprehensive, data-driven decision support. It offers a novel perspective on how heterogeneous populations interpret risk information, form intentions, and take action under uncertain and rapidly changing conditions. While the results underscore the potential for elucidating nuanced behavioral patterns and informing disaster management strategies, they also reveal several limitations, including the insufficient treatment of emotional factors, limited cultural diversity, and a narrow range of data modalities. Despite these challenges, the framework constitutes a significant advance toward more holistic and context-sensitive disaster simulations, ultimately aimed at improving human responses to disasters.

Author Contributions

Conceptualization, X.Z., H.W., S.M., S.W. and Z.Z.; methodology, X.Z., C.D., H.W., S.W., K.D., Z.Z. and F.K.; validation, X.Z., S.M., H.W. and K.D.; formal analysis, S.M. and H.W.; data curation, X.Z., C.D. and F.K.; writing—original draft preparation, X.Z., C.D., S.W., K.D. and F.K.; writing—review and editing, X.Z., C.D., S.W., H.W., S.M., K.D., Z.Z. and F.K.; visualization, X.Z., C.D., J.T., S.W., K.D. and F.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research is supported by the Youth Project of the National Social Science Foundation of China “Research on Unbalanced and Insufficient Development of Social Organizations Based on Big Data Method” (20CSH089).

Data Availability Statement

The original contributions presented in the study are included in the article; further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Sitkin, S.B.; Pablo, A.L. Reconceptualizing the determinants of risk behavior. Acad. Manag. Rev. 1992, 17, 9–38. [Google Scholar]
  2. Parsons, M.; Glavac, S.; Hastings, P.; Marshall, G.; McGregor, J.; McNeill, J.; Morley, P.; Reeve, I.; Stayner, R. Top-down assessment of disaster resilience: A conceptual framework using coping and adaptive capacities. Int. J. Disaster Risk Reduct. 2016, 19, 1–11. [Google Scholar]
  3. Buchanan, D.A.; Denyer, D. Researching tomorrow’s crisis: Methodological innovations and wider implications. Int. J. Manag. Rev. 2013, 15, 205–224. [Google Scholar] [CrossRef]
  4. Mukherji, A.; Ganapati, N.E.; Rahill, G. Expecting the unexpected: Field research in post-disaster settings. Nat. Hazards 2014, 73, 805–828. [Google Scholar]
  5. Burger, A.; Oz, T.; Kennedy, W.G.; Crooks, A.T. Computational social science of disasters: Opportunities and challenges. Future Internet 2019, 11, 103. [Google Scholar] [CrossRef]
  6. Cai, J.; Hu, S.; Que, T.; Li, H.; Xing, H.; Li, H. Influences of social environment and psychological cognition on individuals’ behavioral intentions to reduce disaster risk in geological hazard-prone areas: An application of social cognitive theory. Int. J. Disaster Risk Reduct. 2023, 86, 103546. [Google Scholar]
  7. Feng, Y.; Duives, D.; Daamen, W.; Hoogendoorn, S. Data collection methods for studying pedestrian behaviour: A systematic review. Build. Environ. 2021, 187, 107329. [Google Scholar]
  8. Henderson, J.C.; Nutt, P.C. The influence of decision style on decision making behavior. Manag. Sci. 1980, 26, 371–386. [Google Scholar] [CrossRef]
  9. Zhao, X.; Wang, S.; Wang, H. Organizational geosocial network: A graph machine learning approach integrating geographic and public policy information for studying the development of social organizations in China. ISPRS Int. J. Geo-Inf. 2022, 11, 318. [Google Scholar] [CrossRef]
  10. Ziems, C.; Held, W.; Shaikh, O.; Chen, J.; Zhang, Z.; Yang, D. Can large language models transform computational social science? Comput. Linguist. 2024, 50, 237–291. [Google Scholar]
  11. Chen, J.; Liu, Z.; Huang, X.; Wu, C.; Liu, Q.; Jiang, G.; Pu, Y.; Lei, Y.; Chen, X.; Wang, X.; et al. When large language models meet personalization: Perspectives of challenges and opportunities. World Wide Web 2024, 27, 42. [Google Scholar] [CrossRef]
  12. Ma, J. Can Machines Think Like Humans? A Behavioral Evaluation of LLM-Agents in Dictator Games. arXiv 2024, arXiv:2410.21359. [Google Scholar]
  13. Zhao, X.; Blum, M.; Yang, R.; Yang, B.; Carpintero, L.M.; Pina-Navarro, M.; Wang, T.; Li, X.; Li, H.; Fu, Y.; et al. AGENTiGraph: An Interactive Knowledge Graph Platform for LLM-based Chatbots Utilizing Private Data. arXiv 2024, arXiv:2410.11531. [Google Scholar]
  14. Fischhoff, B.; Bostrom, A.; Quadrel, M.J. Risk perception and communication. Annu. Rev. Public Health 1993, 14, 183–203. [Google Scholar] [CrossRef]
  15. Boer, H.; Seydel, E.R. Protection motivation theory. In Predicting Health Behaviour: Research and Practice with Social Cognition Models; Conner, M., Norman, P., Eds.; Open University Press: Maidenhead, UK, 1996; pp. 95–120. [Google Scholar]
  16. Cialdini, R.B.; Goldstein, N.J. Social influence: Compliance and conformity. Annu. Rev. Psychol. 2004, 55, 591–621. [Google Scholar] [CrossRef]
  17. Simon, H.A. Models of Bounded Rationality: Empirically Grounded Economic Reason; MIT Press: Cambridge, MA, USA, 1997; Volume 3. [Google Scholar]
  18. Tversky, A.; Kahneman, D.; Slovic, P. Judgment Under Uncertainty: Heuristics and Biases; Cambridge University Press: Cambridge, UK, 1982. [Google Scholar]
  19. Ajzen, I. From Intentions to Actions: A Theory of Planned Behavior; Springer: Berlin/Heidelberg, Germany, 1985; pp. 11–39. [Google Scholar]
  20. Beratan, K.K. A cognition-based view of decision processes in complex social–ecological systems. Ecol. Soc. 2007, 12, 27. [Google Scholar] [CrossRef]
  21. Liu, L. Study on Factors Influencing Public Risk Perception in Urban Heavy Rain and Flood Disasters. Ph.D. Dissertation, Party School of the Central Committee of the C.P.C. (National Academy of Governance), Beijing, China, 2023. [Google Scholar] [CrossRef]
  22. Williams, D.J.; Noyes, J.M. How does our perception of risk influence decision-making? Implications for the design of risk information. Theor. Issues Ergon. Sci. 2007, 8, 1–35. [Google Scholar] [CrossRef]
  23. Giardini, F.; Vilone, D. Opinion dynamics and collective risk perception: An agent-based model of institutional and media communication about disasters. JASSS—J. Artif. Soc. Soc. Simul. 2021, 24, 4. [Google Scholar] [CrossRef]
  24. Wei, J.; Wang, F.; Lindell, M.K. The evolution of stakeholders’ perceptions of disaster: A model of information flow. J. Assoc. Inf. Sci. Technol. 2016, 67, 441–453. [Google Scholar] [CrossRef]
  25. Xu, D.; Peng, L.; Su, C.; Liu, S.; Wang, X.; Chen, T. Influences of mass monitoring and mass prevention systems on peasant households’ disaster risk perception in the landslide-threatened Three Gorges Reservoir area, China. Habitat Int. 2016, 58, 23–33. [Google Scholar] [CrossRef]
  26. Antronico, L.; Coscarelli, R.; De Pascale, F.; Condino, F. Social perception of geo-hydrological risk in the context of urban disaster risk reduction: A comparison between experts and population in an area of Southern Italy. Sustainability 2019, 11, 2061. [Google Scholar] [CrossRef]
  27. Sampson, J.P., Jr.; Lenz, J.G.; Reardon, R.C.; Peterson, G.W. A cognitive information processing approach to employment problem solving and decision making. Career Dev. Q. 1999, 48, 3–18. [Google Scholar] [CrossRef]
  28. Suarez, D.; Gomez, C.; Medaglia, A.L.; Akhavan-Tabatabaei, R.; Grajales, S. Integrated decision support for disaster risk management: Aiding preparedness and response decisions in wildfire management. Inf. Syst. Res. 2024, 35, 609–628. [Google Scholar] [CrossRef]
  29. Rühlemann, A.; Jordan, J.C. Risk perception and culture: Implications for vulnerability and adaptation to climate change. Disasters 2021, 45, 424–452. [Google Scholar] [CrossRef] [PubMed]
  30. Kasperson, R.E.; Renn, O.; Slovic, P.; Brown, H.S.; Emel, J.; Goble, R.; Kasperson, J.X.; Ratick, S. The social amplification of risk: A conceptual framework. Risk Anal. 1988, 8, 177–187. [Google Scholar] [CrossRef]
  31. Bentley, R.A.; O’Brien, M.J.; Brock, W.A. Mapping collective behavior in the big-data era. Behav. Brain Sci. 2014, 37, 63–76. [Google Scholar] [CrossRef]
  32. Fekete, A.; Rhyner, J. Sustainable digital transformation of disaster risk—Integrating new types of digital social vulnerability and interdependencies with critical infrastructure. Sustainability 2020, 12, 9324. [Google Scholar] [CrossRef]
  33. Abdulhamid, N.G.; Ayoung, D.A.; Kashefi, A.; Sigweni, B. A survey of social media use in emergency situations: A literature review. Inf. Dev. 2021, 37, 274–291. [Google Scholar] [CrossRef]
  34. Liu, W.; Xu, W.; John, B. Organizational disaster communication ecology: Examining interagency coordination on social media during the onset of the COVID-19 pandemic. Am. Behav. Sci. 2021, 65, 914–933. [Google Scholar] [CrossRef]
  35. Aerts, J.C.; Botzen, W.J.; Clarke, K.C.; Cutter, S.L.; Hall, J.W.; Merz, B.; Michel-Kerjan, E.; Mysiak, J.; Surminski, S.; Kunreuther, H. Integrating human behaviour dynamics into flood disaster risk assessment. Nat. Clim. Change 2018, 8, 193–199. [Google Scholar] [CrossRef]
  36. Walkling, B.; Haworth, B.T. Flood risk perceptions and coping capacities among the retired population, with implications for risk communication: A study of residents in a north Wales coastal town, UK. Int. J. Disaster Risk Reduct. 2020, 51, 101793. [Google Scholar] [CrossRef] [PubMed]
  37. Osberghaus, D.; Finkel, E.; Pohl, M. Individual Adaptation to Climate Change: The Role of Information and Perceived Risk; Discussion Paper, No. 10-061; ZEW—Centre for European Economic Research: Mannheim, Germany, 2010. [Google Scholar] [CrossRef]
  38. Maggi, E.; Vallino, E. Understanding urban mobility and the impact of public policies: The role of the agent-based models. Res. Transp. Econ. 2016, 55, 50–59. [Google Scholar] [CrossRef]
  39. Senanayake, G.P.; Kieu, M.; Zou, Y.; Dirks, K. Agent-based simulation for pedestrian evacuation: A systematic literature review. Int. J. Disaster Risk Reduct. 2024, 111, 104705. [Google Scholar] [CrossRef]
  40. Shibuya, K. A framework of multi-agent-based modeling, simulation, and computational assistance in an ubiquitous environment. Simulation 2004, 80, 367–380. [Google Scholar] [CrossRef]
  41. Serena, L.; Marzolla, M.; D’Angelo, G.; Ferretti, S. A review of multilevel modeling and simulation for human mobility and behavior. Simul. Model. Pract. Theory 2023, 127, 102780. [Google Scholar] [CrossRef]
  42. Tian, Y.; Zhou, T.S.; Yao, Q.; Zhang, M.; Li, J.S. Use of an agent-based simulation model to evaluate a mobile-based system for supporting emergency evacuation decision making. J. Med. Syst. 2014, 38, 149. [Google Scholar] [CrossRef]
  43. Yi, W.; Özdamar, L. A dynamic logistics coordination model for evacuation and support in disaster response activities. Eur. J. Oper. Res. 2007, 179, 1177–1193. [Google Scholar] [CrossRef]
  44. Savage, D.A. Towards a complex model of disaster behaviour. Disasters 2019, 43, 771–798. [Google Scholar] [CrossRef]
  45. Haraguchi, M.; Nishino, A.; Kodaka, A.; Allaire, M.; Lall, U.; Kuei-Hsien, L.; Onda, K.; Tsubouchi, K.; Kohtake, N. Human mobility data and analysis for urban resilience: A systematic review. Environ. Plan. B Urban Anal. City Sci. 2022, 49, 1507–1535. [Google Scholar] [CrossRef]
  46. Bellomo, N.; Clarke, D.; Gibelli, L.; Townsend, P.; Vreugdenhil, B. Human behaviours in evacuation crowd dynamics: From modelling to “big data” toward crisis management. Phys. Life Rev. 2016, 18, 1–21. [Google Scholar] [CrossRef]
  47. Kim, J.; Hastak, M. Social network analysis: Characteristics of online social networks after a disaster. Int. J. Inf. Manag. 2018, 38, 86–96. [Google Scholar] [CrossRef]
  48. Li, T.; Xie, N.; Zeng, C.; Zhou, W.; Zheng, L.; Jiang, Y.; Yang, Y.; Ha, H.Y.; Xue, W.; Huang, Y.; et al. Data-driven techniques in disaster information management. ACM Comput. Surv. (CSUR) 2017, 50, 1–45. [Google Scholar] [CrossRef]
  49. DeAngelis, D.L.; Diaz, S.G. Decision-making in agent-based modeling: A current review and future prospectus. Front. Ecol. Evol. 2019, 6, 237. [Google Scholar] [CrossRef]
  50. Du, E.; Cai, X.; Sun, Z.; Minsker, B. Exploring the role of social media and individual behaviors in flood evacuation processes: An agent-based modeling approach. Water Resour. Res. 2017, 53, 9164–9180. [Google Scholar] [CrossRef]
  51. Boppiniti, S.T. Real-Time Data Analytics with AI: Leveraging Stream Processing for Dynamic Decision Support. Int. J. Manag. Educ. Sustain. Dev. 2021, 4, 746. [Google Scholar] [CrossRef]
  52. Sakaguchi, Y.; Tanaka, M.; Inoue, Y. Adaptive intermittent control: A computational model explaining motor intermittency observed in human behavior. Neural Netw. 2015, 67, 92–109. [Google Scholar] [CrossRef]
  53. Zhai, L.; Lee, J.E. Analyzing the disaster preparedness capability of local government using AHP: Zhengzhou 7.20 rainstorm disaster. Int. J. Environ. Res. Public Health 2023, 20, 952. [Google Scholar] [CrossRef]
  54. Eloundou, T.; Beutel, A.; Robinson, D.G.; Gu-Lemberg, K.; Brakman, A.L.; Mishkin, P.; Shah, M.; Heidecke, J.; Weng, L.; Kalai, A.T. First-Person Fairness in Chatbots. arXiv 2024, arXiv:2410.19803. [Google Scholar] [CrossRef]
  55. Kirchner, J.H.; Chen, Y.; Edwards, H.; Leike, J.; McAleese, N.; Burda, Y. Prover-Verifier Games improve legibility of LLM outputs. arXiv 2024, arXiv:2407.13692. [Google Scholar] [CrossRef]
  56. Ye, Z.; Agarwal, R.; Liu, T.; Joshi, R.; Velury, S.; Le, Q.V.; Tan, Q.; Liu, Y. Evolving Alignment via Asymmetric Self-Play. arXiv 2024, arXiv:2411.00062. [Google Scholar] [CrossRef]
  57. Ruoss, A.; Delétang, G.; Medapati, S.; Grau-Moya, J.; Wenliang, L.K.; Catt, E.; Reid, J.; Lewis, C.A.; Veness, J.; Genewein, T. Amortized Planning with Large-Scale Transformers: A Case Study on Chess. arXiv 2024, arXiv:2402.04494. [Google Scholar] [CrossRef]
  58. Niu, Q.; Liu, J.; Bi, Z.; Feng, P.; Peng, B.; Chen, K.; Li, M.; Yan, L.K.; Zhang, Y.; Yin, C.H.; et al. Large Language Models and Cognitive Science: A Comprehensive Review of Similarities, Differences, and Challenges. arXiv 2024, arXiv:2409.02387. [Google Scholar] [CrossRef]
  59. Xi, Z.; Chen, W.; Guo, X.; He, W.; Ding, Y.; Hong, B.; Zhang, M.; Wang, J.; Jin, S.; Zhou, E.; et al. The rise and potential of large language model based agents: A survey. arXiv 2023, arXiv:2309.07864. [Google Scholar] [CrossRef]
  60. Zhang, Z.; Rossi, R.A.; Kveton, B.; Shao, Y.; Yang, D.; Zamani, H.; Dernoncourt, F.; Barrow, J.; Yu, T.; Kim, S.; et al. Personalization of Large Language Models: A Survey. arXiv 2024, arXiv:2411.00027. [Google Scholar] [CrossRef]
  61. Kamruzzaman, M.; Kim, G.L. Prompting Techniques for Reducing Social Bias in LLMs through System 1 and System 2 Cognitive Processes. arXiv 2024, arXiv:2404.17218. [Google Scholar] [CrossRef]
  62. Chu, Z.; Chen, J.; Chen, Q.; Yu, W.; He, T.; Wang, H.; Peng, W.; Liu, M.; Qin, B.; Liu, T. Navigate through Enigmatic Labyrinth A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future. arXiv 2024, arXiv:2309.15402. [Google Scholar] [CrossRef]
  63. Zhang, Y.; Mao, S.; Ge, T.; Wang, X.; de Wynter, A.; Xia, Y.; Wu, W.; Song, T.; Lan, M.; Wei, F. LLM as a Mastermind: A Survey of Strategic Reasoning with Large Language Models. arXiv 2024, arXiv:2404.01230. [Google Scholar] [CrossRef]
  64. Park, J.S.; O’Brien, J.; Cai, C.J.; Morris, M.R.; Liang, P.; Bernstein, M.S. Generative agents: Interactive simulacra of human behavior. In Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology, San Francisco, CA, USA, 29 October–1 November 2023; pp. 1–22. [Google Scholar] [CrossRef]
  65. Gürcan, Ö. LLM-Augmented Agent-Based Modelling for Social Simulations: Challenges and Opportunities. HHAI 2024: Hybrid Human AI Systems for the Social Good; IOS Press: Amsterdam, The Netherlands, 2024; pp. 134–144. [Google Scholar]
  66. Li, Y.; Yu, Y.; Li, H.; Chen, Z.; Khashanah, K. TradingGPT: Multi-Agent System with Layered Memory and Distinct Characters for Enhanced Financial Trading Performance. arXiv 2023, arXiv:2309.03736. [Google Scholar] [CrossRef]
  67. Merritt, H.; Severino, G.J.; Izquierdo, E.J. The dynamics of social interaction among evolved model agents. Artif. Life 2024, 30, 216–239. [Google Scholar] [CrossRef]
  68. Wang, Q.; Li, W.; Mohajer, A. Load-aware continuous-time optimization for multi-agent systems: Toward dynamic resource allocation and real-time adaptability. Comput. Netw. 2024, 250, 110526. [Google Scholar] [CrossRef]
  69. Gao, C.; Lan, X.; Lu, Z.; Mao, J.; Piao, J.; Wang, H.; Jin, D.; Li, Y. S3: Social-network Simulation System with Large Language Model-Empowered Agents. arXiv 2023, arXiv:2307.14984. [Google Scholar] [CrossRef]
  70. Leng, Y.; Yuan, Y. Do LLM Agents Exhibit Social Behavior? arXiv 2024, arXiv:2312.15198. [Google Scholar] [CrossRef]
  71. Guo, X.; Huang, K.; Liu, J.; Fan, W.; Vélez, N.; Wu, Q.; Wang, H.; Griffiths, T.L.; Wang, M. Embodied LLM Agents Learn to Cooperate in Organized Teams. arXiv 2024, arXiv:2403.12482. [Google Scholar] [CrossRef]
  72. Huang, J.-t.; Lam, M.H.; Li, E.J.; Ren, S.; Wang, W.; Jiao, W.; Tu, Z.; Lyu, M.R. Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench. arXiv 2024, arXiv:2308.03656. [Google Scholar] [CrossRef]
  73. Gao, C.; Lan, X.; Li, N.; Yuan, Y.; Ding, J.; Zhou, Z.; Xu, F.; Li, Y. Large language models empowered agent-based modeling and simulation: A survey and perspectives. Humanit. Soc. Sci. Commun. 2024, 11, 1259. [Google Scholar] [CrossRef]
  74. Chiu, Y.Y.; Sharma, A.; Lin, I.W.; Althoff, T. A Computational Framework for Behavioral Assessment of LLM Therapists. arXiv 2024, arXiv:2401.00820. [Google Scholar] [CrossRef]
  75. Yan, B.; Mai, F.; Wu, C.; Chen, R.; Li, X. A computational framework for understanding firm communication during disasters. Inf. Syst. Res. 2024, 35, 590–608. [Google Scholar] [CrossRef]
  76. Ma, Q.; Xue, X.; Zhou, D.; Yu, X.; Liu, D.; Zhang, X.; Zhao, Z.; Shen, Y.; Ji, P.; Li, J.; et al. Computational Experiments Meet Large Language Model Based Agents: A Survey and Perspective. arXiv 2024, arXiv:2402.00262. [Google Scholar] [CrossRef]
  77. Zhou, L.; Wu, X.; Xu, Z.; Fujita, H. Emergency decision making for natural disasters: An overview. Int. J. Disaster Risk Reduct. 2018, 27, 567–576. [Google Scholar] [CrossRef]
  78. Sarker, I.H. Data science and analytics: An overview from data-driven smart computing, decision-making and applications perspective. SN Comput. Sci. 2021, 2, 377. [Google Scholar] [CrossRef]
  79. Forkosh, O.; Karamihalev, S.; Roeh, S.; Alon, U.; Anpilov, S.; Touma, C.; Nussbaumer, M.; Flachskamm, C.; Kaplick, P.M.; Shemesh, Y.; et al. Identity domains capture individual differences from across the behavioral repertoire. Nat. Neurosci. 2019, 22, 2023–2028. [Google Scholar] [CrossRef] [PubMed]
  80. Azzopardi, L. Cognitive biases in search: A review and reflection of cognitive biases in Information Retrieval. In Proceedings of the ACM SIGIR Conference on Human Information Interaction and Retrieval, Canberra, ACT, Australia, 14–19 March 2021; pp. 27–37. [Google Scholar] [CrossRef]
  81. Bonanno, G.A.; Brewin, C.R.; Kaniasty, K.; Greca, A.M.L. Weighing the costs of disaster: Consequences, risks, and resilience in individuals, families, and communities. Psychol. Sci. Public Interest 2010, 11, 1–49. [Google Scholar] [CrossRef] [PubMed]
  82. Podolny, J.M.; Baron, J.N. Resources and relationships: Social networks and mobility in the workplace. Am. Sociol. Rev. 1997, 62, 673–693. [Google Scholar]
  83. Shelke, S.; Attar, V. Source detection of rumor in social network—A review. Online Soc. Netw. Media 2019, 9, 30–42. [Google Scholar]
  84. Oshio, A.; Taku, K.; Hirano, M.; Saeed, G. Resilience and Big Five personality traits: A meta-analysis. Personal. Individ. Differ. 2018, 127, 54–60. [Google Scholar]
  85. Wachinger, G.; Renn, O.; Begg, C.; Kuhlicke, C. The risk perception paradox—Implications for governance and communication of natural hazards. Risk Anal. 2013, 33, 1049–1065. [Google Scholar] [CrossRef]
  86. Lindsay, B.R. Social Media and Disasters: Current Uses, Future Options, and Policy Considerations. 2011. Available online: https://mirror.explodie.org/CRS-Report-SocialMediaDisasters-Lindsay-SEP2011.pdf (accessed on 6 November 2024).
  87. Meechang, K.; Leelawat, N.; Tang, J.; Kodaka, A.; Chintanapakdee, C. The acceptance of using information technology for disaster risk management: A systematic review. Eng. J. 2020, 24, 111–132. [Google Scholar]
  88. Raikes, J.; Smith, T.F.; Jacobson, C.; Baldwin, C. Pre-disaster planning and preparedness for floods and droughts: A systematic review. Int. J. Disaster Risk Reduct. 2019, 38, 101207. [Google Scholar] [CrossRef]
  89. Lai, B.S.; La Greca, A.M.; Brincks, A.; Colgan, C.A.; D’Amico, M.P.; Lowe, S.; Kelley, M.L. Trajectories of Posttraumatic Stress in Youths After Natural Disasters. JAMA Netw. Open 2021, 4, e2036682. [Google Scholar] [CrossRef]
  90. Sutton, J.; Spiro, E.S.; Johnson, B.; Fitzhugh, S.; Gibson, B.; Butts, C.T. Warning tweets: Serial transmission of messages during the warning phase of a disaster event. Inf. Commun. Soc. 2014, 17, 765–787. [Google Scholar] [CrossRef]
  91. Hayles, C.S. An examination of decision making in post disaster housing reconstruction. Int. J. Disaster Resil. Built Environ. 2010, 1, 103–122. [Google Scholar]
  92. The State Council of the People’s Republic of China. Summary of the Henan Provincial Flooding in 2021: Analysis and Response Measures. 2022. Available online: https://www.gov.cn/xinwen/2022-01/21/content_5669723.htm (accessed on 7 November 2024).
Figure 1. Overview of the multi-stage disaster response simulation and experimentation framework.
Figure 1. Overview of the multi-stage disaster response simulation and experimentation framework.
Systems 13 00240 g001
Figure 2. Construction of the social–cognitive LLM Agents framework. The framework links theoretical constructs to agent-based modeling, enabling the simulation of perception, cognition, and decision-making under evolving disaster conditions.
Figure 2. Construction of the social–cognitive LLM Agents framework. The framework links theoretical constructs to agent-based modeling, enabling the simulation of perception, cognition, and decision-making under evolving disaster conditions.
Systems 13 00240 g002
Figure 3. Experimental design and implementation of the multimodal social–cognitive simulation framework.
Figure 3. Experimental design and implementation of the multimodal social–cognitive simulation framework.
Systems 13 00240 g003
Figure 4. Prompt engineering strategy of the theory-integrated cognitive processing pipeline.
Figure 4. Prompt engineering strategy of the theory-integrated cognitive processing pipeline.
Systems 13 00240 g004
Figure 5. Multi-source information ecosystem in the early stages of the July 2021 Henan Flood.
Figure 5. Multi-source information ecosystem in the early stages of the July 2021 Henan Flood.
Systems 13 00240 g005
Figure 6. Decision-making patterns: choice distribution and confidence scores.
Figure 6. Decision-making patterns: choice distribution and confidence scores.
Systems 13 00240 g006
Figure 7. Demographic variations in decision-making: age, occupation, and gender.
Figure 7. Demographic variations in decision-making: age, occupation, and gender.
Systems 13 00240 g007
Figure 8. Impact of information stance on decision-making and confidence.
Figure 8. Impact of information stance on decision-making and confidence.
Systems 13 00240 g008
Table 1. Key components of resident LLM Agent profiles.
Table 1. Key components of resident LLM Agent profiles.
DimensionCore ElementsImpact on Agent’s Behavior and Decision-Making
DemographicsAge, gender, occupation, income, family structureProvides baseline attributes influencing initial preferences, communication channels, and evacuation options. These conditions shape initial risk appraisal and resource availability.
Personality TraitsBig Five traits [84]: Extraversion, Openness, Conscientiousness, Agreeableness, NeuroticismDetermines cognitive depth, openness to new information, and social conformity. Influences how quickly agents update beliefs and adopt protective measures.
Risk AttitudesGeneral risk perception, disaster experience, focus on specific risks [85]Modulates threat sensitivity, influencing preparedness and information-seeking behavior. Alters the threshold for initiating protective actions.
Social NetworksLiving/working environment, info sources, social media usage [86]Shapes information flow, trust in sources, and susceptibility to misinformation. Drives the emergent collective patterns in perception and response.
Table 2. Components and processes of the multi-module framework.
Table 2. Components and processes of the multi-module framework.
Core ModuleSub-ProcessesDesign and Implementation
Information PerceptionDisaster Information ConstructionAgents collect and process data from diverse sources (e.g., official warnings, media outlets, and social networks), evaluating the credibility and relevance of each. Prompts to the LLM incorporate the agent’s background, shaping initial risk comprehension.
Role PlayingAgents simulate different roles (e.g., community leaders and individual households) to assess how varied social positions affect message interpretation and trust.
Source Credibility AnalysisAgents weigh the reliability of each information source, applying heuristic rules to filter out noise and resolve conflicting signals.
Cognitive ProcessingThreat Perception AssessmentGuided by perceived credibility and situational cues, agents form threat assessments and invoke protection motivation to evaluate severity and vulnerability.
Coping Ability AnalysisAgents appraise internal and external resources (e.g., skills and community support), under the influence of bounded rationality and social norms, to determine feasible protective actions.
Attitude FormationIntegrating threat and coping assessments leads to either stable or shifting attitudes toward risk mitigation measures, in accordance with the theory of planned behavior.
Decision-MakingOption EvaluationAgents compare alternative courses of action (e.g., evacuation, resource stockpiling, and ignoring warnings) based on perceived effectiveness, cost, and time constraints.
Decision SelectionAgents select the most viable action under uncertainty, reflecting heuristics and potential biases.
Action ExecutionFinal decisions are implemented, and the simulation tracks outcomes (success or failure), feedback loops, and subsequent adjustments in attitudes.
Table 3. Interaction between theories and modules.
Table 3. Interaction between theories and modules.
TheoryInformation PerceptionCognitive ProcessingDecision-Making
Risk PerceptionSubjective interpretation of diverse risk inputsPreliminary threat appraisalReinforces outcome expectations
Protection MotivationThreat versus coping appraisalGuides the selection of protective strategies
Social InfluenceFiltering cues from peer networksNorm-driven attitude formationPeer pressure influencing the final choice
Bounded RationalityHeuristic filtering of informationSimplified cognitive evaluationsSelection of satisficing (rather than optimal) actions
Heuristics/BiasesShortcut credibility judgmentsBiased interpretation of threatDeviations from purely rational action
Planned BehaviorFormation of behavioral intentionTranslating intentions into concrete actions
Table 4. Refined experimental design pipeline.
Table 4. Refined experimental design pipeline.
PhaseKey ActionsDetails and Analysis Methods
Pre-DisasterAgent InitializationAssign demographic, personality, risk attitudes, and social network parameters. Introduce early warnings and heterogeneous-quality information. Conduct initial credibility assessments. Statistical sampling ensures a diverse agent population.
During-DisasterReal-time Decision-MakingAgents receive rapidly updating information. LLM-based prompts incorporate uncertain cues, conflicting advice, and evolving social norms. Track decision latency, selected actions, and resource usage. Apply sensitivity analysis to assess the robustness of outcomes.
Post-DisasterLong-Term AdjustmentsAgents revise attitudes and preparedness plans based on actual disaster outcomes. Evaluate how prior successes or failures shape new coping strategies. Use comparative metrics (e.g., the proportion of agents that adopt more cautious behavior) to measure adaptation.
Table 5. Example scenario parameter configuration.
Table 5. Example scenario parameter configuration.
ParameterValue/RangeDescription
Disaster IntensityLow/Moderate/HighDefines initial hazard severity; updated as simulation progresses (e.g., rainfall intensity increments).
Resource Availability0.5–1.0 (normalized)Relative measure indicating the proportion of critical supplies (water, medicine) available. Adjusted over time based on agent actions.
Social Network Density0.1–0.5Determines how interconnected agents are. Higher density implies more rapid information spread. Adjusted to simulate community shifts.
Information Credibility Threshold0.6Sets a minimum credibility score for agents to accept information without skepticism. Lowering it increases agent susceptibility to misinformation.
Table 6. Comparison of demographic distributions between real-world sample (N = 1500) and simulated agents (N = 300).
Table 6. Comparison of demographic distributions between real-world sample (N = 1500) and simulated agents (N = 300).
ItemCategoryReal (%)Sim. (%)
GenderMale48.1348.0
Female51.8752.0
Age≤184.104.0
18–4565.0065.0
45–6021.7021.7
≥609.209.3
Income (CNY/mo)≤100012.012.0
1000–500023.623.7
5000–10,00027.827.7
10,000–20,00020.120.0
≥20,00018.018.0
EducationBelow HS11.411.3
HS20.620.7
Univ.35.936.0
Masters+33.332.0
Table 7. Comparative analysis of real-world and simulated data on disaster risk perception.
Table 7. Comparative analysis of real-world and simulated data on disaster risk perception.
Disaster Risk Perception DimensionsReal World (N = 1500)Sim. (N = 300)
MinMaxMeanS.D.MinMaxMeanS.D.Diff.p -Value
Disaster Familiarity153.300.950153.420.735+0.120.015 *
Perceived Disaster Severity153.320.968153.460.695+0.140.003 *
Perceived Disaster Likelihood153.690.841153.800.590+0.110.006 *
Self-Assessed Manageability153.231.233153.100.983−0.130.046 *
Note: (1) Diff . = M Sim M Real . (2) * p < 0.05 .
Table 8. Decision-Making Options for Disaster Preparedness.
Table 8. Decision-Making Options for Disaster Preparedness.
OptionDescription
Option A. Maintain previously planned disaster prevention measures.Continue existing preparations despite signals of reduced risk, ensuring readiness if the situation worsens.
Option B. Reduce preparation efforts due to perceived lower risk.Scale back disaster prevention to conserve resources based on forecasts indicating decreased threat levels.
Option C. Suspend all preparations based on unverified social media updates.Halt preparations entirely in response to unofficial claims that the flood risk has subsided, risking under-preparedness if those claims prove inaccurate.
Option D. Defer decision until more reliable official information is available.Delay action to avoid premature resource allocation or unnecessary panic, pending authoritative confirmation of threat severity.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhao, X.; Wang, H.; Dai, C.; Tang, J.; Deng, K.; Zhong, Z.; Kong, F.; Wang, S.; Morikawa, S. Multi-Stage Simulation of Residents’ Disaster Risk Perception and Decision-Making Behavior: An Exploratory Study on Large Language Model-Driven Social–Cognitive Agent Framework. Systems 2025, 13, 240. https://doi.org/10.3390/systems13040240

AMA Style

Zhao X, Wang H, Dai C, Tang J, Deng K, Zhong Z, Kong F, Wang S, Morikawa S. Multi-Stage Simulation of Residents’ Disaster Risk Perception and Decision-Making Behavior: An Exploratory Study on Large Language Model-Driven Social–Cognitive Agent Framework. Systems. 2025; 13(4):240. https://doi.org/10.3390/systems13040240

Chicago/Turabian Style

Zhao, Xinjie, Hao Wang, Chengxiao Dai, Jiacheng Tang, Kaixin Deng, Zhihua Zhong, Fanying Kong, Shiyun Wang, and So Morikawa. 2025. "Multi-Stage Simulation of Residents’ Disaster Risk Perception and Decision-Making Behavior: An Exploratory Study on Large Language Model-Driven Social–Cognitive Agent Framework" Systems 13, no. 4: 240. https://doi.org/10.3390/systems13040240

APA Style

Zhao, X., Wang, H., Dai, C., Tang, J., Deng, K., Zhong, Z., Kong, F., Wang, S., & Morikawa, S. (2025). Multi-Stage Simulation of Residents’ Disaster Risk Perception and Decision-Making Behavior: An Exploratory Study on Large Language Model-Driven Social–Cognitive Agent Framework. Systems, 13(4), 240. https://doi.org/10.3390/systems13040240

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop