AI Narrative Modeling: How Machines’ Intelligence Reproduces Archetypal Storytelling
Abstract
:1. Introduction
- The Hero—A protagonist who embarks on a transformative journey;
- The Wise Old Man—A mentor figure offering guidance and wisdom;
- The Shadow—The darker aspects of the self, often represented as an antagonist;
- The Anima/Animus—The inner feminine (Anima) in men and inner masculine (Animus) in women, representing psychological balance;
- The Trickster—A character who disrupts order, introducing chaos and transformation;
- The Everyman—A relatable, ordinary character representing common human experiences.
- To what extent do AI-generated stories naturally replicate archetypal patterns?
- Can AI capture the depth of human emotions, internal conflicts, and transformative character arcs associated with these archetypes?
- How do computational methods compare to expert literary evaluations in assessing AI’s storytelling capabilities?
- Proposing a hybrid evaluation framework combining NLP techniques with expert analysis;
- Conducting a cross-archetypal assessment of AI narratives involving six core Jungian figures;
- Evaluating psychological depth beyond structural coherence;
- Bridging computational linguistics and depth psychology to explore potential “machine intuition”;
- Outlining practical applications for AI-generated archetypal narratives in education, creative writing, and interactive media.
2. Materials and Methods
2.1. Research Framework
- Jungian theory is combined with current LLM research to form the conceptual basis;
- Narrative data are collected using carefully designed prompts to elicit archetypal structures without directly referencing Jung;
- Both NLP techniques and expert evaluations are applied to analyze linguistic and thematic patterns;
- AI and human-authored texts are compared to assess similarities and differences in archetypal expression.
2.2. Data Collection and Corpus
- 1.
- AI-generated narrative dataset
- Each AI model was prompted with a structured archetype-specific query (e.g., “Write a short story featuring a Hero who undergoes a transformative journey”). Prompts were refined to ensure alignment with archetypal themes while avoiding excessive specificity that might bias the model’s output.
- Multiple variations of prompts were used per archetype to encourage diverse story structures and settings. AI-generated outputs were manually reviewed to ensure linguistic coherence, structural consistency, and adherence to the expected archetypal pattern.
- After the quality assessment of initial AI-generated texts based on coherence, narrative structure, and relevance to the target archetype, the final dataset was reduced.
- 2.
- Human-written reference corpus
- Works were chosen based on clear alignment with Jungian archetypal structures, ensuring a representative dataset.
- Texts span diverse cultural and historical sources, including mythological, literary, and modern storytelling traditions. Human Corpus Sources were classic literary and mythological texts (e.g., The Odyssey, Beowulf, Faust, The Epic of Gilgamesh), modern fiction and film scripts (e.g., The Hero’s Journey in Star Wars, The Dark Knight, and The Matrix), and psychological and narrative theory texts discussing archetypes in storytelling (e.g., Joseph Campbell’s The Hero with a Thousand Faces).
- 3.
- Ensuring comparative consistency
- Word count normalization, for which texts were matched in length (~500–1000 words per narrative);
- Genre consistency, for which stories were categorized by literary genre, tone, and structure to avoid confounding variables;
- Controlled thematic prompts, for which AI-generated stories were crafted using prompts designed to reflect the essence of the archetypes found in human literature.
- 4.
- LLM Generation parameters and rationale
- Temperature (0.7)—This moderate value encourages creative variation in language and structure while avoiding incoherent or overly random outputs that can result from higher temperatures (>0.9).
- Top_p (0.9)—Used in nucleus sampling, this setting ensures that the model chooses from the top 90% of probable next-word candidates, maintaining diversity in output without drifting too far from logical or contextually relevant continuations.
- Frequency_penalty (0.0)—This value prevents penalizing repeated words or phrases, as repetition is sometimes necessary for rhetorical emphasis or structural cohesion in narrative storytelling.
- Presence_penalty (0.6)—This encourages the model to introduce new topics and elements by gently discouraging repetition of previously used tokens, which helps generate more dynamic and varied narrative content.
- Max_tokens (1024)—This length constraint ensures that each generated story is substantive enough (roughly 700–800 words) to contain narrative development, character evolution, and thematic expression, while keeping outputs manageable for analysis.
- Stop sequences (None applied)—None were applied to allow for natural narrative closure without artificial truncation, giving the model freedom to complete stories in a more human-like way.
2.3. Experimental Design for Archetypal Pattern Identification in AI-Generated Narratives
- Selecting archetypes: The Hero and The Wise Old Man.
- Prompt the LLM with a theme or topic that is likely to evoke archetypal content, such as “the hero’s journey” or “the wise old man storytelling”.
- Generate multiple text samples of varying lengths and styles.
2.4. Framework for Validation of Expert Evaluation and Computational Methods
- Computational NLP-based analysis: Metrics such as Cosine Similarity, Sentiment Analysis, Term Frequency-Inverse Document Frequency (TF-IDF) Feature Weighting, and Latent Dirichlet Allocation (LDA) Topic Modeling were used to evaluate AI narratives in terms of structural similarity, sentiment polarity, thematic coherence, and lexical variance.
- Expert human evaluation: A panel of 15 experts assessed AI-generated narratives on a 10-point scale, scoring them based on narrative coherence, emotional depth, character development, thematic complexity, and creativity/originality.
2.4.1. Computational NLP-Based Analysis
- Cosine Similarity Analysis.
- 2.
- Sentiment Analysis.
- 3.
- TF-IDF Feature Weighting.
- 4.
- LDA Topic Modeling.
2.4.2. Expert Evaluation
- Literary experts (5 participants)—scholars and researchers in literature, comparative mythology, and storytelling structures.
- Creative writing professionals (4 participants)—novelists, scriptwriters, and game writers specializing in character development and thematic complexity.
- Psychologists and cognitive scientists (3 participants)—experts in cognitive psychology, Jungian analysis, and human emotional response to narratives.
- AI and computational linguistics researchers (3 participants)—specialists in natural language processing (NLP) and AI-generated text evaluation.
2.5. Validation and Bias Considerations in AI-Generated Archetypal Analysis
- Cosine similarity effectively measures the structural resemblance between AI-generated texts and human-written stories, but it fails to assess deeper narrative meaning, symbolic richness, or character transformation.
- Sentiment analysis can capture basic emotional polarity, yet it struggles with subtle psychological depth, irony, or the multi-layered emotions found in archetypal storytelling.
- TF-IDF feature weighting is useful for identifying key lexical patterns, but it lacks the contextual sensitivity needed to recognize metaphorical language or thematic continuity.
- LDA topic modeling effectively detects dominant thematic elements in AI-generated narratives but fails to account for implicit meanings, archetypal symbolism, or narrative subtext.
- Cosine similarity was used to assess whether AI-generated narratives maintain structural coherence and align with archetypal storytelling conventions found in human-written texts.
- Sentiment analysis was applied to quantify emotional expression, offering insights into how AI replicates archetypal emotional dynamics, such as the Hero’s internal struggles or the Wise Mentor’s guidance.
- TF-IDF feature weighting helped identify unique lexical markers associated with different archetypes, allowing researchers to distinguish linguistic variations in AI-generated narratives.
- LDA topic modeling was employed to analyze the thematic structure of AI-generated texts, evaluating whether AI correctly represents Jungian archetypes through dominant thematic elements.
3. Results and Discussion
3.1. Overview of Experimental Approach and Key Findings
- AI demonstrated significant variance in performance across different archetypes. Structured, goal-oriented archetypes (Hero and Wise Old Man) consistently received higher scores in both computational and expert evaluations, with cosine similarity scores of 0.81 and 0.74, respectively. In contrast, psychologically complex archetypes (Shadow and Trickster) showed lower performance, with cosine similarity scores of 0.62 and 0.47, indicating AI’s difficulty with narratives involving ambiguity, moral complexity, and unpredictability.
- A clear divergence emerged between AI’s technical proficiency and its emotional/creative capabilities. Both computational and expert evaluations confirmed that AI-generated narratives maintain strong structural coherence and thematic alignment but exhibit reduced emotional range and creative originality. This pattern was particularly evident in sentiment analysis, where AI consistently demonstrated lower emotional variance than human-authored texts, especially in representing negative emotional states.
- Computational and expert evaluations showed strong agreement in assessing narrative structure and thematic alignment but diverged significantly in evaluating emotional depth and creativity. This suggests that current computational metrics effectively capture technical aspects of storytelling but struggle to quantify the more subjective, psychological dimensions that human experts readily identify.
3.2. Comparative Analysis of Computational and Expert Evaluations for AI-Generated Jungian Archetypal Narratives
- Computational NLP-based methods, which provided objective measurements of textual structure, sentiment, and thematic consistency.
- Expert human evaluations, where literary analysts, psychologists, and AI researchers assessed the depth and quality of AI-generated storytelling.
3.2.1. Comparative Evaluation Across Archetypes
3.2.2. Implications for AI-Generated Storytelling
- AI-generated narratives are structurally sound but emotionally and creatively limited. NLP-based metrics confirm strong alignment with human storytelling structures, but experts highlight a lack of deeper psychological engagement.
- Certain archetypes (Wise Old Man, Everyman) are easier for AI to replicate, while others (Trickster, Shadow) require deeper narrative modeling. AI can handle rational, structured storytelling, but humorous and emotionally complex narratives remain challenging.
- Future AI storytelling models should integrate hybrid evaluation frameworks, combining NLP techniques with cognitive emotion modeling and interactive human feedback.
- Enhancing AI’s ability to model psychological complexity in character development.
- Incorporating real-time human feedback to refine AI-generated stories dynamically.
- Developing cognitive emotion models that improve AI’s capacity for nuanced storytelling.
3.2.3. Analysis of Computational and Expert Evaluations for AI Storytelling
3.3. Comparative Analysis of AI-Based Texts and Human Texts
- AI generates narratives that structurally resemble human-authored texts (cosine similarity);
- AI prefers positive, resolution-oriented storytelling, avoiding deep psychological struggle (sentiment heatmap);
- AI relies on formulaic storytelling and lacks psychological nuance (word cloud and TF-IDF);
- AI-generated stories lack thematic variation and unpredictability, particularly in complex archetypes (KL-Divergence plot).
3.4. Possible Areas of Application of the Study
4. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Trichopoulos, G.; Alexandridis, G.; Caridakis, G. A Survey on Computational and Emergent Digital Storytelling. Heritage 2023, 6, 1227–1263. [Google Scholar] [CrossRef]
- Runco, M.A. AI can only produce artificial creativity. J. Creat. 2023, 33, 100063. [Google Scholar] [CrossRef]
- Jung, C.G. The Archetypes and the Collective Unconscious; Princeton University Press: Princeton, NJ, USA, 1967. [Google Scholar]
- Jung, C.G. Symbols of Transformation; Princeton University Press: Princeton, NJ, USA, 1956. [Google Scholar]
- Kim, J.; Heo, Y.; Yu, H.; Nang, J. A Multi-Modal Story Generation Framework with AI-Driven Storyline Guidance. Electronics 2023, 12, 1289. [Google Scholar] [CrossRef]
- Ren, Y.; Jin, R.; Zhang, T.; Xiong, D. Do Large Language Models Mirror Cognitive Language Processing? arXiv 2024, arXiv:2402.18023. [Google Scholar] [CrossRef]
- Placani, A. Anthropomorphism in AI: Hype and fallacy. AI Ethics 2024, 4, 691–698. [Google Scholar] [CrossRef]
- Messner, W. Improving the cross-cultural functioning of deep artificial neural networks through machine enculturation. Int. J. Inf. Manag. Data Insights 2022, 2, 100118. [Google Scholar] [CrossRef]
- Wang, Q.; Li, H. On Continually Tracing Origins of LLM-Generated Text and Its Application in Detecting Cheating in Student Coursework. Big Data Cogn. Comput. 2025, 9, 50. [Google Scholar] [CrossRef]
- Ji, Z.; Lee, N.; Frieske, R.; Yu, T.; Su, D.; Xu, Y.; Ishii, E.; Bang, Y.J.; Madotto, A.; Fung, P. Survey of Hallucination in Natural Language Generation. ACM Comput. Surv. 2024, 55, 1–38. [Google Scholar] [CrossRef]
- Biswas, P.; Pramanik, S.; Giri, B.C. Cosine Similarity Measure Based Multiattribute Decision-making with Trapezoidal Fuzzy Neutrosophic Numbers. Neutrosophic Sets Syst. 2014, 8, 46–56. Available online: https://fs.unm.edu/NSS/CosineSimilarityMeasureBasedMultiAttribute.pdf (accessed on 18 March 2025).
- Taherdoost, H.; Madanchian, M. Artificial Intelligence and Sentiment Analysis: A Review in Competitive Research. Computers 2023, 12, 37. [Google Scholar] [CrossRef]
- Alshehri, A.; Algarni, A. TF-TDA: A Novel Supervised Term Weighting Scheme for Sentiment Analysis. Electronics 2023, 12, 1632. [Google Scholar] [CrossRef]
- Farkhod, A.; Abdusalomov, A.; Makhmudov, F.; Cho, Y.I. LDA-Based Topic Modeling Sentiment Analysis Using Topic/Document/Sentence (TDS) Model. Appl. Sci. 2021, 11, 11091. [Google Scholar] [CrossRef]
Archetype | Sample Topic Keywords |
---|---|
Hero | journey, challenge, courage, battle, victory |
Wise Old Man | wisdom, guide, vision, knowledge, advice |
Shadow | fear, anger, hidden, conflict, darkness |
Trickster | mischief, chaos, joke, clever, confusion |
Everyman | daily, work, family, routine, decision |
Anima/Animus | feeling, dream, love, mirror, transformation |
Method | Objective |
---|---|
Cosine Similarity | Compare AI vs. human narrative structure |
Sentiment Analysis | Measure emotional tone in AI vs. human texts |
TF-IDF | Identify key lexical differences |
Topic Modeling (LDA) | Analyze dominant themes in storytelling |
Archetype | Observations |
---|---|
Trickster | AI maintains structure well but lacks deep humor and unpredictability. |
Shadow | AI captures dark themes but misses psychological nuance. |
Hero | Strong structure but predictable hero arcs; TF-IDF shows less originality. |
Wise Old Man | AI effectively replicates wisdom-based storytelling patterns. |
Everyman | Balanced, relatable storytelling; AI closely follows archetypal structure. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Kabashkin, I.; Zervina, O.; Misnevs, B. AI Narrative Modeling: How Machines’ Intelligence Reproduces Archetypal Storytelling. Information 2025, 16, 319. https://doi.org/10.3390/info16040319
Kabashkin I, Zervina O, Misnevs B. AI Narrative Modeling: How Machines’ Intelligence Reproduces Archetypal Storytelling. Information. 2025; 16(4):319. https://doi.org/10.3390/info16040319
Chicago/Turabian StyleKabashkin, Igor, Olga Zervina, and Boriss Misnevs. 2025. "AI Narrative Modeling: How Machines’ Intelligence Reproduces Archetypal Storytelling" Information 16, no. 4: 319. https://doi.org/10.3390/info16040319
APA StyleKabashkin, I., Zervina, O., & Misnevs, B. (2025). AI Narrative Modeling: How Machines’ Intelligence Reproduces Archetypal Storytelling. Information, 16(4), 319. https://doi.org/10.3390/info16040319