Enhancing Migraine Trigger Surprisal Predictions: A Bayesian Approach to Establishing Prospective Expectations

Turner, Dana P.; Caplis, Emily; Patel, Twinkle; Houle, Timothy T.

doi:10.3390/e27111102

Open AccessArticle

Enhancing Migraine Trigger Surprisal Predictions: A Bayesian Approach to Establishing Prospective Expectations

by

Dana P. Turner

^1,*,

Emily Caplis

¹,

Twinkle Patel

^1,2 and

Timothy T. Houle

¹

Department of Anesthesia, Critical Care and Pain Medicine Massachusetts General Hospital, Harvard Medical School, Boston, MA 02114, USA

²

Department of Neurology, UMass, Worcester, MA 01655, USA

^*

Author to whom correspondence should be addressed.

Entropy 2025, 27(11), 1102; https://doi.org/10.3390/e27111102 (registering DOI)

Submission received: 17 September 2025 / Revised: 16 October 2025 / Accepted: 22 October 2025 / Published: 25 October 2025

(This article belongs to the Section Information Theory, Probability and Statistics)

Download

Browse Figures

Versions Notes

Abstract

Prior work has demonstrated that higher surprisal, a measure quantifying the unexpectedness of a trigger exposure, predicts headache onset over 12 to 24 h. However, these analyses relied on retrospective expectations of trigger exposure formed after extended data collection. To operationalize surprisal prospectively, Bayesian methods could update expectations dynamically over time. The objective of this study was to extend the application of surprisal theory for predicting migraine attack risk by developing methods to estimate trigger variable likelihood in real time, under conditions of limited personal observation. In a prospective daily diary study of individuals with migraine (N = 104), data were collected over 28 days, including stress, sleep, and exercise exposures. Bayesian models were applied to estimate daily expectations for each variable under uninformative and empirical priors derived from the sample. Stress was modeled using a hurdle-Gamma distribution, sleep using discrete outcomes from a Normal distribution, and exercise using a Bernoulli distribution. Surprisal was calculated based on the predictive distribution at each time point and compared to static empirical surprisal values obtained after full data collection. Dynamic Bayesian surprisal values systematically differed from retrospective empirical estimates, particularly early in the observation period. Divergence was larger and more variable under uninformative priors but attenuated over time. Empirically informed priors produced more stable, lower-bias surprisal trajectories. Substantial individual variability was observed across exposure types, especially for exercise behavior. Prospective surprisal modeling is feasible but highly sensitive to prior specification, especially in sparse data contexts (e.g., a binary exposure). Incorporating empirical or individually informed priors may improve early model calibration, though individual learning remains essential. These methods offer a foundation for real-time headache forecasting and dynamic modeling of brain–environment interactions.

Keywords:

surprisal; Bayesian; information theory; migraine; expectation

1. Introduction

Identifying the antecedents of migraine attacks remains a central challenge in headache science [1]. While many individuals engage in self-monitoring of putative triggers, empirical associations between specific environmental or psychosocial exposures and subsequent migraine onset are often weak or inconsistent [2,3,4,5,6,7]. In our previous work [8], we proposed a novel framework rooted in information theory, operationalizing surprisal, a measure that quantifies how unexpected or improbable an event is given recent prior experience, as a unifying construct for evaluating diverse antecedents or “triggers” of headache [9]. Across multiple datasets, we found that higher surprisal values, representing more unexpected experiences, were associated with increased likelihood of imminent headache onset within 12–24 h [8,10,11]. These results suggest that it is not merely the presence or absence of a trigger that matters but the extent to which the trigger deviates from prior expectations.

This conceptualization is consistent with frameworks in computational neuroscience, particularly predictive coding models [12] and the free energy principle [13]. These theories posit that the brain functions as a hierarchical Bayesian inference engine, continuously generating predictions about incoming sensory input and updating those predictions in response to mismatches, or prediction errors, between expected and actual input [14]. According to the free energy principle, the brain minimizes a variational bound on surprise (i.e., free energy) to maintain homeostasis and reduce uncertainty. Within this framework, surprisal may be interpreted as a measurable correlate of prediction error, reflecting the degree to which a given biopsychosocial event(s) challenges the brain’s internal model of the world [15]. We posit that migraine attack risk may arise from the sum of such errors, reflecting the degree of threat originating from changes/challenges in the environment.

However, to render this theory actionable in applied settings on newly encountered individuals, an important issue must be addressed: how to estimate the likelihood of an event prior to extensive observation. Classical estimation of surprisal depends on well-sampled empirical distributions from which to base surprisal estimates. Yet, in ecological contexts, individuals regularly encounter novel or infrequently repeated events. Estimating the degree of surprisal for these events is akin to knowing how surprising an event is likely to be before having the benefit of weeks of data on which to base the calculation. In all our previous research into surprisal estimation [8,10,11], the information used to estimate surprisal was based on a retrospective expectation formed from the empirically observed distributions (i.e., we learned how surprising any exposure was after having observed it for weeks). For surprisal to serve as a practical tool for forecasting migraine attack risk or modeling real-time brain–environment interactions, it must be possible to approximate the expectedness of events under conditions of limited data. Accordingly, this study explores strategies for inferring event likelihood in low-data contexts, aiming to extend trigger surprisal theory from a retrospective analytic tool to a forward-facing predictive framework.

2. Materials and Methods

This study represents the third pre-specified primary analysis of a longitudinal dataset that has been detailed in two prior publications [10,11]. Following approval by the local Institutional Review Board, data collection took place between April 2021 and December 2024 using a prospective design with daily diaries. Recruitment was conducted through multiple channels, including an institution-hosted online research platform, advertisements on public transportation, and community flyers. Interested individuals completed a telephone screening to assess eligibility. While full methodological details have been published previously [10], a brief summary is provided here for context. Eligible participants were adults aged 18 to 65 years with a diagnosis of migraine with or without aura as defined by the International Classification of Headache Disorders, 3rd Edition (ICHD-3) [16] and who reported experiencing 4 to 14 headache days per month. Exclusion criteria included secondary headache disorders, chronic daily headache or medication overuse headache, recent changes in headache symptoms (within the past six weeks), insufficient English proficiency (below a sixth-grade level), unmanaged psychotic disorders, active substance dependence likely to affect headache activity or adherence to study procedures, and current or planned pregnancy during the course of the study.

Eligible participants completed an enrollment session, conducted either in person or virtually, during which informed consent was obtained via the electronic consent REDCap feature. Following consent, participants completed a battery of baseline questionnaires assessing demographics, headache characteristics, and migraine-related disability (e.g., Migraine Disability Assessment [MIDAS] [17]). They then received instruction in the at-home study procedures, which involved completing brief (5–10 min) electronic diary entries twice daily, once each morning and evening, for a period of 28 days. All entries were submitted through REDCap [18]. At study completion, participants attended a final session (in person or virtual) and completed a follow-up battery of questionnaires similar to those administered at enrollment. The twice-daily diary entries were designed to assess both exposure to a broad array of potential migraine triggers and headache activity. Details of the diary items and their variability have been published previously [10]. Distinct sets of triggers were measured in the morning and evening diaries to reflect differing contextual influences. The morning (AM) diary focused on sleep patterns (duration, quality, awakenings, bedtime, and wake time), late-night eating, weather, and mood state (Profile of Mood States Short Form [POMS-SF] [19]). The evening (PM) diary also included the POMS-SF but emphasized common food and drink triggers, environmental exposures, caffeine and alcohol use, balance disturbances, missed meals, and daily stressors (Daily Stress Inventory [DSI] [20]), in addition to weather influences.

2.1. Selection of Trigger Variables

Three biopsychosocial variables representing stress, sleep, and exercise were selected for dynamic surprisal modeling based on their theoretical relevance to headache risk [3], their practical feasibility for daily assessment, and their distinct measurement properties [2]. Stress (i.e., daily hassles) was assessed using the intensity scale of the Daily Stress Inventory [20], yielding a unique continuous distribution with many zero counts (i.e., no stress) but also a positively skewed distribution. Sleep was measured as the self-reported number of hours slept the previous night, a continuous variable typically heaped at whole numbers due to rounding by the participants (see: [21]). Exercise was captured as a self-reported binary indicator of whether a formal exercise session was completed each day (i.e., yes or no). These variables differ not only in their underlying distributions but also in the amount and type of information each provides for updating the expectations used for estimating surprisal. Stress ratings offer a wide but irregular range of possible values, sleep reports are constrained and discretized by natural rounding behavior, and exercise events provide sparse, dichotomous data. Consequently, each novel trigger variable introduces distinct inferential challenges for real-time probability estimation, particularly early in the observation period when limited data are available (i.e., this unique individual has not yet been observed). Modeling all three allows for an examination of how surprisal behaves across variables with varying information, uncertainty structures, and learning dynamics, providing a broader examination of dynamic predictive frameworks in ecologically valid contexts.

2.2. Prospectively Estimating Surprisal from Daily Observations

To quantify how unexpected an event is for each selected variable in real time, we implemented a Bayesian framework (see [22,23]) to dynamically estimate surprisal (S), defined as the negative logarithm of the predictive probability [24]:

S (x_{t, i}) = - \log P (x_{t, i}∣ θ_{t, i})

(1)

where x_t,i is the observed value of variable x on day t for individual i, and θ_t,i are the parameters of the predictive distribution at that time for that person. Please note that in what follows each variable is defined separately, with the person subscript (i) dropped for convenience. By estimating surprisal for one variable at a time, we choose to ignore the joint distribution of variables (i.e., the surprisal associated with more than one combination of variables at each time), as this choice greatly simplifies the estimation of many variables as would be the case in practice. Several methods for real-time probability estimation were considered, including kernel density estimation [25], Kalman filtering [26], Naïve Bayes, and Bayesian structural time series [27]. However, these approaches typically require extensive historical data and do not possess their usual advantages for sparse, person-specific contexts. We therefore adopted a simple Bayesian updating framework optimized for limited daily observations.

Exercise, a binary variable (i.e., present or absent), was modeled using a Bernoulli likelihood with a conjugate Beta prior. After observing k exposures across t days, the posterior distribution was

Beta (α + k, β + t - k)

(2)

yielding a posterior predictive probability of

p (x_{t + 1} = 1) = \frac{α + k}{α + β + t}

(3)

Sleep duration, a continuous but heaped variable, was modeled using a Normal likelihood with a conjugate Normal prior, truncated at 0 to reflect its positive scale, and rounded to the nearest integer hour to reflect observed reporting behavior. The posterior predictive distribution after t observations was

x_{t + 1} \sim N (μ_{t}, σ_{t}^{2})

(4)

where μ_t is the updated posterior mean, and σ_t² reflects the combination of prior precision and accumulated evidence. To account for the common reporting behavior of rounding to the nearest whole hour, we computed the predictive probability of observing a specific integer value z by evaluating the cumulative density between adjacent half-integer bounds:

p (x_{t + 1} = z) = Φ (\frac{z + 0.5 - μ_{t}}{σ_{t}}) - Φ (\frac{z - 0.5 - μ_{t}}{σ_{t}})

(5)

where Φ(⋅) denotes the cumulative distribution function (CDF) of the standard normal distribution. Thus, the probability of observing a sleep report of z hours corresponds to the probability mass between (z − 0.5) and (z + 0.5) under the day-specific predictive distribution. This approach preserves the probabilistic integrity of the continuous Normal model while honoring the discretized nature of the observed data. Surprisal for each sleep observation was then calculated by taking the negative logarithm of the corresponding rounded predictive probability.

Perceived stress was modeled using the hurdle-Gamma family of probability density functions. A Bernoulli model captured the probability of reporting zero stress, while a Gamma distribution characterized the magnitude of nonzero stress ratings. Given the lack of a true conjugate prior for the two-parameter Gamma distribution, we performed full Bayesian updating at each day by fitting a hierarchical hurdle-Gamma model using Hamiltonian Monte Carlo [28]. Posterior distributions for the hurdle probability, Gamma shape, and Gamma rate parameters were estimated based on all accumulated observations through each day. The expected stress value on day t was computed as:

E (x_{t}) = (1 - p_{zero, t}) \times \frac{{shape}_{t}}{{rate}_{t}}

(6)

where p_zero,t represents the posterior hurdle probability of reporting zero stress.

2.3. Prior Probability

Daily surprisal scores were computed under conditions using two distinct prior distributions. Two sets of prior distributions were used to consider the role of differing amounts of baseline information for estimating the surprisal. First, uninformative priors were specified to reflect minimal prior knowledge about the variable. For binary outcomes (e.g., exercise), we used a flat Beta prior:

θ_{binary} \sim Beta (1,1)

(7)

For sleep duration, a diffuse Normal prior was placed on the mean to form a weakly informative prior:

μ_{sleep} \sim N (7,100)

(8)

assuming a mean of 7 h [29] and very large prior variance (standard deviation ≈ 10 h) to express broad uncertainty. For stress ratings, initial stress expectations were modeled with diffuse priors on the Gamma shape and rate parameters that matched our previous observations [30], each independently drawn from weakly informative Gamma distributions:

{shape}_{stress} \sim Gamma (2,0.1) and {rate}_{stress} \sim Gamma (2,0.1)

(9)

Second, empirical priors were derived individually using a leave-one-out strategy: for each participant, prior distributions were estimated from the empirical data of the remaining N − 1 participants. The prior specification used the same distributions as in the uninformative priors, but for binary and sleep outcomes, empirical priors were centered on the observed means and variances of the sample excluding the target participant. For stress, empirical priors were based on maximum likelihood estimates of the Gamma distribution parameters fit to the leave-one-out sample. Posterior parameters were updated iteratively at each day for each individual, and surprisal values were calculated for every recorded observation, yielding 28 surprisal trajectories per participant per variable.

2.4. Statistical Analyses

All statistical analyses were conducted using R version 4.4.1 and RStudio 2025.09.1+401. All analyses were conducted with the available data, and no a priori statistical power calculations were conducted to guide sample size for this analysis. Descriptive statistics for participant characteristics were summarized using medians and interquartile ranges [25th, 75th] for continuous variables and frequencies and percentages for categorical variables. For dynamic modeling, Bayesian updating was applied iteratively to estimate daily expectations separately for each person and for each variable under both uninformative and empirically derived prior distributions. Daily expected values and surprisal scores were aggregated across participants to summarize group-level trajectories over the 28-day observation period. These values were summarized using mean and frequentist 95%CI that ignore the individual level of uncertainty of each individual surprisal value.

For exercise, surprisal values were calculated separately for each participant, day, and prior type, then compared to static empirical estimates derived from each individual’s full data record. Mean differences between dynamic and static surprisal estimates were summarized at each timepoint, with 95% confidence intervals calculated based on the standard error of the mean. Visualization of distribution fits, expectation trajectories, and surprisal differences was performed using the ggplot2 package [31], with smoothing applied to trajectory plots where noted. All analyses were descriptive and exploratory with no formal hypothesis tests conducted.

3. Results

Sample characteristics have been described previously [10]. Briefly, the study included 109 individuals with migraine (median age = 35 years [IQR: 26–46]), 93.5% of whom were female and 83.5% of whom were White. Of these, 104 participants completed approximately 28 days of twice-daily electronic diaries, yielding 5176 total entries. The sample had a median headache frequency of 8 days per month [IQR: 5–12], a median intensity of 7/10 [IQR: 5.5–8.0], and a median MIDAS score of 24 [IQR: 13–35.5], indicating moderate to severe migraine-related disability.

3.1. Model Fit to Observed Distributions

Estimating surprisal in real time requires distribution-based expectations that reflect the empirical properties of each exposure variable. Stated differently, distributions that express the chances that someone encounters every level of the variable are required for the calculation. To allow this, observed distributions of daily stress, sleep, and exercise data were characterized by the proposed statistical distributions (Figure 1). Stress ratings exhibited a strong positive skew, with a high frequency of zero values. A hurdle-gamma model provided an appropriate fit, capturing both the probability of zero stress and the right-skewed distribution of nonzero values. The empirical cumulative distribution function (ECDF) aligned closely with the predicted cumulative distribution, indicating good model fit.

As expected, sleep duration was typically reported in whole-hour values. A rounded normal model was used to reflect this heaping. The fitted normal density provided an adequate approximation of the observed histogram, except for the slight excess of 8-h values, and the predicted cumulative distribution closely followed the ECDF. Finally, daily exercise, recorded as a binary variable (0 = no exercise, 1 = exercised), followed a Bernoulli distribution with a group-level mean near 0.25 (i.e., across the sample, individuals exercised on 25% of the days).

3.2. Evolution of Expected Values over Time

To assess the influence of prior knowledge on expectation formation, we compared temporal trajectories of estimated values generated from uninformative versus empirical priors. Bayesian expectation trajectories were generated for each individual using both uninformative (i.e., assuming no knowledge about the distribution) and empirical priors (i.e., using information provided by other participants) across the 28-day study period (Figure 2). For stress, expectations under empirical priors began higher than those under uninformative priors, but the two trajectories gradually converged over time as individual observations accumulated. Individual variability in stress reporting was considerable, particularly under the empirical prior, where a subset of participants showed persistently elevated expected values.

In contrast, sleep expectations remained relatively stable across time under both prior conditions. The empirical and uninformative priors produced similar group-level means, though the empirical prior provided some regularization early in the observation period, with greater between-person variability for the uninformative priors, as evidenced by more dispersed individual trajectories. This pattern supports the notion that individual sleep patterns vary across individuals but remain relatively consistent over time in the same individual.

For exercise, substantial divergence was observed between prior conditions. The uninformative prior initially assumed a relatively high probability of exercise (i.e., 50%), which decreased quickly with accumulating data. The empirical prior, by contrast, began with lower expectations that more closely reflected population averages and changed minimally over time. This difference in prior assumptions led to markedly different expectation curves, especially during the early observation period. Additionally, the very different pattern of individual trajectories supports strong individual differences across individuals that are only partially captured by the empirical prior, as the population average provides some information for expectation, but substantial individual differences remain that are learned during the observation period.

3.3. Surprisal Differences by Prior Type

To evaluate the impact of prior specification on surprisal estimation, we compared dynamically updated Bayesian surprisal values to static empirical surprisal values, which were calculated using each individual’s full set of observed data and represent a reasonable benchmark for what could be learned with complete information. As shown in Figure 3 depicting exercise, the average surprisal difference (dynamic minus static) was negative for both prior types, indicating that dynamic surprisal values were generally lower than those based on a static empirical distribution. This difference was greatest early in the study period, reflecting greater uncertainty under limited data conditions.

Notably, surprisal estimates based on the uninformative prior diverged more substantially from the static reference early on but began to converge toward the empirical estimates after approximately two weeks. In contrast, surprisal estimates under the empirical prior maintained a consistent bias with narrower confidence bands, suggesting more stable performance across time. These findings underscore the sensitivity of information-theoretic metrics to prior assumptions, particularly during the early phase of sequential data collection.

4. Discussion

This study demonstrates a practical framework for estimating surprisal in real time by dynamically updating expectations using Bayesian inference. Building on prior work showing that surprisal predicts imminent migraine attacks [8,11], we sought to address a key limitation of prior methods: their reliance on retrospectively computed expectations from fully observed data. By incorporating both uninformative and empirically derived priors, we simulated the process of expectation formation under conditions of limited personal data, mirroring practical use cases in ecological forecasting and digital health applications.

The results illustrate how variable-specific measurement characteristics influence the learning dynamics of expectation and, by extension, surprisal. For example, in the case of stress, a highly skewed, semi-continuous variable, the empirical prior provided a strong initial approximation of expected values, which then adapted to individual idiosyncrasies over time. Conversely, for sleep, which followed a more stable but individualized pattern, expectations were relatively robust across both prior types, reflecting consistent intra-individual patterns. In contrast, exercise behavior exhibited substantial between-person variability and sparsity, making it especially sensitive to prior specification. These findings emphasize that the importance of inferring prior distributions for expectation formation is uneven across variable types, particularly in the early stages of data collection.

Importantly, we found that surprisal values based on dynamic Bayesian expectations may systematically diverge from hindsight-based estimates, particularly early in the observation period. This divergence was larger and more variable under uninformative priors, while empirically derived priors produced more stable trajectories, albeit with a modest and consistent bias relative to the fully informed benchmark. These findings highlight the importance of prior specification, not only for improving model stability and convergence, but also for enhancing the fidelity of applied surprisal values. In clinical and digital health contexts, where early inferences may inform behavioral guidance or therapeutic recommendations, the influence of prior choice is important.

A promising direction for improving early model calibration is in the use of individually informed priors, or priors informed by aggregating data from relevant subpopulations using formal mixed models. Such information could be obtained using baseline self-reports about typical sleep duration, exercise frequency, or stress levels. These personalized prior beliefs, elicited during study enrollment or digital onboarding, may offer a pragmatic middle ground between population-derived empirical priors and uninformed assumptions. Models based on these individualized assumptions could accelerate learning, reduce early surprisal volatility, and better reflect each individual’s expectations in the absence of long observation periods.

The study has several limitations. First, the analyses focused on modeling only three variables that differ in their statistical and behavioral properties. In practice, many variables (i.e., >100) may be incorporated that range from dietary exposures, mood states, and physiological responses. Generalization to other constructs or domains should be formally evaluated. Next, while the Bayesian models simulated real-time learning, all data were previously collected in a controlled observational context with structured diaries; performance may differ in passive sensing or naturalistic mobile data streams. Additionally, by estimating the surprisal of each variable separately for each individual, an opportunity is lost to consider the surprisal associated with the joint distributions between variables, or to ‘draw strength’ from the distributions of other individuals. However, this method of estimation mirrors how the approach will be conducted in practice where different individuals may choose to track differing number of variables that could range into the hundreds of triggers. Finally, the analysis ignored missing data in the diaries, as such information was not considered relevant for evaluating expectations over time. Yet, missing data that are conditional on a considered variable are almost certainly important in applied settings where participants may skip headache diaries when they are at particular risk for an attack.

In conclusion, these findings suggest that real-time surprisal modeling is feasible but highly sensitive to the choice of prior. Incorporating empirical or personalized priors may substantially improve model behavior in early phases of data collection, though individual learning remains essential for long-term calibration. Future work should explore how best to combine individual baseline information with adaptive prior structures.

Author Contributions

Conceptualization, D.P.T. and T.T.H.; Data curation, D.P.T., E.C., T.P. and T.T.H.; Formal analysis, D.P.T. and T.T.H.; Funding acquisition, D.P.T.; Investigation, D.P.T.; Methodology, D.P.T. and T.T.H.; Project administration, E.C. and T.P.; Software, T.T.H.; Supervision, D.P.T.; Validation, T.T.H.; Visualization, T.T.H.; Writing—original draft, D.P.T., E.C., T.P. and T.T.H.; Writing—review & editing, D.P.T., E.C., T.P. and T.T.H. All authors have read and agreed to the published version of the manuscript.

Funding

The research reported in this publication was supported by the National Institute of Neurological Disorders and Stroke of the National Institutes of Health under award number R01NS113823.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Institutional Review Board of Mass General Brigham (protocol code: 2019P002176 and date of approval: 1 August 2019).

Data Availability Statement

Restrictions apply to the datasets. The datasets presented in this article are not readily available because the data are part of ongoing analyses. Requests to access the datasets should be directed to the corresponding author.

Conflicts of Interest

Dana P. Turner received funding from the National Institute of Neurological Disorders and Stroke to support this study. The authors have no additional conflicts of interest to declare.

References

Turner, D.P.; Houle, T.T. Influences on headache trigger beliefs and perceptions. Cephalalgia 2018, 38, 1545–1553. [Google Scholar] [CrossRef]
Turner, D.P. Assessing Headache Triggers: A Practical Guide for Applied Research and Clinical Management; Springer Nature: Cham, Switzerland, 2021. [Google Scholar]
Pellegrino, A.B.W.; Davis-Martin, R.E.; Houle, T.T.; Turner, D.P.; Smitherman, T.A. Perceived triggers of primary headache disorders: A meta-analysis. Cephalalgia 2018, 38, 1188–1198. [Google Scholar] [CrossRef]
Pavlovic, J.M.; Buse, D.C.; Sollars, C.M.; Haut, S.; Lipton, R.B. Trigger factors and premonitory features of migraine attacks: Summary of studies. Headache 2014, 54, 1670–1679. [Google Scholar] [CrossRef]
Lipton, R.B.; Pavlovic, J.M.; Haut, S.R.; Grosberg, B.M.; Buse, D.C. Methodological issues in studying trigger factors and premonitory features of migraine. Headache 2014, 54, 1661–1669. [Google Scholar] [CrossRef]
Martin, P.R. Headache triggers: To avoid or not to avoid, that is the question. Psychol. Health 2000, 15, 801–809. [Google Scholar] [CrossRef]
Martin, P.R. How do trigger factors acquire the capacity to precipitate headaches? Behav. Res. Ther. 2001, 39, 545–554. [Google Scholar] [CrossRef] [PubMed]
Turner, D.P.; Lebowitz, A.D.; Chtay, I.; Houle, T.T. Headache triggers as surprise. Headache 2019, 59, 495–508. [Google Scholar] [CrossRef] [PubMed]
Turner, D.P.; Caplis, E.; Bertsch, J.; Houle, T.T. Information theory and headache triggers. Headache 2023, 63, 899–907. [Google Scholar] [CrossRef]
Turner, D.P.; Caplis, E.; Patel, T.; Houle, T.T. Development of a migraine trigger measurement system using surprisal. Neurol. Res. Int. 2025, 2025, 8872708. [Google Scholar] [CrossRef]
Turner, D.P.; Patel, T.; Caplis, E.; Houle, T.T. Evaluating migraine trigger surprisal: Associations with migraine activity. medRxiv 2025. [Google Scholar] [CrossRef] [PubMed]
Clark, A. Whatever next? Predictive brains, situated agents, and the future of cognitive science. Behav. Brain Sci. 2013, 36, 181–204. [Google Scholar] [CrossRef] [PubMed]
Friston, K.J. The free-energy principle: A unified brain theory? Nat. Rev. Neurosci. 2010, 11, 127–138. [Google Scholar] [CrossRef]
Spratling, M.W. A review of predictive coding algorithms. Brain Cogn. 2017, 112, 92–97. [Google Scholar] [CrossRef]
Friston, K.; Da Costa, L.; Sajid, N.; Heins, C.; Ueltzhöffer, K.; Pavliotis, G.A.; Parr, T. The free energy principle made simpler but not too simple. Phys. Rep. 2023, 1024, 1–29. [Google Scholar] [CrossRef]
Olesen, J. International Classification of Headache Disorders. Lancet Neurol. 2018, 17, 396–397. [Google Scholar] [CrossRef]
Stewart, W.F.; Lipton, R.B.; Dowson, A.J.; Sawyer, J. Development and testing of the Migraine Disability Assessment (MIDAS) Questionnaire to assess headache-related disability. Neurology 2001, 56, S20–S28. [Google Scholar] [CrossRef]
Harris, P.A.; Taylor, R.; Thielke, R.; Payne, J.; Gonzalez, N.; Conde, J.G. Research electronic data capture (REDCap): A metadata-driven methodology and workflow process for providing translational research informatics support. J. Biomed. Inform. 2009, 42, 377–381. [Google Scholar] [CrossRef]
Shacham, S. A shortened version of the Profile of Mood States. J. Personal. Assess. 1983, 47, 305–306. [Google Scholar] [CrossRef] [PubMed]
Brantley, P.J.; Waggoner, C.D.; Jones, G.N.; Rappaport, N.B. A Daily Stress Inventory: Development, reliability, and validity. J. Behav. Med. 1987, 10, 61–74. [Google Scholar] [CrossRef] [PubMed]
Houle, T.T.; Turner, D.P.; Houle, T.A.; Smitherman, T.A.; Martin, V.; Penzien, D.B.; Lipton, R.B. Rounding behavior in the reporting of headache frequency complicates headache chronification research. Headache 2013, 53, 908–919. [Google Scholar] [CrossRef]
Houle, T.T.; Deng, H.; Tegeler, C.H.; Turner, D.P. Continuous updating of individual headache forecasting models using Bayesian methods. Headache 2021, 61, 1264–1273. [Google Scholar] [CrossRef] [PubMed]
Gelman, A.; Carlin, J.B.; Stern, H.S.; Rubin, D.B. Bayesian Data Analysis; Chapman and Hall/CRC: Boca Raton, FL, USA, 1995. [Google Scholar]
Bernardo, J.M.; Smith, A.F.M.; Berliner, M. Bayesian Theory; Wiley: New York, NY, USA, 1994. [Google Scholar]
Chen, Y.C. A tutorial on kernel density estimation and recent advances. arXiv 2017, arXiv:1704.03924. [Google Scholar] [CrossRef]
Harvey, A.C. Forecasting, Structural Time Series Models and the Kalman Filter; Cambridge University Press: Cambridge, UK, 1990. [Google Scholar]
Scott, S.L.; Varian, H.R. Predicting the present with Bayesian structural time series. Int. J. Math. Model. Numer. Optim. 2014, 5, 4–23. [Google Scholar] [CrossRef]
Carpenter, B.; Gelman, A.; Hoffman, M.D.; Lee, D.; Goodrich, B.; Betancourt, M.; Brubaker, M.; Guo, J.; Li, P.; Riddell, A. Stan: A probabilistic programming language. J. Stat. Softw. 2017, 76, 1–32. [Google Scholar] [CrossRef]
Steptoe, A.; Peacey, V.; Wardle, J. Sleep duration and health in young adults. Arch. Intern. Med. 2006, 166, 1689–1692. [Google Scholar] [CrossRef] [PubMed]
Houle, T.T.; Turner, D.P.; Golding, A.N.; Porter, J.A.H.; Martin, V.T.; Penzien, D.B.; Tegeler, C.H. Forecasting individual headache attacks using perceived stress: Development of a multivariable prediction model for persons with episodic migraine. Headache 2017, 57, 1041–1050. [Google Scholar] [CrossRef] [PubMed]
Wickham, H. ggplot2: Elegant Graphics for Data Analysis; Springer: New York, NY, USA, 2016. [Google Scholar]

Figure 1. Observed and fitted distributions for stress, sleep, and exercise variables. Top left: Histogram and fitted hurdle-Gamma density for stress ratings, with the spike at zero reflecting the hurdle component. Top right: ECDF of stress ratings compared to the fitted hurdle-Gamma cumulative distribution. Middle left: Histogram of self-reported sleep duration overlaid with a rounded Normal density fit. Middle right: ECDF of sleep compared to the predictive cumulative distribution from the Normal model. Bottom left: Bar plot of exercise occurrence (0 = no, 1 = yes) with Bernoulli-predicted probabilities. Bottom right: ECDF of exercise compared to the Bernoulli cumulative distribution.

Figure 2. Evolution of expected values over time for stress, sleep, and exercise under uninformative (blue) and empirical (red) priors. Top left: Mean expected stress values across participants with 95% confidence intervals. Top right: Individual stress expectation trajectories. Middle left: Mean expected sleep duration over time. Middle right: Individual sleep expectation trajectories, showing intra-individual consistency. Bottom left: Mean expected probability of exercise over time. Bottom right: Individual exercise trajectories, highlighting wide between-person variability and prior-related differences in early estimates.

Figure 3. Mean difference between dynamic Bayesian surprisal estimates and static empirical surprisal values for exercise across the 28-day study period. Differences are plotted separately for empirical priors (red) and uninformative priors (blue), with shaded ribbons representing 95% confidence intervals. The dashed horizontal line at zero indicates perfect agreement. Dynamic surprisal estimates based on empirical priors exhibited smaller and more stable deviations from static estimates, whereas those based on uninformative priors showed greater early divergence that gradually diminished over time.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Turner, D.P.; Caplis, E.; Patel, T.; Houle, T.T. Enhancing Migraine Trigger Surprisal Predictions: A Bayesian Approach to Establishing Prospective Expectations. Entropy 2025, 27, 1102. https://doi.org/10.3390/e27111102

AMA Style

Turner DP, Caplis E, Patel T, Houle TT. Enhancing Migraine Trigger Surprisal Predictions: A Bayesian Approach to Establishing Prospective Expectations. Entropy. 2025; 27(11):1102. https://doi.org/10.3390/e27111102

Chicago/Turabian Style

Turner, Dana P., Emily Caplis, Twinkle Patel, and Timothy T. Houle. 2025. "Enhancing Migraine Trigger Surprisal Predictions: A Bayesian Approach to Establishing Prospective Expectations" Entropy 27, no. 11: 1102. https://doi.org/10.3390/e27111102

APA Style

Turner, D. P., Caplis, E., Patel, T., & Houle, T. T. (2025). Enhancing Migraine Trigger Surprisal Predictions: A Bayesian Approach to Establishing Prospective Expectations. Entropy, 27(11), 1102. https://doi.org/10.3390/e27111102

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Enhancing Migraine Trigger Surprisal Predictions: A Bayesian Approach to Establishing Prospective Expectations

Abstract

1. Introduction

2. Materials and Methods

2.1. Selection of Trigger Variables

2.2. Prospectively Estimating Surprisal from Daily Observations

2.3. Prior Probability

2.4. Statistical Analyses

3. Results

3.1. Model Fit to Observed Distributions

3.2. Evolution of Expected Values over Time

3.3. Surprisal Differences by Prior Type

4. Discussion

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI