Perspective on Predictive Modeling: Current Status, New High-Order Methodology and Outlook for Energy Systems
Abstract
:1. Introduction
- (a)
- a mathematical model comprising equations that relate the system’s independent variables and parameters to the system’s state (i.e., dependent) variables;
- (b)
- inequality and/or equality constraints that delimit the ranges of the system’s parameters;
- (c)
- one or several computational results, customarily referred to as “system responses” (or objective functions, or indices of performance), which are computed using the mathematical model; and
- (d)
- experimentally measured responses, with their respective nominal (mean) values and uncertainties (variances, covariances, skewness, kurtosis, etc.).
2. Traditional Least-Square Based Deterministic Predictive Modeling Methodologies
- (i)
- The so-called “data-adjustment methodology”, which is perhaps the oldest such methodology in use and was developed in the nuclear energy field. The initial “data-adjustment methodology”, which was developed in the 1960s for the large-scale time-independent linear systems modeled by the linear neutral particle transport equation, is briefly reviewed in Section 2.1. The data-adjustment methodology for time-dependent nonlinear systems, which could be considered to be the predecessor for the so-called “4D-Var” methodology mentioned below, is reviewed in Section 2.2.
- (ii)
- The so-called “data assimilation” methodology, which is implemented for the large-scale time-dependent systems encountered in the atmospheric and geophysical field, is briefly reviewed in Section 2.3.
2.1. Time-Independent Least-Squares Based Data Adjustment
- (i)
- The (symmetric) relative covariance matrix for the model parameters, , is defined as follows:
- (ii)
- The (symmetric) relative covariance matrix for the measured model responses (effective multiplication factors), , is defined as follows:
- (iii)
- The rectangular matrix contains as elements the relative covariances between the measured responses and model parameters, and is defined as follows:
- (iv)
- Each model response, denoted as , is considered to be a linear function of the model parameters, denoted as , having the following form:
- (v)
- The (column) vector of Lagrange multipliers , which appears in the functional defined in Equation (1), enforces the linear model defined by Equation (7) as a “hard constraint.”
2.2. Time-Dependent Least-Squares Based Data Adjustment
- (i)
- calibrated best-estimate parameter values:
- (ii)
- The calibrated best-estimate covariance matrix, , corresponding to the calibrated best-estimates system parameters:
- (iii)
- The vector of calibrated best-estimate system responses at all time instances :
- (iv)
- The expression of the calibrated best-estimate covariance block-matrix, , for the best-estimate responses:
- (v)
- The best-estimate response-parameter covariance block-matrix :
- (i)
- The expressions for the calibrated best-estimate parameter values take on the following particular form of Equation (32) at time node :
- (ii)
- The components , , of the calibrated best-estimate covariance matrix, , have the following particular form of Equation (35):
- (iii)
- The vector , representing the calibrated best-estimates for the system parameters at a time instance , takes on the following particular form of Equation (37):
- (iv)
- The components of the calibrated best-estimate covariance block-matrix for the best-estimate responses takes on the following particular form of Equation (38) for :
- (v)
- The matrix-valued components , , of the best-estimate response-parameter covariance matrix reduce to the following particular forms of Equation (42):
2.3. Least-Squares Based Variational Data Assimilation
- The functional quantifies, in a least-square sense, the squared differences between the initial state and the background state ; denotes the estimated background covariance matrix. The background state provides an initial guess for minimization of
- The functional quantifies, in a least-square sense, the squared differences between the observed state and initial state at the time instance ; denotes the estimated observation covariance matrix and denotes the observation operator at the time instance .
- The functional quantifies, in a least-square sense, the model errors; denotes the covariance matrix of model errors.
3. BERRU-PM-2+: Second-Order-Plus MaxEnt Forward and Inverse Predictive Modeling Methodology
3.1. Construction of the Second-Order-Accurate MaxEnt Probability Distribution of Computational Model Responses and Parameters
- (i)
- maximizes the Shannon [43] information entropy for the computational model, , defined below:
- (ii)
- satisfies the “moments constraints” defined by Equations (66), (67), (70), (71) and (72);
- (iii)
- satisfies the normalization condition:
3.2. Construction of the Second-Order-Accurate MaxEnt Probability Distribution of Experimentally Measured Responses
3.3. Construction of the Complete Second-Order-Accurate Joint Posterior MaxEnt Probability Distbribution of Computed and Measured Responses and Model Parameters
3.4. Practical Situation: Only Response Measurements Are Available to Be Assimilated
- (i)
- the derivative of a function with respect to a component of is denoted using a subscript, e.g., , , , where denotes the total number of independent variables;
- (ii)
- the superscripts denote the respective component of the inverse Hessian of the respective function, e.g., denotes the -element of the inverse Hessian matrix ;
- (iii)
- an index that appears as a subscript and a superscript implies a summation over all possible values of that index;
- (iv)
- the “hat” denotes that the respective quantity is evaluated at the saddle point of , which is defined as the point at which the gradient of vanishes, i.e., .
3.5. Characteristics of the BERRU-PM-2+ Methodology: Summary
4. Discussion, Conclusions and Outlook
Funding
Conflicts of Interest
Appendix A
Appendix A.1. Fourth-Order Approximate Expressions for the Expected Value of the Computed Response
Appendix A.2. Fourth-Order Approximate Expressions for the Correlations between Computed Responses and Model Parameters
Appendix A.3. Fourth-Order Approximate Expressions for the Correlations between Computed Responses
References
- Humi, I.; Wagschal, J.J.; Yeivin, Y. Multi-group constants from integral data. In Proceedings of the 3rd International Conference on the Peaceful Uses of Atomic Energy, Geneva, Switzerland, 31 August–9 September 1964; Volume 2, p. 398. [Google Scholar]
- Cecchini, G.; Farinelli, U.; Gandini, A.; Salvatores, A. Analysis of integral data for few-group parameter evaluation of fast reactors. In Proceedings of the 3rd International Conference on the Peaceful Uses of Atomic Energy, Geneva, Switzerland, 31 August–9 September 1964; Volume 2. [Google Scholar]
- Usachhev, L.N. Perturbation theory for the breeding ratio and for other number ratios pertaining to various reactor processes. J. Nucl. Energy. Parts A/B React. Sci. Technol. 1964, 18, 571. [Google Scholar] [CrossRef]
- Rowlands, J. The production and performance of the adjusted cross-section set FGL5. In Proceedings of the International Symposium on Physics of Fast Reactors, Tokyo, Japan, 16–19 October 1973. [Google Scholar]
- Gandini, A.; Petilli, M. AMARA: A Code Using the Lagrange Multipliers Method for Nuclear Data Adjustment; CNEN-RI/FI(73)39; Comitato Nazionale Energia Nucleare: Casaccia/Rome, Italy, 1973. [Google Scholar]
- Kuroi, H.; Mitani, H. Adjustment to cross-section data to fit integral experiments by least squares method. J. Nucl. Sci. Technol. 1975, 12, 663. [Google Scholar] [CrossRef]
- Dragt, J.B.; Dekker, J.W.M.; Gruppelaar, H.; Janssen, A.J. Methods of adjustment and error evaluation of neutron capture cross sections. Nucl. Sci. Eng. 1977, 62, 11. [Google Scholar] [CrossRef]
- Weisbin, C.R.; Oblow, E.M.; Marable, J.H.; Peelle, R.W.; Lucius, J.L. Application of sensitivity and uncertainty methodology to fast reactor integral experiment analysis. Nucl. Sci. Eng. 1978, 66, 307. [Google Scholar] [CrossRef]
- Barhen, J.; Cacuci, D.G.; Wagschal, J.J.; Bjerke, M.A.; Mullins, C.B. Uncertainty analysis of time-dependent nonlinear systems: Theory and application to transient thermal hydraulics. Nucl. Sci. Eng. 1982, 81, 23–44. [Google Scholar] [CrossRef]
- Cacuci, D.G. Sensitivity theory for nonlinear systems: I. Nonlinear functional analysis approach. J. Math. Phys. 1981, 22, 2794–2812. [Google Scholar] [CrossRef]
- Lewis, J.M.; Lakshmivarahan, S.; Dhall, S.K. Dynamic Data Assimilation: A Least Square Approach; Cambridge University Press: Cambridge, UK, 2006. [Google Scholar]
- Lahoz, W.; Khattatov, B.; Ménard, R. (Eds.) Data Assimilation: Making Sense of Observations; Springer: Heidelberg, Germany, 2010. [Google Scholar]
- Faragó, I.; Havasi, Á.; Zlatev, Z. (Eds.) Advanced Numerical Methods for Complex Environmental Models: Needs and Availability; Bentham Science Publishers: Bussum, The Netherlands, 2013. [Google Scholar]
- Cacuci, D.G.; Navon, M.I.; Ionescu-Bujor, M. Computational Methods for Data Evaluation and Assimilation; Chapman & Hall/CRC: Boca Raton, FL, USA, 2014. [Google Scholar]
- Cacuci, D.G. Predictive modeling of coupled multi-physics systems: I. Theory. Ann. Nucl. Energy 2014, 70, 266–278. [Google Scholar] [CrossRef]
- Cacuci, D.G. BERRU Predictive Modeling: Best-Estimate Results with Reduced Uncertainties; Springer: Berlin, Germany, 2019. [Google Scholar]
- Jaynes, E.T. Information theory and statistical mechanics. Phys. Rev. 1957, 106, 620–630. [Google Scholar] [CrossRef]
- Cacuci, D.G. Second-order adjoint sensitivity analysis methodology (2nd-ASAM) for large-scale nonlinear systems: I. Theory. Nucl. Sci. Eng. 2016, 184, 16–30. [Google Scholar] [CrossRef]
- Cacuci, D.G. The Second-Order Adjoint Sensitivity Analysis Methodology; CRC Press: Boca Raton, FL, USA, 2018. [Google Scholar]
- Cacuci, D.G. The nth-order comprehensive adjoint sensitivity analysis methodology for nonlinear systems (nth-CASAM-N): Mathematical framework. J. Nucl. Eng. 2022, 3, 10. [Google Scholar] [CrossRef]
- Cacuci, D.G. The nth-Order Comprehensive Adjoint Sensitivity Analysis Methodology: Overcoming the Curse of Dimensionality. Volume I: Linear Systems; Springer Nature Switzerland: Cham, Switzerland, 2022. [Google Scholar]
- SCALE: A Modular Code System for Performing Standardized Computer Analyses for Licensing Evaluation; ORNL/TM 2005/39, Version 6; Oak Ridge National Laboratory: Oak Ridge, TN, USA, 2009.
- Venard, C.; Santamarina, A.; Leclainche, A.; Mournier, C. The R.I.B. Tool for the determination of computational bias and associated uncertainty in the CRISTAL criticality safety package. In Proceedings of ANS Nuclear Criticality Safety Division Topical Meeting (NCSD 2009), Richland, DC, USA, 13–17 September 2009.
- Gandin, L.S. Objective Analysis of Meteorological Fields; Gridromet: Leningrad, Russia, 1963. (In Russian) [Google Scholar]
- Anthes, R.A. Data assimilation and initialization of hurricane prediction models. J. Atmos. Sci. 1974, 31, 702. [Google Scholar] [CrossRef]
- Sasaki, Y.K. A fundamental study of the numerical prediction based on the variational principle. J. Meteor. Soc. Japan 1955, 33, 262. [Google Scholar] [CrossRef] [Green Version]
- Sasaki, Y.K. An objective analysis based on the variational method. J. Meteor. Soc. Japan 1958, 36, 77. [Google Scholar] [CrossRef] [Green Version]
- Lorenc, A.C. Analysis-methods for numerical weather prediction. Q. J. R. Meteorol. Soc. 1986, 112, 1177. [Google Scholar] [CrossRef]
- Hall, M.C.G.; Cacuci, D.G.; Schlesinger, M.E. Sensitivity analysis of a radiative-convective model by the adjoint method. J. Atm. Sci. 1982, 39, 2038–2050. [Google Scholar] [CrossRef]
- Hall, M.C.G.; Cacuci, D.G. Physical interpretation of the adjoint functions for sensitivity analysis of atmospheric models. J. Atm. Sci. 1983, 40, 2537–2546. [Google Scholar] [CrossRef]
- Práger, T.; Kelemen, F.D. Adjoint methods and their application in earth sciences. In Advanced Numerical Methods for Complex Environmental Models: Needs and Availability; Faragó, I., Havasi, Á., Zlatev, Z., Eds.; Bentham Science Publishers: Bussum, The Netherlands, 2013; Chapter 4, Part A; pp. 203–275. [Google Scholar]
- Derber, J.C. The variational four-dimensional assimilation of analysis using filtered models as constraints. Ph.D. Thesis, University of Wisconsin–Madison, Madison, WI, USA, 1985. [Google Scholar]
- Derber, J.C. Variational four-dimensional analysis using the quasigeostrophic constraint. Mon. Weather. Rev. 1987, 115, 998. [Google Scholar] [CrossRef]
- Hoffmann, R.N. A four-dimensional analysis exactly satisfying equations of motion. Mon. Weather. Rev. 1986, 114, 388. [Google Scholar] [CrossRef]
- Navon, I.M.; de Villiers, R. Combined penalty multiplier optimization methods to enforce integral invariants conservation. Mon. Weather. Rev. 1983, 111, 1228. [Google Scholar] [CrossRef]
- Navon, I.M. A review of variational and optimization methods in meteorology. Dev. Geomath. 1986, 5, 29–34. [Google Scholar]
- Navon, I.M. Practical and theoretical aspects of adjoint parameter estimation and identifiability in meteorology and oceanography. Dyn. Atmos. Ocean. 1998, 27, 55–79. [Google Scholar] [CrossRef]
- Derber, J.C.; Parrish, D.F.; Lord, S.J. The new global operational analysis system at the National Meteorological Center. Weather. and Forecast 1991, 6, 538. [Google Scholar] [CrossRef]
- Mahfouf, J.F.; Buizza, R.; Errico, R.M. Strategy for including physical processes in the ECMWF variational data assimilation system. In Proceedings of the ECMWF workshop on non-linear aspects of data assimilation, Reading, UK, 9–11 September 1996. [Google Scholar]
- Rabier, F. Overview of global data assimilation developments in numerical weather-prediction centers. Q. J. R. Meteorol. Soc. 2005, 131, 3215. [Google Scholar] [CrossRef]
- Bellman, R.E. Dynamic Programming. Rand Corporation; Princeton University Press: Princeton, NJ, USA, 1957; ISBN 978-0-691-07951-6. [Google Scholar]
- Tukey, J.W. The Propagation of Errors, Fluctuations and Tolerances; Technical Reports No. 10–12; Princeton University: Princeton, NJ, USA, 1957. [Google Scholar]
- Shannon, C.E. A Mathematical theory of communication. Bell System Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef] [Green Version]
- Evangelou, E.; Zhu, Z.; Smith, R.L. Estimation and prediction for spatial generalized linear mixed models using high order Laplace approximation. J. Stat. Plan. Inference 2011, 141, 3564–3577. [Google Scholar] [CrossRef] [Green Version]
- Shun, Z. and P. McCullagh. Laplace approximation of high dimensional integrals. J. R. Stat. Society. Ser. B 1995, 57, 749–760. [Google Scholar]
- Tang, Y.; Reid, N. Laplace and saddle point approximations in high dimensions. arXiv 2021, arXiv:2107.10885. [Google Scholar] [CrossRef]
- Tichonov, A.N. Regularization of non-linear ill-posed problems. Dokl. Akad. Nauk. 1963, 49. [Google Scholar]
- Tichonov, A.N. Solution of incorrectly formulated problems and the regularization method. Sov. Math. Dokl. 1963, 4, 1035. [Google Scholar] [CrossRef]
- Tarantola, A. Inverse Problem Theory and Methods for Model Parameter Estimation; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 2005. [Google Scholar]
- Hoefer, A.; Buss, O.; Neuber, J.C. How confident can we be in confidence intervals for the computational bias obtained with the generalized linear least squares methodology?—A toy model analysis. In Proceedings of the International Conference on Nuclear Criticality, ICNC 2011, Edinburgh, Scotland, UK, 19–22 September 2011. [Google Scholar]
- Hoefer, A.; Buss, O.; Neuber, J.C. Limitations of the generalized linear least squares methodology for bias estimation in criticality safety analysis. In Proceedings of the IAEA International Workshop on Burnup Credit Criticality Calculation Methods and Applications, Beijing, China, 25–28 October 2011. Available online: https://www.researchgate.net/publication/276271639 (accessed on 5 December 2022).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Cacuci, D.G. Perspective on Predictive Modeling: Current Status, New High-Order Methodology and Outlook for Energy Systems. Energies 2023, 16, 933. https://doi.org/10.3390/en16020933
Cacuci DG. Perspective on Predictive Modeling: Current Status, New High-Order Methodology and Outlook for Energy Systems. Energies. 2023; 16(2):933. https://doi.org/10.3390/en16020933
Chicago/Turabian StyleCacuci, Dan Gabriel. 2023. "Perspective on Predictive Modeling: Current Status, New High-Order Methodology and Outlook for Energy Systems" Energies 16, no. 2: 933. https://doi.org/10.3390/en16020933
APA StyleCacuci, D. G. (2023). Perspective on Predictive Modeling: Current Status, New High-Order Methodology and Outlook for Energy Systems. Energies, 16(2), 933. https://doi.org/10.3390/en16020933