<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Algorithms</journal-id>
<journal-title>Algorithms</journal-title>
<issn pub-type="epub">1999-4893</issn>
<publisher>
<publisher-name>Molecular Diversity Preservation International (MDPI)</publisher-name></publisher></journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3390/a4010028</article-id>
<article-id pub-id-type="publisher-id">algorithms-04-00028</article-id>
<article-categories>
<subj-group>
<subject>Article</subject></subj-group></article-categories>
<title-group>
<article-title>Defense of the Least Squares Solution to Peelle's Pertinent Puzzle</article-title></title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Burr</surname><given-names>Tom</given-names></name><xref ref-type="aff" rid="af1-algorithms-04-00028"><sup>1</sup></xref><xref ref-type="corresp" rid="c1-algorithms-04-00028"><sup>*</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Kawano</surname><given-names>Toshihiko</given-names></name><xref ref-type="aff" rid="af2-algorithms-04-00028"><sup>2</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Talou</surname><given-names>Patrick</given-names></name><xref ref-type="aff" rid="af2-algorithms-04-00028"><sup>2</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Pan</surname><given-names>Feng</given-names></name><xref ref-type="aff" rid="af3-algorithms-04-00028"><sup>3</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Hengartner</surname><given-names>Nicolas</given-names></name><xref ref-type="aff" rid="af4-algorithms-04-00028"><sup>4</sup></xref></contrib></contrib-group>
<aff id="af1-algorithms-04-00028">
<label>1</label> Statistical Sciences, Los Alamos National Laboratory, Los Alamos NM, USA; E-Mail: <email>tburr@lanl.gov</email></aff>
<aff id="af2-algorithms-04-00028">
<label>2</label> Nuclear and Particle Physics, Los Alamos National Laboratory, Los Alamos NM, USA; E-Mail: <email>talou@lanl.gov</email></aff>
<aff id="af3-algorithms-04-00028">
<label>3</label> Decision Applications, Los Alamos National Laboratory, Los Alamos NM, USA; E-Mail: <email>fpan@lanl.gov</email></aff>
<aff id="af4-algorithms-04-00028">
<label>4</label> Information Sciences, Los Alamos National Laboratory, Los Alamos NM, USA; E-Mail: <email>nickh@lanl.gov</email></aff>
<author-notes>
<corresp id="c1-algorithms-04-00028">
<label>*</label>Author to whom correspondence should be addressed; E-Mail: <email>kawano@lanl.gov</email>; Tel.: +1-505-664-0513; Fax: +1-505-667-1931.</corresp></author-notes>
<pub-date pub-type="collection">
<year>2011</year></pub-date>
<pub-date pub-type="epub">
<day>15</day>
<month>02</month>
<year>2011</year></pub-date>
<volume>4</volume>
<issue>1</issue>
<fpage>28</fpage>
<lpage>39</lpage>
<history>
<date date-type="received">
<day>06</day>
<month>12</month>
<year>2010</year></date>
<date date-type="rev-recd">
<day>15</day>
<month>01</month>
<year>2011</year></date>
<date date-type="accepted">
<day>07</day>
<month>02</month>
<year>2011</year></date></history>
<permissions>
<copyright-statement>© 2011 by the authors; licensee MDPI, Basel, Switzerland.</copyright-statement>
<copyright-year>2011</copyright-year>
<license>
<p>This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/.)</p></license></permissions>
<abstract>
<p>Generalized least squares (GLS) for model parameter estimation has a long and successful history dating to its development by Gauss in 1795. Alternatives can outperform GLS in some settings, and alternatives to GLS are sometimes sought when GLS exhibits curious behavior, such as in Peelle's Pertinent Puzzle (PPP). PPP was described in 1987 in the context of estimating fundamental parameters that arise in nuclear interaction experiments. In PPP, GLS estimates fell outside the range of the data, eliciting concerns that GLS was somehow flawed. These concerns have led to suggested alternatives to GLS estimators. This paper defends GLS in the PPP context, investigates when PPP can occur, illustrates when PPP can be beneficial for parameter estimation, reviews optimality properties of GLS estimators, and gives an example in which PPP does occur.</p></abstract>
<kwd-group>
<kwd>Peelle's puzzle</kwd>
<kwd>mean squared error</kwd>
<kwd>measurement error modeling</kwd></kwd-group></article-meta></front>
<body>
<sec sec-type="intro">
<label>1.</label>
<title>Introduction</title>
<p>Generalized least squares (GLS) for parameter estimation has a long and successful history dating to its development by Gauss in 1795. In some settings, alternatives to GLS can be effective, and are sometimes sought when GLS exhibits curious behavior, such as in Peelle's Pertinent Puzzle (PPP).</p>
<p>PPP was introduced in 1987 in the context of estimating fundamental parameters that arise in nuclear interaction experiments [<xref ref-type="bibr" rid="b1-algorithms-04-00028">1</xref>]. PPP is described below and when it occurs, the GLS estimate of the parameter is guaranteed to be outside the range of the data, which has elicited concerns that GLS is flawed and has led to suggested alternatives to GLS estimators.</p>
<p>A GLS estimate lying outside the range of the data causes heartache among nuclear scientists. Therefore, PPP continues to be of theoretical and practical interest. For example, a summary report of International Evaluation of Neutron Cross Section Standards [<xref ref-type="bibr" rid="b2-algorithms-04-00028">2</xref>] discusses PPP in terms of a standard least-squares procedure. Neutron cross sections are fundamental parameters that describe the probabilities of various neutron interactions. These cross sections are typically estimated from multiple experiments so some type of weighted average estimation scheme is used. The cross section estimates from any two experiments can have shared errors arising for example from using the same measured background, which can lead to covariance structures such as those described below.</p>
<p>We quote the original PPP problem proposed by [<xref ref-type="bibr" rid="b1-algorithms-04-00028">1</xref>] from a report of Chiba and Smith [<xref ref-type="bibr" rid="b3-algorithms-04-00028">3</xref>], “Suppose we are required to obtain the weighted average of two experimental results for the same physical quantity. The first result is 1.5, and the second result 1.0. The full covariance matrix of those data is believed to be the sum of three components. The first component is fully correlated with standard error 20% of each respective value. The second and third components are independent of the first and of each other, and correspond to 10% random uncertainties in each experimental result.</p>
<p>Although this PPP statement is vague, by converting it to something more interpretable, GLS can be applied and the resulting estimate is 0.88 (with an associated standard deviation of 0.22), which is outside the range of the measurements. Zhao and Perey [<xref ref-type="bibr" rid="b4-algorithms-04-00028">4</xref>] re-interpreted PPP by introducing a third datum c, through which the common error can be explicitly specified as follows, “Suppose we have two independent measurements. One is <italic>m</italic><sub>1</sub> = 1.5 ± 10%. Another is <italic>m</italic><sub>2</sub> = 1.0 ± 10%. To convert this quantity into another physical quantity, we need a conversion factor <italic>c</italic>, which after intermediate steps omitted here is 1.0 with uncertainty of 20%. Now the experimental results are, <italic>y</italic><sub>1</sub> = <italic>cm</italic><sub>1</sub> = 1.5 and <italic>y</italic><sub>2</sub> = <italic>cm</italic><sub>2</sub> = 1.0. We are required to obtain the weighted average of those experimental data.</p>
<p>In this interpretation, the common error (the “fully correlated” component) is understood to be <italic>multiplicative</italic>, and <italic>m</italic><sub>1</sub> = 1.5 ± 10% is assumed to mean that the <italic>true</italic> standard deviation is 0.15 for <italic>m</italic><sub>1</sub> (and 0.10 for <italic>m</italic><sub>2</sub>). Even after these interpretations, some vagueness remains. There is no convention regarding what confidence is associated with ±10%. Nor is there a convention for whether the standard deviation includes all error sources, or only includes random error effects, ignoring accuracy. In addition, we show below that it can matter whether the standard deviation is expressed as a fraction of the true quantity or of the measured quantity.</p>
<p>One of our contributions is to make explicit assumptions and examine their implications in order to convert vague statements to statements about which it is possible to find agreement among physical scientists and statisticians regarding suitable approaches. We also defend GLS in the PPP context, illustrate when PPP can be beneficial, briefly describe properties of GLS estimators, show that PPP cannot occur for certain measurement error models, and calculate a covariance matrix Σ for <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub> for which PPP occurs that follows from a physical description of a realistic measurement scenario.</p></sec>
<sec>
<label>2.</label>
<title>PPP</title>
<p>The vagueness of the original PPP statement is one reason there are so many interpretations of PPP [<xref ref-type="bibr" rid="b5-algorithms-04-00028">5</xref>]. PPP can occur if there is a large positive covariance between <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub> and the variances of <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub> are very different.</p>
<p>Let Σ be the 2-by-2 symmetric covariance matrix for <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub> with diagonal entries 
<inline-formula>
<mml:math id="mm1" display="inline">
<mml:semantics id="sm1">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula>, 
<inline-formula>
<mml:math id="mm2" display="inline">
<mml:semantics id="sm2">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula>, and off-diagonal entry <italic>σ</italic><sub>12</sub>, which denote the variance of <italic>y</italic><sub>1</sub>, the variance of <italic>y</italic><sub>2</sub>, and the covariance of <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub>, respectively. Zhao and Perry [<xref ref-type="bibr" rid="b4-algorithms-04-00028">4</xref>] approximated Σ for their definition of PPP (using <italic>y</italic><sub>1</sub> = <italic>cm</italic><sub>1</sub> and <italic>y</italic><sub>2</sub> = <italic>cm</italic><sub>2</sub> as described in the Introduction) as
<disp-formula id="FD1">
<label>(1)</label>
<mml:math id="mm3" display="block">
<mml:semantics id="sm3">
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mo>∑</mml:mo></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mo>=</mml:mo>
<mml:mtext>Cov</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd/>
<mml:mtd>
<mml:mrow>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mn>12</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mn>12</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd/>
<mml:mtd>
<mml:mrow>
<mml:mo>≈</mml:mo>
<mml:msubsup>
<mml:mi>μ</mml:mi>
<mml:mi>c</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mtd>
<mml:mtd/></mml:mtr>
<mml:mtr>
<mml:mtd/>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>c</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>μ</mml:mi>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>μ</mml:mi>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>μ</mml:mi>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>μ</mml:mi>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd/>
<mml:mtd>
<mml:mrow>
<mml:mo>≈</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mn>0.1125</mml:mn></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mn>0.06</mml:mn></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mn>0.06</mml:mn></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mn>0.05</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:semantics></mml:math></disp-formula>where <italic>σ<sub>m</sub></italic><sub>1</sub>, <italic>σ<sub>m</sub></italic><sub>2</sub>, and <italic>σ<sub>c</sub></italic> are the standard deviations in the quantities <italic>m</italic><sub>1</sub>, <italic>m</italic><sub>2</sub>, and <italic>c</italic>, respectively. The first approximation arises because the relatively small term 
<inline-formula>
<mml:math id="mm4" display="inline">
<mml:semantics id="sm4">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>c</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula> was omitted (see below). The second approximation follows by approximating 
<inline-formula>
<mml:math id="mm5" display="inline">
<mml:semantics id="sm5">
<mml:mrow>
<mml:msubsup>
<mml:mi>μ</mml:mi>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula> by <italic>m</italic><sub>1</sub><italic>m</italic><sub>2</sub>. <xref rid="FD1" ref-type="disp-formula">Equation 1</xref> assumes that <italic>m</italic> and <italic>c</italic> are independent. And, the relative standard deviations of 10%, 10%, and 20% are assumed to be the fractions relative to the measured values of 1.5, 1.0, and 1.0, respectively by definition, so that <italic>σ<sub>m</sub></italic><sub>1</sub> = 0.15, <italic>σ<sub>m</sub></italic><sub>2</sub> = 0.10, and <italic>σ<sub>c</sub></italic> = 0.2. See Section 4 for further discussion.</p>
<p>Readers might find it informative that those with traditional statistical education among the authors were the most willing to accept GLS estimates despite the apparent flaw of lying outside the range of the data. Statisticians will often consider alternatives to GLS, but recognize that GLS estimation is difficult to beat, at least in terms of typical performance measures such as being close on average to the true parameter value over hypothetical repeats of the pair of experiments [<xref ref-type="bibr" rid="b6-algorithms-04-00028">6</xref>]. Also, note that because 
<inline-formula>
<mml:math id="mm6" display="inline">
<mml:semantics id="sm6">
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:mfrac>
<mml:mo>=</mml:mo>
<mml:mn>0.5</mml:mn>
<mml:mo>/</mml:mo>
<mml:mn>0.21</mml:mn>
<mml:mo>=</mml:mo>
<mml:mn>2.39</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula>, one might consider the original PPP to be an unusual data realization. However, we show using elementary algebra and error modeling in Theorem 2 that when PPP occurs, it occurs for <italic>all</italic> data realizations.</p>
<p>Although Peelle [<xref ref-type="bibr" rid="b1-algorithms-04-00028">1</xref>] originally constructed the covariance matrix Σ as in <xref rid="FD1" ref-type="disp-formula">Equation 1</xref> and other authors followed, this result is not exact if the common error is multiplicative because of the omitted term 
<inline-formula>
<mml:math id="mm7" display="inline">
<mml:semantics id="sm7">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>c</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula> and because of the need to estimate <italic>μ<sub>m</sub></italic>. If we include the 
<inline-formula>
<mml:math id="mm8" display="inline">
<mml:semantics id="sm8">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>c</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula> term, then
<disp-formula id="FD2">
<label>(2)</label>
<mml:math id="mm9" display="block">
<mml:semantics id="sm9">
<mml:mrow>
<mml:mtext>Var</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>c</mml:mi>
<mml:mi>m</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mi>μ</mml:mi>
<mml:mi>c</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>μ</mml:mi>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>c</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>c</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></disp-formula>where <italic>σ<sub>m</sub></italic> is <italic>σ<sub>m</sub></italic><sub>1</sub> or <italic>σ<sub>m</sub></italic><sub>2</sub> in <xref rid="FD1" ref-type="disp-formula">Equation 1</xref>. The covariance matrix is therefore estimated as
<disp-formula id="FD3">
<label>(3)</label>
<mml:math id="mm10" display="block">
<mml:semantics id="sm10">
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mo>∑</mml:mo></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mi>μ</mml:mi>
<mml:mi>c</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mtd>
<mml:mtd/></mml:mtr>
<mml:mtr>
<mml:mtd/>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>c</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>μ</mml:mi>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>μ</mml:mi>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>μ</mml:mi>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>μ</mml:mi>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd/>
<mml:mtd>
<mml:mrow>
<mml:mo>≈</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mn>0.1134</mml:mn></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mn>0.06</mml:mn></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mn>0.06</mml:mn></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mn>0.0504</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:semantics></mml:math></disp-formula>where again as in <xref rid="FD1" ref-type="disp-formula">Equation 1</xref>, <italic>μ<sub>m</sub></italic> is estimated by <italic>m</italic><sub>1</sub> or <italic>m</italic><sub>2</sub>.</p>
<p>As shown below, prior to substituting the approximation for 
<inline-formula>
<mml:math id="mm11" display="inline">
<mml:semantics id="sm11">
<mml:mrow>
<mml:msubsup>
<mml:mi>μ</mml:mi>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula>, <xref rid="FD1" ref-type="disp-formula">Equations 1</xref> and <xref rid="FD3" ref-type="disp-formula">3</xref> satisfy conditions under which PPP cannot occur (Theorem 2 in Section 3). However, because in all published investigations we are aware of, 
<inline-formula>
<mml:math id="mm12" display="inline">
<mml:semantics id="sm12">
<mml:mrow>
<mml:msubsup>
<mml:mi>μ</mml:mi>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula> is approximated by <italic>m</italic><sub>1</sub><italic>m</italic><sub>2</sub> in the off-diagonal and 
<inline-formula>
<mml:math id="mm13" display="inline">
<mml:semantics id="sm13">
<mml:mrow>
<mml:msubsup>
<mml:mi>μ</mml:mi>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula> is approximated by 
<inline-formula>
<mml:math id="mm14" display="inline">
<mml:semantics id="sm14">
<mml:mrow>
<mml:msubsup>
<mml:mi>m</mml:mi>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula> in the upper left entry in the second term and by 
<inline-formula>
<mml:math id="mm15" display="inline">
<mml:semantics id="sm15">
<mml:mrow>
<mml:msubsup>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula> in the lower right entry in the second term, Σ <italic>as estimated</italic> does not have the condition referred to in Theorem 2 below Therefore, due to estimation error in Σ, PPP does appear to occur, which means that the GLS estimate of <italic>μ</italic> is outside the range of the data for Σ as given by <xref rid="FD1" ref-type="disp-formula">Equation 1</xref> or <xref rid="FD3" ref-type="disp-formula">Equation 3</xref>.</p>
<p>The fact that <italic>μ<sub>m</sub></italic> is approximated in the context of estimating variance and covariance in a multiplicative error model raises at least three issues: (1) the issue of how uncertainty in measurement is expressed; (2) an issue related to simulating observations from the assumed measurement error model as a way to consider likelihoods other than the Gaussian, and (3) accounting for uncertainty in Σ. For issue (1), we will make our measurement error model assumptions explicit throughout in order to eliminate needless ambiguities. Issues (2) and (3) are investigated in [<xref ref-type="bibr" rid="b6-algorithms-04-00028">6</xref>].</p>
<p>To focus on GLS behavior when PPP occurs, <italic>this paper assumes</italic> Σ <italic>is known exactly without error</italic>. However, for historical and presentation purposes, <xref rid="FD1" ref-type="disp-formula">Equations 1</xref> and <xref rid="FD3" ref-type="disp-formula">3</xref> are presented, and both clearly involve approximations. In contrast, Theorem 3 illustrates a measurement error model for which the exact covariance can satisfy the PPP condition.</p></sec>
<sec>
<label>3.</label>
<title>GLS</title>
<p>It is well known that the GLS method can be applied to <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub> to obtain the best linear unbiased estimate (BLUE) <italic>μ̂</italic> of <italic>μ</italic> [<xref ref-type="bibr" rid="b7-algorithms-04-00028">7</xref>]. Here, “best” means minimum variance and unbiased means that on average (across hypothetical or real realizations of the same experiment), the estimate <italic>μ̂</italic> will equal its true value <italic>μ</italic>. The GLS estimate for <italic>μ</italic> arising from the model
<disp-formula id="FD4">
<label>(4)</label>
<mml:math id="mm16" display="block">
<mml:semantics id="sm16">
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mi>μ</mml:mi>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mi>μ</mml:mi>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>e</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>with Σ = Cov(<italic>e</italic><sub>1</sub>, <italic>e</italic><sub>2</sub>) = Cov(<italic>y</italic><sub>1</sub>, <italic>y</italic><sub>2</sub>) given by
<disp-formula id="FD5">
<label>(5)</label>
<mml:math id="mm17" display="block">
<mml:semantics id="sm17">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>μ</mml:mi>
<mml:mo>^</mml:mo></mml:mover>
<mml:mo>=</mml:mo>
<mml:mi>c</mml:mi>
<mml:msup>
<mml:mi>G</mml:mi>
<mml:mi>t</mml:mi></mml:msup>
<mml:msup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:msup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>where the scalar <italic>c</italic> = (<italic>G<sup>t</sup></italic>Σ<sup>−1</sup><italic>G</italic>)<sup>−1</sup>. And the variance <italic>σ</italic><sup>2</sup> of the GLS estimate <italic>μ̂</italic> is given by
<disp-formula id="FD6">
<label>(6)</label>
<mml:math id="mm18" display="block">
<mml:semantics id="sm18">
<mml:mrow>
<mml:msup>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn></mml:msup>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msup>
<mml:mi>G</mml:mi>
<mml:mi>t</mml:mi></mml:msup>
<mml:msup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:msup>
<mml:mi>G</mml:mi></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:semantics></mml:math></disp-formula>where the matrix <italic>G</italic> is given by <italic>G<sup>t</sup></italic> = (1, 1). Putting the covariance of <xref rid="FD1" ref-type="disp-formula">Equation 1</xref> into <xref rid="FD5" ref-type="disp-formula">Equations 5</xref> and <xref rid="FD6" ref-type="disp-formula">6</xref> gives <italic>μ̂</italic> = <italic>c</italic><sub>1</sub><italic>y</italic><sub>1</sub> + <italic>c</italic><sub>2</sub><italic>y</italic><sub>2</sub> = 0.88 and <italic>σ</italic> = 0.22 where <italic>c</italic><sub>1</sub> = −0.24 and <italic>c</italic><sub>2</sub> = 1.24. Notice that <italic>c</italic><sub>1</sub> + <italic>c</italic><sub>2</sub> = 1 (so that <italic>μ̂</italic> is unbiased) but also that <italic>c</italic><sub>1</sub> &lt; 0 and that <italic>μ̂</italic> is smaller than each of the two measured values, <italic>y</italic><sub>1</sub> = 1.5 and <italic>y</italic><sub>2</sub> = 1.0.</p>
<p>GLS is usually introduced in the context of estimating <italic>β</italic> and future <italic>y</italic> values in a linear regression relating the response <italic>y</italic> to predictors <italic>X</italic> via <italic>y</italic> = <italic>Xβ</italic> + <italic>∊</italic> ([<xref ref-type="bibr" rid="b7-algorithms-04-00028">7</xref>]). Therefore, in <xref rid="FD4" ref-type="disp-formula">Equation 4</xref>, the mean <italic>μ</italic> plays the role of the unknown <italic>β</italic>. The GLS solution in <xref rid="FD5" ref-type="disp-formula">Equations 5</xref> and <xref rid="FD6" ref-type="disp-formula">6</xref> then follows from standard calculus or projection matrix results. For example, one can write <italic>μ̂</italic> = <italic>a</italic><sub>1</sub><italic>y</italic><sub>1</sub> + (1 − <italic>a</italic><sub>1</sub>)<italic>y</italic><sub>2</sub>, note that 
<inline-formula>
<mml:math id="mm19" display="inline">
<mml:semantics id="sm19">
<mml:mrow>
<mml:mtext>var</mml:mtext>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mover accent="true">
<mml:mi>μ</mml:mi>
<mml:mo>^</mml:mo></mml:mover>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>+</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mn>2</mml:mn></mml:msup>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mn>2</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>+</mml:mo>
<mml:mn>2</mml:mn>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi>y</mml:mi>
<mml:mn>2</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula> and solve for <italic>a</italic><sub>1</sub> to minimize var(<italic>μ̂</italic>) by setting the derivative of var(<italic>μ̂</italic>) with respect to <italic>a</italic><sub>1</sub> to 0.</p>
<p>The GLS solution of <xref rid="FD5" ref-type="disp-formula">Equations 5</xref> and <xref rid="FD6" ref-type="disp-formula">6</xref> with covariance Σ of <xref rid="FD3" ref-type="disp-formula">Equation 3</xref> is 0.89 ± 0.22. Because the last term in <xref rid="FD2" ref-type="disp-formula">Equation 2</xref> is smaller than the others, the Σ in <xref rid="FD3" ref-type="disp-formula">Equation 3</xref> that is slightly modified compared to the Σ in <xref rid="FD1" ref-type="disp-formula">Equation 1</xref> still leads to PPP. But in Section 4 we describe other modifications to Σ that do not lead to PPP.</p>
<p>GLS estimation has a long and successful history, but met with serious objection within the nuclear physics community in the context of combining estimates from multiple experiments upon observing a tendency to produce estimates that are outside the range of the data. More specifically, to date the tendency has been to produce estimates that are less than the minimum data value, so have been criticized as being “too small” [<xref ref-type="bibr" rid="b2-algorithms-04-00028">2</xref>].</p>
<p>GLS estimation is guaranteed to produce the BLUE even if the underlying data are not Gaussian. However, if the data is not Gaussian, then the minimum variance unbiased estimator (MVUE) is not necessarily linear in the data. Also, though unbiased estimation might sound politically correct, it is not necessarily superior to biased estimation [<xref ref-type="bibr" rid="b8-algorithms-04-00028">8</xref>]. Therefore, PPP has motivated the nuclear physics community to consider estimators other than GLS.</p>
<p>If the data has a Gaussian distribution, then it is well known that the GLS estimate is the same as the maximum likelihood (ML) estimate. This is because the log of the Gaussian likelihood involves a sum of squares, so choosing an estimate (the GLS estimate) that minimizes a sum of squares corresponds to choosing an estimate (the ML estimate) that maximizes the likelihood. Ordinary LS (OLS), weighted LS (WLS), and GLS are all essentially the same technique, but OLS is used if Σ is proportional to a unit-diagonal matrix, WLS is used if Σ is proportional to a diagonal matrix, and GLS is used if Σ is an arbitrary positive definite covariance matrix. The Gauss-Markov theorem [<xref ref-type="bibr" rid="b7-algorithms-04-00028">7</xref>] proves that the OLS estimator is the BLUE, and very similar theorems prove the same result for WLS or GLS.</p>
<p>The ML estimate depends on the assumed distributions for the errors. For example, if we replace the Gaussian (Normal) distributions with logNormal distributions, the ML estimate will change. In the cases considered here, ML gives the same estimate as GLS, because the data distribution is Gaussian. Because the ML approach makes strong use of the assumed error distributions, the ML estimate is sensitive to the assumed error distribution. The ML method has desirable properties, including asymptotically minimum variance as the sample size increases. However, in our example, the sample size is tiny (two), so asymptotic results for ML estimates are not relevant. It still is possible that an ML estimator will be better for nonGaussian data than GLS [<xref ref-type="bibr" rid="b6-algorithms-04-00028">6</xref>]. Typically, “better” is defined as the mean squared error (MSE) of the estimator, which is well known to satisfy MSE = variance + bias<sup>2</sup>. In some cases, biased estimators have lower MSE than unbiased estimators because the bias introduced is more than offset by a reduction in variance [<xref ref-type="bibr" rid="b8-algorithms-04-00028">8</xref>].</p></sec>
<sec>
<label>4.</label>
<title>Closer Look into PPP</title>
<sec>
<title/>
<p>The original PPP does not clearly state whether the common error is additive or multiplicative. This ambiguity was examined in [<xref ref-type="bibr" rid="b5-algorithms-04-00028">5</xref>]. In the “additive” scenario case, <italic>y</italic><sub>1</sub> = <italic>m</italic><sub>1</sub> + <italic>b</italic> and <italic>y</italic><sub>2</sub> = <italic>m</italic><sub>2</sub> + <italic>b</italic>, and the source of correlation may be a common background measurement <italic>b</italic>. And, the situation is somewhat different from PPP, because if <italic>σ<sub>b</sub></italic> is 20% of <italic>y</italic><sub>1</sub> = 1.5, then it is 30% of <italic>y</italic><sub>2</sub> = 1.0. The covariance matrix for this case is
<disp-formula id="FD7">
<label>(7)</label>
<mml:math id="mm20" display="block">
<mml:semantics id="sm20">
<mml:mrow>
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mo>∑</mml:mo></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mtd>
<mml:mtd/></mml:mtr>
<mml:mtr>
<mml:mtd/>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>b</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mn>1</mml:mn></mml:mtd>
<mml:mtd>
<mml:mn>1</mml:mn></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mn>1</mml:mn></mml:mtd>
<mml:mtd>
<mml:mn>1</mml:mn></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd/>
<mml:mtd>
<mml:mrow>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mn>0.1125</mml:mn></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mn>0.09</mml:mn></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mn>0.09</mml:mn></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mn>0.1</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:semantics></mml:math></disp-formula>and the GLS solution is 1.15 ± 0.31. With the covariance matrix of <xref rid="FD7" ref-type="disp-formula">Equation 7</xref>, PPP does not occur, and the GLS solution of 1.15 is a weighted average of 1.0 ± 0.1 and 1.5 ± 0.15. Although as <italic>σ<sub>b</sub></italic> changes, the standard deviation <italic>σ<sub>μ̂</sub></italic> is scaled accordingly, the GLS solution does not change. It is reasonable that the GLS solution does not change as <italic>σ<sub>b</sub></italic> changes, because whatever the background fluctuation is, <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub> are impacted in the same manner by <italic>b</italic>.</p>
<p>If instead <italic>σ<sub>b</sub></italic> is 20% of <italic>μ<sub>m</sub></italic>, then its value is unknown because <italic>μ<sub>m</sub></italic> is unknown and must be estimated. However, regardless of the value of <italic>σ<sub>b</sub></italic> &gt; 0, PPP cannot occur in the additive case as parameterized in <xref rid="FD7" ref-type="disp-formula">Equation 7</xref> (Theorem 1 below).</p>
<p>In Sivia's [<xref ref-type="bibr" rid="b9-algorithms-04-00028">9</xref>] notation, <xref rid="FD5" ref-type="disp-formula">Equation 5</xref> can be written as
<disp-formula id="FD8">
<label>(8)</label>
<mml:math id="mm21" display="block">
<mml:semantics id="sm21">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>μ</mml:mi>
<mml:mo>^</mml:mo></mml:mover>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:semantics></mml:math></disp-formula>with
<disp-formula id="FD9">
<label>(9)</label>
<mml:math id="mm22" display="block">
<mml:semantics id="sm22">
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo>−</mml:mo>
<mml:mi>ρ</mml:mi>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn></mml:msub></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mo>,</mml:mo></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>−</mml:mo>
<mml:mi>ρ</mml:mi>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:semantics></mml:math></disp-formula>where <italic>ρ</italic> is the correlation coefficient. Sivia demonstrated that PPP does not occur if the condition
<disp-formula id="FD10">
<label>(10)</label>
<mml:math id="mm23" display="block">
<mml:semantics id="sm23">
<mml:mrow>
<mml:mi>ρ</mml:mi>
<mml:mo>≤</mml:mo>
<mml:mo>min</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:mfrac>
<mml:mo>,</mml:mo></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>is fulfilled (and does occur if the condition does not hold). It is not difficult to show that the additive case of <xref rid="FD7" ref-type="disp-formula">Equation 7</xref> satisfies this condition, which we state as Theorem 1.</p>
<sec>
<title>Theorem 1</title>
<p>If Σ is given by
<disp-formula id="FD11">
<label>(11)</label>
<mml:math id="mm24" display="block">
<mml:semantics id="sm24">
<mml:mrow>
<mml:mo>∑</mml:mo>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mtd>
<mml:mtd/></mml:mtr>
<mml:mtr>
<mml:mtd/>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mn>2</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>b</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mn>1</mml:mn></mml:mtd>
<mml:mtd>
<mml:mn>1</mml:mn></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mn>1</mml:mn></mml:mtd>
<mml:mtd>
<mml:mn>1</mml:mn></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>then PPP does not occur.</p>
<sec>
<title>Proof</title>
<p>The covariance matrix Σ can be expressed as
<disp-formula id="FD12">
<label>(12)</label>
<mml:math id="mm25" display="block">
<mml:semantics id="sm25">
<mml:mrow>
<mml:mo>∑</mml:mo>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mi>ρ</mml:mi>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mi>ρ</mml:mi>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:semantics></mml:math></disp-formula>with 
<inline-formula>
<mml:math id="mm26" display="inline">
<mml:semantics id="sm26">
<mml:mrow>
<mml:mi>ρ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>b</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>b</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>/</mml:mo>
<mml:mn>2</mml:mn></mml:mrow></mml:msup>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>b</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>/</mml:mo>
<mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mfrac></mml:mrow></mml:semantics></mml:math></inline-formula>. If 
<inline-formula>
<mml:math id="mm27" display="inline">
<mml:semantics id="sm27">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>&lt;</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula> then 
<inline-formula>
<mml:math id="mm28" display="inline">
<mml:semantics id="sm28">
<mml:mrow>
<mml:mi>ρ</mml:mi>
<mml:mo>≤</mml:mo>
<mml:mo>min</mml:mo>
<mml:mspace width="0.2em"/>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:mfrac>
<mml:mo>,</mml:mo></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:semantics></mml:math></inline-formula> is equivalent to 
<inline-formula>
<mml:math id="mm29" display="inline">
<mml:semantics id="sm29">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>b</mml:mi>
<mml:mn>4</mml:mn></mml:msubsup>
<mml:mo>≤</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>b</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow>
<mml:mo>×</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>b</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:semantics></mml:math></inline-formula> which obviously holds. The proof is identical if 
<inline-formula>
<mml:math id="mm30" display="inline">
<mml:semantics id="sm30">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>&gt;</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula>. And if 
<inline-formula>
<mml:math id="mm31" display="inline">
<mml:semantics id="sm31">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula>, then PPP can never occur.</p>
<p>Notice that <xref rid="FD1" ref-type="disp-formula">Equations 1</xref> and <xref rid="FD3" ref-type="disp-formula">3</xref> have the same form as assumed in Theorem 1. Therefore, PPP also cannot occur for the multiplicative error model assumed. However, as discussed, PPP can occur with the multiplicative error model case if Σ is estimated as in <xref rid="FD1" ref-type="disp-formula">Equations 1</xref> and <xref rid="FD3" ref-type="disp-formula">3</xref>.</p>
<p>Several authors explored different values for the covariance matrix Σ to understand the relationship between the covariance matrix and the estimates. Some numerical examples are in [<xref ref-type="bibr" rid="b2-algorithms-04-00028">2</xref>]. Jones <italic>et al.</italic> [<xref ref-type="bibr" rid="b10-algorithms-04-00028">10</xref>] and Finn <italic>et al.</italic> [<xref ref-type="bibr" rid="b11-algorithms-04-00028">11</xref>] also reported linear regression of strongly correlated data and emphasized that if 
<inline-formula>
<mml:math id="mm32" display="inline">
<mml:semantics id="sm32">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>≠</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula> then contrary to common belief, positive or negative correlation can be exploited to produce an unbiased GLS estimate that has lower variance than in the zero correlation case. The historical definition of PPP requires the correlation to be positive because only in that case will the GLS estimate lie outside the range of the data for certain Σ.</p>
<p>Next rewrite <xref rid="FD8" ref-type="disp-formula">Equation 8</xref> as 
<inline-formula>
<mml:math id="mm33" display="inline">
<mml:semantics id="sm33">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>μ</mml:mi>
<mml:mo>^</mml:mo></mml:mover>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:semantics></mml:math></inline-formula> with 
<inline-formula>
<mml:math id="mm34" display="inline">
<mml:semantics id="sm34">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:semantics></mml:math></inline-formula> and 
<inline-formula>
<mml:math id="mm35" display="inline">
<mml:semantics id="sm35">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:semantics></mml:math></inline-formula>. A condition equivalent to <xref rid="FD10" ref-type="disp-formula">Equation 10</xref> is obtained as follows. By setting the first derivative with respect to 
<inline-formula>
<mml:math id="mm36" display="inline">
<mml:semantics id="sm36">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula> to zero, one can solve for the value of 
<inline-formula>
<mml:math id="mm37" display="inline">
<mml:semantics id="sm37">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula> such that the variance of 
<inline-formula>
<mml:math id="mm38" display="inline">
<mml:semantics id="sm38">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:semantics></mml:math></inline-formula> is minimum subject to 
<inline-formula>
<mml:math id="mm39" display="inline">
<mml:semantics id="sm39">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula>. The result is 
<inline-formula>
<mml:math id="mm40" display="inline">
<mml:semantics id="sm40">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>−</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mn>12</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>−</mml:mo>
<mml:mn>2</mml:mn>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mn>12</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mfrac></mml:mrow></mml:semantics></mml:math></inline-formula>. Therefore, a condition equivalent to <xref rid="FD10" ref-type="disp-formula">Equation 10</xref> is: PPP occurs if 
<inline-formula>
<mml:math id="mm41" display="inline">
<mml:semantics id="sm41">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>&lt;</mml:mo>
<mml:mn>0</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula> or 
<inline-formula>
<mml:math id="mm42" display="inline">
<mml:semantics id="sm42">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>&gt;</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula>. Another interesting fact is that if PPP occurs (so <xref rid="FD10" ref-type="disp-formula">Equation 10</xref> does not hold), then either 
<inline-formula>
<mml:math id="mm43" display="inline">
<mml:semantics id="sm43">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>&lt;</mml:mo>
<mml:mn>0</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula> and 
<inline-formula>
<mml:math id="mm44" display="inline">
<mml:semantics id="sm44">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>&gt;</mml:mo>
<mml:mn>0</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula> or 
<inline-formula>
<mml:math id="mm45" display="inline">
<mml:semantics id="sm45">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>&gt;</mml:mo>
<mml:mn>0</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula> and 
<inline-formula>
<mml:math id="mm46" display="inline">
<mml:semantics id="sm46">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>&lt;</mml:mo>
<mml:mn>0</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula>, with 
<inline-formula>
<mml:math id="mm47" display="inline">
<mml:semantics id="sm47">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula> so that <italic>μ̂</italic> is unbiased. We can now state and prove Theorem 2.</p></sec></sec>
<sec>
<title>Theorem 2</title>
<p>Suppose 
<inline-formula>
<mml:math id="mm48" display="inline">
<mml:semantics id="sm48">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula> and 
<inline-formula>
<mml:math id="mm49" display="inline">
<mml:semantics id="sm49">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula> have opposite signs. Then either <italic>μ̂</italic> &lt; min(<italic>y</italic><sub>1</sub>, <italic>y</italic><sub>2</sub>) or <italic>μ̂</italic> &gt; max(<italic>y</italic><sub>1</sub>, <italic>y</italic><sub>2</sub>). That is, <italic>μ̂</italic> will <italic>always</italic> fall outside the range of the (<italic>y</italic><sub>1</sub>, <italic>y</italic><sub>2</sub>).</p>
<sec>
<title>Proof</title>
<p>First assume 
<inline-formula>
<mml:math id="mm50" display="inline">
<mml:semantics id="sm50">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>&gt;</mml:mo>
<mml:mn>0</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula> and 
<inline-formula>
<mml:math id="mm51" display="inline">
<mml:semantics id="sm51">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>&lt;</mml:mo>
<mml:mn>0</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula>. If <italic>y</italic><sub>1</sub> &lt; <italic>y</italic><sub>2</sub> then 
<inline-formula>
<mml:math id="mm52" display="inline">
<mml:semantics id="sm52">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>μ</mml:mi>
<mml:mo>^</mml:mo></mml:mover>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo>&lt;</mml:mo>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:semantics></mml:math></inline-formula> because 
<inline-formula>
<mml:math id="mm53" display="inline">
<mml:semantics id="sm53">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>&lt;</mml:mo>
<mml:mn>0</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula>. If <italic>y</italic><sub>1</sub> &gt; <italic>y</italic><sub>2</sub> then 
<inline-formula>
<mml:math id="mm54" display="inline">
<mml:semantics id="sm54">
<mml:mrow>
<mml:mover accent="true">
<mml:mi>μ</mml:mi>
<mml:mo>^</mml:mo></mml:mover>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>2</mml:mn></mml:msub>
<mml:mo>&gt;</mml:mo>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn></mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:semantics></mml:math></inline-formula> because 
<inline-formula>
<mml:math id="mm55" display="inline">
<mml:semantics id="sm55">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>&lt;</mml:mo>
<mml:mn>0</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula>. The proof is completed by next assuming 
<inline-formula>
<mml:math id="mm56" display="inline">
<mml:semantics id="sm56">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>1</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>&lt;</mml:mo>
<mml:mn>0</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula> and 
<inline-formula>
<mml:math id="mm57" display="inline">
<mml:semantics id="sm57">
<mml:mrow>
<mml:msubsup>
<mml:mi>a</mml:mi>
<mml:mn>2</mml:mn>
<mml:mo>′</mml:mo></mml:msubsup>
<mml:mo>&gt;</mml:mo>
<mml:mn>0</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula>, and following similar steps.</p></sec></sec></sec>
<sec>
<label>4.1.</label>
<title>Additional Support for GLS by Numerical Example</title>
<p>The fact that the GLS estimate is the BLUE estimate (and also the MVUE estimate if the data is Gaussian) and that it lies below the range of (<italic>y</italic><sub>1</sub>, <italic>y</italic><sub>2</sub>) suggests two features. First, <italic>μ</italic> must be likely to fall outside the range of (<italic>y</italic><sub>1</sub>, <italic>y</italic><sub>2</sub>). Second, there must better than random chance capability to guess on which side of the range of (<italic>y</italic><sub>1</sub>, <italic>y</italic><sub>2</sub>) that <italic>μ</italic> lies.</p>
<p>In addition to GLS's BLUE property, we can add support for GLS by numerical example to illustrate features one and two. <xref ref-type="fig" rid="f1-algorithms-04-00028">Figure 1</xref> plots the contours of the bivariate normal density having Σ given by <xref rid="FD1" ref-type="disp-formula">Equation 1</xref> and <italic>μ</italic> = 0.88 which as shown previously is the OLS estimate of the mean in the case <italic>y</italic><sub>1</sub> = 1.5 and <italic>y</italic><sub>2</sub> = 1.0. Informally, we can integrate this density over regions 1 and 3 to see that there is a large probability that both <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub> lie either above the mean or below the mean, so indeed <italic>μ</italic> is likely to fall outside the range of (<italic>y</italic><sub>1</sub>, <italic>y</italic><sub>2</sub>). Integration of the bivariate normal for Σ given by <xref rid="FD1" ref-type="disp-formula">Equation 1</xref> indicates that with probability approximately 0.40, <italic>μ</italic> lies below the minimum of (<italic>y</italic><sub>1</sub>, <italic>y</italic><sub>2</sub>) and with the same probability <italic>μ</italic> lies above the maximum of (<italic>y</italic><sub>1</sub>, <italic>y</italic><sub>2</sub>). This is a total of approximately 0.80 probability that <italic>μ</italic> falls outside the range of (<italic>y</italic><sub>1</sub>, <italic>y</italic><sub>2</sub>), which is an example of feature one. Having an estimate lie outside the range of the data is therefore defensible, provided (feature two) that one can guess with better than random chance performance whether <italic>μ</italic> lies below the minimum or whether <italic>μ</italic> lies above the maximum of (<italic>y</italic><sub>1</sub>, <italic>y</italic><sub>2</sub>). To see that one can beat random guessing performance, suppose <italic>y</italic><sub>1</sub> &gt; <italic>y</italic><sub>2</sub> as in our case (<italic>y</italic><sub>1</sub> = 1.5 and <italic>y</italic><sub>2</sub> = 1.0). Then, because 
<inline-formula>
<mml:math id="mm58" display="inline">
<mml:semantics id="sm58">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mn>0.1125</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula> is larger than 
<inline-formula>
<mml:math id="mm59" display="inline">
<mml:semantics id="sm59">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mn>2</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mn>0.05</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula> in <xref rid="FD1" ref-type="disp-formula">Equation 1</xref>, <italic>μ</italic> is more likely to fall below <italic>y</italic><sub>2</sub> because if instead <italic>μ</italic> &gt; <italic>y</italic><sub>1</sub> then the distance from <italic>y</italic><sub>1</sub> to <italic>μ</italic> would be smaller than the distance from <italic>y</italic><sub>2</sub> to <italic>μ</italic>, contradicting the fact that 
<inline-formula>
<mml:math id="mm60" display="inline">
<mml:semantics id="sm60">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>&gt;</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>y</mml:mi>
<mml:mn>2</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula>. To confirm this line of reasoning, in 10,000 simulations (in the statistical computing language R) of (<italic>y</italic><sub>1</sub>, <italic>y</italic><sub>2</sub>) pairs having <italic>μ</italic> = 0.88, 57% of the simulation runs for which <italic>y</italic><sub>1</sub> &gt; <italic>y</italic><sub>2</sub> did in fact also have <italic>μ</italic> &lt; <italic>y</italic><sub>2</sub>. On the basis of 10,000 simulations, 57% is repeatable to within ± 1% or less, so this is better than random chance (50%) guessing. This is not a formal proof but does suggest a direction to understand when a GLS estimate falling outside the range of the data is effective. Note that <italic>y</italic><sub>1</sub> = 1.5 and <italic>y</italic><sub>2</sub> = 1.0 in the PPP statement, and the GLS estimate is <italic>μ̂</italic> = 0.88 &lt; <italic>y</italic><sub>2</sub>.</p></sec></sec>
<sec>
<label>5.</label>
<title>Example Where PPP Occurs without Approximation</title>
<sec>
<title/>
<p>Thus far we have not demonstrated any error model which exactly (without approximation) produces a Σ that leads to PPP. A situation that leads to PPP without the approximations in <xref rid="FD1" ref-type="disp-formula">Equation 1</xref> is expressed in Theorem 3.</p>
<sec>
<title>Theorem 3</title>
<p>Suppose <italic>m</italic><sub>1</sub> = <italic>μ</italic> + <italic>∊<sub>R</sub></italic><sub>1</sub> and <italic>m</italic><sub>2</sub> = <italic>μ</italic> + <italic>∊<sub>R</sub></italic><sub>2</sub> where <italic>∊<sub>R</sub></italic><sub>1</sub> is random error in <italic>m</italic><sub>1</sub> with variance 
<inline-formula>
<mml:math id="mm61" display="inline">
<mml:semantics id="sm61">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>R</mml:mi>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula> and similarly for <italic>∊<sub>R</sub></italic><sub>2</sub>. Then if <italic>y</italic><sub>1</sub> = <italic>m</italic><sub>1</sub> + <italic>∊<sub>S</sub></italic> and <italic>y</italic><sub>2</sub> = <italic>m</italic><sub>2</sub> + <italic>α∊<sub>S</sub></italic>, where <italic>∊<sub>S</sub></italic> ∼ <italic>N</italic>(0, <italic>σ<sub>S</sub></italic>) and <italic>α</italic> is any positive scale factor other than 1, the covariance matrix Σ of (<italic>y</italic><sub>1</sub>, <italic>y</italic><sub>2</sub>) can lie in the PPP region.</p>
<sec>
<title>Proof</title>
<p>The proof is by demonstration. Specify any values <italic>σ</italic><sub>1</sub>, <italic>σ</italic><sub>2</sub>, and <italic>σ</italic><sub>12</sub> that satisfy the PPP condition 
<inline-formula>
<mml:math id="mm62" display="inline">
<mml:semantics id="sm62">
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>ρ</mml:mi>
<mml:mo>&gt;</mml:mo>
<mml:mo>min</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:mfrac>
<mml:mo>,</mml:mo></mml:mrow></mml:mtd>
<mml:mtd>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn></mml:msub></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:semantics></mml:math></inline-formula>. Choose any <italic>α</italic> &gt; 0, then 
<inline-formula>
<mml:math id="mm63" display="inline">
<mml:semantics id="sm63">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>S</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mn>12</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>/</mml:mo>
<mml:mi>α</mml:mi></mml:mrow></mml:semantics></mml:math></inline-formula>, 
<inline-formula>
<mml:math id="mm64" display="inline">
<mml:semantics id="sm64">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>R</mml:mi>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>−</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>S</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula>, and 
<inline-formula>
<mml:math id="mm65" display="inline">
<mml:semantics id="sm65">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>R</mml:mi>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>−</mml:mo>
<mml:msup>
<mml:mi>α</mml:mi>
<mml:mn>2</mml:mn></mml:msup>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>S</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula>. As an example, let 
<inline-formula>
<mml:math id="mm66" display="inline">
<mml:semantics id="sm66">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mn>0.1134</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula>, 
<inline-formula>
<mml:math id="mm67" display="inline">
<mml:semantics id="sm67">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mn>0.0504</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula>, and 
<inline-formula>
<mml:math id="mm68" display="inline">
<mml:semantics id="sm68">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mn>12</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mn>0.06</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula> as in <xref rid="FD3" ref-type="disp-formula">Equation 3</xref>. Then if <italic>α</italic> = 0.7, we have 
<inline-formula>
<mml:math id="mm69" display="inline">
<mml:semantics id="sm69">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>S</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mn>12</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>/</mml:mo>
<mml:mi>α</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.06</mml:mn>
<mml:mo>/</mml:mo>
<mml:mn>0.7</mml:mn>
<mml:mo>=</mml:mo>
<mml:mn>0.0857</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula>, 
<inline-formula>
<mml:math id="mm70" display="inline">
<mml:semantics id="sm70">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>R</mml:mi>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>−</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>S</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mn>0.1134</mml:mn>
<mml:mo>−</mml:mo>
<mml:mn>0.0857</mml:mn>
<mml:mo>=</mml:mo>
<mml:mn>0.0277</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula>, and 
<inline-formula>
<mml:math id="mm71" display="inline">
<mml:semantics id="sm71">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>R</mml:mi>
<mml:mn>2</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mn>2</mml:mn>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>−</mml:mo>
<mml:msup>
<mml:mi>α</mml:mi>
<mml:mn>2</mml:mn></mml:msup>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mi>S</mml:mi>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mn>0.06</mml:mn>
<mml:mo>−</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>0.7</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msup>
<mml:mo>×</mml:mo>
<mml:mn>0.06</mml:mn>
<mml:mo>/</mml:mo>
<mml:mn>0.7</mml:mn>
<mml:mo>=</mml:mo>
<mml:mn>0.018</mml:mn></mml:mrow></mml:semantics></mml:math></inline-formula>.</p>
<p>Note:
<list list-type="bullet">
<list-item>
<p>If <italic>α</italic> = 1, then Σ has the form of <xref rid="FD11" ref-type="disp-formula">Equation 11</xref>, so PPP cannot occur ([<xref ref-type="bibr" rid="b2-algorithms-04-00028">2</xref>]).</p></list-item>
<list-item>
<p>If <italic>α</italic> &lt; 0 then PPP cannot occur. However, our applications have <italic>α</italic> &gt; 0. Jones <italic>et al.</italic> [<xref ref-type="bibr" rid="b10-algorithms-04-00028">10</xref>] showed that if <italic>ρ</italic> → ±1 and <italic>σ</italic><sub>1</sub> ≠ <italic>σ</italic><sub>2</sub>, then <italic>μ</italic> can be estimated with surprisingly small variance. That fact plus the known BLUE property of GLS could convince us to just “live with” the PPP because it can make sense for the GLS estimate <italic>μ̂</italic> to lie outside the range of (<italic>y</italic><sub>1</sub>, <italic>y</italic><sub>2</sub>) in the case <italic>α</italic> &gt; 0, or <italic>σ</italic><sub>12</sub> &gt; 0.</p></list-item></list></p>
<p>One example in which the assumptions of Theorem 3 hold involves subtracting a background measurement from a region of interest (ROI) measurement to get a net result. Because the background measurement often involves a different number of channels than the ROI measurement, a scale factor <italic>k</italic> is introduced to estimate the net counts as net = peak − <italic>k</italic> × background. <xref ref-type="fig" rid="f2-algorithms-04-00028">Figure 2</xref> illustrates a hypothetical example where each of three peak ROIs have a corresponding background measurement in the plot of the square root of detected neutron counts <italic>versus</italic> neutron energy in arbitrary units (au).</p>
<p>Suppose each ROI and corresponding background are analyzed separately, and consider the first ROI in <xref ref-type="fig" rid="f2-algorithms-04-00028">Figure 2</xref>. The count times could vary between the two experiments, so 
<inline-formula>
<mml:math id="mm72" display="inline">
<mml:semantics id="sm72">
<mml:mrow>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>R</mml:mi>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>≠</mml:mo>
<mml:msubsup>
<mml:mi>σ</mml:mi>
<mml:mrow>
<mml:mi>R</mml:mi>
<mml:mn>2</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:semantics></mml:math></inline-formula>. Both experiment 1 and experiment 2 measure the ROI counts, but in many situations, the background measurement is made only by experiment 1. In that case, experiments 1 and 2 must use the background measurement made by experiment 1. Also, if the ROI is found by analyzing the shape of the curve (the “spectrum”) that describes detected count rates <italic>versus</italic> particle energy, then the ROI for experiments 1 and 2 could differ. We then have <italic>N</italic><sub>1</sub> = <italic>G</italic><sub>1</sub> − <italic>a</italic><sub>1</sub> × <italic>B</italic> and <italic>N</italic><sub>2</sub> = <italic>G</italic><sub>2</sub> − <italic>a</italic><sub>2</sub> × <italic>B</italic>, where <italic>N</italic> is net counts, <italic>G</italic> is gross counts, <italic>B</italic> is background, <italic>a</italic><sub>1</sub> is the scale factor for experiment 1, and <italic>a</italic><sub>2</sub> is the scale factor for experiment 2. The scale factor <italic>a</italic><sub>1</sub> for experiment 1 is the ratio of the number of ROI channels to the number of background channels, and similarly for the scale factor <italic>a</italic><sub>2</sub> for experiment 2. The channel counts have variation from repeat to repeat so the detected counts will vary around the true counts with some error. As an aside, often the channel counts have approximately a Poisson distribution which for large count rates is well approximated by a Gaussian distribution. Regardless of which probability distribution best describes the channel counts, there are measurement errors in <italic>N</italic><sub>1</sub>, <italic>N</italic><sub>2</sub>, and one can divide <italic>N</italic><sub>1</sub> = <italic>G</italic><sub>1</sub> − <italic>a</italic><sub>1</sub> × <italic>B</italic> by <italic>a</italic><sub>1</sub> to convert this pair of equations to those assumed in Theorem 3.</p></sec></sec></sec></sec>
<sec sec-type="conclusions">
<label>6.</label>
<title>Conclusions</title>
<p>Because Peelle's original statement is vague, there have been several interpretations and solutions. In our experience, there is considerable variation among experimentalists in the expression of measurement uncertainty, and a wide the range of analyses can result from vague uncertainty statements.</p>
<p>The three main contributions of this paper are: (1) illustrating examples when PPP cannot occur (Theorem 1); (2) providing insight when PPP is effective and appropriate (related to Theorem 2), and (3) deriving a realistic covariance matrix Σ for which PPP occurs according to physical descriptions of realistic measurement scenarios (Theorem 3). We also showed via numerical integration that an estimate lying outside the range of the data is sensible. This is because the unequal variances of <italic>x</italic><sub>1</sub> and <italic>x</italic><sub>2</sub> provide information regarding whether <italic>μ</italic> is more likely to be less than the minimum or greater than the maximum of <italic>x</italic><sub>1</sub> and <italic>x</italic><sub>2</sub>.</p>
<p>Of course GLS provides a good estimate <italic>μ̂</italic> in general because of its well-known BLUE property (and MVUE if the data is Gaussian) and in particular for the PPP problem if the covariance Σ is well known.</p>
<p>There will almost always be estimation error in Σ̂, and often the measurement errors are nonGaussian. Therefore, we consider the following two topics in [<xref ref-type="bibr" rid="b6-algorithms-04-00028">6</xref>]: (1) alternatives to GLS when there is estimation error in Σ̂, and to provide estimators other than GLS that use the estimated likelihood in the case of non-Gaussian error models that make ML estimation difficult. Regarding (1), it is known that weighted estimates do not always outperform equally-weighted estimates when there is estimation error in the weights. We have already noted that estimation of Σ in <xref rid="FD1" ref-type="disp-formula">Equation 1</xref> introduces apparent PPP when PPP does not actually occur.</p>
<p>Finally, [<xref ref-type="bibr" rid="b10-algorithms-04-00028">10</xref>] showed that when the diagonal entries in Σ are different and the correlation is large and positive, the GLS estimate can have lower variance than if the correlation is zero. This fact does not seem to be widely known, and experimental opportunities to exploit this fact are under investigation.</p></sec></body>
<back>
<sec sec-type="display-objects">
<title>Figures</title>
<fig id="f1-algorithms-04-00028" position="float">
<label>Figure 1.</label>
<caption>
<p>Contours of example bivarate normal density of <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub> illustrating that <italic>μ</italic> = 0.88 is likely to fall outside the range of <italic>y</italic><sub>1</sub> and <italic>y</italic><sub>2</sub> because of the large probability of (<italic>y</italic><sub>1</sub>, <italic>y</italic><sub>2</sub>) falling in region 1 or 3.</p></caption>
<graphic xlink:href="algorithms-04-00028f1.gif"/></fig>
<fig id="f2-algorithms-04-00028" position="float">
<label>Figure 2.</label>
<caption>
<p>Example region of interest and corresponding background that can lead to the PPP condition.</p></caption>
<graphic xlink:href="algorithms-04-00028f2.gif"/></fig></sec>
<ack>
<p>We acknowledge the next generation safeguards initiative within the U.S. Department of Energy.</p></ack>
<ref-list>
<title>References</title>
<ref id="b1-algorithms-04-00028"><label>1.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Peelle</surname><given-names>R.</given-names></name></person-group><source>Peelle's Pertinent Puzzle</source><publisher-name>Oak Ridge National Laboratory Memorandum</publisher-name><publisher-loc>Washington DC, USA</publisher-loc><year>1987</year></citation></ref>
<ref id="b2-algorithms-04-00028"><label>2.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Nichols</surname><given-names>A.L.</given-names></name></person-group><source>International Evaluation of Neutron Cross Section Standards</source><publisher-name>International Atomic Energy Agency</publisher-name><publisher-loc>Vienna, Austria</publisher-loc><year>2007</year></citation></ref>
<ref id="b3-algorithms-04-00028"><label>3.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Chiba</surname><given-names>S.</given-names></name><name><surname>Smith</surname><given-names>D.</given-names></name></person-group><source>A Suggested Procedure for Resolving an Anomaly in Least-Squares Data Analysis Known as Peelle's Pertinent Puzzle and the General Implications for Nuclear Data Evaluation</source><comment>Report ANL/NDM-121</comment><publisher-name>Argonne National Laboratory</publisher-name><publisher-loc>Argonne, IL, USA</publisher-loc><year>1991</year></citation></ref>
<ref id="b4-algorithms-04-00028"><label>4.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Zhao</surname><given-names>Z.</given-names></name><name><surname>Perey</surname><given-names>R.</given-names></name></person-group><source>The Covariance Matrix of Derived Quantities and Their Combination</source><comment>Report ORNL/TM-12106</comment><publisher-name>Oak Ridge National Laboratory</publisher-name><publisher-loc>Oak Ridge, TN, USA</publisher-loc><year>1992</year></citation></ref>
<ref id="b5-algorithms-04-00028"><label>5.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Hanson</surname><given-names>K.</given-names></name><name><surname>Kawano</surname><given-names>T.</given-names></name><name><surname>Talou</surname><given-names>P.</given-names></name></person-group><article-title>Probabilistic Interpretation of Peelle's Pertinent Puzzle</article-title><conf-name>Proceeding of International Conference of Nuclear Data for Science and Technology</conf-name><conf-date>26 September–1 October 2004</conf-date><person-group person-group-type="editor"><name><surname>Haight</surname><given-names>R.C.</given-names></name><name><surname>Chadwick</surname><given-names>M.B.</given-names></name><name><surname>Kawano</surname><given-names>T.</given-names></name><name><surname>Talou</surname><given-names>P.</given-names></name></person-group><publisher-name>AIP Conference Proceedings</publisher-name><publisher-loc>Melville, NY, USA</publisher-loc><year>2005</year></citation></ref>
<ref id="b6-algorithms-04-00028"><label>6.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Burr</surname><given-names>T.</given-names></name><name><surname>Kawano</surname><given-names>T.</given-names></name><name><surname>Talou</surname><given-names>P.</given-names></name><name><surname>Hengartner</surname><given-names>N.</given-names></name><name><surname>Pan</surname><given-names>P.</given-names></name></person-group><article-title>Alternatives to the Least Squares Solution to Peelle's Pertinent Puzzle</article-title><comment>submitted for publication</comment><year>2010</year></citation></ref>
<ref id="b7-algorithms-04-00028"><label>7.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Christensen</surname><given-names>R.</given-names></name></person-group><source>Plane Answers to Complex Questions, The Theory of Linear Models</source><publisher-name>Springer</publisher-name><publisher-loc>New York</publisher-loc><year>1999</year><fpage>23</fpage><lpage>25</lpage></citation></ref>
<ref id="b8-algorithms-04-00028"><label>8.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Burr</surname><given-names>T.</given-names></name><name><surname>Frey</surname><given-names>H.</given-names></name></person-group><article-title>Biased Regression: The Case for Cautious Application</article-title><source>Techometrics</source><year>2005</year><volume>47</volume><fpage>284</fpage><lpage>296</lpage><pub-id pub-id-type="doi">10.1198/004017005000000012</pub-id></citation></ref>
<ref id="b9-algorithms-04-00028"><label>9.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Sivia</surname><given-names>D.</given-names></name></person-group><article-title>Data Analysis—A Dialogue With The Data</article-title><source>Advanced Mathematical and Computational Tools in Metrology VII</source><person-group person-group-type="editor"><name><surname>Ciarlini</surname><given-names>P.</given-names></name><name><surname>Filipe</surname><given-names>E.</given-names></name><name><surname>Forbes</surname><given-names>A.B.</given-names></name><name><surname>Pavese</surname><given-names>F.</given-names></name><name><surname>Perruchet</surname><given-names>C.</given-names></name><name><surname>Siebert</surname><given-names>B.R.L.</given-names></name></person-group><publisher-name>World Scientific Publishing Company</publisher-name><publisher-loc>Lisbon, Portugal</publisher-loc><year>2006</year><fpage>108</fpage><lpage>118</lpage></citation></ref>
<ref id="b10-algorithms-04-00028"><label>10.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jones</surname><given-names>C.</given-names></name><name><surname>Finn</surname><given-names>J.</given-names></name><name><surname>Hengartner</surname><given-names>N.</given-names></name></person-group><article-title>Regression with Strongly Correlated Data</article-title><source>J. Multivariate Anal.</source><year>2008</year><volume>99</volume><fpage>2136</fpage><lpage>2153</lpage><pub-id pub-id-type="doi">10.1016/j.jmva.2008.02.008</pub-id></citation></ref>
<ref id="b11-algorithms-04-00028"><label>11.</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Finn</surname><given-names>J.</given-names></name><name><surname>Jones</surname><given-names>C.</given-names></name><name><surname>Hengartner</surname><given-names>N.</given-names></name></person-group><article-title>Strong Nonlinear Correlations, Conditional Entropy, and Perfect Estimation</article-title><source>Bayesian Inference and Maximum Entropy Methods in Science and Engineering</source><conf-name>Proceeding of 27th International Workshop Bayesian Inference and Maximum Entropy Methods in Science and Engineering</conf-name><conf-loc>Saratoga Springs, NY, USA</conf-loc><conf-date>2007</conf-date><publisher-name>AIP Conference Proceedings</publisher-name><publisher-loc>Melville, NY, USA</publisher-loc><year>2007</year><fpage>954</fpage></citation></ref></ref-list></back></article>
