Abstract
Over the past decade, a series of procedures has been introduced to estimate, using a non-iterative method, the linear-by-linear association parameter of an ordinal log-linear model. This paper will examine the two key non-iteratively determined estimates of the parameter for the analysis of the association between the two categorical variables that form a contingency table; these are the log and the Beh-Davy non-iterative estimates, referred to simply as the LogNI and the BDNI estimates, respectively. Such an examination will focus on determining their asymptotic characteristics. To do so, a computational study was undertaken for tables of varying sizes to show that these two estimates are asymptotically unbiased. It is also shown that both estimates are asymptotically normally distributed. On the basis of the standard errors, their relative efficiency was established for the 13 commonly analysed contingency tables that appear throughout the literature.
1. Introduction
Log-linear models (LLMs) are one of the most used tools in the analysis of the association between the categorical variables that form a contingency table. Extensive discussions on their development and application can be found in, for example, [1,2,3]. LLMs have been a focal point of research in the statistical and allied disciplines since 1950s and stem from the work of [4], who considered the association between three categorical variables. Since then, many notable contributions have been made to this area of research including those of [5,6,7,8,9,10,11,12,13]—although [14] points out that the term log-linear model was first coined by [15] (p. 119). Much of this work focused on the study of variables consisting of nominally structured categories. However, there are many practical settings where the variables consist of ordered categories. In the past, an analysis of ordered variables would have been undertaken by treating them as nominal variables (this still is the case in some studies even now). Therefore, where a variable consists of ordered categories, it is important to incorporate such a feature into the development of statistical tools used to analyse the association. Such models are commonly referred to as ordinal log-linear models, or OLLMs. A historical account on the development of OLLMs can be found by considering, for example, [3] (pp. 386–395) and [16] (pp. 145–154). Independent discussions of the origins of OLLMs have also been given by [17] and [18]; both authors focused their attention on a two-way contingency table. The development of OLLMs for a singly ordered (column variable) table was discussed by [18], while [17] considered a doubly ordered table. More recent discussions have been made of the type of OLLMs we consider in this paper, which span a variety of different contexts. For example, the books of [1,3] (Section 10.4), [16] (Section 6.2), [19,20] (Section 3.1), (Section 7.1), [21] (p. 248), [22] (Chapter 4), and [23] (Section 6.1.1) all provide various levels of introductory discussion of OLLMs. One may also refer to [24,25,26,27,28,29,30,31,32,33,34,35] for applications and further insights.
An important aspect of LLMs and OLLMs is the estimation of the parameters from these models. In the case of OLLMs for the doubly ordered contingency table (as considered by [17]), the primary parameter of interest is that which reflects the linear-by-linear association between two ordered variables. Traditionally, the estimation of this parameter has involved using a variety of iterative procedures. The two most used procedures are Newton’s algorithm and iterative proportional fitting; one may refer, for example, to [36,37,38,39,40] for more on these algorithms. One may also consider [16] and [19], who provide a detailed account of these methods and have provided a list of computer programs, including GLIM, SPSS, and SAS, for performing the necessary calculations. Rather than considering an iterative algorithm for estimating the association parameter in OLLMs, one may adopt a more direct approach that is built on a non-iterative framework. One such approach is to consider the method originally proposed by [41] and further studied by [40] through empirical studies. The former authors propose one method for estimating the linear-by-linear parameter through non-iterative (direct) means, while the latter authors considered three further, and related, methods. The accuracy and reliability of the non-iterative estimation for the OLLM of a doubly ordered contingency table was investigated by [42,43], who provided a simulation study of these four methods. They demonstrated that using their non-iterative techniques provides exceptionally stable and reliable estimates of the linear-by-linear association parameter when compared with Newton’s method. The links between these non-iterative methods, using the Box-Cox transformation, were demonstrated by [44]. It is this non-iterative method of estimation which we shall focus our attention on in this paper.
With the foundational mathematical links between the non-iterative estimates of the linear-by-linear association parameter established, and their reliability and stability verified through simulation, this paper identifies the asymptotic characteristics of the two key non-iterative estimates studied [40,41,42,43,44]. We shall be paying particular attention to the bias of the estimators—a topic that was briefly examined by [45]. We also examine here the asymptotic distribution and relative efficiency of each of the two non-iterative estimates. To undertake such a study, this paper is divided into five further sections. In Section 2, a brief review of the non-iterative estimation for OLLMs is given, highlighting the two key estimates for which the asymptotic characteristics are considered. Section 3 includes the theoretical and mathematical development of the asymptotic properties and the asymptotic distributions of the two estimates. An evaluation of these properties, through empirical studies and by simulation, is discussed in Section 4. Section 5 provides a discussion of these results and some remarks on the future research in this area of study.
2. Non-Iterative Estimation of the Linear-by-Linear Association Parameter
For a two-way contingency table, N, with I ordered rows and J ordered columns, denotes the proportion of individuals/units in the (i, j)’th cell as , where is the ’th cell value of N, for and . Therefore, . Denote and as the marginal proportion of the ’th row and ’th column categories, respectively, such that . Moreover, let be the expected cell frequency of the ’th cell; for example, when the row and column variables are independent, , although any appropriate model of association or correlation may be used to determine . One such model is the ordinal log-linear model—also known as the uniform association model—for the doubly ordered two-way contingency table, which was considered by [40] (and many of the references given above) and is defined as
The parameters are as follows: is our parameter of interest and reflects the association between the two categorical variables; is the grand mean of the expected cell frequencies, while and are the main effects of the ’th row and ’th column, respectively, such that and . With there being parameters in (1), iterative techniques are commonly used to estimate them. However, the last three can be treated as nuisance parameters [41] (p. 76) and estimated by
and
so that there is only a single parameter that requires estimating, that of . This suggests that a direct, non-iterative, method of estimating can be used, and we shall now turn our attention to how this can be conducted.
To reflect the structure of the ordinal variable, scores are assigned to each of the categories. For (1), and are the scores associated with the ’th row and ’th column and, depending on the nature of the order, are chosen a priori. For example, when the row and column variables consist of increasingly ordered categories then and . The mean of the row scores is denoted by , while is the mean of the column scores.
The parameter of interest in (1) is and is the measure of the linear-by-linear association between the variables since
Therefore, when the row and column scores are chosen such that , is the common log-odds ratio of the contingency table.
Four non-iterative estimation methods for the parameter were considered by [40], but we shall restrict our attention to examining the two key methods. The first is the estimator
which is referred to as the LogNI estimate due to the presence of the logarithm function in the non-iterative (NI) estimate. The second estimate, and one that was originally proposed by [41], is
and is referred to as the BDNI (Beh-Davy NI) estimate of . For (2) and (3), and . The estimate (3) can be obtained by considering the first-order Taylor series expansion . Equations (2) and (3) are also linked through the Box-Cox transformation and [45] discussed the features of the estimation procedure using this transformation. An excellent overview of the various features of and for estimating from the OLLM of (1) was provided by [46] (p. 221–223) and [47]. The method introduced by [48] was also discussed by [47] and is a method of non-iterative estimation based on weighted mean and Mantel–Haenszel type estimators. A link that exists between and and the correspondence analysis of a double ordered two-way contingency table [49,50,51,52,53] was demonstrated by [54]. In doing so, [54] analysed the relationships that exist between the predictors of drug-likeness used in drug discovery to help filter out small molecules (drugs) that may fail clinical trials; see also [55], who applied a non-iterative approach to ordinal log-linear models in the investigation of a log permeability index in drug discovery. Further practical and technical features of and not discussed here have been examined in the following conference papers by the current authors: [56,57]. The latter of these involved modelling gender-specific associations between temperament and character traits and symptoms of psychological distress, such as anxiety, psychoticism, and depression, to motivate the value of creating gender-specific treatments for psychological distress.
To evaluate the performance and compare the non-iterative estimates of using (2) and (3)—and two additional Taylor-series approximations of (2)—with the estimates obtained using the iterative algorithm of Newton’s method, [42,43] performed a series of simulation studies. As part of their studies, strategies needed to be considered for dealing with the presence of random zero-cell frequencies since the logarithm function in (2) leads to non-existent solutions for such cases. There are a variety of different ways in which these zeros can be dealt with. See, for example, [3] (Section 10.6.5), [23] (p. 119) and [58,59,60,61] for a discussion of this issue. In the simulations below, we shall be replacing a zero-cell frequency with 0.05, as [42,43,44] did. Note that determining when random zero-cell frequencies are present does not suffer from this problem. However, the simulation studies of [42,43] suggest that the accuracy and reliability of begins to falter as the dimension of the two-way table increases. Therefore, the discussions made in the following section are limited to the comparison of and .
3. Asymptotic Characteristics of and
In this section, the mean and variance of the sampling distributions of and are determined. Our discussion of the asymptotic distributional properties of the estimates is made using the delta method
where is a function of the variable and is a function of the expected value of : see, for example, [61] (p. 240) and [3] (p. 73). We shall also be making use of the following restrictions on and
in the derivation of the following results, which determine the mean and variance of and . The bias of these estimates is also considered.
3.1. Unbiasedness of and
Suppose we first consider the non-iterative estimate of given by . By substituting into (2), the estimate can be alternatively expressed as
Suppose that the cell frequencies, represented as elements of the vector , are independent Poisson random variables. Then, the random variable on the right-hand side of (6) is , so that
By using the delta method, such that
the expectation of can be expressed as
There are many ways to model the cell values of the contingency table. By considering an unsaturated (linear-by-linear) association model of the type described in [41], (Equation (19)), is the expected cell frequency of the (i, j)’th cell where
The link between this model and the OLLM of (1) was also shown by [40]. Substituting (10) into (9) yields, after simplification,
By using the first-order Taylor series expansion of the logarithm
and after a little algebra using (5), we obtain the asymptotic expectation of
Equation (12) asserts that is (approximately) an unbiased estimator of the linear-by-linear association parameter in the OLLM of (1).
We can similarly show that is also an unbiased estimate of . Following the same argument made above for , the expected value for can be derived by taking the expectation of both sides of (3), yielding
By substituting (10) into (13), and after some simplification, we obtain the expectation of the BDNI estimate
By considering the restriction (5) imposed on and , this expectation simplifies to
3.2. Variance of and
We now turn our attention to the derivation of the variance of and . Consider first the variance of , which is expressed as
One may note that the second term on the right-hand side is a constant. Therefore, by expanding the above variance expression, the cross-product terms are
and are zero from (5). Thus, by using the delta method and simplifying, the variance of is
By following a similar derivation, the variance of is
When there is complete independence between the two categorical variables, (15) and (16) simplify to zero.
3.3. Asymptotic Distribution of and
As we discussed when determining the expectation and variance of the estimates of and , here we shall also use the delta method to determine their asymptotic distribution under the assumption that the observed cell frequencies are independent Poisson random variables. For large samples, the delta method implies that the asymptotic distributions of and are (approximately) true normal distributions, as (17) and (18) show, respectively:
and
By comparing the expressions for (15) and (16) with (17) and (18), it can be observed that for , the asymptotic variance is smaller than the exact variance. This implies that as the sample size increases, the accuracy and reliability of improves.
4. Empirical and Computational Study
To study the asymptotic characteristics of and , 10,000 contingency tables of varying sizes are randomly generated. We consider contingency tables ranging in size from 2 × 2 to 5 × 5 since in many practical studies involving two-way contingency tables, rarely are tables of a size greater than 5 × 5 considered. However, in such cases, the conclusions from our study here are equally applicable. To reflect the varying magnitude of the cell frequencies (which are assumed to be a Poisson random variable), these values are randomly generated using various values of the Poisson parameter λ. Here, we consider λ values of 1.5, 5, 20, and 50; a small value of λ reflects, on average, small cell frequencies, while large values of λ reflect, on average, large cell frequencies. For each combination of dimension and λ, the non-iterative estimates of are calculated, yielding and . Therefore, a population of 10,000 values and a population of 10,000 values is obtained. The mean of the values and the values is denoted by and , respectively; note that and are therefore the population means of the two non-iterative estimates. From each population of 10,000 non-iterative estimate values, samples of and are randomly selected; small samples of and are deemed to be those representing 0.1%, 1%, and 5% of the population. Similarly, moderate sized samples (15%) and large samples (30% and 50%) are also considered. For each sample size, the mean of the sampling distribution of the and values obtained are denoted by and , respectively.
4.1. Unbiasedness of and
Consider Table 1, which summarises the computational study undertaken to verify the unbiasedness of the LogNI estimate; see (12). It can be observed that for smaller values of and small samples, the difference between and is relatively large. This difference arises because small values of increase the chances of a zero-cell value arising by chance. Even with the addition of 0.05 to account for the presence of the logarithm function in (2), it means that such a value may be a relatively serious remedy; to overcome this remedy, one could substitute for the zero-cell frequency an even smaller non-negative value; for example, [58] (p. 13) suggested replacing a random zero-cell frequency with 0.00000001. However, as the sample size increases, even for smaller , the population mean of the estimates obtained from (2), , and the mean of the sampling distribution, , are virtually identical. It is also to be noted that for a larger value of , the difference between and is zero for large samples of and is minimal even for the small sample sizes. Similar conclusions can be reached concerning the bias of the estimates, . The unbiasedness of this non-iterative parameter estimate is apparent from observing the virtually zero difference between and that is summarised in Table 2 for the varying contingency table sizes and Poisson parameter .
Table 1.
Property of unbiasedness of for varying dimensions, sample sizes, and λ s of simulated contingency tables.
Table 2.
Property of unbiasedness of for varying dimensions and λ s of simulated contingency tables.
4.2. Asymptotic Variances of and
The variance of the estimates and , defined by (17) and (18), respectively, is summarised in Table 3. The table shows that the variance of becomes larger as one considers smaller values of λ and small-sized tables (in particular, for those of size 2 × 2, 2 × 3, 2 × 4, and 2 × 5). This is because when λ has a smaller value (λ = 1.5, say), the cell values that are close to, or exactly, zero dominate the association structure between the variables of the contingency table. Its impact therefore leads to being a poor performing estimate of . However, for moderate and larger-sized contingency tables, the variance of is consistently smaller than the variance of for all values of λ and contingency table sizes. When the value of λ is large (λ = 50, say), the variances for and are virtually identical.
Table 3.
Asymptotic variances of and for varying dimensions and λ s of simulated contingency tables.
4.3. Relative Efficiency of and
To compare the variances of the two non-iterative estimation procedures given by (2) and (3), we consider their relative efficiencies
which are summarised in Table 4 based on the variances given in Table 3. Table 4 shows that is more efficient than for estimating for tables of moderate and larger size. It is worth noting that for moderate and larger values of λ, the relative efficiency of approaches 1. This indicates that for contingency tables with large cell values, the performance of both estimates is virtually the same. However, the same cannot be said for small λ values.
Table 4.
Relative efficiencies, RE, of and for varying dimensions and λ s of simulated contingency tables.
4.4. Graphical Display for the Asymptotic Distributions of and
The asymptotic distributions of and , defined by (17) and (18), are graphically demonstrated by considering Figure 1a–h and Figure 2a–h respectively. The graphical displays are shown for a small (λ = 1.5) and large (λ = 50) Poisson parameter and for a 2 × 2 and 4 × 5 contingency table. The same populations of 10,000 and 10,000 values are again considered here, and the sampling distributions have been generated for 1000 samples ranging in size from n = 10 to n = 5000. The histograms depict the sampling distribution of and . They show that, for a small and large Poisson parameter, and for small and moderately sized contingency tables, the distribution of each non-iterative estimate is asymptotically normally distributed.
Figure 1.
Sampling distribution of for various λ and n.
Figure 2.
Graphical display of asymptotic distribution of .
4.5. An Empirical Study
To highlight the practical implications of the results described above, we shall consider the relative efficiency of the estimates and by studying 13 of the datasets analysed by [40] (Table 1). These data were selected for their common use in categorical data analysis. They have been given the same labels here as were given by [40]; therefore, further details on the context of the data and the key references in which they appear can be found in [40]. These datasets are summarised in Table 5 and vary in size. The smallest contingency table (Visitor) is of size 3 × 3 while the largest contingency table analysed (Occupation) is of size 8 × 8. The sample size also varies, from n = 132 (for Visitor) to n = 25,263 (for Swedish). All the zero-cell frequencies have been replaced by 0.05. Table 5 summarises the relative efficiency of the estimates, as follows. Defined by (19), for each dataset. From these results, the variance of is always less than, or equal to, the variance of for all the datasets. Note that for the datasets Aberdeen, Denmark, and Swedish, all of which have very large cell frequencies, the two variances are identical, resulting in a relative efficiency of 1. This empirically verifies our earlier findings, which state that as the Poisson parameter increases (leading to large randomly generated cell frequencies), the variance of both estimates is virtually identical.
Table 5.
Standard errors and efficiency of and for 13 datasets.
5. Conclusions
This paper examined the issues of bias, variance, and sampling distribution properties and was concerned with two standard approaches for non-iteratively estimating the linear-by-linear parameter of an ordinal log-linear model. We have done so assuming a Poisson sampling scheme, but it can also be adapted for a multinomial sampling scheme; see, for example, [62] for a study of asymptotic results for multinomial models. These approaches to estimation are easy to determine mathematically and so are easier to compute than their iterative counterparts (including Newton’s method and iterative proportional fitting). We have confirmed that the two non-iterative estimation procedures lead to estimates that are unbiased and asymptotically normally distributed. We have also demonstrated empirically, and through simulation, that the variance of the BDNI estimate is less than the variance of the LogNI estimate. This therefore suggests that the BDNI estimate is a more efficient, and reliable, non-iterative estimate than the LogNI estimate. This seems contrary to the findings originally proposed by [40,42,43] and so requires further examination.
In this paper we have examined the simple case where the OLLM consists of the estimation of a single parameter. At present, the non-iterative estimation procedures have not been generalised to incorporate estimations for models with multiple association parameters. In fact, there is only a limited amount of literature available on the specification of such models and the interpretation of the parameters. Therefore, an investigation of this issue, and into the properties of the non-iterative estimation for such models, is a topic for future consideration.
Author Contributions
Conceptualization, S.Z., E.J.B. and I.L.H.; methodology, S.Z., S.A.C., E.J.B. and I.L.H.; software, S.Z. and S.A.C.; validation, S.Z., S.A.C., E.J.B. and I.L.H.; formal analysis, S.Z. and S.A.C.; investigation, S.Z., S.A.C., E.J.B. and I.L.H.; resources, S.Z., S.A.C., E.J.B. and I.L.H.; data curation, S.Z. and E.J.B.; writing—original draft preparation, S.Z. and S.A.C.; writing—review and editing, E.J.B. and I.L.H.; visualization, S.Z., E.J.B. and I.L.H.; supervision, E.J.B. and I.L.H.; project administration, E.J.B. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Data Availability Statement
All data appear in their source that is referenced or has been generated for simulation purposes.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Christensen, R. Log-Linear Models and Logistic Regression, 2nd ed.; Springer: New York, NY, USA, 1997; p. 501. [Google Scholar]
- Imrey, P.B.; Koch, G.G.; Stokes, M.E. Categorical data analysis: Some reflections on the log-linear model and logistic regression. Part 1: Historical and methodological overview. Int. Stat. Rev. 1981, 49, 265–283. [Google Scholar] [CrossRef]
- Agresti, A. Categorical Data Analysis, 3rd ed.; Wiley: Hoboken, NJ, USA, 2013; p. 752. [Google Scholar]
- Bartlett, M.S. Contingency table interactions. J. R. Stat. Soc. 1935, 2, 248–252. [Google Scholar] [CrossRef]
- Roy, S.N.; Kastenbaum, M.A. On the hypothesis of no “interaction” in a multi-way contingency table. Ann. Math. Stat. 1956, 27, 749–757. [Google Scholar] [CrossRef]
- Darroch, J.N. Interaction in multi factor contingency table. J. R. Stat. Soc. Ser. B 1962, 24, 251–263. [Google Scholar] [CrossRef]
- Plackett, R.L. A note on interactions in contingency table. J. R. Stat. Soc. Ser. B 1962, 24, 162–166. [Google Scholar] [CrossRef]
- Goodman, L.A. Interactions in multi-dimensional contingency tables. Ann. Math. Stat. 1964, 35, 632–646. [Google Scholar] [CrossRef]
- Birch, M.W. Maximum likelihood in three-way contingency table. J. R. Stat. Soc. Ser. B 1963, 25, 220–223. [Google Scholar] [CrossRef]
- Birch, M.W. A new proof of Pearson-Fisher theorem. Ann. Math. Stat. 1964, 35, 817–824. [Google Scholar] [CrossRef]
- Birch, M.W. The detection of partial association I: The 2 × 2 case. J. R. Stat. Soc. Ser. B 1964, 26, 313–324. [Google Scholar] [CrossRef]
- Birch, M.W. The detection of partial association II: The general case. J. R. Stat. Soc. Ser. B 1965, 27, 111–124. [Google Scholar] [CrossRef]
- Mantel, N. Model for complex contingency tables and polychotomous dosage response curves. Biometrics 1966, 22, 83–95. [Google Scholar] [CrossRef]
- David, H.A. First (?) occurrence of common terms in mathematical statistics. Am. Stat. 1995, 49, 121–133. [Google Scholar] [CrossRef]
- Bishop, Y.M.M.; Fienberg, S.E. Incomplete two-dimensional contingency tables. Biometrics 1969, 25, 119–128. [Google Scholar] [CrossRef]
- Agresti, A. Analysis of Ordinal Categorical Data, 2nd ed.; Wiley: Hoboken, NJ, USA, 2010; p. 424. [Google Scholar]
- Haberman, S.J. Log-linear models for frequency tables with ordered classifications. Biometrics 1974, 36, 589–600. [Google Scholar] [CrossRef]
- Simon, G. Alternative analysis for the singly ordered contingency table. J. Am. Stat. Assoc. 1974, 69, 971–976. [Google Scholar] [CrossRef]
- Ishii-Kuntz, M. Ordinal Log-Linear Models; Sage Publications Inc.: Thousand Oaks, CA, USA, 1994; p. 72. [Google Scholar]
- Clogg, C.C.; Shihadeh, E.S. Statistical Models for Ordinal Variables, 1st ed.; Sage: Thousand Oaks, CA, USA, 1994; p. 192. [Google Scholar]
- Simonoff, J.S. Analyzing Categorical Data; Springer: New York, NY, USA, 2003; p. 498. [Google Scholar]
- Powers, D.A.; Xie, Y. Statistical Methods for Categorical Data Analysis, 2nd ed; Elsevier Science Ltd: London, UK, 2008; pp. 296. [Google Scholar]
- Kateri, M. Contingency Table Analysis; Springer: New York, NY, USA, 2014; p. 304. [Google Scholar]
- Ishii-Kuntz, M. Association models in family research. J. Marriage Fam. 1991, 53, 337–348. [Google Scholar] [CrossRef]
- Siddiqi, J. Log-linear and log-bilinear models with ordinal categories. J. Soc. Sci. Humanit. 1995, 35, 53–60. [Google Scholar]
- Agresti, A. Modelling ordered categorical data: Recent advances and future challenges. Stat. Med. 1999, 18, 2191–2207. [Google Scholar] [CrossRef]
- Alin, A. Power-divergence test statistics for testing linear by linear association. Commun. Stat. Simul. Comput. 2009, 38, 1381–1390. [Google Scholar] [CrossRef]
- Alin, A. Penalized power-divergence test statistics for testing linear-by-linear association. J. Stat. Comput. Simul. 2012, 82, 227–239. [Google Scholar] [CrossRef]
- Bagheban, A.A.; Zayeri, F. A generalization of the uniform association model for assessing rater agreement in ordinal scales. J. Appl. Stat. 2010, 37, 1265–1273. [Google Scholar] [CrossRef]
- Beh, E.J.; Farver, T.B. A numerical evaluation of the classification of Portuguese red wine. Curr. Anal. Chem. 2012, 8, 218–223. [Google Scholar] [CrossRef][Green Version]
- Kateri, M. Φ-divergence in contingency table analysis. Entropy 2018, 20, 324. [Google Scholar] [CrossRef] [PubMed]
- Kateri, M. Families of generalized quasisymmetry models: A Φ-divergence approach. Symmetry 2021, 13, 2297. [Google Scholar] [CrossRef]
- Pham, T.M.; Kateri, M. Inference for ordinal log-linear models based on algebraic statistics. J. Algebr. Stat. 2019, 10, 30–50. [Google Scholar] [CrossRef]
- Hakimi, A.; Farughi, H.; Amiri, A.; Arkat, J. Phase II monitoring of the ordinal multivariate categorical processes. Adv. Ind. Eng. 2021, 55, 249–267. [Google Scholar] [CrossRef]
- Fujisawa, K.; Tahata, K. Quasi association models for square contingency tables with ordinal categories. Symmetry 2022, 14, 805. [Google Scholar] [CrossRef]
- Haberman, S.J. The Analysis of Frequency Data; University of Chicago Press: Chicago, IL, USA, 1974; p. 419. [Google Scholar]
- Goodman, L.A. Simple models for the analysis of association in cross-classifications having ordered categories. J. Am. Stat. Assoc. 1979, 74, 537–552. [Google Scholar] [CrossRef]
- Goodman, L.A. Three elementary views of log-linear models for the analysis of cross-classifications having ordered categories. Sociol. Methodol. 1981, 12, 193–239. [Google Scholar] [CrossRef]
- Clogg, C.C. Some models for the analysis of association in multiway cross classifications having ordered categories. J. Am. Stat. Assoc. 1982, 77, 803–815. [Google Scholar] [CrossRef]
- Beh, E.J.; Farver, T.B. An evaluation of non-iterative methods for estimating the linear-by-linear parameter for ordinal log-linear models. Aust. N. Z. J. Stat. 2009, 51, 335–352, Erratum in: Aust. N. Z. J. Stat. 2010, 52, 241–242. [Google Scholar] [CrossRef]
- Beh, E.J.; Davy, P.J. A non-iterative alternative to ordinal log-linear models. J. Appl. Math. Decis. Sci. 2004, 7, 67–86. [Google Scholar] [CrossRef]
- Beh, E.J.; Farver, T.B. On estimating the linear-by-linear parameter for ordinal log-linear models: A computational study. ISRN Comput. Math. 2012, 2012, 340415. [Google Scholar] [CrossRef][Green Version]
- Beh, E.J.; Farver, T.B. A computational study assessing maximum likelihood and non-iterative methods for estimating the linear-by-linear parameter for ordinal log-linear models. ISRN Comput. Math. 2012, 2012, 396831. [Google Scholar] [CrossRef]
- Beh, E.J.; Farver, T.B. The Box-Cox transformation and non-iterative estimation methods for ordinal log-linear models. Aust. N. Z. J. Stat. 2012, 54, 475–484. [Google Scholar] [CrossRef]
- Zafar, S.; Cheema, S.A.; Beh, E.J.; Hudson, I.L. A study of bias for non-iterative estimates of the linear-by-linear association parameter from the ordinal log-linear model. In Advances on Latent Variables—Methods, Models and Applications; Bretari, E., Carpita, M., Eds.; Vita e Pensiero Publishers: Milan, Italy, 2013; p. 6. ISBN 978 88 343 2556. [Google Scholar]
- Von Eye, A.; Mun, E.-Y. Log-Linear Modeling: Concepts, Interpretation and Application; Wiley: Hoboken, NJ, USA, 2013; p. 472. [Google Scholar]
- Espendiller, M. Association in Contingency Tables: An Information Theoretic Approach. Unpublished. Ph.D. Thesis, RWTH Aachen University, Aachen, Germany, 2017. [Google Scholar] [CrossRef]
- Clayton, D.G. Some odds ratio statistics for the analysis of ordered categorical data. Biometrika 1974, 61, 525–531. [Google Scholar] [CrossRef]
- Beh, E.J. Simple correspondence analysis of ordinal cross-classifications using orthogonal polynomials. Biom. J. 1997, 29, 589–613. [Google Scholar] [CrossRef]
- Beh, E.J.; Lombardo, R. Correspondence Analysis: Theory, Methods and New Strategies, 1st ed.; Wiley: Chichester, UK, 2014; p. 592. [Google Scholar]
- Beh, E.J.; Lombardo, R. An Introduction to Correspondence Analysis, 1st ed.; Wiley: Chichester, UK, 2021; p. 240. [Google Scholar]
- Beh, E.J.; Lombardo, R. Features of the polynomial biplot for ordered contingency tables. J. Comput. Graph. Stat. 2022, 31, 403–412. [Google Scholar] [CrossRef]
- Lombardo, R.; Beh, E.J.; Kroonenberg, P.M. Modelling trends in ordered correspondence analysis using orthogonal polynomials. Psychometrika 2016, 81, 325–349. [Google Scholar] [CrossRef]
- Zafar, S.; Cheema, S.A.; Beh, E.J.; Hudson, I.L.; Hudson, S.A.; Abell, A.D. Linking ordinal log-linear models with correspondence analysis: An application to estimating drug-likeness in the drug discovery process. In MODSIM2013, Proceedings of the 20th International Congress on Modelling and Simulation, Adelaide, Australia, 1–6 December 2013; Piantadosi, J., Anderssen, R.S., Boland, J., Eds.; Modelling and Simulation Society of Australia and New Zealand: Canberra, Australia; pp. 1945–1951. ISBN 978-0-9872143-3-1. [CrossRef]
- Zafar, S.; Beh, E.J.; Hudson, I.L. Non-iterative estimation methods for the association parameters of the ordinal log-linear models for three-way tables. In Proceedings of the Australian Statistics Conference 2016 (ASC2016), Canberra, Australia, 5–8 December 2016. [Google Scholar]
- Zafar, S.; Hudson, I.L.; Beh, E.J.; Abell, A. A non-iterative approach to ordinal log-linear models: Investigations of log D in drug discovery. In Proceedings of the 31st International Workshop on Statistical Modelling, Rennes, France, 4–8 July 2016. [Google Scholar]
- Zafar, S.; Hudson, I.L.; Beh, E.J.; Joyce, P.R. Modelling risk profiles of depression symptoms using Cloninger’s temperament and character traits: A non-iterative approach to assess linear-by-linear association within ordered contingency tables. In MODSIM2015, Proceedings of the 21st International Congress on Modelling and Simulation (MODSIM2015), Melbourne, Australia, 12-15 December 2015; Weber, T., McPhee, M.J., Anderssen, R.S., Eds.; Modelling and Simulation Society of Australia and New Zealand: Canberra, Australia, 2015; pp. 1668–1674. ISBN 978-0-9872143-5-5. [Google Scholar]
- Evers, N.; Namboodri, N.K. Monte-Carlo assessment of the stability of log-linear estimates in small sample. In Proceedings of American Statistical Association, Social Statistics Section, Washington, DC, USA; American Statistical Association: Alexandria, VA, USA, 1977. [Google Scholar]
- Clogg, C.C.; Eliason, S.R. Some common problems in log-linear analysis. Sociol. Methods Res. 1987, 15, 4–44. [Google Scholar] [CrossRef]
- Fienberg, S.E. The Analysis of Cross Classified Categorical Data, 2nd ed.; Springer: New York, NY, USA, 2007; p. 198. [Google Scholar]
- Casella, G.; Berger, R.L. Statistical Inference, 2nd ed.; Cengage Learning: Duxbury, MA, USA, 2002; p. 686. [Google Scholar]
- Akoto, I.; Mexia, J.T.; Marques, F.J. Asymptotic Results for Multinomial Models. Symmetry 2021, 13, 2173. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).