A Reappraisal of the Threshold Hypothesis of Creativity and Intelligence

Weiss, Selina; Steger, Diana; Schroeders, Ulrich; Wilhelm, Oliver

doi:10.3390/jintelligence8040038

Open AccessArticle

A Reappraisal of the Threshold Hypothesis of Creativity and Intelligence

¹

Institute of Psychology and Pedagogy, Ulm University, Albert-Einstein Allee 47, 89081 Ulm, Germany

²

Institute of Psychology, University of Kassel, Holländische Strasse 36-38, 34127 Kassel, Germany

^*

Author to whom correspondence should be addressed.

J. Intell. 2020, 8(4), 38; https://doi.org/10.3390/jintelligence8040038

Submission received: 28 August 2020 / Revised: 26 October 2020 / Accepted: 3 November 2020 / Published: 11 November 2020

(This article belongs to the Special Issue Intelligence and Creativity)

Download

Browse Figures

Versions Notes

Abstract

:

Intelligence has been declared as a necessary but not sufficient condition for creativity, which was subsequently (erroneously) translated into the so-called threshold hypothesis. This hypothesis predicts a change in the correlation between creativity and intelligence at around 1.33 standard deviations above the population mean. A closer inspection of previous inconclusive results suggests that the heterogeneity is mostly due to the use of suboptimal data analytical procedures. Herein, we applied and compared three methods that allowed us to handle intelligence as a continuous variable. In more detail, we examined the threshold of the creativity-intelligence relation with (a) scatterplots and heteroscedasticity analysis, (b) segmented regression analysis, and (c) local structural equation models in two multivariate studies (N₁ = 456; N₂ = 438). We found no evidence for the threshold hypothesis of creativity across different analytical procedures in both studies. Given the problematic history of the threshold hypothesis and its unequivocal rejection with appropriate multivariate methods, we recommend the total abandonment of the threshold.

Keywords:

creativity; intelligence; threshold hypothesis; necessary but not sufficient condition

1. Introduction

Can you be creative without being smart? Many researchers argued that creativity presupposes intelligence (e.g., Guilford 1967) and intuitively this proposition probably makes sense for many readers. Indeed, the abilities needed for divergent production/thinking (Guilford 1967) and idea generation and evaluation (Mumford and McIntosh 2017) are closely intertwined with other cognitive abilities, commonly referred to as convergent thinking (Carroll 1993; Cropley 2006). For example, the creativity required to come up with an invention for high-tech problems builds upon substantial expertise in a field, as well as decontextualized fluid intelligence (e.g., Nusbaum and Silvia 2011). However, the intellectual prerequisites for different tasks challenging creativity might vary (Diedrich et al. 2018; Jauk et al. 2013) and the relevance of general intelligence might not be the same at different points in the distribution of creative abilities.

Historically, creative ability was incorporated in most models of intelligence, predominantly as a lower-order factor below general intelligence. Creative abilities1—often measured by divergent thinking tasks, including indicators of fluency or originality (Runco 2008)—are part of the structure of intellect model (Guilford 1967), the three-stratum theory of cognitive abilities (Carroll 1993), and the Berlin intelligence structure model (Jäger et al. 1997; Süß and Beauducel 2005). The relation between intelligence and creativity was evaluated in several studies (e.g., in terms of a lower-order factor in the Cattell–Horn–Carroll model of cognitive abilities; McGrew 2009; Silvia et al. 2013). Recent evidence showed that creative abilities (e.g., divergent thinking scored for fluency) and general intelligence were substantially related (r = 0.46, Karwowski et al. 2018; β = 0.45, Nusbaum and Silvia 2011), especially when using a broad set of indicators (β = 0.51; Benedek et al. 2012; β = 0.40, Weiss et al. 2020a). This is corroborated by a review that states that the progress in analytical tools, as well as in measurement (e.g., in cognitive neuroscience), has led to the conclusion that creativity and intelligence are closely related (Silvia 2015). Research reporting lower correlations are often based either on narrow measures of the constructs or on very heterogenous measures (e.g., a meta-analysis by Kim (2005) found a mean correlation of r = 0.17). Among others, the substantial correlation between the two constructs resurrected the question if the relation between creativity and intelligence might not follow a necessary condition, but a necessary but not sufficient condition (Guilford 1967). Further, they wondered if it was in accordance with the so-called threshold hypothesis (e.g., Karwowski et al. 2016). In the present paper, we reviewed different interpretations of Guilford’s original finding and tried to translate them to testable statistical means. Moreover, we discussed three analytical approaches to study the relation between intelligence and creativity in two different data sets that varied with regard to the age of samples and the measures for creativity and intelligence.

2. The Threshold Hypothesis of Creativity and Intelligence

Guilford was one of the first to describe and investigate the relationship between creativity and intelligence. In his initial publication, he stated that “high IQ is not a sufficient condition for high DP [divergent production] ability; it is almost a necessary condition” (Guilford 1967, p. 168). Thus, Guilford assumed that highly intelligent individuals are not necessarily creative but can be creative, while less intelligent individuals are necessarily less creative (Guilford and Christensen 1973), which became an assumption known as the necessary but not sufficient condition. This relationship is schematically depicted in the left plot in Figure 1. If Guilford’s assumption holds and intelligence is a necessary but not sufficient condition for being creative, individuals’ scores scatter within the triangle. Although the original wording of Guilford’s theory was quite unambiguous, comparatively little research was done to test this assumption. Only recently, researchers picked up on the necessary but not sufficient condition (e.g., Karwowski et al. 2016; Shi et al. 2017). In contrast to the necessary but not sufficient condition, one can see that the necessary (and sufficient) condition corresponds to an ordinary linear regression (see Figure 1, middle plot).

The original formulation of a necessary but not sufficient condition was later (erroneously from many researchers) converted into the so-called threshold hypothesis. The threshold hypothesis states that the relationship between creativity and intelligence varies depending on the level of intelligence. Proponents assume that, below a certain threshold of intelligence, intelligence and creativity show a positive linear relationship, whereas above that threshold intelligence and creativity are uncorrelated (see right plot in Figure 1) or are less strongly correlated. Interestingly, although Guilford is widely named as the originator of the threshold hypothesis, he was no advocate in later publications and theoretically and analytically distinguished between the ideas of assuming a necessary but not sufficient condition, suggesting a threshold. Guilford and colleagues showed in two studies (including 45 tests of divergent production and two IQ tests with various scales) that none of the scatter plots suggested a threshold and that the ubiquitous positive relationship “shows a continuous, gradual shift from low to high IQ”, ultimately leading to the completely opposite conclusion that there is no support for any threshold (Guilford and Christensen 1973, p. 251). Guilford and Christensen concluded the absence of a threshold despite a triangular-shaped scatter for most of their plots (e.g., 20 triangular plots for semantic tasks out of 25 tasks), as the linear regression did not show any breaks. This implies that they distinguished intelligence as a necessary but not sufficient condition for being creative (triangular shape of a scatterplot) and the assumption of a threshold given by a difference in correlations between creativity and intelligence tasks at a certain point (break in the regression line; see Figure 1). In summary, there are (at least) three different perspectives on the link between creativity and intelligence: intelligence being (a) a necessary condition, (b) a necessary but not sufficient condition, and (c) the threshold hypothesis. Herein, we overview what researchers understand by the term “partly vary”. In the following, we discuss the theoretical assumptions and empirical evidence of the threshold hypothesis.

3. Theoretical Underpinnings of the Threshold Hypothesis

What are the theoretical underpinnings for the threshold hypothesis? Unfortunately, a large amount of research regarding the intelligence-creativity link lacks a thorough theoretical explanation as to why a threshold should exist and if present where it should be (see Karwowski and Gralewski 2013). The confusion of terms and the different operationalizations to test the theory might be a direct result of sparse theoretical ideas. However, to discuss where a threshold should exactly be placed in the ability distribution is irrelevant if the “why” is not clear. Although the threshold hypothesis could not be equated with a non-linear relationship between intelligence and creativity, some researchers borrow the theoretical argumentation from other parts of intelligence research, i.e., the ability of the dedifferentiation hypothesis (i.e., Spearman’s law of diminishing returns (SLODR, Spearman 1927) or age-related differentiation (Garrett 1946)) to explain the threshold hypothesis of creativity.

At first glance, lending ideas from SLODR seem to be a viable approach, as (general) intelligence directly affects the ability to be creative (e.g., Forthmann et al. 2019; Gilhooly et al. 2007; Silvia et al. 2013). According to SLODR (Spearman 1927), correlations between cognitive abilities decrease with increasing levels of abilities (e.g., Hartung et al. 2018). Transferring this logic would imply that intelligence might facilitate the use of elemental skills (e.g., long-term memory) and, once an advanced level of intelligence is reached, higher levels of intelligence are no longer beneficial for further increasing creative performance, thus leading to a correlational pattern as discussed above. A further example can be found in the differentiation of language ability (Garrett 1946). Initially, it depends on single skills such as oral language comprehension, but the more mature someone gets the more language abilities are differentiated (e.g., reading comprehension, linguistic usage). However, the evidence regarding age differentiation is mixed (Breit et al. 2020; Van Der Maas et al. 2006), and theoretical explanations for this phenomenon are surprisingly sparsely elaborated upon. Some findings support ability differentiation (e.g., Legree et al. 1996), while others use more sophisticated data-analytic approaches to see support for the differentiation hypothesis (Hartung et al. 2018).

However, the consideration of this literature only adds little insight when it comes to why there should be a qualitative gap or threshold in the relation of creativity and intelligence. Moreover, the literature does not provide any cohesive theoretical background for where to set a cutoff a priori. Despite this weak theoretical foundation of the threshold hypothesis, the question if there is a threshold still inspired a considerable amount of studies. In the next paragraph, we summarize the empirical evidence from these studies and give a systematic overview of the findings.

4. Empirical Evaluation of the Threshold Hypothesis

In Table 1, we summarize prominent findings on the threshold hypothesis and give an overview the methods and results of the studies. Strikingly, almost as many different thresholds existed as did studies. The diverse set of results can be attributed to (a) different understandings how the threshold hypothesis is best operationalized, (b) varying sample sizes and sample characteristics, (c) different measures used to assess both intelligence and creative ability, and (d) the analytical procedures to settle a specific threshold.

Although the sample size reported in the studies varied considerably (e.g., N = 88 to N = 12,255), sample size did not seem to affect the results systematically, leaving no evidence for potential publication biases due to missing statistical power. The same was true for other sample features, although some may argue that sample characteristics such as age or ability distribution might influence the results. Age itself had been assumed to affect the factor structure of intelligence as stated in the age differentiation hypothesis (Garrett 1946), but findings were mixed (Breit et al. 2020; Hülür et al. 2011; Tucker-Drob 2009). Moreover, as the threshold was assumed to be at an intelligence score around z = 1.33, some samples might have simply failed to include enough cases above that threshold, failing to depict the whole ability spectrum. However, on the contrary, the studies reported in Table 1 show the opposite effect. Studies that oversampled highly gifted participants (z > 2, Holling and Kuhn 2008; z > 1.33, Preckel et al. 2006) did not find evidence for the threshold hypothesis.

Second, the measures used to study the threshold hypothesis might have influenced the results. For example, Jauk et al. (2013) derived varying thresholds for different dimensions of divergent thinking (originality: z = 0, creative fluency: z = 1.33; ideational fluency: z = −1.00), but no threshold for the relation between creative achievement (assessed via self-reports) and intelligence. Overall, the measures of creativity used in the different studies differed largely in breadth and depth of their operationalization (Weiss et al. 2020b). With respect to the measures of intelligence, most studies focused on indicators that assessed fluid intelligence—the ability of abstract reasoning in novel situations—which is an important constituent of overall general intelligence (e.g., Heitz et al. 2005). It is recommended to use a broad measure of creativity when assessing the threshold to eliminate potential item selection bias from narrow tests, although no systematic influence was established (Table 1).

In contrast to the aforementioned study characteristics, the analytical strategy affects whether and where a threshold is found (e.g., Karwowski and Gralewski 2013). Both correlational analyses and segmented regression analyses mostly reported the existence of a threshold (e.g., Cho et al. 2010; Jauk et al. 2013), which varied. Two studies that used correlational analyses confirmed a threshold at z = 1.33, despite segmented regression analysis often resulting in different thresholds. Conversely, multi-group confirmatory factor analysis, which evaluates the factor structure (of creativity) in different ability groups, seemed to show no difference between the groups (Holling and Kuhn 2008; Preckel et al. 2006). Based on the previous results, it seemed plausible that the analytical method had a direct impact on the results. Therefore, we considered different methods to probe the threshold hypothesis.

5. Analytical Strategies in the Investigation of the Threshold Hypothesis

Previous studies reported results regarding the threshold hypothesis, most of which were based on a (a) correlational analysis in a split sample, (b) segmented regression analysis, and (c) multi-group confirmatory factor analysis. Additionally, the necessary but not sufficient condition analysis (for more information, see Dul 2016) has recently gained attention as a statistical tool in the threshold literature. However, the results of the necessary but not sufficient condition analysis could not be directly compared to the results of other methods. Finding a significant proportion above the ceiling did not necessarily imply a threshold (Guilford and Christensen 1973; Ilagan and Patungan 2018), because it did not test for a break in the regression line (see Figure 1). Moreover, there were several open theoretical issues (e.g., causality assumptions that are not examined and further problematized) and issues regarding that method (e.g., no account for sampling error and a high sensitivity to outliers; for a criticism see Ilagan and Patungan 2018). In the present paper, we focused on methods that were used to study the threshold hypothesis rather than the necessary but not sufficient condition.

5.1. Correlational Analysis in Split Sample

The correlational analysis—which often capitalizes on an extreme group design (Preacher et al. 2005)—is the analytical method with the longest tradition in the investigation of the threshold hypothesis (e.g., Cho et al. 2010; Fuchs-Beauchamp et al. 1993; Getzels and Jackson 1962). For this analytical approach, the sample is split into two groups at an a priori set threshold into a low ability group and a high ability group with correlations between intelligence and creativity separately computed. According to the threshold hypothesis, a threshold exists if the correlation is lower or even zero in the high ability group compare to the low ability group (Karwowski and Gralewski 2013). Although this method might seem like a direct translation of the threshold hypothesis into statistical means, it comes with a long list of potential disadvantages. First, the sample split needs a strong theoretical justification for setting the threshold. Given the unclear theoretical roots of the threshold hypothesis, the often-used threshold of z = 1.33 is not sufficiently backed up by theory. Eventually, this uncertainty concerning the cutoff yields the risk of exploiting researcher’s degrees of freedom (Simmons et al. 2011; Wicherts et al. 2016), probing different thresholds until the desired result is achieved. Second, splitting the sample into two subsamples dichotomizes an otherwise continuous variable (i.e., intelligence), which results in all sorts of statistical problems, such as informational loss, an underestimation of the strength of the bivariate relation, and a mis-categorization of participants that are close to the threshold (MacCallum et al. 2002). Third, as the correlational analysis is based on manifest variables, measurement error and task specificity are not taken into account. Fourth, the analysis most likely suffers from a lack of measurement precision at the more extreme points of the ability distribution because fewer items assess the extremes (Byrne 2010). Fifth, such differences in the correlational patterns in two groups are often biased by samples restricted in dispersion and reliability being lower in the group that is more severely range-restricted. Since the high IQ group in a heterogeneous sample for obvious reasons often contains only a few cases, the parameter estimates (e.g., slope of the regression) are less robust. The point estimate is lower by virtue of variance restriction and by virtue of the fact that item difficulty distribution often follows ability distribution. Therefore, fewer items with good discrimination are available in the tails of the distribution. This indicates that the reliability of person parameters follows the test information function, which is low where few items discriminate. Therefore, sufficient statistical power can often not be reached in extreme groups of small sizes. Consequently, correlational analysis is especially prone to false positive conclusions due to the very nature of the threshold hypothesis.

5.2. Segmented Regression Analysis

Segmented linear regression analysis determines whether different (linear) relationships exist across the continuum of intelligence. This regression analysis includes the estimation of multiple linear models that are fitted for different segments of the data (Ryan and Porth 2007). This means the intelligence continuum is divided several times into two segments and ordinary least squares (OLS) regressions are fitted separately within these segments. A break, which is referred to as a threshold in the linear regression (such as displayed in Figure 1, right panel), means that the slopes of the two regressions differ significantly. A possible advantage of this method is that it can be used to detect a potentially unknown breakpoint rather than confirming an a priori set breakpoint (Ryan and Porth 2007). The method is usually applied if there is a strong theoretical assumption that justifies a break in the relation often in terms of a dose-response relationship (e.g., a critical level of stress leads to preterm birth, Whitehead et al. 2002). Such a strong theoretical basis cannot be assumed in the relation between intelligence and creativity. Furthermore, the segmented regression comes along with several model assumptions that normally distributed and independent residuals are homoscedastic. i.e., OLS regression (Ryan and Porth 2007). However, studies reporting results based on the segmented regression analysis often fail to report tests of homoscedasticity of the data or QQ-plots that examine the normal distribution of residuals. We chose a segmented regression analysis to allow a direct comparison to previous research and because the basic assumptions were met (i.e., homoscedasticity of residuals). Robust alternatives to segmented regression, such as the robust bent line regression (Zhang and Li 2017) or the Robin Hood algorithm for curvilinear relations (Simonsohn 2018), can be considered if the assumptions are violated. These analytical methods assume an unknown change point in a non-linear regression of manifest variables, but the theoretical basis for such an assumption is vague. Moreover, these methods also suffer from problems such as imprecise false positive rates (Type I errors) and the assumption of a change in sign of the regression in two regions (Simonsohn 2018). Besides, methods such as the quantile regression have been applied to investigate thresholds (Dumas 2018; Karwowski et al. 2020), although they do not provide a direct test of a threshold as the segmented regression analysis does.

5.3. Local Structural Equation Modeling

The last analytical method we want to present is a novel approach, termed local structural equation models (LSEM; Hildebrandt et al. 2016). To understand its merits, we will first address the shortcomings of multi-group confirmatory factor analysis (MGCFA), which has been previously used in the threshold literature. MGCFA is a method within the framework of structural equation modeling to analyze measurement parameters (e.g., factor loadings, item intercepts) across different ability groups beyond a simple comparison of correlations (Vandenberg and Lance 2000). Although the latent variable approach is superior compared to simple regressions of manifest variables in an extreme group design, the multi-group setting requires an arbitrary dichotomization of a continuous variable (e.g., z = 2.00, Holling and Kuhn 2008; z = 1.33, Preckel et al. 2006). Another disadvantage of the method is that it does not allow for the direct examination of the correlation of creativity and intelligence, as well as its change across the intelligence continuum. In general, studying the factor variance of creativity over the intelligence continuum might indicate a notable change or threshold (e.g., Holling and Kuhn 2008), i.e., a systematic increase or decrease in factor variance is one way that (de-)differentiation can manifest (Molenaar et al. 2010). However, multi-group confirmatory factor analyses that rely on discretizing a continuous variable at an arbitrary point can mask such a change in the variance. A recent extension of the structural equation models that ameliorates the drawback of an artificial dichotomization of the continuous variable intelligence is LSEM (Hildebrandt et al. 2016). In a nutshell, LSEM involves the fitting of several “conventional” structural equation models along the distribution of a continuous moderator with weighted observations (Olaru et al. 2019). The weight of each observation is based on the proximity of an observation to a specific value of the moderator, so that observations near this focal point provide more information to model estimation than more distant points. In the present context, a series of measurement models for creativity was estimated with intelligence as a continuous moderator. Based on this method, changes in the model fit the factor structure, mean values, and variances without splitting the sample into arbitrary groups (see for example Hartung et al. 2020).

6. The Present Studies

The threshold hypothesis is often attributed to Guilford, though he intended for a necessary but not sufficient condition between intelligence and creativity. In fact, he opposed the idea of a threshold based on empirical findings (1973). Since then, the threshold hypothesis has developed a life of its own, despite the empirical support being weak. In our reading, the theoretical basis of the cognitive mechanisms of the threshold hypothesis, as well as the data analytical approaches, are often not met with the necessary rigor. Applying Occam’s razor, no threshold should be assumed or postulated unless convincingly demonstrated otherwise. In the present manuscript, we re-analyzed two data sets that varied with respect to participants’ age and the indicators of creativity and intelligence with different analytical strategies. More specifically, we evaluated the relation between intelligence and creativity in both data sets based on the following analytical strategies: (a) scatterplots and heteroscedasticity analysis, (b) segmented regression analysis, and (c) local structural equation models.

7. Method

7.1. Samples and Design

7.1.1. Study 1

The first data set included measures of intelligence, emotional intelligence, and creativity. It was published in the context of investigating the self-other knowledge asymmetry (Neubauer et al. 2018). After data cleaning (excluding n = 6 multivariate outliers with a Mahalanobis distance > 15; Meade and Craig 2012), the total sample included N = 456 adolescents and young adults (ranging from 13 years to 20 years). About 55% of the participants were female. The students were recruited from 13 different public and private schools in rural and urban areas of Austria. For more information please see Neubauer et al. (2018). The dataset is available online via OSF (https://osf.io/v8e5x/).

7.1.2. Study 2

The second data set was part of a larger multivariate study of creativity and its covariates (Goecke et al. 2020; Steger et al. 2020; Weiss et al. 2020a). The analysis was based on N = 438 participants after excluding n = 12 multivariate outliers with a Mahalanobis distance > 15. Two participants showed high-end performance regarding all creativity indicators. They were not excluded from the data set as they were not flagged as multivariate outliers. The sample included adults between 18 and 49 years. About 65% of the participants were female. For more information regarding the sample and data preparation, see Weiss et al. (2020a). The dataset is available online via OSF (https://osf.io/6fxv5/).

7.2. Measures and Scoring

7.2.1. Study 1

In the first study by Neubauer et al. (2018), intelligence was measured based on the “Intelligenz-Struktur-Analyse” (ISA; Fay et al. 2001), which includes three subtests for verbal, numerical, and spatial reasoning. Creativity was measured using three items from the “Alternate Uses Task” (Jauk et al. 2013). Participants were instructed to name as many original alternate uses for an umbrella, plastic bottle, and a shoe as possible within two minutes. We presented the results for the fluency scoring of answers, i.e., the human coding of the quantity of solutions (for more information, see Neubauer et al. 2018). The fluency scores matched the instruction, which were highly correlated with originality score and frequently applied in the literature. Additionally, we also present the results based on originality scores in the supplementary material (Figures S2–S4).

7.2.2. Study 2

In the second study, intelligence was measured using the verbal and figural subtest of the “Berlin Test of Fluid and Crystallized Intelligence” (Wilhelm et al. 2014). Divergent thinking was measured based on six verbal and figural tests that were either instructed for fluency or originality. The similar attributes test (including 6 items) and the inventing names test (including 18 items) were both adapted from verbal creativity tests (Schoppe 1975). The other two fluency indicators were a typical retrieval fluency test (including 6 items), and the figural fluency test (including 4 items; Jäger et al. 1997). All fluency indicators were rated by humans for the frequency of solutions. Two additional tests (combining objects, French et al. 1963) and inventing nicknames (Schoppe 1975) were rated for the originality/creativity of solutions. Three human raters scored participants’ answers on a five-point rating scale (Amabile 1982; Silvia et al. 2008). For more detailed information, please see Weiss et al. (2020a).

7.3. Statistical Analyses

The heteroscedasticity analysis and segmented regression analysis were based on manifest variables. We used z-standardized mean values, including either all creativity indicators (Study 1: three items of the Alternate Use Task; Study 2: six tests of fluency and originality) or all intelligence indicators (Study 1: three subtests for verbal, figural, and numerical fluid intelligence; Study 2: indicators for figural and verbal fluid intelligence). The local structural equation modeling relies on a measurement model for creativity using z-standardized values. In Study 1, the measurement model was identified using three indicators of the alternate uses task, whereas the model fitted the data well in Study 2 (χ²(9) = 13.31, p = 0.15, CFI = 0.99, RMSEA = 0.03, SRMR = 0.03). In comparison to Weiss et al. (2020a), we modeled creativity as a single factor of divergent thinking, excluding the nested originality factor in the present analysis because it shows low factor saturation and factor variance, which causes estimation problems in LSEM.

7.3.1. Scatterplots and Heteroscedasticity

First, we investigated whether a threshold existed using a scatterplot analysis. Since visual inspection of scatterplots is highly subjective, we tested for heteroscedasticity. Normally distributed residuals indicate homoscedasticity, i.e., the absence of heteroscedasticity. We assumed that if a somehow non-linear relationship between creativity and intelligence existed, values should show a heteroscedasticity, which could be tested with the Breusch–Pagan test (Breusch and Pagan 1979). The Breusch–Pagan test assumes a constant confounding variable variance in the null hypothesis. A non-significant test for heteroscedasticity rendered the existence of a threshold very unlikely.

7.3.2. Segmented Regression Analysis

We used the segmented regression analysis as a second approach to investigate the threshold hypothesis. In this case, a significant change in the slope of the linear regression within the two segments indicated the existence of a threshold. In both studies, intelligence was analyzed as independent variable and divergent thinking as the dependent variable. In addition to estimating the amount and position of possible breakpoints, we used the Davies test to see if any breakpoints occurred between the second greatest and second smallest value (Davies 2002; Muggeo 2008). As no significant changes were assumed if more than 10 segments were specified, we used the recommended default of the Davies test (i.e., 10 segments). If the Davies test was non-significant, the regression parameters were constant across the complete intelligence range.

7.3.3. Local Structural Equation Modeling

Finally, we used LSEM to investigate the threshold hypothesis. In contrast to MGCFA, which relies on the categorization of intelligence as a moderator (e.g., Holling and Kuhn 2008), LSEM allows for the investigation of a factor structure of creativity (Figure 2) across the intelligence continuum. LSEM is a person-sampling method applied to investigate deviations in the measurement model across observations (Olaru et al. 2019). Compared to MGCFA, which requires the grouping of participants, the observations in LSEM are weighted as a function of their proximity to a focal point of intelligence (Hildebrandt et al. 2009). The weights are normally distributed around the focal point, implying a full weight at a focal point and weights decreasing according to the probability density of the normal distribution with increasing distance from the focal point. For example, if the measurement model of divergent thinking (Figure 2) is estimated at the focal point of z = 1.33, all participants with an intelligence score of z = 1.33 are assigned the highest weight (i.e., 1), and weights decrease as scores are more distant from z = 1.33. For each focal point of intelligence, the measurement model of creativity is sequentially estimated based on the weighted samples (Hildebrandt et al. 2016). In Studies 1 and 2, we applied general intelligence as a moderator based on a moderator grid of z = 0.5, ranging from z = −1.50 to z = 1.50, resulting in seven focal points. The effective sample size ranged between N_eff ≈ 106 and N_eff ≈ 215 for Study 1 and N_eff ≈ 92 and N_eff ≈ 223 for Study 2.

7.4. Open Science

We conducted all analyses using R version 4.0.2. Segmented regression analyses were estimated using the R package segmented (Muggeo 2008), whereas LSEM was conducted using the packages lavaan and sirt (Robitzsch 2020; Rosseel 2012). To make the present analyses transparent and reproducible, we provided all material (i.e., data set of Study 2, syntax, and supplemental material) at the Open Science Framework. The data set of Study 1 is available online. We also report descriptive statistics (i.e., mean values, standard deviations, and correlations) for the indicators used in the following analysis in the supplementary material (Table S1 and Figure S1).

8. Results

8.1. Scatterplots and Heteroscedasticity

Scatterplots and testing for heteroscedasticity were ur first means to investigate the datasets and to skim for breakpoints in the relation between creativity and intelligence (see Figure 3). In Study 1, the correlation between creativity and intelligence was lower (r = 0.19, p < 0.01) than in Study 2 (r = 0.27, p < 0.01). At first glance, the scatterplots (Figure 3, upper part) showed no sign of a threshold. The heteroscedasticity plots (Figure 3, lower part) showed flat lines based on the loess smoothing function, which indicated evenly distributed residuals across the fitted values. Additionally, the Breusch–Pagan test for heteroscedasticity was not significant in both studies (Study 1: BP(1) = 0.64, p = 0.42; Study 2: BP(1) = 1.16, p = 0.28), so that homoscedasticity could be assumed. The scatterplot and heteroscedasticity plot based on the originality scores (Study 1) are presented in the supplementary material (Figure S2). The Breusch–Pagan test was not significant for originality (BP(1) = 0.47, p = 0.49).

8.2. Segmented Regression Analysis

The segmented regression analysis estimates breakpoints in an otherwise linear relationship between two variables. For all breakpoints, the change in slope were not significant; Figure 4 displays the largest change in slopes for Studies 1 and 2. The largest change in slope for the originality indicators (Study 1) is presented in the supplementary material (Figure S3), which was not significant. In sum, there is no evidence for the threshold hypothesis using segmented regression analysis. Nevertheless, we estimated ΔR² on Fisher’s z-standardized correlation coefficients with z = 1.33 as a breakpoint, because this cutoff was often selected as a potential threshold. In both studies, the number of participants after the breakpoint was small (n₁ = 47, n₂ = 43). The resulting difference was ΔR² = 0.06 in Study 1 and ΔR² = 0.05 in Study 2.

8.3. Local Structural Equation Models

To detect possible changes in the factor variance of creativity along the intelligence continuum as an indication of a threshold, we fitted local structural equation models in Studies 1 and 2. The model in Study 1 was identified, while the measurement model for creativity fitted well along the intelligence continuum in Study 2, with a slight deterioration in model fit at the tales of the distribution (CFI_min = 0.92, RMSEA_max = 0.10, and SRMR_max = 0.05). No systematic changes in the factor variance of divergent thinking across general intelligence as a moderator were detectable (see Figure 5; see Figure S4 in the supplement for changes in the factor variance of originality). Furthermore, we also fitted a model that constrained the factor loadings to equality to examine if model fit deteriorates. The constraints were introduced to the model with the joint estimation approach for LSEM (separate models at the focal points are equivalently estimated in a multiple group model context; implemented in sirt::lsem.estimate). Similar factor loadings and no decrement in the model fit contradict the idea of a threshold. The loadings at the different focal points in Studies 1 and 2 are displayed in the supplementary material (Study 1: Figure S5; Study 2: Figure S6). As there was no significant change in the model fit, it can be assumed that the loadings do not show greater changes at different focal points in both studies.

9. Discussion

Investigations regarding a change in the relation between variables above and below a threshold are not limited to creativity research, but can be encountered in many fields such as second language learning (e.g., Cummins 1979). In our reading, these threshold-hypotheses share that they are overgeneralizations of evidence that mainly derived from studies with small sample sizes. Besides, these studies often lacked comprehensive theoretical underpinnings, which is in stark contrast to the extensive attention these hypotheses have attracted over the past decades. Thresholds assumptions should be encountered with some skepticism steered by various conceptual and methodological problems (Takakuwa 2003), and should entail some essential questions.

9.1. Does a Threshold Exist?

In the present case, we reanalyzed two studies with different operationalizations of both fluency/originality and intelligence using three analytical approaches to investigate a potential threshold. Despite these efforts, we were unable to find any compelling evidence for the existence of a threshold. First, the scatterplots of intelligence and creativity did not show any abnormalities and the data were homoscedastic. Second, we found no significant breakpoints using the segmented regression analysis. Finally, the factor variance and factor loadings of a measurement model of creativity did not change across the intelligence continuum. Moreover, since our findings were based on relatively large sample sizes, including different age groups and a variety of different measures of both constructs, we deemed it unlikely that our results were distorted due to a lack of power or sampling issues. This finding is congruent with a number of previous studies that were also unable to find support for an intelligence creativity threshold (e.g., Preckel et al. 2006; Sligh et al. 2005). Remember that Guilford himself led the way when he concluded from two large multivariate studies that he found “no evidence to support a threshold hypothesis regarding the relation of creative potential to IQ” (Guilford and Christensen 1973, p. 252). We concur with this statement. Despite our systematic approach, we did not find any evidence to support the existence of a threshold.

9.2. Why Do Researchers Keep on Finding Evidence Anyway?

While the inference drawn from our results is unambiguous, previous research on the existence of a threshold of creativity and intelligence is not. A narrative review might infer that the results are mixed with some evidence against a threshold (e.g., Holling and Kuhn 2008; Preckel et al. 2006), and some evidence in favor of a threshold (e.g., Jauk et al. 2013; Karwowski et al. 2016). What are potential causes for these inconclusive results? As we saw in the literature review, some differences were caused by the choice of specific analytical approaches, yet the problem goes deeper. Maybe the most apparent is that the threshold is not set a priori. Short of a convincing theory, these cutoffs are arbitrary and leave ample room for many researchers’ degrees of freedom in the data analysis. This problem is exacerbated by different handlings of outliers, the choice of analytical tools, etc. (Simmons et al. 2011). Declaring a threshold presupposes its existence and a specific number suggests a precision rarely found in behavioral sciences. As such, it neglects its positivistic identification. Thresholds suggest a qualitative difference of humans below and above the value that is implausible with respect to creativity and intelligence in specific but also, more generally, for psychological dispositions. Even more nuanced approaches—such as the conditional threshold theory (Harris et al. 2019), that supposes that openness plays a critical role in the intelligence-creativity threshold—are adding further complexity and researcher’s degrees of freedom. There are additional shortcomings of the prevalent data analytical strategy, such as violated model assumptions (e.g., normally distributed and homoscedastic residuals; Gelman and Hill 2006). In sum, these statistical issues discussed presumably lead to inconsistent results, which have been reported in the literature.

It is important to note that there seems to be a confirmation bias in psychology. This bias usually occurs if subjects are asked to evaluate ambiguous evidence and see their initial expectations confirmed. Equipped with the hypothesis that the relation between intelligence and creativity is weaker above some thresholds, and given the inconclusive literature with partial support for a threshold, researchers are more likely to find that a threshold exists rather than contemplating why their results are at odds with what seems to be a compelling and positive result. Indeed, it is likely that a critical reader suspects those studies that are unable to find a threshold were somewhat flawed, maybe suffering from methodological deficiencies such as small sample sizes, inadequate measures, or other biases. This suspicion is very likely justified since most of the research—including the “positive” findings—suffers from these shortcomings (Ioannidis 2005). In the same vein, researchers who find themselves confronted with a negative result might feel the urge to try searching a little harder to escape these allegations, or to get their results published more easily (Bakker et al. 2012). We are afraid this explanatory bias helps the threshold hypothesis to escape extinction.

9.3. How Should We Approach the Threshold Hypothesis?

We wanted to shed light on a research question that has led to diverging results for over 50 years. We applied analytical strategies that have been used previously—such as the test of heteroscedasticity and the segmented regression analysis—but both approaches usually rely on manifest variables. Therefore, we proposed local structural equation modeling as an additional novel and powerful analytical tool for a continuous treatment of moderators. However, in LSEM, large sample sizes are required to estimate models at each focal point. In the case of the intelligence-creativity threshold hypothesis, this implies that large sample sizes (about N = 150; e.g., Muthén and Muthén 2002) are needed at the tails of distribution, which further increases sampling difficulties for normally distributed variables, such as creativity and intelligence. In contrast to other methods, LSEM allows for the detection of non-linear trends and an investigation into the origins of violations of measurement invariance (e.g., Hartung et al. 2020; Olaru et al. 2019).

With the present manuscript, we sought to demonstrate that the search for a specific threshold between intelligence and creativity is a wild goose chase. With that said, we do not want to discourage theoretically well-informed studies that are conducted with the necessary methodological rigor. However, we remain skeptical that a profound theoretical basis exists for further assuming a threshold or a non-linear relationship. In sum, there is no convincing evidence—theoretically or analytically—for the existence of a threshold in the relation between creativity and intelligence. Intelligence is definitely relevant for producing divergent ideas, but its relation appears linear across the continuum of intelligence. If measured broadly, the magnitude of the correlation also seems to fall within an expectable range, which mitigates prior concerns on the strength of the relation between intelligence and creativity. We assume that differentiation will not appear for other factors of creativity (e.g., originality) and intelligence (e.g., crystallized intelligence; Sligh et al. 2005). Nevertheless, studying such aspects in the future—for example, the relation between general retrieval ability, creative retrieval, and crystallized intelligence (e.g., Forthmann et al. 2019), or the overlap between fluency and originality—is interesting to further our understanding of cognitive abilities and its relationship with creativity.

Supplementary Materials

The following are available online at https://www.mdpi.com/2079-3200/8/4/38/s1, Table S1: Descriptive Statistics for all Indicators, Figure S1: Correlations and Bivariate Scatterplots between Manifest Scores, Figure S2: Scatterplot and Heteroscedasticity Plot in Study 1: Originality, Figure S3: Segmented Regression Analysis in Study 1: Originality, Figure S4: Factor Variances at each Focal Point along the Intelligence Continuum in Study 1: Originality, Figure S5: Loadings at the Focal Points in Dataset 1, Figure S6: Loadings at the Focal Points in Dataset 2.

Author Contributions

Conceptualization, S.W., D.S., U.S., and O.W.; Methodology, S.W., D.S., U.S., and O.W.; Validation, D.S., U.S., and O.W; Formal Analysis, S.W.; Investigation, S.W. and D.S.; Resources, O.W. and U.S.; Data Curation, S.W.; Writing—Original Draft Preparation, S.W.; Writing—Review & Editing, D.S., U.S., and O.W.; Visualization, S.W.; Supervision, O.W. and U.S; Project Administration, S.W.; Funding Acquisition, O.W. and U.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Acknowledgments

We thank Andrea Hildebrandt and Yadwinder Kaur for recruitment and data collection, as well as helping conceptualizing the creativity measurement. Besides, we thank the numerous research assistants that helped during the data collection as well as with the human coding of creativity measures. We are also thankful for the possibility to analyze the data set published by Neubauer et al. (2018) and we thank Aljoscha Neubauer, Anna Pribil, Alexandra Wallner, and Gabriela Ho-fer for allowing a publication of these results.

Conflicts of Interest

The authors declare no conflict of interest.

References

Amabile, Teresa. M. 1982. Social psychology of creativity: A consensual assessment technique. Journal of Personality and Social Psychology 43: 997–1013. [Google Scholar] [CrossRef]
Arendasy, M., L. F. Hornke, M. Sommer, J. Häusler, M. Wagner-Menghin, G. Gittler, and M. Wenzl. 2004. Manual Intelligence-Structure-Battery (INSBAT). Mödling: Schuhfried Gmbh. [Google Scholar]
Bakker, Marjan, Annette van Dijk, and Jelte M. Wicherts. 2012. The rules of the game called psychological science. Perspectives on Psychological Science 7: 543–54. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Benedek, Mathias, Fabiola Franz, Moritz Heene, and Aljoscha C. Neubauer. 2012. Differential effects of cognitive inhibition and intelligence on creativity. Personality and Individual Differences 53: 480–85. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Breit, Moritz, Martin Brunner, and Franzis Preckel. 2020. General intelligence and specific cognitive abilities in adolescence: Tests of age differentiation, ability differentiation, and their interaction in two large samples. Developmental Psychology 56: 364–84. [Google Scholar] [CrossRef]
Breusch, Trevor S., and Adrian R. Pagan. 1979. A simple test for heteroscedasticity and random coefficient variation. Econometrica 5: 1287–94. [Google Scholar] [CrossRef]
Byrne, Barbara M. 2010. Structural Equation Modeling with AMOS: Basic Concepts, Applications, and Programming, 2nd ed. London: Routledge. [Google Scholar]
Carroll, John B. 1993. Human Cognitive Abilities: A Survey of Factor-Analytic Studies. Cambridge: Cambridge University Press. [Google Scholar]
Cattell, Raymond B., and A. K. S. Cattell. 1960. Culture Fair Intelligence Test: Scale 2. Champaign: Institute for Personality and Ability Testing. [Google Scholar]
Cho, Sun Hee, Jan Te Nijenhuis, Annelies E. M. Vianen, Heui-Baik Kim, and Kun Ho Lee. 2010. The relationship between diverse components of intelligence and creativity. The Journal of Creative Behavior 44: 125–37. [Google Scholar] [CrossRef]
Cropley, Arthur. 2006. In praise of convergent thinking. Creativity Research Journal 18: 391–404. [Google Scholar] [CrossRef]
Cummins, James. 1979. Linguistic interdependence and the educational development of bilingual children. Review of Educational Research 49: 222–51. [Google Scholar] [CrossRef]
Davies, Robert B. 2002. Hypothesis testing when a nuisance parameter is present only under the alternative: Linear model case. Biometrika 89: 484–89. [Google Scholar] [CrossRef]
Diedrich, Jennifer, Emanuel Jauk, Paul J. Silvia, Jeffrey M. Gredlein, Aljoscha C. Neubauer, and Mathias Benedek. 2018. Assessment of real-life creativity: The Inventory of Creative Activities and Achievements (ICAA). Psychology of Aesthetics, Creativity, and the Arts 12: 304–16. [Google Scholar] [CrossRef]
Dul, Jan. 2016. Necessary condition analysis (NCA) logic and methodology of “necessary but not sufficient” causality. Organizational Research Methods 19: 10–52. [Google Scholar] [CrossRef]
Dumas, Denis. 2018. Relational reasoning and divergent thinking: An examination of the threshold hypothesis with quantile regression. Contemporary Educational Psychology 53: 1–14. [Google Scholar] [CrossRef]
Fay, E., G. Trost, and G. Gittler. 2001. Intelligenz-Struktur-Analyse (ISA). Frankfurt: Swets Test Services. [Google Scholar]
Finke, Ronald A. 1990. Creative Imagery: Discoveries as Inventions Invisualization. Mahwah: Lawrence Erlbaum Associates, Inc. [Google Scholar]
Forthmann, Boris, David Jendryczko, Jana Scharfen, Ruben Kleinkorres, Mathias Benedek, and Heinz Holling. 2019. Creative ideation, broad retrieval ability, and processing speed: A confirmatory study of nested cognitive abilities. Intelligence 75: 59–72. [Google Scholar] [CrossRef]
French, John W., Ruth B. Ekstrom, and Leighton A. Price. 1963. Manual for Kit of Reference Tests for Cognitive Factors (Revised 1963). Princeton: Educational Testing Service. [Google Scholar]
Fuchs-Beauchamp, Karen D., Merle B. Karnes, and Lawrence J. Johnson. 1993. Creativity and intelligence in preschoolers. Gifted Child Quarterly 37: 113–17. [Google Scholar] [CrossRef]
Garrett, Henry E. 1946. A developmental theory of intelligence. American Psychologist 1: 372–78. [Google Scholar] [CrossRef] [PubMed]
Gelman, Andrew, and Jennifer Hill. 2006. Data Analysis Using Regression and Multilevel/hierarchical Models. Cambridge: Cambridge University Press. [Google Scholar]
Getzels, Jacob W., and Philip W. Jackson. 1962. Creativity and Intelligence: Explorations with Gifted Students. Hoboken: Wiley. [Google Scholar]
Gilhooly, K. J., Evie Fioratou, S. H. Anthony, and V. Wynn. 2007. Divergent thinking: Strategies and executive involvement in generating novel uses for familiar objects. British Journal of Psychology 98: 611–25. [Google Scholar] [CrossRef] [Green Version]
Goecke, Benjamin, Selina Weiss, Diana Steger, Ulrich Schroeders, and Oliver Wilhelm. 2020. Its more about what you don’t know than what you do know: Perspectives on Overclaiming. Intelligence 81. [Google Scholar] [CrossRef]
Guilford, J. P., and Paul R. Christensen. 1973. The one-way relation between creative potential and IQ*. The Journal of Creative Behavior 7: 247–52. [Google Scholar] [CrossRef]
Guilford, Joy Paul. 1967. The Nature of Human Intelligence. New York: McGraw-Hill. [Google Scholar]
Harris, Alexandra M., Rachel L. Williamson, and Nathan T. Carter. 2019. A conditional threshold hypothesis for creative achievement: On the interaction between intelligence and openness. Psychology of Aesthetics, Creativity, and the Arts 13: 322–37. [Google Scholar] [CrossRef]
Hartung, Johanna, Laura E. Engelhardt, Megan L. Thibodeaux, K. Paige Harden, and Elliot M. Tucker-Drob. 2020. Developmental transformations in the structure of executive functions. Journal of Experimental Child Psychology 189: 104681. [Google Scholar] [CrossRef]
Hartung, Johanna, Philipp Doebler, Ulrich Schroeders, and Oliver Wilhelm. 2018. Dedifferentiation and differentiation of intelligence in adults across age and years of education. Intelligence 69: 37–49. [Google Scholar] [CrossRef]
Heitz, Richard P., Nash Unsworth, and Randall W. Engle. 2005. Working memory capacity, attention control, and fluid intelligence. In Handbook of Understanding and Measuring Intelligence. Edited by O. Wilhelm and R. W. Engle. Thousand Oaks: Sage Publications, pp. 61–78. [Google Scholar]
Hildebrandt, Andrea, Oliver Lüdtke, Alexander Robitzsch, Christopher Sommer, and Oliver Wilhelm. 2016. Exploring factor model parameters across continuous variables with local structural equation models. Multivariate Behavioral Research 51: 257–58. [Google Scholar] [CrossRef] [PubMed]
Hildebrandt, Andrea, Oliver Wilhelm, and Alexander Robitzsch. 2009. Complementary and competing factor analytic approaches for the investigation of measurement invariance. Review of Psychology 16: 87–102. [Google Scholar]
Holling, Heinz, and Jörg-Tobias Kuhn. 2008. Does intellectual giftedness affect the factor structure of divergent thinking? Evidence from a MG-MACS analysis. Psychology Science Quaterly 50: 283–94. [Google Scholar]
Hülür, Gizem, Oliver Wilhelm, and Alexander Robitzsch. 2011. Intelligence differentiation in early childhood. Journal of Individual Differences 32: 170–79. [Google Scholar] [CrossRef]
Ilagan, Michael John, and Welfredo Patungan. 2018. The relationship between intelligence and creativity: On methodology for necessity and sufficiency. Archives of Scientific Psychology 6: 193–204. [Google Scholar] [CrossRef]
Ioannidis, John P. A. 2005. Why most published research findings are false. PLoS Medicine 2: e124. [Google Scholar] [CrossRef] [Green Version]
Jäger, A. O., Heinz Martin Süß, and A. Beauducel. 1997. Berliner Intelligenzstruktur-Test: BIS-Test. Göttingen: Hogrefe. [Google Scholar]
Jauk, Emanuel, Mathias Benedek, Beate Dunst, and Aljoscha C. Neubauer. 2013. The relationship between intelligence and creativity: New support for the threshold hypothesis by means of empirical breakpoint detection. Intelligence 41: 212–21. [Google Scholar] [CrossRef] [Green Version]
Karwowski, Maciej, and Jacek Gralewski. 2013. Threshold hypothesis: Fact or artifact? Thinking Skills and Creativity 8: 25–33. [Google Scholar] [CrossRef]
Karwowski, Maciej, Dorota M. Jankowska, Arkadiusz Brzeski, Marta Czerwonka, Aleksandra Gajda, Izabela Lebuda, and Ronald A. Beghetto. 2020. Delving into creativity and learning. Creativity Research Journal 32: 4–16. [Google Scholar] [CrossRef]
Karwowski, Maciej, Jan Dul, Jacek Gralewski, Emanuel Jauk, Dorota M. Jankowska, Aleksandra Gajda, Michael H. Chruszczewski, and Mathias Benedek. 2016. Is creativity without intelligence possible? A Necessary Condition Analysis. Intelligence 57: 105–17. [Google Scholar] [CrossRef]
Karwowski, Maciej, Marta Czerwonka, and James C. Kaufman. 2018. Does intelligence strengthen creative metacognition? Psychology of Aesthetics, Creativity, and the Arts 14: 353–60. [Google Scholar] [CrossRef]
Kaufman, Alan S., and Nadeen L. Kaufman. 1993. The Kaufman Adolescent and Adult Intelligence Test Manual. Circle Pines: American Guidance Service. [Google Scholar]
Kim, Kyung Hee. 2005. Can only intelligent people be creative? A meta-analysis. Journal of Secondary Gifted Education 16: 57–66. [Google Scholar] [CrossRef]
Legree, Peter J., Mark E. Pifer, and Frances C. Grafton. 1996. Correlations among cognitive abilities are lower for higher ability groups. Intelligence 23: 45–57. [Google Scholar] [CrossRef]
MacCallum, Robert C., Shaobo Zhang, Kristopher J. Preacher, and Derek D. Rucker. 2002. On the practice of dichotomization of quantitative variables. Psychological Methods 7: 19–40. [Google Scholar] [CrossRef] [PubMed]
McGrew, Kevin S. 2009. CHC theory and the human cognitive abilities project: Standing on the shoulders of the giants of psychometric intelligence research. Intelligence 37: 1–10. [Google Scholar] [CrossRef]
Meade, Adam. W., and S. Bartholomew Craig. 2012. Identifying careless responses in survey data. Psychological Methods 17: 437–55. [Google Scholar] [CrossRef] [Green Version]
Molenaar, Dylan, Conor V. Dolan, Jelte M. Wicherts, and Han LJ Van Der Maas. 2010. Modeling differentiation of cognitive abilities within the higher-order factor model using moderated factor analysis. Intelligence 38: 611–24. [Google Scholar] [CrossRef]
Muggeo, Vito M. R. 2008. Segmented: An R package to fit regression models with broken-line relationships. R News 8: 20–25. [Google Scholar]
Mumford, Michael D., and Tristan McIntosh. 2017. Creative thinking processes: The past and the future. The Journal of Creative Behavior 51: 317–22. [Google Scholar] [CrossRef]
Muthén, Linda K., and Bengt O. Muthén. 2002. How to use a Monte Carlo study to decide on sample size and determine power. Structural Equation Modeling 9: 599–620. [Google Scholar] [CrossRef]
Neubauer, Aljoscha C., Anna Pribil, Alexandra Wallner, and Gabriela Hofer. 2018. The self–other knowledge asymmetry in cognitive intelligence, emotional intelligence, and creativity. Heliyon 4: e01061. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Nusbaum, Emily C., and Paul J. Silvia. 2011. Are intelligence and creativity really so different? Fluid intelligence, executive processes, and strategy use in divergent thinking. Intelligence 39: 36–45. [Google Scholar] [CrossRef] [Green Version]
Olaru, Gabriel, Ulrich Schroeders, Johanna Hartung, and Oliver Wilhelm. 2019. Ant colony optimization and local weighted structural equation modeling. A tutorial on novel item and person sampling procedures for personality research. European Journal of Personality 33: 400–19. [Google Scholar] [CrossRef]
Preacher, Kristopher J., Derek D. Rucker, Robert C. MacCallum, and W. Alan Nicewander. 2005. Use of the extreme groups approach: A critical reexamination and new recommendations. Psychological Methods 10: 178–92. [Google Scholar] [CrossRef] [Green Version]
Preckel, Franzis, Heinz Holling, and Michaela Wiese. 2006. Relationship of intelligence and creativity in gifted and non-gifted students: An investigation of threshold theory. Personality and Individual Differences 40: 159–70. [Google Scholar] [CrossRef]
Raven, John, John C. Raven, and John H. Court. 2003. Manual for Raven’s Progressive Matrices and Vocabulary Scales. Section 1: General Overview. San Antonio: Harcourt Assessment. [Google Scholar]
Robitzsch, Alexander. 2020. sirt: Supplementary Item Response Theory Models. Available online: https://rdrr.io/github/alexanderrobitzsch/sirt/man/sirt-package (accessed on 20 January 2020).
Rosseel, Yves. 2012. Lavaan: An R package for structural equation modeling and more. Version 0.5–12 (BETA). Journal of Statistical Software 48: 1–36. [Google Scholar] [CrossRef] [Green Version]
Runco, Mark A. 2008. Commentary: Divergent thinking is not synonymous with creativity. Psychology of Aesthetics, Creativity, and the Arts 2: 93–96. [Google Scholar] [CrossRef]
Ryan, Sandra E., and Laurie S. Porth. 2007. A Tutorial on the Piecewise Regression Approach Applied to Bedload Transport Data; Washington: US Department of Agriculture, Forest Service, Rocky Mountain Research Station.
Schoppe, Karl-Josef. 1975. Verbaler Kreativitäts-Test-VKT: ein Verfahren zur Erfassung verbal-produktiver Kreativitätsmerkmale. Göttingen: Verlag für Psychologie CJ Hogrefe. [Google Scholar]
Shi, Baoguo, Lijing Wang, Jiahui Yang, Mengpin Zhang, and Li Xu. 2017. Relationship between divergent thinking and intelligence: An empirical study of the threshold hypothesis with Chinese children. Frontiers in Psychology 8: 254. [Google Scholar] [CrossRef] [Green Version]
Silvia, Paul J. 2015. Intelligence and creativity are pretty similar after all. Educational Psychology Review 27: 599–606. [Google Scholar] [CrossRef]
Silvia, Paul J., Beate P. Winterstein, John T. Willse, Christopher M. Barona, Joshua T. Cram, Karl I. Hess, Jenna L. Martinez, and Crystal A. Richard. 2008. Assessing creativity with divergent thinking tasks: Exploring the reliability and validity of new subjective scoring methods. Psychology of Aesthetics, Creativity, and the Arts 2: 68–85. [Google Scholar] [CrossRef]
Silvia, Paul J., Roger E. Beaty, and Emily C. Nusbaum. 2013. Verbal fluency and creativity: General and specific contributions of broad retrieval ability (Gr) factors to divergent thinking. Intelligence 41: 328–40. [Google Scholar] [CrossRef]
Simmons, Joseph P., Leif D. Nelson, and Uri Simonsohn. 2011. False-positive psychology: Undisclosed flexibility in data collection and analysis allows presenting anything as significant. Psychological Science 22: 1359–66. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Simonsohn, Uri. 2018. Two lines: A valid alternative to the invalid testing of U-shaped relationships with quadratic regressions. Advances in Methods and Practices in Psychological Science 1: 538–55. [Google Scholar] [CrossRef] [Green Version]
Sligh, Allison C., Frances A. Conners, and Beverly Roskos-Ewoldsen. 2005. Relation of creativity to fluid and crystallized intelligence. The Journal of Creative Behavior 39: 123–36. [Google Scholar] [CrossRef]
Spearman, Charles. 1927. The Abilities of Man. New York: MacMillan. [Google Scholar]
Steger, Diana, Ulrich Schroeders, and Oliver Wilhelm. 2020. Caught in the act: Predicting cheating in unproctored knowledge assessment. Assessment. [Google Scholar] [CrossRef]
Süß, Heinz Martin, and André Beauducel. 2005. Faceted models of intelligence. In Handbook of Understanding and Measuring Intelligence. Edited by Oliver Wilhelm and Randall W. Engle. Thousand Oaks: Sage Publications. [Google Scholar]
Takakuwa, Mitsunori. 2003. Lessons from a paradoxical hypothesis: A methodological critique of the threshold hypothesis. In Előadás: 4th International Symposium on Bilingualism. Tempe: Arizona State University, p. 12. [Google Scholar]
Terman, Lewis, and Maud Merill. 1973. Manual for the Third Revision (Form LM) of the Stanford-Binet Intelligence Scale. Boston: Houghton Mifflin. [Google Scholar]
Torrance, E. Paul. 1981. Thinking Creatively in Action and Movement. Earth City: Scholastic Testing Service. [Google Scholar]
Torrance, E. Paul. 1999. Torrance Tests of Creative Thinking: Thinking Creatively with Pictures, Form A. Earth City: Scholastic Testing Service. [Google Scholar]
Tucker-Drob, Elliot M. 2009. Differentiation of cognitive abilities across the life span. Developmental Psychology 45: 1097–118. [Google Scholar] [CrossRef] [Green Version]
Urban, Klaus K. 2005. Assessing creativity: The test for creative thinking—drawing production (TCT-DP). International Education Journal 6: 272–80. [Google Scholar]
Van Der Maas, Han L., Conor V. Dolan, Raoul P. Grasman, Jelte M. Wicherts, Hild M. Huizenga, and Maartje E. Raijmakers. 2006. A dynamical model of general intelligence: the positive manifold of intelligence by mutualism. Psychological Review 113: 842–61. [Google Scholar] [CrossRef]
Vandenberg, Robert J., and Charles E. Lance. 2000. A review and synthesis of the measurement invariance literature: Suggestions, practices, and recommendations for organizational research. Organizational Research Methods 3: 4–70. [Google Scholar] [CrossRef]
Wechsler, David. 1981. WAIS Manual; Wechsler Adult Intelligence Scale-Revised. New York: The Psychological Corporation. [Google Scholar]
Weiss, Selina, Diana Steger, Kaur Yadwinder, Andrea Hildebrandt, Ulrich Schroeders, and Oliver Wilhelm. 2020a. On the trail of creativity: Dimensionality of divergent thinking and its relation with cognitive abilities and personality. European Journal of Personality. [Google Scholar] [CrossRef]
Weiss, Selina, Oliver Wilhelm, and Patrick Kyllonen. 2020b. A review and taxonomy of creativity measures. Psychology of Aesthetics, Creativity and the Arts. submitted. [Google Scholar]
Whitehead, Nedra, Holly A. Hill, Donna J. Brogan, and Cheryl Blackmore-Prince. 2002. Exploration of threshold analysis in the relation between stressful life events and preterm delivery. American Journal of Epidemiology 155: 117–24. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Wicherts, Jelte M., Coosje L. S. Veldkamp, Hilde E. M. Augusteijn, Marjan Bakker, Robbie C. M. van Aert, and Marcel A. L. M. van Assen. 2016. Degrees of freedom in planning, running, analyzing, and reporting psychological studies: A checklist to avoid p-hacking. Frontiers in Psychology 7: 1832. [Google Scholar] [CrossRef] [Green Version]
Wilhelm, Oliver, Ulrich Schroeders, and Stefan Schipolowski. 2014. Berliner Test zur Erfassung fluider und kristalliner Intelligenz für die 8. Bis 10. Jahrgangsstufe [Berlin Test of Fluid and Crystallized Intelligence for Grades 8‒10]. Göttingen: Hogrefe. [Google Scholar]
Wilson, Robert C., J. P. Guilford, Paul R. Christensen, and Donald J. Lewis. 1954. A factor-analytic study of creative-thinking abilities. Psychometrika 19: 297–311. [Google Scholar] [CrossRef]
Zhang, Feipeng, and Qunhua Li. 2017. Robust bent line regression. Journal of Statistical Planning and Inference 185: 41–55. [Google Scholar] [CrossRef]

1	We understand creativity as the ability to produce divergent ideas; thus, we do not further distinguish between creativity and divergent thinking for the purpose of this paper and use the terms interchangeably from now on.

Figure 1. Schematic representations of the relation between creativity and intelligence. The x- and y-axis display z standardized values.

Figure 2. Measurement models for divergent thinking. Study 1 (left model), Study 2 (right model) including standardized loadings and standard errors. Study 1: AUT are single items of the alternate uses task. Study 2: indicators are test-scores. Fluency test-scores are as follows: sa (similar attributes), in (inventing names), ff (figural fluency), and rf (retrieval fluency). Co (combining objects) and ni (nicknames) are originality indicators that were only instructed and scored for originality.

Figure 3. Scatterplots and heteroscedasticity plots. Scatterplots (including the 95% confidence interval) for the correlation between divergent thinking and intelligence are presented upper part. Heteroscedasticity plots including standard errors (grey) and standard deviations of the fitted values (dashed line) are given in the lower part.

Figure 4. Segmented regression analysis. The breakpoint for the relation between general intelligence and divergent thinking. The dotted line represents the 95% confidence interval.

Figure 5. Standardized factor variances at each focal point along the intelligence continuum.

Table 1. Previous investigations in the relation of creativity and intelligence.

Study	Sample	Analytical Method	Measures of Creative Ability (DT)	Measures of Intelligence	Results	Threshold (z-Standardized)
Guilford and Christensen (1973)	360 (students)	Scatterplots	10 verbal and figural DT tests ¹	e.g., Stanford Achievement Test	No Threshold	-
Fuchs-Beauchamp et al. (1993)	496 (pre-schoolers)	Correlations in two IQ groups	Thinking Creatively in Action and Movement ²	e.g., Stanford-Binet Intelligence Scale ⁸	Threshold	1.33
Sligh et al. (2005)	88 (college students)	Correlations in two IQ groups	Finke Creative Invention Task ³	KAIT ⁹	No Threshold	-
Preckel et al. (2006)	1328 (students)	Correlations and Multigroup CFA	BIS-HB ⁴	BIS-HB ⁴	No Threshold	-
Holling and Kuhn (2008)	1070 (students)	Multigroup CFA	BIS-HB ⁴	Culture Fair Test ¹⁰	No Threshold	-
Cho et al. (2010)	352 (young adults)	Correlations in two IQ groups	Torrance Test ⁵	e.g., WAIS ¹¹	Threshold	1.33
Jauk et al. (2013)	297 (adults)	SRA	Alternate Uses and Instances ⁶	Intelligence-Structure-Battery ¹²	Threshold	−1.00 to 1.33
(Karwowski and Gralewski (2013)	921 (students)	Regression analysis and CFA	Test for Creative Thinking-Drawing Production ⁷	Raven’s Progressive Matrices ¹³	Threshold	1.00 to 1.33
Shi et al. (2017)	568 (students)	among others SRA	Torrance Test ⁵	Raven’s Progressive Matrices ¹³	Threshold	0.61 to 1.12

SRA = Segmented Regression Analysis; CFA = Confirmatory Factor Analysis, ¹ Wilson et al. (1954); ² Torrance (1981); ³ Finke (1990); ⁴ BIS-HB = Berlin Intelligence Structure Test, Jäger et al. (1997); ⁵ Torrance (1999); ⁶ Jauk et al. (2013); ⁷ Urban (2005); ⁸ Terman and Merill (1973); ⁹ KAIT = Kaufman Adolescent and Adult Intelligence Test, Kaufman and Kaufman (1993); ¹⁰ Cattell and Cattell (1960); ¹¹ WAIS = Wechsler Adult Intelligence Scale, Wechsler (1981); ¹² Arendasy et al. (2004); ¹³ Raven et al. (2003).

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Weiss, S.; Steger, D.; Schroeders, U.; Wilhelm, O. A Reappraisal of the Threshold Hypothesis of Creativity and Intelligence. J. Intell. 2020, 8, 38. https://doi.org/10.3390/jintelligence8040038

AMA Style

Weiss S, Steger D, Schroeders U, Wilhelm O. A Reappraisal of the Threshold Hypothesis of Creativity and Intelligence. Journal of Intelligence. 2020; 8(4):38. https://doi.org/10.3390/jintelligence8040038

Chicago/Turabian Style

Weiss, Selina, Diana Steger, Ulrich Schroeders, and Oliver Wilhelm. 2020. "A Reappraisal of the Threshold Hypothesis of Creativity and Intelligence" Journal of Intelligence 8, no. 4: 38. https://doi.org/10.3390/jintelligence8040038

APA Style

Weiss, S., Steger, D., Schroeders, U., & Wilhelm, O. (2020). A Reappraisal of the Threshold Hypothesis of Creativity and Intelligence. Journal of Intelligence, 8(4), 38. https://doi.org/10.3390/jintelligence8040038

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Reappraisal of the Threshold Hypothesis of Creativity and Intelligence

Abstract

1. Introduction

2. The Threshold Hypothesis of Creativity and Intelligence

3. Theoretical Underpinnings of the Threshold Hypothesis

4. Empirical Evaluation of the Threshold Hypothesis

5. Analytical Strategies in the Investigation of the Threshold Hypothesis

5.1. Correlational Analysis in Split Sample

5.2. Segmented Regression Analysis

5.3. Local Structural Equation Modeling

6. The Present Studies

7. Method

7.1. Samples and Design

7.1.1. Study 1

7.1.2. Study 2

7.2. Measures and Scoring

7.2.1. Study 1

7.2.2. Study 2

7.3. Statistical Analyses

7.3.1. Scatterplots and Heteroscedasticity

7.3.2. Segmented Regression Analysis

7.3.3. Local Structural Equation Modeling

7.4. Open Science

8. Results

8.1. Scatterplots and Heteroscedasticity

8.2. Segmented Regression Analysis

8.3. Local Structural Equation Models

9. Discussion

9.1. Does a Threshold Exist?

9.2. Why Do Researchers Keep on Finding Evidence Anyway?

9.3. How Should We Approach the Threshold Hypothesis?

Supplementary Materials

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI