The 2016 US Presidential Elections: What Went Wrong in Pre-Election Polls? Demographics Help to Explain

Zeedan, Rami

doi:10.3390/j2010007

Open AccessArticle

The 2016 US Presidential Elections: What Went Wrong in Pre-Election Polls? Demographics Help to Explain

by

Rami Zeedan

Jewish Studies Program, The University of Kansas, Lawrence, KS 66045, USA

J 2019, 2(1), 84-101; https://doi.org/10.3390/j2010007

Submission received: 14 December 2018 / Revised: 1 February 2019 / Accepted: 25 February 2019 / Published: 1 March 2019

(This article belongs to the Special Issue Feature Papers for J-Multidisciplinary Scientific Journal)

Download

Browse Figures

Versions Notes

Abstract

:

This study examined the accuracy of the various forecasting methods of the 2016 US Presidential Elections. The findings revealed a high accuracy in predicting the popular vote. However, this is most suitable in an electoral system which is not divided into constituencies. Instead, due to the Electoral College method used in the US elections, forecasting should focus on predicting the winner in every state separately. Nevertheless, miss-predicted results in only a few states led to false forecasting of the elected president in 2016. The current methods proved less accurate in predicting the vote in states that are less urbanized and with less diverse society regarding race, ethnicity, and religion. The most challenging was predicting the vote of people who are White, Protestant Christians, and highly religious. In order to improve pre-election polls, this study suggests a few changes to the current methods, mainly to adopt the “Cleavage Sampling” method that can better predict the expected turnout of specific social groups, thus leading to higher accuracy of pre-election polling.

Keywords:

pre-election polls; 2016 US presidential elections; polling methods; race; ethnicity; religion; urban

1. Introduction

Pre-elections polls are a tool for candidates in elections to help them win their political campaign, from the top tier—with national elections—to the lowest level—local elections [1]. Pre-election polls, along with exit polls [2], are tools used for different purposes by the media, academic researchers, and the public. However, regardless of the user, the main reason to use these election-focused surveys is the desire to know accurate information where uncertainty is around. For this purpose, the science of polling has been developing over the past century to minimize the uncertainty in the forecast of election results by obtaining more statistically significant results [3].

While trying to predict as exactly as possible the results of upcoming elections, thus increasing accuracy, pollsters cope with [4]: methodological problems (such as coverage, sampling, non-responding, weighting, adjustment, or treatment of non-disclosers); socio-political problems (such as characteristics of the campaign, of the parties, or the electoral system); and sociological problems (such as characteristics of the society, its cleavages, or traditions).

Nonetheless, errors happen occasionally, expanding the criticism on miss-predicted outcomes [5]. Examples range from around the world, such as in previous presidential elections in the US: underestimating of Ronald Reagan’s victory in 1980 and the overestimation of Bill Clinton’s victory in 1996 [6], or Hungary’s “Black Sunday” in 2002 [7]. Despite those errors and others, over the years, pre-election and exit polls have established a significant role in media and campaigns and have become more reliable as a tool to predict election results, regardless of the country, its political structure, or electoral system [8]. However, elections held in 2015–2016 in several countries expanded the number of miss-predicted results. Some examples are shown in Appendix Table A1 and represent the outlier when considering the outcome of hundreds of other campaigns where pre-election polls were accurate [9]. Nevertheless, it is the focus of this study to examine this phenomenon. The examples in this research represent democracies with a variety of types of political structure and are based on different electoral systems: parliamentary democracy in Israel and the UK, presidential democracy in the US, and semi-presidential democracy in Poland.

In May 2015, pre-election presidential polls in Poland predicted a lead for the then president, Mr. Komorowski, over other candidates, such as Mr. Duda, and even predicted the possibility for Mr. Komorowski to win from the first round, given the margin of error, for example, 39% to 31% in a pre-election poll from 8 May 2015 [10]. The exit polls showed a different expected result—a tie with a slight lead by Mr. Duda over Mr. Komorowski (34.8% to 32.2%) [11], which was close to the actual vote (34.8% to 33.77%) in the first round. Eventually, Mr. Duda was declared president after winning the second round of the elections.

A similar situation was found in the same month during the national legislative elections in Britain. Pre-election polls predicted a competitive competition between the Conservative party and the Labor party (for example 219 to 219 seats) [12]. However, the exit-polls showed different expected results—a lead by the Conservative party (316 to 239 seats) [13]. Eventually, election results were not as competitive as the pre-election polls predicted. The Conservative party gained a majority in the parliament (330 to 232 seats), even more than predicted by the exit polls. None of the pre-election polls or the exit polls in the UK were able to serve as an accurate predictor of the results. Thus, as the same as in Poland, they failed to serve their goal.

A much more disturbing situation was the 2015 Israeli elections, which are considered as “The Black Tuesday of pollsters in Israel.” The pre-election polls predicted a lead for the Zionist Union party over the Likud party (for example, 25 to 21 seats, out of 120 seats). The pre-election national polls also predicted a slight lead for the block of left-center parties over the block of right-religious parties (64 to 56 seats). As in the UK case, the exit polls showed a different situation than the pre-election polls. The Likud party was predicted to lead over the Zionist Union (27 to 26 seats). However, the lead of the left-center parties over the right-religious parties was maintained (65 to 55 seats) in the exit polls [14]. Conversely to the UK and the Polish cases, the election results in Israel were not accurately predicted by neither the pre-election results nor the exit polls. The Likud party was eventually the biggest in the Knesset and much more than the Zionist Union (30 to 24 seats).

A year later, in June 2016, a week before the voting day on Brexit, it was predicted by some pollsters that the “Remain” supporters were more likely to narrowly win over the “Leave” supporters (by as low as 1% or by high as 10%). However, it is worth noting that several online polls predicted a “Leave” win. Eventually, the “Leave” supporters won the referendum by 51.9% to 48.1%, while adding more questions about the accuracy of the polling methodologies [15].

Pre-election polls in all of the above examples were not able to serve their primary goal: to give an accurate estimation of the outcome on election day. However, these examples are different in many ways. They differ in the election methods, the polling methods, the depth of the mistake (i.e., miss-predicted pre-election polls vs. miss-predicted exit polls, or both), and the source of the mistakes. This differentiation applies as well to the US presidential elections. The US political system is a presidential democracy, in which the president is the head of the state and leads the executive branch that is separated from the legislative branch and the judicial branch [16]. The election system in the US is based on the Electoral College. It applies that the President of the US is elected by an absolute majority within a body of 538 electors. Each state is assigned a number of electors that is equal to the combined total of the state’s number of members in the House of Representatives and the Senate [17]. Every state chooses those electors, in most cases by a majority of votes in the state [18].

The 2016 US Presidential Elections have been labeled as “Unprecedented” Elections regarding the triumph of non-establishment presidential candidates during the primaries [19], the unpopularity of the final candidates [20], and the growing importance of gender, race, religion, and ethnicity [21]. Given that, and the errors in predicting the vote in recent national elections in some democratic countries, it was essential to examine the various methods of predicting the vote in the 2016 US presidential election. The 2016 US Presidential Election ultimately ended up in the same way as the examples mentioned above of a failure in predicting the winner. In the next pages, we will assess the accuracy of some leading methods. While assessing the various methods conducted in this cycle, it is worth noting some untypical methods that were accurate, like: “The Keys to the White House” and “The Primary Model”.

“The Keys to the White House” [22] is a method that is based on a set of 13 true/false statements. In case any six of them are false, the incumbent party loses the presidency, and the challenging party wins. However, it is worth noting that assessing true/false is subjective and may change over the campaign. Following this method, Lichtman predicted that the “Democrats would not be able to hold on to the White House.” Using this assessment, however, without giving specifics on how the assessment is made, this method predicted the change of power in the presidency from Democrats to Republicans.

“The Primary Model” [23] is a method that relies on presidential primaries as a predictor of the vote in the general election; it also makes use of a swing of the electoral pendulum that is useful for forecasting. As in the case of the “The Keys to the White House”, “The Primary Model” also predicted the victory of Donald Trump [24]. However, “The Primary Model” predicted a Trump win over the popular vote, which turned out to not be accurate.

Conversely to the first two methods, the following methods that we examined did not accurately predict the Trump win. “National newspapers endorsements” is another method that indicates possible media influence on voters, rather than an accurate estimator of the results [25]. In this cycle, the Democratic candidate Hillary Clinton had gained the support of a long list of editorial boards, with a total of more than 500 endorsements [26]. While her Republican rival Trump received less than 30. Moreover, most of those newspapers that endorsed Trump where less regarded and less circulated (the most circulated, the Las Vegas Review-Journal, is ranked 57th regarding circulation.)

The following method examined was the “Mock Presidential Election” by Western Illinois University [27]. This method was promoted as one that has been accurate in predicting the next president since 1975. The results of the “Mock Presidential Election” conducted in November 2015 predicted a win for Sanders with more than 400 Electoral College votes [28]. However, Sanders was not on the Democratic ticket, and neither was the Democratic nominee—Clinton—close to this number of electors.

The last method we examined as a non-public opinion polling method was “Market Predictions.” It has been examined and found useful in recent years in sports and politics as well as in other areas [29]. Other researchers showed that poll-based forecast outperforms “Market Predictions” [30]. In any case, as of the night before election day of the 2016 US campaign, leading companies in the “Market Predictions” were accepting investments with much higher success rates for Clinton over Trump (for example: “Predictwise” with 88.9% for Clinton) [31]. In this case, a recent study raised doubts on using “Market Predictions,” while suggesting the possibility of bias such as market manipulation or misunderstanding by participants even when they invest their own money [32].

Following this discussion, this research examined the accuracy of several methods used in the 2016 US Presidential Elections. Despite the growing importance of non-public opinion methods, as of 2016, public opinion polling methods remain the leading tools to predict the results of elections worldwide. However, there are various methods of pre-election polls. These include, for example, telephone interviews using a landline, mobile, or combination of the two; internet surveys of a recruited group of people or voluntary participants; face-to-face interviews; and mail surveys [33]. The research questions are: What is the accuracy of the different methods in predicting votes? Why do pre-election polls miss-predict results? What are the suggested improvements needed in the pre-election polling methodology?

The current literature is split on this matter. Some acknowledge an existing problem and try to suggest new methods. Prosser and Mellon [34] describe the failure of predicting the results of the recent elections in the UK and the US. The same also with Lauderdale et al. [35] who suggest a new method to improve accuracy following those recent elections. They suggest applying multilevel regression and post-stratification methods to pre-election polling. Others also joined, such as Cowley and Kavanagh [36], Moon [37], and Toff [38]. The Report of the Inquiry into the 2015 British general election opinion polls [39] stated that the error is not new, but in any case, suggested that the primary cause of the error in 2015 was unrepresentative samples.

Others fail to acknowledge any problem. Such as Jennings and Wlezien [9] who claim that recent performance of polls has not been outside the ordinary. This claim is similar to Jennings’ recent claim [40]. Even a report by the American Association for Public Opinion Research (AAPOR) [41] stated that the problem is only in some state-level polls and predicted that the errors “are perhaps unlikely to repeat.” Tourangeau [42] acknowledges the challenges but fails to approve the “failure” (apostrophes as in the source) in 2016. Panagopoulos et al. [43] even examined the state-wide polls in the US 2016 elections and found no significant bias.

2. Materials and Methods

To examine the accuracy of the pre-election polls, the root-mean-square error (RMSE) was employed, as shown in Equation (1), to determine how accurate each poll compared to the actual results [44]. It is an estimator that measures the average of the squares of the errors, which is the difference between the estimator (

\hat{M i t}

) and the actual results for each candidate (Mit) [45].

\hat{M i t}

is the vector of the predictions of the results

\hat{M}

of the n candidates in the elections of a specific poll in a specific time frame t. Mit is the vector of the actual results of the elections for the candidate n. The advantage of using RMSE is that it has the same units as the quantity being estimated; for an unbiased estimator, the RMSE is the square root of the variance, known as the standard deviation. The lower the RMSE is, the higher the accuracy of the poll to the actual results.

The framework of the correlation is:

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} (\hat{M i t} - M i t)^{2}}

(1)

This research hypothesizes that demographic characters explain the miss-predicted results. To examine that, this study relies on the following independent factors. For race and ethnicity, we used: percentage of the Hispanic population, the percentage of the White population, the percentage of the Black population [46], and the Diversity Index [47]. For religion, we used the percentage of Evangelical Protestant, Mainline Protestant, Historically Black Protestant, Catholic, Mormon, Other Christian, Total Christian, Non-Christian, Unaffiliated, and Level of religiosity [48]. The dependent factors are the RMSE and a factor that represented true/false of prediction of the winner (1 for true, 0 for false).

To examine the research questions, we focused on the correlation between the abovementioned dependent and independent variables by using two different methods. The correlation between the RMSE and all the dependent factors, we used the Pearson correlation coefficient [49]. If the Pearson correlation value is positive, it means that the higher the value of that relevant factor is, the higher the RMSE of the prediction is. Which in return means a less accurate prediction. When the Pearson correlation value is negative, it means that the higher the percentage of that relevant factor is, the lower the RMSE of the prediction is. Which in return means a more accurate prediction. Given that the true/false prediction of the winner is a dichotomy factor, we examined its correlation with the independent variables using the Biserial correlation coefficient [50]. If the Biserial correlation coefficient is positive, it means that the higher the value of that relevant factor, the higher the chances are to get a correct prediction of the winner.

3. Results

3.1. Popular Vote

Table 1 presents the results of the examination of the predicted results by leading pollsters on the last week before election day, compared to the final results of the popular vote: Clinton—48.2%, Trump—46.1%, others—5.7% [51] (for full details on these polls see Table A2 in the Appendix). Most polls predicted the lead of Clinton in the popular vote, except two. More than 60% of these last-week polls were accurate with an RMSE smaller than 5%. Most polls fell out of their declared margin of error.

According to this examination, the United Press International/CVoter International poll was the most accurate. It is worth noting that their method was based on internet interviews in which participants self-selected to participate.

However, when examining the various methods during the 2016 campaign, little difference was found in the accuracy of predicting the results, as presented in Figure A1 and Figure A2 in the Appendix. Comparing internet interviews vs phone interviews reveal that internet interviews had slightly better accuracy. Mixed methods were only in two polls of our sample. Thus we were not able to conclude whether it was better. Applying the same comparison between polls that weighted for social and demographic stratifications (such as race/ethnicity, age, gender, region, partisanship, education, annual income, and marital status) gives better accuracy for polls with no-weighting.

3.2. Poll-Aggregators

Based on those last-week polls, all major poll-aggregators predicted the lead of Clinton in the popular vote and had a relatively high level of accuracy, as shown in Table 2. Among them, the RealClearPolitics was the most accurate. The accuracy of the top three aggregators was found as better than the average and the median (4.4%) of the last-week polls (as reported in Table 1). Only the prediction by FiveThirtyEight was with lower accuracy.

Based on these predictions, most poll aggregators formulated a projection of “the chance of winning.” All the aggregators mentioned above, like others, projected a Clinton win: Princeton Election Consortium—99% Clinton; The Huffington Post—98% Clinton; The New York Times—85% Clinton; FiveThirtyEight—71% Clinton. Some of these projections gave a slight chance for a Trump win, after analyzing the various ways leading to the presidency. Alternatively, as FiveThirtyEight’s editor-in-chief stated on the eve of the Election Day [52]: “…there’s a wide range of outcomes, and most of them come up with Clinton.”

3.3. State Polls and the Electoral College

Instead of focusing on predicting the popular vote, predictions based on pre-election polls in the US presidential elections need to be based on predicting the winner of the electors in every state, as already mentioned by other scholars who examined the US 2016 elections [53]. However, examining the performance of pollsters at the state level during the 2016 presidential election also reveals a problem. There were many differences between pollsters. However, the conclusions were similar—a Clinton win. For example, the polls published by the New York Times (NYT) predicted the results accurately in 44 states and Washington DC [54]. This is considered as a high-level accuracy of polls. However, the problem is that these polls miss-predicted the winner in “only” six states, as shown in Table 3. Those six states have altogether 108 electors that went to Trump. This number of electors made all the difference between losing and winning for Clinton and Trump, respectively, and made all the difference between predicting accurately, or incorrectly, the outcome of the presidential elections.

Based on such state polls, poll aggregators gave a lead for Clinton in the Electoral College (as shown in Table 4), while the final results were Trump 304 to Clinton 227 (and seven write-ins/other). The accuracy of such predictions was very low and even lower than the least accurate poll of the popular vote.

Therefore, this required a more thorough investigation of the state-level pre-election polls. The results, as shown in Table 5, suggest the following. Only one of the Demographic Characters by State was correlated correctly with predicting the winner of the state’s popular vote. The higher the Diversity Index is in a state, the higher the chances to predict the winner in that state (r = 0.39, r = 0.38, and r = 0.38). It suggests that current methods used by pollsters are more adjusted to a diverse population, but not able to predict the winner accurately in less diverse states.

This is also supported by the Pearson value of the correlation between the Diversity Index and the RMSE of all the three predictions examined. It shows that the higher the Diversity Index value is, the lower the RMSE is (r = −0.53, r = −0.50, and r = −0.50), meaning the higher the accuracy of the prediction. In other words, the higher the diversity is, the higher the accuracy of the prediction.

When examining the correlation between the other demographic factors and the RMSE of the NYT predictions, similar trends were found with that of the Huff-Post predictions. However, both were different from the FiveThirtyEight predictions. In fact, in the FiveThirtyEight predictions, we did find significant correlations only to two factors—Black population and Historically Black Protestant. In both cases, the RMSE of the FiveThirtyEight predictions were correlated positively to those two factors (r = 0.37 and r = 0.29, respectively). This suggests that the higher the Black population is in a state, the higher the RMSE of the FiveThirtyEight predictions. The same is for Historically Black Protestant, which supports the same conclusion. It means that their models are less useful in predicting trends among Black communities or followers of the Historically Black Protestant.

Despite some differences between the NYT predictions and the Huff-Post, we can still conclude the following for both predictions. Accurate predictions are more likely found in states with: higher Diversity Index (r = −0.53 or r = −0.50), higher percentage of people religiously affiliated as Catholic (r = −0.40 or r = −0.48), “Other Christians” (r = −0.40 or r = −0.44), Non-Christians (r = −0.52 or r = −0.54), and Unaffiliated (r = −0.42 or −0.43). Accurate predictions are also more likely found in states with a higher percentage of people living in an urban setting, compared to rural (r = −0.54 or r = 0.59).

Less accurate predictions are more likely in states with: higher religiosity (r = 0.46 or r = 0.49), higher percentage of Christians (r = 0.51 or r = 0.53)—especially higher percentages of Mainline Protestant (r = 0.38 or r = 0.42) and Evangelical Protestants (r = 0.45 or r = 0.51). It was also evident that it is more likely to have less accurate predictions in states with a higher percentage of White populations (r = 0.40 or r = 0.48).

These findings suggest that the NYT and the Huff-Post predictions were based on models that can cope with states with highly urbanized populations, with more a diverse society regarding race, ethnicity, and religion. However, those models proved less accurate to predict the vote in states that were less urbanized, with a homogeneous population and less diverse society regarding race, ethnicity, and religion. The current models are not able to predict: White people, who are Christians, mostly Protestants (Evangelical or Mainline), are highly religious, and live in rural areas.

3.4. Explanations of the Errors

None of the public opinion polling methods in the 2016 US presidential elections were accurate to a level that was useful to predict the elected president. Polls that predicted the popular vote were relatively very accurate; however, this did not lead to accurately calling the winner of the elections. Moreover, polls which were supposed to predict the distribution of the Electoral College, hence leading to the winner, provided a wrong Clinton win prediction.

Identifying the sources of the failure will require the understanding that such failure exists and a thorough examination. Explanations may include for example sampling errors, last minute changes, or “The Shy Theory.”

Sampling errors happen when pollsters fail to achieve a representative sample [55]. The sources of such a failure vary and may include these possibilities: a lack of an accurate phone numbers database; people who refused to participate, for various reasons, may have resulted in sampling errors; and pollsters assumed that people who did not vote in the 2012 election would not vote in 2016. Thus, they were excluded from the sample or less accounted for. Some of these people came to be Trump supporters, and they did cast their ballots. There was also a financial aspect that led to less state-level polls being conducted. In addition, it is claimed that many people only have a mobile phone—usually younger people—while many pollsters use mainly landline phones, which are used more by older people [56]. However, from the sample polls we examined, all of them used a combination of landline and mobile phones.

Last minute changes in the choice of voters or those who at the last minute decided to not cast ballots went without notice by pollsters. This may be caused by a variety of reasons, such as FBI director James Comey’s late decision to review additional Clinton emails which might have swayed some voters without being noticed by the pollsters [57]; and the Get Out the Vote [58] campaign may have been much more effective for one candidate than the other, and this may have resulted in more supporters of one candidate casting their ballots than initially expected.

“The Shy theory”/“The lying” phenomena describes the situation when respondents do not give candid answers or an effect of the “Spiral Silence” [59]—when voters become reluctant to express their preferences to pollsters. It is explained by the fact that Clinton supporters were more likely to admit that to pollsters compared with Trump supporters. Moreover, voters were sheepish about admitting to a human pollster that they were backing Trump. There is even a claim that Trump supporters considered pollsters and the media as biased against Trump, thus they preferred to lie on their preferred choice to make polls intentionally mistaken. Research that was conducted on this claim found “no evidence in support of the “Shy Trump Supporter” hypothesis” [60].

Despite the errors, the findings from the 2016 US presidential election conclude that pre-election polls are still an accurate tool to predict the results. This has been evident in the data shown above concerning the popular vote and most of the inter-state pre-election polls. This fact should alone lead to eliminating almost all the explanations of the failure. For example, one cannot claim that there were significant sampling errors, or significant last-minute changes, or a significant number of interviewees who lied to pollsters or miss-led them, only in few states while in a majority of states it was not so significant.

Hence, we suggest that the way to explain the failure and to fix the problem should not be based on those claims: sampling errors (i.e., database or misrepresentation of non-voters in 2012), or last-minute changes, or the “Shy Theory.” Instead, it needs to be based on this research’s findings—that the sampling methods are not robust enough for the various social structures.

4. Discussion

The importance of popular vote predictions is well-established in the literature [61]. It is most suitable in an electoral system which is not divided into constituencies, such as in Israel [62]. However, this research focused on the US elections in which the elected president is decided by winning the electoral college of the presidential constituencies that are in each and every state. The findings of this research question the use of the popular vote to project the winner of the presidential elections in the US. Given that this is the second time in five election cycles (2000 and 2016) that the candidate who won the Electoral College, and became president, had lost the popular vote [63], this study argues that predicting the popular vote is becoming less relevant.

Instead, predictions of the US presidential elections need to be based on predicting the winner of the Electoral College in every state. However, this research showed that predicting the vote in the constituencies—the states—is less accurate than the popular vote in some cases. Therefore, an improvement is required in the method of predicting the vote at the state level, in order to secure the highest level of accuracy. Accurate prediction of the outcome of elections in all states independently will produce an accurate prediction of the elected president.

Furthermore, this study suggests that the primary failure in predicting the US 2016 presidential election was due to the non-adjustment of the methods to social structures and its social groups. Among the reasons for the failure of predicting the winner in the 2016 campaign is that pollsters underestimated Trump’s support among voters who are White, Christian, mostly Protestant, (Evangelical or Mainline), highly religious, and who live in rural areas.

The miss-predicted 2016 US election is among a series of failures of pre-election polls around the world in the past two years—Poland, UK, and Israel, in which polls mainly miss-predicted a win of a more conservative party or a single candidate. Hence, it may be a sign of a change happening in democratic societies, regardless of the type of political system or electoral method. However, further research in those countries and others are required to approve this claim. From our perspective in this paper, focusing on the US, the polling approach and methods were not able yet to reflect this change. For that, polls need to be able to adapt their methodology. Otherwise, it risks becoming not relevant in some of the election cycles ahead and in more countries.

Our conclusion of an existing problem that needs to be examined and solved is in line with the findings of other scholars such as Prosser and Mellon [34], Cowley and Kavanagh [36], Moon [37], and Toff [38]. All agree that current methods need to be re-examined to improve the accuracy of pre-election polls. The Report of the Inquiry into the 2015 British general election opinion polls [39] and Lauderdale et al. [35] mainly focus on the unrepresentative samples, as this research suggests.

However, our findings contradict with other scholars. We contradict mainly with the findings of Jennings and Wlezien [9], Jennings [40], the AAPOR report [41], and Tourangeau [42]. Our findings contradict those of Panagopoulos et al. [43] by providing a significant bias in some of the state level pre-election polls and contradicting their results. In this study, we show that there is a problem in the sampling that is not representative of all social groups. Their main focus is that the failures are “outliers,” only in a few state-level cases, and is likely not to repeat. However, we showed in this research that that kind of errors could sway the prediction to the wrong outcome. It could happen in any country, regardless of its political structure or electoral system. To guarantee that the failure is minimized, improvement is needed.

Based on the outcome of this research, in order to improve pre-election polls, it is suggested to adopt “Cleavage Sampling.” It can better represent the expected turnout of specific social groups, such as some of Trump’s supporters. This is in line with the suggestions of post-stratification methods by Lauderdale et al. [33]. Polls are about minimizing the uncertainty in the forecast of election results by obtaining more statistically significant results. Sampling is concerned with the selection of a subset of individuals from within a statistical population to estimate characteristics of the whole population. To do so, we need to reflect the population better and by that to solve some of the sampling errors.

Following the miss-predicted elections’ results of 2015–2016, the main contribution of the article is that higher accuracy will be achieved by better reflecting actual trends of turnout and choice among the social cleavages in the states—by region, race, religion, and ethnicity. In this study, we found that polls that weighted for social and demographic stratifications (such as race/ethnicity, age, gender, region, partisanship, education, annual income, and marital status) gave a better accuracy compared to polls with no-weighting. The argument is that societies in Western countries are becoming more politically divided and emphasize once again the differences between social cleavages in democratic states. In this study, the findings suggest that the current models proved less able to accurately predict the vote in states that are less urbanized, with a homogeneous population, and less diverse society regarding race, ethnicity, and religion. To characterize those that the current models are not able to predict—White people who are Christian, mostly Protestant (Evangelical or Mainline), and are highly religious. As such, it is suggested that from the various sampling methods (i.e., simple random sampling, cluster sampling, systematic sampling, multiple sampling) the stratified sampling—“Cleavage Sampling”—is the one expected to be the most suitable to predict the results. This “Cleavage Sampling” will be able to produce more accurate estimates of the population than in a simple random sample of the whole population [64]. It will work better when the population is split into reasonably homogeneous groups. Pollsters need to focus on specific states. Every state needs to be sampled according to social cleavages—by region (urban vs rural), ethnicity, race, religion, and religiosity.

Also, other suggestions need to be examined, such as the use of multiple methods and “Cross-section" distribution of the undecided.

Pollsters already concluded that to understand the trends among voters better, polls need to be repeatedly conducted over the campaign period. Moreover, others have already shown that the use of multiple methods better serves the accuracy of the prediction [29,65,66,67]. This is a potential solution to last minute changes. In this study, we found no significant difference between the different methods, including more traditional phone interviews and internet interviews. In this research, we were not able to examine the use of multiple methods due to the small sample (not many surveys used multiple methods). However, it is suggested to examine the use of multiple methods. This multiple methods approach includes the more traditional methods of collecting data—landline and mobile phone surveys along with internet polling. Moreover, it includes the use of social media trends like Twitter and Google searches [68]. Recent research tested a new method of collecting data in the field by placing a mobile polling station in various places within a constituency [64]. Although this was tested in a one-time experiment, it needs to be tested more in the future, as it proved to be much more accurate than traditional phone interviews or internet surveys, as it reached groups of people that are not usually represented in traditional samples.

In addition, the distribution of the undecided is one of the significant problems that pollsters cope with. Many pollsters assume a proportional distribution of the undecided. Thus, many polls fall short of predicting the results due to the wrong distribution of the undecided. In such a changing world, it is suggested to examine the “Cross-section” distribution of the undecided. In this approach, it is suggested to distribute more undecided to the challenger, if there is an incumbent [69] or a challenger to a candidate of an incumbent’s party. This is in line with the basic approach included in “The Keys to the White House” and “The Primary Model” which both turned out to be more accurate in predicting the results of the 2016 US presidential election. It is a potential solution to the “Spiral Silence”. According to this, in a “two-candidate” election, the “challenger” candidate will receive the higher share from the undecided—similar to the percentage of the leading candidates, and the “leading” candidate will receive the lower share from the undecided—similar to the percentage of the other.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Examples of miss-predicted election results 2015–2016.

Period	Country	Example of Pre-Election Prediction	Example of Exit Poll Results	Actual Results
March 2015	Israel	25—Zionist Union party; 21—the Likud party	26—Zionist Union party; 27—the Likud party	24—Zionist Union party; 30 the Likud party
May 2015	Poland	39%—Mr. Komorowski; 31% Mr. Duda	34.8%—Mr. Duda; 32.2%—Mr. Komorowski	34.8%—Mr. Duda; 33.8%—Mr. Komorowski
June 2015	UK	219—The Conservative party; 219—the Labor party	316 The Conservative party; 239 the Labor party	330—The Conservative party; 232—the Labor party
June 2016	UK—BREXIT referendum	45%—“Leave”; 46%—“Stay”	8%—“Leave”; 52%—“Stay”	51.9%—“Leave”; 48.1%—“Stay”

A1. Sources for Table A1

Results as presented on the Israeli Channel 10 at the end of the election day—17 March (22:00);
Central Election Commission for the 20th Knesset. Israeli 2015 election results. The Knesset, 2016, Jerusalem, Israel (In Hebrew);
See references [10,11,12,13,14,15].

A2. Sources for Table 1 and Table A2

A3. Sources for Table 2 and Table 3

A4. Sources for Table 4

Data retrieved from the following websites on 08 November 2016 at 06:00:
http://www.nytimes.com/interactive/2016/us/elections/polls.html
https://projects.fivethirtyeight.com/2016-election-forecast/national-polls/#now
http://elections.huffingtonpost.com/pollster/2016-general-election-trump-vs-clinton
http://election.princeton.edu/2016/11/08/final-mode-projections-clinton-323-ev-51-di-senate-seats-gop-house/

Table A2. Pre-election polls conducted in the last week before election-day of the 2016 US presidential election (sorted by higher accuracy).¹

Pollster/Publisher	Sample Size	Sample Character	Sampling Method	Source of Sample	Weighting	Estimated Error	Clinton	Trump	Others	RMSE
United Press International/CVoter International	1625	LV	Internet	Individuals self-select to participate	N/A	3.0%	48.9%	46.1%	5.0%	0.6%
Insights West	940	LV	Internet	N/A	Age, gender, and region	3.2%	49.0%	45.0%	6.0%	0.8%
Google Surveys	21,049	LV	Internet	N/A	Age, gender, and state	0.7%	48.0%	45.1%	6.9%	0.9%
Monmouth	748	LV	Phone	Landline and mobile	Age, gender, race, and partisanship	3.5%	50.0%	44.0%	6.0%	1.6%
Fox News	1295	LV	Phone	Landline and mobile	State voters’ size	2.5%	48.0%	44.0%	8.0%	1.8%
Angus Reid Institute	1151	RV	Internet	Panel	N/A	2.9%	48.0%	44.0%	8.0%	1.8%
Franklin Pierce University/Boston Herald	1009	LV	Phone	Landline and mobile	Selection, respondent gender, respondent age and region; East, West, Mid-West, South	3.1%	48.0%	44.0%	8.0%	1.8%
American Broadcasting Company/Washington Post Tracking	2220	LV	Phone	Landline and mobile	N/A	2.5%	47.0%	43.0%	10.0%	3.1%
Gravis Marketing	16,639	RV	Phone and Internet	N/A	Voting patterns	0.8%	47.0%	43.0%	10.0%	3.1%
LA Times/University of Southern California Tracking	2935	LV	Internet	Randomly selected	N/A	4.5%	43.6%	46.8%	9.6%	3.5%
Investor’s Business Daily/TIPP Tracking	1107	LV	Phone	Landline and mobile	N/A	3.1%	43.4%	45.0%	11.6%	4.4%
Rasmussen Reports	1500	LV	Phone and Internet	Automated polling systems and online survey	N/A	2.5%	45.0%	43.0%	12.0%	4.4%
National Broadcasting Company News/SurveyMonkey	70,194	LV	Internet	Non-probability survey	N/A	1.0%	47.0%	41.0%	12.0%	4.7%
POLITICO/Morning Consult	1482	RV	Internet	N/A	Age, race/ethnicity, gender, educational attainment, region, annual household income, home ownership status andmarital status.	3.0%	45.0%	42.0%	13.0%	5.2%
McClatchy/Marist	940	LV	Phone	Landline and mobile	N/A	3.2%	44.0%	43.0%	13.0%	5.2%
Economist/YouGov	3677	LV	Internet	Individuals self-select to participate. Then interview chosen randomly, but stratified by gender, age, race, education, and region.	Age, gender, income, race, and region	N/A	45.0%	41.0%	14.0%	5.9%
Columbia Broadcasting System News/New York Times	1426	LV	Phone	Landline and mobile	Demographic variables	3.0%	45.0%	41.0%	14.0%	5.9%
Selzer and Company/Bloomberg	799	LV	Phone	Landline and mobile	Age and race	3.5%	44.0%	41.0%	15.0%	6.6%
The Times-Picayune/Lucid	1200	LV	Internet	Individuals self-select to participate.	National demographics	N/A	45.0%	40.0%	15.0%	6.7%
National Broadcasting Company News/Wall St. Journal	1282	LV	Phone	Landline and mobile.	N/A	2.7%	44.0%	40.0%	16.0%	7.3%
Reuters/Ipsos	2195	LV	Internet	N/A	Gender, age, education and ethnicity	2.3%	42.0%	39.0%	19.0%	9.4%

¹ For sources of the data, see Section A2 in the Appendix; LV—Likely voters; RV—registered voters.

Figure A1. The relation between sample size and the RMSE (r = −0.03).

Figure A2. The relation between estimated error and the RMSE (r = 0.04).

References and Note

Mludzinski, T.; Peacock, K. Outside the Marginals: Constituency and Regional Polling at the 2015 General Elections. In Political Communication in Britain; Palgrave Macmillan: Cham, Switzerland, 2017; pp. 63–75. [Google Scholar]
Pre-election polls are polls conducted until few days before the election day. While exit-polls are conducted during election day usually by asking voters to repeat their vote in a semi-poll station outside an official polling ballot
Katz, D. Do interviewers bias poll results? Public Opin. Q. 1942, 6, 248–268. [Google Scholar] [CrossRef]
Crespi, I. Pre-Election Polling: Sources of Accuracy and Error; Russell Sage Foundation: New York, NY, USA, 1988. [Google Scholar]
Daves, R.P.; Newport, F. Pollsters under Attack 2004 Election Incivility and Its Consequences. Public Opin. Q. 2005, 69, 670–681. [Google Scholar] [CrossRef]
Martin, E.A.; Traugott, M.W.; Kennedy, C. A review and proposal for a new measure of poll accuracy. Public Opin. Q. 2005, 69, 342–369. [Google Scholar] [CrossRef]
Bodor, T. Hungary’s “Black Sunday” of public opinion research: The anatomy of a failed election forecast. Int. J. Public Opin. Res. 2011, 24, 450–471. [Google Scholar] [CrossRef]
Jennings, W.; Wlezien, C. The timeline of elections: A comparative perspective. Am. J. Political Sci. 2016, 60, 219–233. [Google Scholar] [CrossRef]
Jennings, W.; Wlezien, C. Election polling errors across time and space. Nat. Hum. Behav. 2018, 2, 276–283. [Google Scholar] [CrossRef] [Green Version]
IBRiS. The Pre-Election Poll of the 2015 Poland Presidential Elections. Available online: http://wiadomosci.onet.pl/kraj/ostatni-sondaz-prezydencki-przed-cisza-wyborcza/6fsqj6 (accessed on 11 May 2015). (In Polish).
TVPINFO. IPSOS Exit-Poll of the 2015 Poland Presidential Elections. Available online: http://www.tvp.info/20004330/duda-wygral-pierwsza-ture-komorowski-tuz-za-nim-swietny-wynik-kukiza-sondazowe-wyniki-wyborow (accessed on 11 May 2015). (In Polish).
BMG Research Ltd. Election 2015: New exclusive poll puts Labor and Tories on exactly 33.7 percent each. Available online: http://www.bmgresearch.co.uk/; http://www.may2015.com (accessed on 15 May 2015).
PSOS MORI. 2015 UK Elections—Exit Polls Results. Available online: www.bbc.com (accessed on 20 May 2015).
Gershoni-Eliaho, N. Project 61: Database of published pre-election polls of Israel’s 2015 election. Available online: http://www.InfoMeyda.com (accessed on 13 March 2015).
ComRes. EU Referendum Poll. Available online: https://www.comresglobal.com/polls/sun-eu-referendum-poll-june-2016/ (accessed on 15 September 2016).
Horowitz, D.L. Comparing democratic systems. J. Democr. 1990, 1, 73–79. [Google Scholar] [CrossRef]
Engstrom, R.L. The United States: The Future—Reconsidering Single-Member Districts and the Electoral College. In The Handbook of Electoral System Choice; Palgrave Macmillan: London, UK, 2004; pp. 164–176. [Google Scholar]
Ross, R.E. Federalism and the Electoral College: The Development of the General Ticket Method for Selecting Presidential Electors. Publius J. Fed. 2016, 46, 147–169. [Google Scholar] [CrossRef]
Groshek, J.; Koc-Michalska, K. Helping populism win? Social media use, filter bubbles, and support for populist presidential candidates in the 2016 US election campaign. Inf. Commun. Soc. 2017, 20, 1389–1407. [Google Scholar] [CrossRef]
Liu, H.; Jacobson, G.C. Republican Candidates’ Positions on Donald Trump in the 2016 Congressional Elections: Strategies and Consequences. Pres. Stud. Q. 2018, 48, 49–71. [Google Scholar] [CrossRef]
Schaffner, B.F.; MacWilliams, M.; Nteta, T. Understanding white polarization in the 2016 vote for president: The sobering role of racism and sexism. Political Sci. Q. 2018, 133, 9–34. [Google Scholar] [CrossRef]
Lichtman, A.J. The keys to the White House, 1996: A Surefire guide to Predicting the Next President; Rowman and Littlefield Publishers: Lanham, MD, USA, 2016. [Google Scholar]
Norpoth, H. On the razor’s edge: The forecast of the primary model. Political Sci. Politics 2008, 41, 683–686. [Google Scholar] [CrossRef]
Norpoth, H. Forecasting presidential elections since 1912. Available online: http://primarymodel.com/ (accessed on 8 November 2016).
Chiang, C.-F.; Knight, B. Media bias and influence: Evidence from newspaper endorsements. Rev. Econ. Stud. 2011, 78, 795–820. [Google Scholar] [CrossRef]
Watrel, R.H.; Weichelt, R.; Davidson, F.M.; Heppen, J.; Fouberg, E.H.; Archer, J.C.; Morrill, R.L.; Shelley, F.M.; Martis, K.C. (Eds.) Atlas of the 2016 Elections; Rowman & Littlefield: Lanham, MD, USA, 2018. [Google Scholar]
Hardy, R.J. The Presidential Sweepstakes: A Mock Election; University of Missouri: Columbia, MA, USA, 1980. [Google Scholar]
The Western Illinois University. Democratic Ticket Wins Mock Presidential Election; WIU Students Voted Sanders/O’Malley to Take Top Seats. Available online: http://www.wiu.edu/news/newsrelease.php?release_id=13059 (accessed on 15 September 2016).
Berg, J.E.; Nelson, F.D.; Rietz, T.A. Prediction market accuracy in the long run. Int. J. Forecast. 2008, 24, 285–300. [Google Scholar] [CrossRef]
Erikson, R.S.; Wlezien, C. Are political markets really superior to polls as election predictors? Public Opin. Q. 2008, 72, 190–215. [Google Scholar] [CrossRef]
Predictwise. Data-driven polling, change-making insights. Available online: Predictwise.com (accessed on 8 November 2016).
Graefe, A. Prediction market performance in the 2016 US presidential election. Foresight Int. J. Appl. Forecast. 2017, 38–42. [Google Scholar]
Brooker, R.G.; Schaefer, T. Public Opinion in the 21st Century: Methods of Measuring Public Opinion. (Unpublished Work). Available online: http://www.uky.edu/AS/PoliSci/Peffley/pdf/473Measuring%20Public%20Opinion.pdf (accessed on 5 December 2018).
Prosser, C.; Mellon, J. The Twilight of the Polls? A Review of Trends in Polling Accuracy and the Causes of Polling Misses. Gov. Oppos. 2018, 1–34. [Google Scholar] [CrossRef]
Lauderdale, B.E.; Bailey, D.; Blumenau, Y.J.; Rivers, D. Model-Based Pre-Election Polling for National and Sub-National Outcomes in the US and UK. (Unpublished Work). Available online: https://www.jackblumenau.com/papers/mrp_polling.pdf (accessed on 5 December 2018).
Cowley, P.; Kavanagh, D. Wrong-Footed Again: The Polls. In The British General Election of 2017; Palgrave Macmillan: Cham, Switzerland, 2018; pp. 259–283. [Google Scholar]
Moon, N. The performance of the polls. In Political Communication in Britain 2017; Palgrave Macmillan: Cham, Switzerland, 2017; pp. 39–48. [Google Scholar]
Toff, B. Rethinking the Debate over Recent Polling Failures. Political Commun. 2018, 35, 327–332. [Google Scholar] [CrossRef]
Sturgis, P.; Nick, B.; Mario, C.; Stephen, F.; Jane, G.; Jennings, W.; Jouni, K.; Ben, L.; Patten, S. Report of the Inquiry into the 2015 British General Election Opinion Polls; British Polling Council and the Market Research Society: London, UK, 2016. [Google Scholar]
Jennings, W. The Polls in 2017. In Political Communication in Britain 2019; Palgrave Macmillan: Cham, Switzerland, 2019; pp. 209–220. [Google Scholar]
Kennedy, C.; Blumenthal, M.; Clement, S.; Clinton, J.D.; Durand, C.; Franklin, C.; McGeeney, K.; Miringoff, L.; Olson, K.; Rivers, D.; et al. An evaluation of the 2016 election polls in the United States. Public Opin. Q. 2018, 82, 1–33. [Google Scholar] [CrossRef]
Tourangeau, R. Presidential Address Paradoxes of Nonresponse. Public Opin. Q. 2017, 81, 803–814. [Google Scholar] [CrossRef]
Panagopoulos, C.; Endres, K.; Weinschenk, A.C. Preelection poll accuracy and bias in the 2016 US general elections. J. Elect. Public Opin. Parties 2018, 28, 157–172. [Google Scholar] [CrossRef]
Jagers, P.; Oden, A.; Trulsson, L. Post-stratification and ratio estimation: Usages of auxiliary information in survey sampling and opinion polls. Int. Stat. Rev./Rev. Int. Stat. 1985, 53, 221–238. [Google Scholar] [CrossRef]
Chai, T.; Draxler, R.R. Root mean square error (RMSE) or mean absolute error (MAE)?—Arguments against avoiding RMSE in the literature. Geosci. Model Dev. 2014, 7, 1247–1250. [Google Scholar] [CrossRef]
Henry, J.; Kaiser Family Foundation. Population Distribution by Race/Ethnicity. Available online: https://www.kff.org/other/state-indicator/distribution-by-raceethnicity/?currentTimeframe=0&sortModel=%7B%22colId%22:%22Location%22,%22sort%22:%22asc%22%7D (accessed on 5 December 2018).
Lee, B.A.; Martin, M.J.; Matthews, S.A.; Farrell, C.R. State-level changes in US racial and ethnic diversity, 1980 to 2015: A universal trend? Demogr. Res. 2017, 37, 1031–1048. [Google Scholar] [CrossRef] [PubMed]
Pew Research Center. America’s Changing Religious Landscape. Available online: http://www.pewforum.org/2015/05/12/americas-changing-religious-landscape/ (accessed on 5 December 2018).
Lawrence, I.; Lin, K. A concordance correlation coefficient to evaluate reproducibility. Biometrics 1989, 45, 255–268. [Google Scholar]
Tate, R.F. Correlation between a discrete and a continuous variable. Point-biserial correlation. Ann. Math. Stat. 1954, 25, 603–607. [Google Scholar] [CrossRef]
Wasserman, D. Final Results Based on the “2016 National Popular Vote Tracker, Cook Political Report. Available online: https://docs.google.com/spreadsheets/d/133Eb4qQmOxNvtesw2hdVns073R68EZx4SfCnP4IGQf8/edit#gid=19 (accessed on 2 January 2017).
FiveThirtyEight. Statement by Nate Silver, 7 November 2016. Available online: www.fivethirtyeight.com (accessed on 8 November 2016).
Kostadinov, B. Predicting the Next US President by Simulating the Electoral College. J. Humanist. Math. 2018, 8, 64–93. [Google Scholar] [CrossRef] [Green Version]
Katz, J. Who Will Be President? the New York Times website. Available online: https://www.nytimes.com/interactive/2016/upshot/presidential-polls-forecast.html (accessed on 8 November 2016).
Wang, W.; Rothschild, D.; Goel, S.; Gelman, A. Forecasting elections with non-representative polls. Int. J. Forecast. 2015, 31, 980–991. [Google Scholar] [CrossRef] [Green Version]
Callegaro, M.; Ayhan, O.; Gabler, S.; Haeder, S.; Villar, A. Combining Landline and Mobile Phone Samples: A Dual Frame Approach; (GESIS-Working Papers, 2011/13); GESIS Leibniz-Institut für Sozialwissenschaften: Köln, Germany, 2011; Available online: https://nbn-resolving.org/urn:nbn:de:0168-ssoar-282659 (accessed on 5 December 2018).
McElwee, S.; McDermott, M.; Jordan, W. 4 Pieces of Evidence showing FBI Director James Comey cost Clinton the Election. Vox.com. 11 January 2017. Available online: https://www.vox.com/the-big-idea/2017/1/11/14215930/comey-email-election-clinton-campaign (accessed on 31 January 2019).
Green, D.P.; Gerber, A.S. Get Out the Vote: How to Increase Voter Turnout; Brookings Institution Press: Washington, DC, USA, 2015. [Google Scholar]
Noelle-Neumann, E.; Noelle-Neumann, E. The Spiral of Silence: Public Opinion, Our Social Skin, 2nd ed.; University of Chicago Press: Chicago, IL, USA, 1993. [Google Scholar]
Coppock, A. Did Shy Trump Supporters Bias the 2016 Polls? Evidence from a Nationally-representative List Experiment. Stat. Politics Policy 2017, 8, 29–40. [Google Scholar] [CrossRef]
Buchanan, W. Election predictions: An empirical assessment. Public Opin. Q. 1986, 50, 222–227. [Google Scholar] [CrossRef]
Rahat, G.; Hazan, R.Y. The barriers to electoral system reform: A synthesis of alternative approaches. West Eur. Politics 2011, 34, 478–494. [Google Scholar] [CrossRef]
Bartels, L.M.; Zaller, J. Presidential vote models: A recount. Ps Political Sci. Politics 2001, 34, 9–20. [Google Scholar] [CrossRef]
Zeedan, R. Predicting the Vote in Kinship-Based Municipal Elections- the case of Arab localities in Israel. J. Muslim Minority Aff. 2018, 38, 87–102. [Google Scholar] [CrossRef]
Cuzán, A.G.; Armstrong, J.S.; Jones, R.J., Jr. How we computed the PollyVote. Foresight Int. J. Appl. Forecast. 2005, 1, 51–52. [Google Scholar]
Wolfers, J.; Zitzewitz, E. Prediction markets. J. Econ. Perspect. 2004, 18, 107–126. [Google Scholar] [CrossRef]
Atkeson, L.R.; Adams, A.N.; Alvarez, R.M. Nonresponse and mode effects in self-and interviewer-administered surveys. Political Anal. 2014, 22, 304–320. [Google Scholar] [CrossRef]
Tumasjan, A.; Sprenger, T.O.; Sandner, P.G.; Welpe, I.M. Election Forecasts with Twitter: How 140 Characters Reflect the Political Landscape. Soc. Sci. Comput. Rev. 2010, 29, 402–418. [Google Scholar] [CrossRef]
Mitofsky, W.J. Review: Was 1996 a Worse Year for Polls Than 1948? Public Opin. Q. 1998, 62, 230–249. [Google Scholar] [CrossRef]

Table 1. Pre-election polls conducted in the last week before election day of the 2016 US presidential election (sorted by higher accuracy). ¹

Pollster/Publisher	Clinton	Trump	Others	RMSE
United Press International/CVoter International	48.9%	46.1%	5.0%	0.6%
Insights West	49.0%	45.0%	6.0%	0.8%
Google Surveys	48.0%	45.1%	6.9%	0.9%
Monmouth	50.0%	44.0%	6.0%	1.6%
Fox News	48.0%	44.0%	8.0%	1.8%
Angus Reid Institute	48.0%	44.0%	8.0%	1.8%
Franklin Pierce University/Boston Herald	48.0%	44.0%	8.0%	1.8%
American Broadcasting Company/Washington Post Tracking	47.0%	43.0%	10.0%	3.1%
Gravis Marketing	47.0%	43.0%	10.0%	3.1%
LA Times/University of Southern California Tracking	43.6%	46.8%	9.6%	3.5%
Investor’s Business Daily/TIPP Tracking	43.4%	45.0%	11.6%	4.4%
Rasmussen Reports	45.0%	43.0%	12.0%	4.4%
National Broadcasting Company News/SurveyMonkey	47.0%	41.0%	12.0%	4.7%
POLITICO/Morning Consult	45.0%	42.0%	13.0%	5.2%
McClatchy/Marist	44.0%	43.0%	13.0%	5.2%
Economist/YouGov	45.0%	41.0%	14.0%	5.9%
Columbia Broadcasting System News/New York Times	45.0%	41.0%	14.0%	5.9%
Selzer and Company/Bloomberg	44.0%	41.0%	15.0%	6.6%
The Times-Picayune/Lucid	45.0%	40.0%	15.0%	6.7%
National Broadcasting Company News/Wall St. Journal	44.0%	40.0%	16.0%	7.3%
Reuters/Ipsos	42.0%	39.0%	19.0%	9.4%

¹ For sources of the data, see Section A2 in the Appendix. Notes: As of 8 November 2016 06:00; TIPP—polling unit of TechnoMetrica Market Intelligence.

Table 2. Predictions by major poll aggregators on the last night before election day of the 2016 US presidential election (sorted by higher accuracy).¹

Poll-Aggregator	Clinton	Trump	Others	RMSE
RealClearPolitics	46.8%	43.6%	9.6%	2.8%
The Huffington Post	47.3%	42.0%	10.7%	3.8%
The New York Times	45.9%	42.8%	11.3%	4.0%
FiveThirtyEight	45.7%	41.8%	12.5%	4.9%

¹ For sources of the data, see Section A3 in the Appendix. Notes: As of 8 November 2016 06:00; The Princeton Election Consortium was not included due to their decision this cycle not to present their accurate aggregated projection of the popular vote for each candidate. Instead, it was presented as the projected margin between the leading candidate and the challenger: Clinton +4%.

Table 3. Example of miss-predicted results by state (New York Times) on the last night before election day of the 2016 US presidential election.¹

State	Pre-Election Polls (Difference Clinton–Trump)	Actual Results(Difference Clinton–Trump)	The Electoral College of Miss-Predicted States
Florida	+2% Clinton	−1.2% Trump	29
Michigan	+7% Clinton	−0.2% Trump	16
North Carolina	+1% Clinton	−3.7% Trump	15
Ohio	0% Even	−8.1% Trump	18
Pennsylvania	+5% Clinton	−0.7% Trump	20
Wisconsin	+6% Clinton	−0.8% Trump	10
Total			108

¹ For sources of the data, see Section A3 in the Appendix. Note: As of 8 November 2016 06:00.

Table 4. Predictions of the Electoral College by major poll aggregators on the last night before election day of the 2016 US presidential election (sorted by higher accuracy).¹

Poll-Aggregator	Clinton	Trump	Others	RMSE
FiveThirtyEight	302	235	1	11.0%
Princeton Election Consortium	307	231	0	11.6%
The New York Times	322	216	0	13.9%
The Huffington Post	324	214	0	14.2%

¹ For sources of the data, see Section A4 in the Appendix. Notes: As of 8 November 2016 06:00. RMSE was calculated on the percentage of electors out of 538. RealClearPolitics did not publish an Electoral College projection.

Table 5. Correlation between RMSE, true/false prediction, and demographic characters by state¹.

Topic	Demographic Characters by State	NYT Predictions		Huff-Post Predictions		FiveThirtyEight Predictions
Topic	Demographic Characters by State	RMSE	True/False Prediction	RMSE	True/False Prediction	RMSE	True/ False Prediction
Race/ Ethnicity	Hispanic population	−0.52	…	−0.49	…	…	…
	White population	0.40	…	0.48	…	…	…
	Black population	…	…	…	…	0.37	…
	Diversity Index	−0.53	0.39	−0.50	0.38	−0.50	0.38
Religion	Evangelical Protestant	0.45	…	0.51	…	…	…
	Mainline Protestant	0.38	…	0.42	…	…	…
	Historically Black Protestant	…	…	…	…	0.29*	…
	Catholic	−0.40	…	−0.48	…	…	…
	Mormon	…	…	…	…	…	…
	Other Christian	−0.40	…	−0.44	…	…	…
	Total Christian	0.51	…	0.53	…	…	…
	Non-Christian	−0.52	…	−0.54	…	…	…
	Unaffiliated	−0.42	…	−0.43	…	…	…
	Level of religiosity	0.46	…	0.49	…	…	…
Urban	Urban	−0.54	…	−0.59	…	…	…

¹N = 49. Notes: Significance was calculated at p ≤ 0.01; “…” is for not significant; RMSE—by Pearson correlation coefficient; true/false prediction—by Biserial correlation coefficient; States not included were Maine and Nebraska, due to their method of distributing the Electoral College; * Significant at p ≤ 0.05.

© 2019 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zeedan, R. The 2016 US Presidential Elections: What Went Wrong in Pre-Election Polls? Demographics Help to Explain. J 2019, 2, 84-101. https://doi.org/10.3390/j2010007

AMA Style

Zeedan R. The 2016 US Presidential Elections: What Went Wrong in Pre-Election Polls? Demographics Help to Explain. J. 2019; 2(1):84-101. https://doi.org/10.3390/j2010007

Chicago/Turabian Style

Zeedan, Rami. 2019. "The 2016 US Presidential Elections: What Went Wrong in Pre-Election Polls? Demographics Help to Explain" J 2, no. 1: 84-101. https://doi.org/10.3390/j2010007

APA Style

Zeedan, R. (2019). The 2016 US Presidential Elections: What Went Wrong in Pre-Election Polls? Demographics Help to Explain. J, 2(1), 84-101. https://doi.org/10.3390/j2010007

Article Menu

The 2016 US Presidential Elections: What Went Wrong in Pre-Election Polls? Demographics Help to Explain

Abstract

1. Introduction

2. Materials and Methods

3. Results

3.1. Popular Vote

3.2. Poll-Aggregators

3.3. State Polls and the Electoral College

3.4. Explanations of the Errors

4. Discussion

Funding

Conflicts of Interest

Appendix A

A1. Sources for Table A1

A2. Sources for Table 1 and Table A2

A3. Sources for Table 2 and Table 3

A4. Sources for Table 4

References and Note

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI