Friday, September 28, 2012

September 28, 2012 Analysis

Polls included:
R+0 - Rasmussen Daily Track
O+6 - Bloomberg
O+1 - AP/GfK
O+3 - Battleground
O+5 - Fox News

O+1.88% - Current RCP Average
O+1.49% - Average using the 2008 turnout model
R+1.28% - Average using the 2010 turnout model
R+1.34% - Average using the 2004 turnout model
R+2.04% - Average using the Rasmussen Party ID turnout model

Today the WSJ/NBC poll drops out of the RCP average and the Fox News poll is included.  This results is a slight shift in the averages toward Obama.  Also I'm a little surprised to see Rasmussen tied.  Usually on a Friday Obama begins to pick up a point or two.  Typically, Obama is leading by 2 in the daily track my Monday.  It will be unsurprising if that happens again this weekend.

Thursday, September 27, 2012

Fox News Poll - September 27, 2012


Tonight Fox released a new poll.  It is getting better in terms of partisan split, but still not quite there.

Top line is a 5 point lead for Obama, 48-43 with 7% undecided.

Let's start with the good things they have done.  The partisan split is D+3, which is on the surface, defensible.  However, it over samples Democrats beyond the 2008 turnout, and undersamples Independents.  It is also continuing to show Independents favor Obama, which is counter to all the other polls out there.  It is closer though, the last time it was by 10 points, this time it is by 4 points. The D/R/I sample is 41/38/21.

So lets plug these numbers into the reweighting model.

Starting with an assumption that undecides will break 50/50 we get the following:
O+5 - Current result
O+7 - 2008 turnout
O+3 - 2010 turnout
O+3 - 2004 turnout
O+1 - Rasmussen Party ID

Now using my default assumption that undecides break 2:1 to the challenger:
O+5 - Current result
O+5 - 2008 turnout
O+1 - 2010 turnout
Even - 2004 turnout
R+1 - Rasmussen Party ID

People are going to use this poll as proof that Romney's campaign is off the rails.  However, when you look at the trend in this poll specifically, it is movement of several points toward Romney.  I will include this poll in the average tomorrow along with the Rasmussen update.


I want to highlight something...


Down in the methodology post something important is buried.  I think it needs its own post.

What really happened in 2008?

If you listen to the stories of the Hope and Change enthusiasm, you would think that the Democrats got millions upon millions of new voters to show up at the polls and vote for them.  This is compounded by the turnout discussion we use regarding 2008, "It was a D+7 year".

But what does D+7 really mean?

In 2004 122 million people voted.  The partisan ID of those votes are:
D 37% - 45.1 million
R 37% - 45.1 million
I 26% - 31.9 million

In 2008 131 million people voted. The partisan ID of those votes are:
D 39% - 51.1 million
R 32% - 41.9 million
I 29% - 38 million

So the actual changes from 2004 to 2008 are:
D + 6 million
R - 3.2 million
I + 6.1 million

The combination of Republicans staying home and additional Independent voters is 50% higher than the number of new Democrats that showed up.

It shows clearly why Obama is running the campaign he is running.  He needs to do three things. 
  1. He needs to keep his base voting at historic levels.  
  2. He needs to keep Republicans at home like they were in 2008.  
  3. And he needs to make Romney toxic to Independents.
How is that working out for him?

A lot of reports are out there that Democrat enthusiasm is low, however the Dem convention did help him, so we can give him a solid B+.

Republican enthusiasm matches or exceeds Democrat enthusiasm whenever it is measured.  There are a lot of broken glass voters against Obama.  Call this a D-.

Finally every poll shows at least 50/50 support among Independent for Romney, sometimes much higher.  So this is a failure, I'm giving him an F here.

So a back of the envelop expected result for 2012 looks like:
D - 50 million
R - 52 million
I -38 million (at a 52/48 split it would be 19.8m Romney/18.2m Obama)

Obama: 68.2 million votes
Romney: 71.8 million votes


September 27, 2012 Analysis

Polls included:
R+0 - Rasmussen Daily Track
O+5 - WSJ/NBC
O+6 - Bloomberg
O+1 - AP/GfK
O+3 - Battleground

O+1.88% - Current RCP Average
O+1.46% - Average using the 2008 turnout model
R+1.50% - Average using the 2010 turnout model
R+1.58% - Average using the 2004 turnout model
R+2.31% - Average using the Rasmussen Party ID turnout model

September 26, 2012 Analysis


Polls included:
R+2 - Rasmussen Daily Track
O+5 - WSJ/NBC
O+6 - Bloomberg
O+1 - AP/GfK
O+3 - Battleground

Note regarding Rasmussen, he released two numbers, one with leaners and one without.  RCP chose to use the "without leaners" in their average, even though other polls they include do include leaners.  I am averaging using the "with leaners" number for consistency.

O+1.63% - Current RCP Average
O+1.21% - Average using the 2008 turnout model
R+1.75% - Average using the 2010 turnout model
R+1.83% - Average using the 2004 turnout model
R+2.56% - Average using the Rasmussen Party ID turnout model

Methodology

Warning: Math ahead...

The election of 2012 is turning into a joke of epic proportion regarding the public polling provided.  Public polls are supposed to be predictive of voter behavior, not instruments of propaganda.  While I personally support Mitt Romney, I am not interested in cherry picking polls or pushing a specific result.  I am interested in knowing where my chosen candidate stands with the electorate.


4 and 8 years ago we were able to look at the Real Clear Politics (RCP) average and get a sense of the state of the race.  However, in 2012 the RCP average is not indicative of where the race stands.  


If you take the polls used in the RCP average, and look at their internals, you find that the sampling is not based in reality.  In 2008 the Democrats had their best year in several generations, with 39% of the electorate compared to 32% Republican (D+7).  This was a turnout proportion unmatched in modern times.  In 2010 we saw an Even turnout and in 2004 an Even turnout.  Many of the polls in the RCP average are reporting results that are higher than the 2008 turnout results.  D+10 or D+11 are not untypical, and one of the Pew polls was even D+19!


What I am doing is similar to what is being done at unskewedpolls.com, except I am offering a range of results instead of a single one.  I am going into internals of the public polls and reweighting them to give a different polling average based on 2008 actual turnout, 2010 actual turnout, 2004 actual turnout,  and a 3 month rolling average of the Rasmussen Party Identification poll.

Turnout Models

We start with a predicted make up of the voting population on election day.  Three turnout models are used that match the actual voter identification during three recent elections, 2004, 2008, and 2010.

2004

37% - Democrat
37% - Republican
26% - Independent

2008
39% - Democrat
32% - Republican
29% - Independent 

2010
36% - Democrat
36% - Republican
28% - Independent 

One interesting thing to note, which has important implications on the analysis of polls, is that in 2008 Democrats turned out only 2-3% above their normal levels.  The real difference is that Republicans didn't vote.  If you are to believe the polls used in the RCP average, then you must believe that Democrats will turn out at levels four times higher than than their record surge, or believe that Republicans are going to stay home at even higher levels than in 2008.  Do either of these assumptions make any sense in the age of Obama?

Rasmussen Party ID poll

Every month, Rasmussen conducts a poll on partisan identification.  He asks 15,000 people whether they identify themselves as Democrat, Republican, or Independent.  In polling terms, 15,000 samples is a huge number, resulting in a margin of error (MOE) of less than 1%.  Interestingly, he doesn't use this information in weighting his daily tracking poll, instead using an internals weighting based on responses received in the same poll over the last few weeks.  The Rasmussen party ID poll has proven to be a good indicator of party performance in recent elections, in 2004 it reported D+1 and in 2008 D+8, which were within 1 point of the actual turnout (leaning slighting in favor of Democrats).

What I am doing is using a 3 month average of this poll to provide a 4th turnout model in my reweighting.  My reasoning for using a 3 month average is that his August poll produced a huge jump in Republican identification, and I want to even out the numbers. Here are the current results:


June

34.0% - Democrat
35.4% - Republican
30.5% - Independent

July

34.0% - Democrat
34.9% - Republican
31.1% - Independent

August
33.3% - Democrat
37.6% - Republican
29.2% - Independent

Average (3 month - 45,000 samples)
33.8% - Democrat
36.0% - Republican
30.3% - Independent

Reweighting Polls

In order to reweight the polls, I start with the top line result for the poll.  For example the AP/GfK poll is reporting a result of Romney -1%.

Next I look at the internals of the poll to get the partisan breakdown.  In the AP/GfK poll it is Democrat/Republican/Independent (D/R/I) of 50/37/13.  Obviously the polling sample in this poll is way off with a D+13 sample.

Next I calculate the modification to the top line number that should be made under each of the turnout models.  The formula for this is ((Target Party ID) - (Reported Party ID)) x (Target Party ID/50).  In English, this is the difference between what was reported and what is expected for each identification times the expected identification divided by 50.  It results in how much the top line result should be adjusted for each possible party identification.  Using the Democrats and the 2008 model the Democrats would have (39-50)*(39/50) = -8.58 adjustment to the top line result (-1 + 8.58 = 7.58 toward Romney).


I do not adjust for partisan splits.  Some polls report x% of Republicans will support Obama, and vice versa.  I assume the value of a Republican in votes for Romney is equal to the value of a Democrat in votes for Obama.  I also don't try to adjust for voter enthusiasm.  If you want a sense of that, then pay attention to the turnout models.  2004 is a good Republican year, the Rasmussen Party ID would be an unprecedented Republican year.


I also adjust Independents to match the specific model.  By default I assume an independent to be a neutral choice, worth 50% to Romney and 50% to Obama.  However, if a poll reports a preference of Independents, then I will weight the result to account for the differential.


Finally, I am including a modification factor for the undecideds reported by the poll.  I maintain this as an independent variable.  When I assume a 50% split of undecides, then there is no affect on the poll.  Most of the time I will assume a 2:1 split toward the challenger Romney, since that matches historical trends.  Note there are exceptions though, in 2004 they split 2:1 for the incumbent.


The reweighting of the AP/GfK poll used as an example thus results in:

Original - Obama +1
2008 - Romney +6
2010 - Romney +10
2004 - Romney +10
Rasmussen - Romney +11

As a final step I am averaging all of these reweighted polls and giving the result.  Note that I do not include polls of which I can't see the internals.  Doing so would force apples to oranges comparisons, which negates the point of this analysis. This means I can't include Gallup in the average, and drop it out of the RCP average as well.  This is why my reporting of the RCP average will differ from what they post on their web site.