Event Calendar
May 2012
(view month)
S M T W R F S
* * 01 02 03 04 05
06 07 08 09 10 11 12
13 14 15 16 17 18 19
20 21 22 23 24 25 26
27 28 29 30 31 * *
<< (add event) >>


User Blox 4
- Put stuff here

Barack Obama
"Lincoln Sells Out Slaves"
by: Rob Kailey - Sep 13
1 Comments
If You Haven't Seen This
by: Rob Kailey - Apr 28
5 Comments
Impeach the President?
by: Rob Kailey - Mar 16
15 Comments
It's the system, stupid!
by: Jay Stevens - Oct 25
7 Comments

Search




Advanced Search


Rob Kailey is a working schmuck with no ties or affiliations to any governmental or political organizations, save those of sympathy.

Previewing the Congressional primary liveblog tomorrow

by: Twohundertseventy

Mon Jun 07, 2010 at 09:32:22 AM MST


(Meet Rasmus Pianowski, a 17-year-old kid from Hamburg, Germany, obsessed with American politics. Apparently, he followed Nate Silver's 538.com pretty closely, and has been inspired to try out his own election prediction models. He's done well enough to earn himself a research intern position at Pollster.com. He's volunteered for a lot of Democratic statewide candidates -- Al Franken, for one -- and recently was inspired by Tyler Gernant's candidacy to volunteer for him...Enjoy! - promoted by Jay Stevens)

This is going to be a fairly technical post, so don't say I didn't warn you.

I'm going to liveblog the incoming results of the Democratic Congressional Primary tomorrow, and I likely won't have time to explain what I'm doing then, so I'll simply do it now.

I'm going to employ two separate statistical methods to extrapolate incoming county returns to the other parts of the state, so that we'll hopefully know where the race is roughly going to end up when 10 or 15 counties have reported.

The first one is a fairly straightforward ordinary least squares multiple regression analysis.

In layman's terms that means that the results of the counties that already have reported are broken down and analyzed based on up to 16 different socio-economic and political variables. The goal is to explain what drives the election results-- that's fairly intuitive. For example, in the Alabama general election 2008 it's obvious that the racial make-up of each county is a good predictor of the McCain-Obama election result.

Now, you need to quantify that, and you also need to expand it to more than just one variable- and that's what this OLS-multiple linear regression does.

Basically, we'll end up with a formula like (I'm making this up) GernantVote%= MedianHouseholdIncome*0.00004+percentage w/o health insurance*-1.2+Gore voteshare*0.8+20.

Then, we just need to plug in the MHI, percentage w/o health insurance and Gore's voteshare for each MT county, and we have an estimate for each county. I'm weighting them by the percentage that the counties historically contribute to Democratic Primary electorates, and then we have an estimate for the whole primary. This is a fairly robust method.

The second one is a so-called  k-nearest neighbor Analysis.

Basically, I'm analyzing how similar Montana's counties are to each other by standardizing them. Essentially, if they have roughly similar values on a number of variables, their similarity score will be very low, if they're diverse, it will be high.

The 'twin counties'- the most similar to each other- are Sanders and Mineral at 0.69, while the two most different ones are Gallatin and Roosevelt at 5.08.

Then, for each county that hasn't reported yet, I'm taking the three most similar counties out of those that do have reported, and average the results of those counties.

When those results are again weighted by turnout, we have a second estimate of where the race is going to end up.

This should work fairly well when the counties come in in an order that's independent from the voting results. If for example McDonald's strongholds all come in early and Tyler's late, it wouldn't do well, as it could choose only from McDonald-counties when it's looking at what  counties are similar.

If both approaches show roughly the same result, we can be relatively sure that it's going to work out that way.

I'm already very much looking forward to tomorrow.  

Twohundertseventy :: Previewing the Congressional primary liveblog tomorrow
Tags: , , , , , (All Tags)
Bookmark and Share
Print Friendly View Send As Email

Thanks 270! (0.00 / 0)
Looking forward to reading your updates.

Live blog & math (0.00 / 0)
Well reasoned and explained. I'll be following the live blogging and subsequent analysis with great interest.

or we could just count the freaking votes as they come in..... (0.00 / 0)
obvious solutions sometimes escape people, but not bears.

[ Parent ]
We'll have both! (0.00 / 0)
i think the math just gives us a possible advance picture.  At the very least, it will be interesting to see how it works right?

I would be interested to know how you are really weighing the counties (0.00 / 0)
Is a 5.08 weight difference really significant?  Thanks for doing this, it is great to see math/science applied to politics.  

Cheers,  


[ Parent ]
Those numbers are not weights, they are (0.00 / 0)
scores that basically tell you how different/similar two counties are demographically and politically.

What I'm doing is taking the mean and the standard deviation (measuring the variance) of several variables. Then I'm calculating z-scores for them, which measure how far above or below the mean a county is.

If, for example, Al Gore's average vote share in Montana was 33% with a standard deviation of 4%, a county in which he got 41% would receive a score of 2, a county in which he got 29% would receive  a score of -1.

Then for each pair of counties I'm adding up the differences and dividing by ten (because imo smaller numbers are easier to digest).

So that basically two counties with a difference of 5.08 about 2-3 standard deviations apart on average, which means that they have, like, nothing in common. The two counties with a difference of 0.69 are on average just about 0.5 standard deviations apart, which means that they fall in the same range on most things.

I'm weighing the county results by past turnout, so that for example Missoula gets a weight of 11.8% (because in past primaries they contributed about 11.8% of the Dem primary electorate), while Petroleum gets a weight of 0.04%.  


[ Parent ]
Gotcha (0.00 / 0)
But what variables are you using?  

[ Parent ]
Jay's going to post the raw data I'm (0.00 / 0)
using, and I'm going to explain the variables then.  

[ Parent ]
I'll post the variables in a spreadsheet soon... (0.00 / 0)


[ Parent ]
I'd love to see this data for the GOP primary... (0.00 / 0)
...seems that race fits in more with the national anti-incumbent narrative...

me too. (0.00 / 0)
there is some scary stuff going on in Republican primaries this cycle.

[ Parent ]
i suspect... (0.00 / 0)
...it would be easy enough to simply do GOP primary results into the spreadsheet, with the same variables...I've got the spreadsheet, I'll have to post it...

[ Parent ]
It would probably do okay as well, yeah. (0.00 / 0)
I will probably do some more extensive post-primary analysis of all the primaries as well, when the final results are in.  

[ Parent ]
Thanks 270! (4.00 / 1)
This is great.  If you email me at mntnacowgirl@gmail.com, I'd be happy to share with you what I mean about this video.  Your call of course either way.

[ Parent ]
If you didn't see it yet, I wrote you a mail :) (0.00 / 0)


[ Parent ]
I replied. Thanks for your advice and insight : ) (0.00 / 0)


[ Parent ]
Menu

Make a New Account

Username:

Password:



Forget your username or password?


Bookmark and Share

Poll
Voting. Useful or not?
Yes!
No!
Maybe, but only if you vote my way.
There are theories that ...
Meh ...

Results

Blog Roll
  • A Secular Franciscan Life
  • Big Sky Blog
  • David Crisp's Billings Blog
  • Discovering Urbanism
  • Ecorover
  • Great Falls Firefly
  • Intelligent Discontent
  • Intermountain Energy
  • Lesley's Podcast
  • Livingston, I Presume
  • Great Falls Firefly
  • Montana Cowgirl
  • Montana Main St.
  • Montana Maven
  • Montana With kids
  • Patia Stephens
  • Prairie Mary
  • Speedkill
  • Sporky
  • The Alberton Papers
  • The Fighting Liberal
  • The Montana Capitol Blog
  • The Montana Misanthrope
  • Thoughts From the Middle of Nowhere
  • Treasure State Judaism
  • Writing and the West
  • Wrong Dog's Life Chest
  • Wulfgar!

  • Powered by: SoapBlox