Projection, Projection, Projection

May 26, 2010 01:23

So I was a bit curious as to how projections other than PCA might work on the senate data.

Random Projection: So this looks pretty easy, just multiply by a random matrix! But how could that possibly work? In fact it sounds stupid! Well according to some complicated math that I don't really understand: it could actually work pretty well:

2007-20082009




Well it sort of works but not really. There is a price you pay for just being random!

Principal Component Analysis(PCA): So "random projection" is actually a random LINEAR projection and PCA is in some sense the BEST linear projection so we would expect PCA to be better right???

2007-20082009




Yeah, it actually is better. But it also requires much more computation- so there is a trade off there.

Multi-dimensional Scaling(MDS): This is in some sense the BEST NON-LINEAR projection. Unfortunately we can't compute it exactly but MATLAB has a function to approximate it. I actually have no idea how good of a job the function does, but its code is several pages long, so it has to be good ... right?

2007-20082009




CONCLUSIONS?
So what "value-add" do you get from this that you wouldn't get just from following politics? After all some things that jump out in the visualizations are pretty well known:

(1) Senator Collins and Snowe of Maine are the most liberal Republicans.

(2) Ben Nelson of Nebraska is an awfully conservative Democrat.

etc.

But here is one thing that I noticed that I thought was kind of interesting:

Democrats seem to vote together more frequently than Republicans.

This is a bit counter intuitive since Democrats are considered to be more ideologically diverse than Republicans. Perhaps it is just a function of being in the majority: the Majority Leader only schedules votes on bills on which most Democrats agree.

Still it is not the kind of thing that would be easy to notice without quantitative data.

science

Previous post Next post
Up