doubts about the data - Marist versus Trafalgar

blazespinnaker commented 3 years ago

Just wanted to say, huge fan of this dataset, huge fan of Nate (bought and loved the book) for like forever. Really hope decency and rationalism wins in 2020.

But, I have some doubts about the pollster rankings, after looking at some data[1][2]. Given that you're being so transparent about this data and I haven't found anything on the web, I assume I'm missing something obvious. Please let me know what it is. Using real math, if you can. I love math. You can't argue with Math!

In particular, I'm looking at Marist College (ranked A+, with 0.2% R house effect!) versus Trafalgar (Ranked C- with R+3.1% ).

I picked these two pollsters as Trafalgar was pretty dramatically right in 2016 and looking at their 2018 results, they caught the crazy squeakers in Florida. Marist is interesting because a lot of polls and ranked highly by 538.

So Trafalgar and that 48 to 46 margin in PA for Trump in 2016. 48 to 44 in FL. Wow. Scott/DeSantis to win in Florida in 2018. It's like they are from the Future.

That said, in 2018, they missed and seemed to over estimate R in some other key states (NV, AZ, went to Ds), but then missed over estimated Dems in OH (went to R). Not sure what the political partisan calculus is there, if there is any, but ok.

Trafalgar also over estimated dems in LA for Gov win in 2018. Not very interesting as it was a 20% margin of victory state (and he did win, after all). But - you have no entry for bias there. Hmmm. I guess if I wanted to fix my house effect I might put my thumb on the scales of polls that don't really matter to me and could help say that I'm fair and balanced? Not sure why you left that one blank.

Now onto Marist. What happened in 2018? Who paid these folks? The average over estimate was like 4-5% in favor of democrats across the board. 50-46 in Florida, dems to win, 4. Dems to win Senate in Indiana! Indiana! They were off by around 8% in favor of Dems, and of course the Rs took it. What was Marist smoking? I want some.

I don't think there is a lot of signal in their call for Ds to underperform in Wicker/Baria Senate race in MS. Nice to see Marist finally show some love to the republicans, but did it have to be in a 20% margin of victory race?

If I were unconsciously biasing polls in a way that helped close races for my political ideology, I probably would unconsciously not care about biasing polls in a way that didn't help in races I didn't care about. So, I like to drop this result in the same that the result for Trafalgar seemed to be dropped (or at least was blank). It seems fair.

Without the Wicker/Baria race, I think the average error is about 5% or so in favor of democrats. With it, around 3% in favor of democrats? Anyways, more than Trafalgar's average error for republicans

To add insult to Injury, Marist called 2016 Florida President General 45.5 to 45 - For Hillary. I mean, come on :)

You know, I am ok with rating Trafalgar poorly. I am sure there is a good reason for that which I won't argue. But relatively speaking, Marist is just as bad if not worse, frankly. Sure, they've been around forever, but results and accuracy matter.

blazespinnaker commented 3 years ago

All the data is not in yet, however looking at NC/FL it's obvious Marist did very poorly and their 'house effect' is so obviously not correct. As did 538, generally speaking. I would suggest doing away with the pollster grading until you get your house in order. Some suggestions:

Consider splitting it up into a score card, instead of a grade mechanism. This will clarify. The grading mechanism as it stands is condescending and clearly not valid.
You may already do this, but it would be good to do more. Weight polls more by areas of accuracy. Dig into cross tabs especially. Where did the tabs line up with reality? Where did they not? I know we don't have great exit polling
Regression! I don't see you talking enough about this. How are you regressing against previous elections? Why aren't you regressing? If you did, 2020 would have been much more accurate.

e-traudt commented 3 years ago

Regression is a statistical skill, 538 is a group of sportsball spreadsheet jockeys with no actual math ability.

On Wed, Nov 4, 2020 at 7:20 PM blazespinnaker notifications@github.com wrote:

All the data is not in yet, however looking at NC/FL it's obvious Marist did very poorly. As did 538, generally speaking. I would suggest doing away with the pollster grading until you get your house in order. Some suggestions:

Consider splitting it up into a score card, instead of a grade mechanism. This will clarify. The grading mechanism as it stands is condescending and clearly not valid.

You may already do this, but it would be good to do more. Weight polls more by areas of accuracy. Dig into cross tabs especially. Where did the tabs line up with reality? Where did they not? I know we don't have great exit polling

Regression! I don't see you talking enough about this. How are you regressing against previous elections? Why aren't you regressing? If you did, 2020 would have been much more accurate.

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/fivethirtyeight/data/issues/267#issuecomment-722045786, or unsubscribe https://github.com/notifications/unsubscribe-auth/ADEXKGYYAMVGV6HWRJSEXRLSOHVWFANCNFSM4TFXXHCA .

abitrolly commented 3 years ago

Wow. That's a lot of grief. :D Care to fill a PR with code and less emotion? Arguments like "clearly not valid", "it's obvious" reminds me of lovely state propaganda I eat every day in Belarus.

blazespinnaker commented 3 years ago

My initial post was a bit wordy, but I think the point is valid - using regression properly would have overweighted pollsters that got it right in 2016 and I suspect the results would have been more accurate. It would have also sussed out the 'house effect' that I think is definitely wrong here.

As for the other one, I think it's also valid statement. The grading is condescending, there really is no other way to say that. If you look at the marist results for NC (+6 D) and Florida (+5 D), how can you possibly say the grading is valid? Their average error is worse than Trafalgar. 3.25 for R, 2.35 for D. Trafalgar - 1.25 for R and 2.25 for D.

I'm not arguing that Trafalgar doesn't have issues, but there is a blind spot here with Marist. And Marist frequently shows up in 538 because they poll a lot.

The house effect for Marist is 0.01 R. That's just impossible to justify and for an A+ pollster I don't understand how this was missed.

BTW, it's really not that hard to do regression. Plenty of tutorials out there and np / pandas / scikit make it very easy to do. Logit regression has always produced good results for me.

I'll try to do an example of it.

abitrolly commented 3 years ago

@blazespinnaker GitHub seems to be able to render Python notebooks pretty well, although I am looking at https://observablehq.com/ for doing any proof work for visualization.

e-traudt commented 3 years ago

Here’s a sharper summary

https://mobile.twitter.com/Peoples_Pundit/status/1325875305211367424

Peoples_Pundit @Peoples_Pundit https://mobile.twitter.com/Peoples_Pundit All of these so-called "models" were wrong — again — because they're not real models. These are overpaid poll-readers, who cannot even do that with competence. Shame of them now. Shame on you later if you even bother to visit their garbage websites.

On Thu, Nov 5, 2020 at 5:03 AM blazespinnaker notifications@github.com wrote:

Yeah, I'll rewrite it. Sorry.

— You are receiving this because you commented.

Reply to this email directly, view it on GitHub https://github.com/fivethirtyeight/data/issues/267#issuecomment-722274742, or unsubscribe https://github.com/notifications/unsubscribe-auth/ADEXKG454JUO3WE2GHJCJI3SOJZ6TANCNFSM4TFXXHCA .

blazespinnaker commented 3 years ago

Please, let's keep the politics out of a technical forum. It is obfuscating the root issue, which is the rational of giving an A+ pollster a 0.001 R house effect when it clearly has a large D bias.

e-traudt commented 3 years ago

Polling is not a technical profession, it is paid political advertising. The only remaining question is how and how much Joey 2-Masks paid.

[image: image.png]

On Wed, Nov 11, 2020 at 11:59 PM blazespinnaker notifications@github.com wrote:

Please, let's keep the politics out of a technical forum. It is obfuscating the root issue, which is the rational of giving an A+ pollster a 0.001 R house effect when it clearly has a large D bias.

— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/fivethirtyeight/data/issues/267#issuecomment-725837933, or unsubscribe https://github.com/notifications/unsubscribe-auth/ADEXKGZOVOTPW3AVDFA6HUTSPNTUPANCNFSM4TFXXHCA .

ScottSupak commented 1 year ago

lol

blazespinnaker commented 1 year ago

Marist's obvious dem bias paid off bigly in the midterms. Also, shifting to text-to-web surveys was obviously the right call and has improved their accuracy. Do they deserve a 'higher grade' now than they did going into the midterms? Sure, just as Trafalgar deserves a lower grade.

FWIW, I think the future looks very bright for polling, much more so than it did before. Polling companies are realizing that they can no longer rely on phones and need to use appropriate data science / ML technique to do predictive polling.

Methodology is about to get very very opaque, but also very very accurate.

fivethirtyeight / data

doubts about the data - Marist versus Trafalgar #267