ROTOWIRE.COM NFL MLB / Minors NBA NHL GOLF RACING    COMMUNITY FORUMS PODCASTS MYROTOWIRE ASK AN EXPERT GAMES

Sample Size and Statistical Significance

I'm not going to mention a certain Twins catcher by name, but I do want to make a distinction between the two concepts above because it seems in our discussion of said catcher, they have been confused.

When you're looking at a set of data - a player's at-bats, a pitcher's innings, whatever - and you see something in it that appears to be an outlier, you're faced with the question of whether there's a cause to that anomaly or whether it's just random fluctuation. In other words, is Justin Verlander the same guy he was last year, but for one month, he faced a combination of free swingers, favorable shadows over the batter's box, hitters who faced Tim Wakefield the day before, or is there a more central cause - that Verlander's stuff is harder, better located and/or nastier?

And in considering that question, the sample size of the data is important because over a short period of time, random fluctuation is more likely to occur. Put differently, you might flip an evenly weighted coin 10 times and get 8 heads. You will never in a trillion years flip it 10,000 times and get 8,000 heads. Okay, most of us grasp that.

The question is - at what point do we conclude that the coin is *not* evenly weighted? Well, it's not simply a matter of how many times you flip the coin, i.e., sample size. It's *also* a matter of the *magnitude of the deviation*. If you get 16 heads and four tails, I wouldn't be so sure the coin is unevenly weighted. But if you get 20 heads and no tails, it's almost certainly so. (Less than 1 in a million odds of that). In both cases, the sample size is quite small, but in the latter, it's more than sufficient. When the coin lands at a an 80-percent heads clip, you need a larger sample to determine the coin is rigged because the magnitude of the deviation from the baseline (50/50) is less. And if the coin lands heads at a 55-percent clip, you need an even larger number of flips to determine whether it's rigged.

So understand that the sample size is only one of two factors in determining the significance of the outlier. The other is the magnitude.

That's why when you see Verlander strike out 60 batters in 44 IP or Joe Mauer - f*** it, I'll mention him, hit 11 home runs, you cannot simply say, "it's only one month, I'm not a believer" without also considering the magnitude of the deviation.

Is 60K in 44 IP a big enough magnitude to make one month significant. Is 11 home runs? In my opinion, yes. But whatever your opinion, you must address both factors if you're going to get a good gauge of whether it's dumb luck or a new baseline.

Comments

By: Colin Wyers
On: 6/2/2009 5:24:00 PM
You're forgetting a third factor. The odds of a MLB player having a hot streak like Mauer's are pretty low. The odds of AN MLB player having a hot streak like Mauer's are considerably less so. In other words, given a normal distribution we should expect SOME unlikely things to occur in the population of MLB players. Picking them post facto and looking at how many SDs they are from the mean isn't instructive.
 
By: ephinz
On: 6/2/2009 7:26:00 PM
Mauer homered again - an even better 15 HR pace for June!
 
By: iceguy
On: 6/3/2009 9:08:00 AM
What Mauer has done would be unlikely for a player with a 1 HR every 20 AB rate (0.5% of hitting 12 or more in 102). Is simply impossible for a player with a 1 HR every 61 AB rate (less than 1 in 10,000,000). Which is to say that it's safe to conclude his rate has gone up. But how much, is the question.

And to that end, Colin's point is a good one. Sure, there is less than 1% chance that a 1-in-20 Mauer would hit 12 in 102. But considering all of the 250 hitters with 100 ABs this season, there's a 72% chance that at least one of them would do something that rare (ie 0.5%). So that it happens to be Joe Mauer is surprising, but that any random player would perform so far outside his skill set is not that surprising.

If I had to guess at Mauer's HR rate the rest of the year, I would say somewhere around 1 in 20. That would make him a .320-25-115 C over a full season. I think that's worth a first round pick, but more like a late first-rounder.
 
By: Chris Liss
On: 6/3/2009 3:53:00 PM
Right, but I think Colin's point is better addressed by looking at the specific player. If Mark Buehrle strikes out 9 guys per nine for a month, but his stuff is largely the same, you'd want to be skeptical. But if he's throwing 97 mph, you'd be more likely to believe. Mauer's physical stature and age make it more believable than a random player with a home run spike, e.g., Jason Bartlett or Ryan Theriot. But it is another factor - but one where the difference between players and coins becomes more apparent.
 
By: robin1
On: 1/12/2011 11:28:00 PM
Very important, time management and the creative flow. A great way to get into the creative flow is by utilizing time wisely concentrating through meditation.
___________________________
wool rugs
 

Leave a comment

Commenting is restricted to registered users only. Please register or login now to submit a comment.

Tell Someone

  • Digg it
  • submit to reddit reddit
  • Add to Mixx!

Recent Favorites

How BABIP Affects the Value of Strikeouts and Walks
The league-average batting-average-on-balls-in-play (BABIP) has been on a downward trajectory for the last half decade, something that has implications for the values of different at-bat outcomes. BABIP is down precipitously this season (.289), though it's hard to tell to what extent that's due to all the shifting defenses (better data on where balls are hit have enabled this), or is simply just an April cold-weather/small sample anomaly.
NHL Playoff Observations -- Monday
Just a few things that caught my eye ...
  • Alexander Semin was on the wrong side of the grocery stick most of the night. You know, that spot between the back-up goalie and the rest of the team ... the one where the dead wood goes so it doesn't get in the way of on-the-fly changes.
  • Honey badger don't care. Honey badger doesn't give a $%!&.  T.J. Oshie gave up four inches and 50 pounds to Dustin Penner, but he didn't care. Then again, honey badger didn't care when he hit Rick Nash a few years ago, either. I wonder if the NHL would let me get a Blues #74 jersey with Honey Badger on the back.
Buy Lowest
Every year, we hear advice about how it's time to "buy low" on struggling stars like Albert Pujols, Giancarlo Stanton and Robinson Cano. "Get them now for cheap before they heat up!" Except that the market for superstars - at least in competitive leagues - usually doesn't move much after one bad month. (Whether it should in these cases is another question). It's more likely that the Pujols and Stanton owners are doubling down, figuring that only a hot streak from a player of that caliber can undo the damage that's already been done.

That doesn't mean you can't take advantage of the market's early overreaction, but you'll have to aim a little lower and take on quite a bit more risk. The players listed below are those who should be available at an actual discount.

Fantasy Rookies
Now that the 2012 NFL Draft is complete and rosters look about the way they will when training camps open in late July, it’s time to start the speculation train full steam ahead to fantasy draft days. Many players taken in this class will have a fantasy impact, some immediate, some in small spurts as rookies and some not for years to come.
Cowboys Draft: First Blush Thoughts
My initial reactions to the 2012 Dallas draft crop.

RSS Feeds