Baseball Simulations
How likely are the world records we hold dear? Should they have happened? Should they been set by the people who did them? There's an New York Times Opinion piece written by some researchers who examined Joe DiMaggio's 56 game hitting streak to determine how likely it was to happen again. Turns out it's very likely.
In the 10, 000 simulations the researchers ran on the entire history of baseball:
More than half the time, or in 5,295 baseball universes, the record for the longest hitting streak exceeded 53 games. Two-thirds of the time, the best streak was between 50 and 64 games.
In other words, streaks of 56 games or longer are not at all an unusual occurrence. Forty-two percent of the simulated baseball histories have a streak of DiMaggio’s length or longer. You shouldn’t be too surprised that someone, at some time in the history of the game, accomplished what DiMaggio did.
The real surprise is when the record was set. Our analysis reveals that 1941 was one of the least likely seasons for such an epic streak to occur.
In the rest of the article they discuss the other people more likely (based on the simulations) to have made the streak.
tags: emerging tech, etech
| comments: 8
| Sphere It
submit:
0 TrackBacks
TrackBack URL for this entry: http://blogs.oreilly.com/cgi-bin/mt/mt-t.cgi/6416
Comments: 8
I can't imagine that a simple statistical model would be all that helpful; certainly there're a lot of other factors (didn't Yogi Berra say that baseball is 90% mental, and the other half is physical?), and I could guarantee, as a manager, that I could break any streak: just have the pitcher(s) walk the guy at every at bat.
The problem with statistical simulation models is that they start by assuming that x = 1, then go about the whole algorithm to end up with the learning that in most cases, x = 1 or thereabouts. And I say this from professional experience in financial modelling.
Those simulations forget that these are humans and the pressure builds as the streak grows longer. The constant bombardment by the media makes these long streaks nearly impossible.
What this analysis omits is the recognition of a different era in baseball. As stated above, there were simulations which indicate consecutive game hitting streaks of 50-64 games. I would agree this record could be broken if the current brand of baseball were the same as it were in 1941 - where 1 pitcher would pitch the entire game.
I truly believe this record in unbreakable in the modern era of baseball. Today hitters face a starting pitcher for approx. the 1st 6 innings then multiple relievers to close the game. It's a large advantage for a hitter to face a pitcher - seeing the same arm angle, speeds, pitch sequences....etc. There are too many 'specialty relief pitchers' out there that are dominant against right or lefthanded batters. In todays game - the 56 game hitting streak and the batting .400 mark will not be broken due to the fact that hitters have to face constant pitching variations that hitters in previous eras of baseball did not.
Post A Comment:
STAY CONNECTED
RECENT COMMENTS
- Baseball Fan on Baseball Simulations: What this analysis omit...
- Baseball Handicappers on Baseball Simulations: Those simulations forge...
- Ajeet Khurana on Baseball Simulations: The problem with statis...
- Ross Stapleton-Gray on Baseball Simulations: I can't imagine that a ...
- Tony Talbot on Baseball Simulations: If its so common why ha...
- Jorge Rosa on Baseball Simulations: Sorry: STEPHEN Jay Goul...
- Jorge Rosa on Baseball Simulations: A must read os the subj...
- Thomas Lord on Baseball Simulations: Perhaps we celebrate D...
Thomas Lord [03.31.08 02:41 PM]
Perhaps we celebrate DiMaggio for a different reason.
Statistics capture some of the dynamics of the situation but they do not tell the story. Ok, some record was inevitable but.... why was it Joe?
Perhaps he knew a thing or two that helped him do that.
Nothing is accidental.
-t