I can’t tell you how many times I’ve been watching a game on and all of the sudden, my twitter feed blows up when something big happens, like Giancarlo Stanton hitting another 450+ ft homerun. But I missed it because I didn’t know that he was at bat. So I decided to make something that will allow me to customize my baseball viewing experience. Something that will allow me to see as much of the baseball that I want to see. I got the idea from using Dan Brooks’ (of RedZone, which switches between the games with the highest leverage index. This allowed me to see the potential of using MLB’s gameday data.

Enter the Dashboard, which allows you to customize what you want to watch the most, and automatically switches between games based on your priorities. Here are some of the things you can customize:

  • Batters: Want to see Bryce Harper, Mike Trout, Giancarlo Stanton, or even Bartolo Colon at the plate? Add it to your list, and the application will switch to their game when they come to the plate.
  • Pitchers: Don’t want to miss Jake Arrieta or Clayton Kershaw pitch? This will switch to their game while they are on the mound and go to another game on your list while their team is batting.
  • Baserunners: It doesn’t get much more exciting than when Billy Hamilton is on the bases. If the runner of your choosing is on 1st or 2nd base with the next base open, the application will make sure you see it.
  • Fantasy Teams: The three settings above allows you add all of the players on your fantasy team to track their progress.
  • Teams: Let’s say you’re a huge Braves fan and that’s mostly what you want to see. But you also want to see each of Manny Machado’s at bats. Put Machado at #1 and Atlanta at #2 and you’ll get to watch your Braves games, but the application switches to Orioles games when Machado comes to the plate. It will switch back when his at bat is over.
  • Leverage Index: If you don’t want to miss a tense moment in any game, set the LI to something like “>= 3.0″ and it will switch to any game that meets that criteria.
  • No-hitters: Don’t want to miss a potential no-hitter, but don’t think it’s important until after the 7th inning? You can set that as a priority.
  • Vin Scully: There’s only a few more months left that we get to appreciate the greatest announcer in baseball history. This setting will switch to Dodger games when they are at home, or on the road in San Diego, Anaheim and San Francisco.
  • Position Players Pitching: Who doesn’t love to see a left fielder pitch in a blowout or in the 18th inning? This setting will switch to a game if a non-pitcher is on the mound.
  • Extra Innings: Self-explanatory. This setting will switch to games that are in extra innings.
  • Replay Challenge/Review: Let’s say you are a masochist and you want to see all replay challenges. This will switch to those situations.

Now suppose none of your priority items are met, or your teams are on commercial break. In that event, the application will switch to the game with the current highest leverage index. It will keep switching between these games until any of your priority criteria are met.

In addition to the above priority items, you can also avoid watching any teams of your choosing. If you’re blacked out from seeing a certain team, you can add them to your ignore list and the application will avoid changing to their games since you won’t be able to watch them.

We all know has a delay from the actual live game to when you see it on your screen. You can adjust the delay timer setting if the games are switching too early or late. However, I recommend not changing this setting too much since it can severely alter the experience.

Finally, you can set whether or not you want the application to wait for the current at bat to finish before switching to a higher priority game, or if you want to it change immediately. The default setting is to wait. Changing the setting to switch immediately will allow you to not miss higher priority situations, but beware that you could see a lot of changing at inopportune times.

If you choose to use this, I hope you enjoy it. Let me know what you think.

Posted in Announcements, General, Site Additions | Leave a comment

Game Star Ratings

I’ve added a star rating to each game. It measures the “enjoyability” of a game based on a few different factors. There are many elements to a game that can make it enjoyable to the unbiased fan. I’ve tried to include the most important of these.

The rating system ranges from 0 stars to 5 stars and goes in increments of .25 stars. The average game will be around 2.5 stars.

Leverage Index (aLI)
The first and most important element is leverage index, which measures the importance of each situation in the game. The more crucial a moment in a game is to the outcome, the higher the leverage index will be. A leverage index of “1” is average. A leverage index of “3” is equal to three times as important as the average play. FanGraphs has a primer on LI for those interested.

In the game rating formula, I use average leverage index over the course of the entire game. I could have chosen to just go with the top X plays in a game, or the number of plays over a certain threshold, but I felt the average over the course of the game is best suited to gauge the intensity of the entire game.

Win Expectancy Change (WE+/-)
Next is change in win expectancy per play. Suppose an RBI single increases a team’s win expectancy from 55% to 65%. That would obviously be an increase of 10 percentage points. I calculate the average absolute value of WE change over the course of the entire game and use that number for my rating formula.

The average play in the average game will have a win expectancy change of about 3.3 percentage points. Bigger and more exciting plays will increase this number, while plays in blowout games will do the opposite.

Leverage index and win expectancy change very likely have a high correlation, which would cause these elements to be “double counted”. I have taken that into consideration and am fine with it since they are the most important factors in gauging a game’s intensity.

Championship Leverage Index (CLI)
Championship leverage index is similar to in-game leverage index (above), in that it gauges the importance of a single game as opposed to a single play. The game importance is measured in how much a team’s probability of winning the World Series changes in a win versus a loss.

The average game will have a CLI of 1 and is equal to the average game on opening day. In the 2nd Wild Card era (2012-present), the average game on opening day can change a team’s chances of winning the World Series by 0.59 percentage points.

The CLI used in this ratings formula is the average of the two team’s CLI for this game.

Examples: A team that is already eliminated has a 0% chance of winning the World Series. A win will not increase their chances, so their CLI will be 0. The same goes for a team that has already clinched their division. A division title ensures that a team is 1 of the 8 teams in the postseason tournament, meaning they have a 12.5% chance of winning the World Series. A win or a loss after clinching the division will not change this number. But a one-game playoff for the division (game 163) is a “win or go home” scenario and will have a CLI of around 21, since it is 21 times more important than the average game on opening day.

Comeback (CB)
The final element is comeback, which is defined as the highest win expectancy the losing team reached during the game. A comeback can range anywhere from 50 percentage points to 100 percentage points. A comeback of 100 percentage points means that the losing team had a 100% chance of winning, but still managed to lose the game. A comeback of 50 percentage points means the losing team was never able to increase their win expectancy above the 50% level at the beginning of the game and likely means the game was never much in doubt.

Formula and Weights
Each of the four elements (LI, WE+/-, CLI, CB) are individually compared to a large sample of games ranked in a percentile. These percentiles are then weighted and combined to create the star rating. The weights are:
aLI = 1.5
WE+/- = 1.5
CLI = 1
CB = 1

Example: A game has an average leverage index of 1.25, an average win expectancy change of 4.5 percentage points, a championship leverage index of 1.55, and a 85% comeback. Their percentiles and weights are:
aLI = 70 * 1.5
WE+/- = 82 * 1.5
CLI = 92 * 1
CB = 90 * 1

Their sum is 410. This number is divided by 25 and rounded to the nearest whole number. It is finally divided by 4 to give you the star rating. This game would be a 4 star game (410 / 25 = 16.4 = 16 / 4 = 4).

Elements of a game not currently included in star rating system
Individual game performances and milestones. A player hitting 4 HR in a game is exciting and uncommon and makes each of the at bats more important. A pitcher taking a no-hitter or perfect game late into the game has the same effect. These types of elements are currently not included, but are “on the table” for future versions.

Star Players
One could argue that the more superstar players in a game could make it more enjoyable. This rating system does not take the players superstar status or skill level into account.

Special Games
While Derek Jeter’s final home game was exciting in its own right, I would argue that it was even more enjoyable since it was his final game at Yankee Stadium. This rating system doesn’t take these rare situations into account.

The Home Crowd’s Enjoyment
As mentioned above, this star rating measures the enjoyment for the unbiased fan. The home crowd may have a different definition of an enjoyable game based on whether their team wins, but this system makes no such distinction.

Posted in Announcements, General, Statistical Analysis | 5 Comments

2016 Retrosheet Database

One of the best days of every offseason for a baseball nerd is Retrosheet annual end of season release day. It’s the day one of the best sites on the internet releases the play by play data from the previous season. If you’re like me, you download it immediately and go to town. But one thing I’ve always wanted was the ability to access the data during the current season.

So this past offseason, I designed a way to take mlb’s gameday data and convert it into a Chadwick-style retrosheet database. The database (.csv files) will be available and updated daily* in the downloads section. I’m making it available mainly because I know there are others out there, like me, that are interested in having an in-season pbp database. But also because I’d like to have more than one set of eyes on it, to help iron out the kinks and catch any errors.

Error Checking
I run a few processes to check for errors and to validate the data. But there is still the possibility that errors will come up from time to time. I’d like to make this a forum for error reporting, for those who are interested in helping.

Daily Download
Just as with this website, I intend to have updates available daily. Usually, the site is updated in the morning. But with a full-time job and two toddlers at home, there can sometimes be a delay.

Missing columns
There are a few columns in the events table that I have left blank:
“EVENT_TX”: It turns out that it is a huge process to replicate this. While I believe the “EVENT_TX” column is helpful in quickly identifying the play, I don’t use it in my queries and felt it wasn’t worth the hassle. The same goes for the “BAT_PLAY_TX” and the “RUNX_PLAY_TX” columns.

“BATTEDBALL_LOC_TX”: Gameday does include hit locations for all balls in play, but I have yet to dive into this data. If there is someone who has experience with this data and is willing to assist in converting gameday’s x and y coordinates to Project Scoresheet locations codes, please let me know.

“UMP_ID”: These columns for the six umpires are currently left blank.

“GWRBI_BAT_ID”: This is left blank because game winning RBI’s are no longer officially recorded.

ID’s for players making their Major League debut
Since these players have yet to be assigned official ID’s by retrosheet, I just give them the next available ID for their name. For example, if a John Smith were to make a debut, he would be assigned ID “smitj005″, since 005 would be next in line.

Building a Retrosheet Database
For those who are interested in using the data, but lack experience, David Temple at TechGraphs recently created a helpful two part tutorial.

If you find this data useful and have some disposable income, please consider donating. I do not get paid for my work on this website and while it is my passion to work with baseball data, it does take a lot of time and money (server costs) to keep it up. I’d like to also suggest donating to David Smith and the Retrosheet team.

Posted in Announcements, General, Site Additions | Leave a comment

2011 Royals Farm System

The Royals are on the brink of winning their first World Series in 30 years, leading the Mets three games to one. While it’s certainly not over (5 of 43 teams have come back from being down 3-1 in the World Series), I figured it was as good of a time as any to write about the Royals 2011 Farm System.

Compilation of Royals Top Prospect Rankings
Prospect Pos FG BA BP Sickels Avg
Mike Moustakas 3B 1 3 1 1 1.5
Eric Hosmer 1B 2 1 3 2 2.0
Wil Myers OF 3 2 4 3 3.0
John Lamb LHP 5 4 2 6 4.3
Mike Montgomery LHP 4 5 5 5 4.8
Danny Duffy LHP 7 7 7 4 6.3
Chris Dwyer LHP 9 8 6 9 8.0
Christian Colon SS 8 6 8 11 8.3
Jeremy Jeffress RHP 11 8 9.5
Brett Eibner OF 12 10 14 10 11.5
Tim Collins LHP 13 13 10 16 13.0
Aaron Crow RHP 14 9 16 14 13.3
Tim Melville RHP 15 14 15 14.7
Johnny Giavotella 2B 21 18 9 12 15.0
Yordano Ventura RHP 10 12 13 27 15.5
Cheslor Cuthbert 3B 17 15 12 19 15.8
Louis Coleman RHP 20 19 17 13 17.3
Jason Adam RHP 16 11 15 28 17.5
Robinson Yambati RHP 19 16 11 26 18.0
Salvador Perez C 18 17 20 18 18.3
Patrick Keating RHP 22 22 17 20.3
Will Smith LHP 19 25 22.0
Derrick Robinson OF 23 26 18 22.3
Jarrod Dyson OF 26 20 23.0
Kevin Chapman LHP 23 23.0
David Lough OF 24 25 22 23.7
Jeff Bianchi SS 30 21 21 24.0
Orlando Calixte SS 24 24.0
Clint Robinson 1B 28 20 24.0
Buddy Baumann LHP 24 24.0
Noah Arguelles LHP 25 25.0
Humberto Arteaga SS 27 23 25.0
Henry Barrera RHP 27 27.0
Crawford Simmons LHP 28 28.0
Lucas May C 29 29.0
Elisaul Pimentel RHP 29 29.0
Kelvin Herrera RHP 30 30.0
Greg Holland RHP

Below is a list of the top ten farm systems in 2011, using five different rankings from the industry. The first three were a consensus among most of the publications, with Kansas City topping the list in each one.

Compilation of Farm System Rankings
Rank Team BA BP Law Sickels THT Avg
1 Royals 1 1 1 1 1 1.0
2 Rays 2 2 2 2 2 2.0
3 Braves 3 3 3 4 3 3.2
4 Blue Jays 4 5 4 5 4 4.4
5 Yankees 5 4 9 6 14 7.6
6 Reds 6 9 8 7 11 8.2
7 Indians 7 7 17 3 9 8.6
8 Angels 15 6 6 8 8 8.6
9 Phillies 10 8 5 11 12 9.2
10 Twins 12 15 7 9 6 9.8

Rankings alone won’t do the praise for this system justice, so here are some of the comments coming from those who ranked them:

Kevin Goldstein at Baseball Prospectus:

This is not just the best minor-league system in baseball, it’s the best by a wide margin. The more I wrote about these prospects, the more trouble I had figuring out any way for things to go wrong. Another winning record could occur as early as 2012, but more importantly, the team should return to annual playoff contention shortly thereafter.

Keith Law at ESPN:

The phrase “Mission Accomplished” has acquired an ironic connotation of late, but if anyone could use the phrase earnestly to describe his own efforts, it would be [Dayton] Moore, as the Royals have arms coming out of their ears.

That’s particularly impressive when you consider that Kansas City’s top two prospects are bats, and there are some solid position player prospects further down in the system.

Jim Callis at Baseball America:

The Royals set a record by placing nine players on our Top 100 Prospects list, starting with three of the very best hitting prospects in the minors in 1B Eric Hosmer, 3B Mike Moustakas and OF Wil Myers. They also have an enviable collection of lefthanders, led by John Lamb, Mike Montgomery, Danny Duffy and Chris Dwyer.

John Sickels at

What can you say? This is one hell of a farm system. While the young pitching gets a large amount of attention, and deservedly so, the Royals also have three of the most elite young bats in baseball in the Moustakas/Hosmer/Myers troika.

Matt Hagen at The Hardball Times:

Kansas City has a plethora of top-end impact talent and loads of depth throughout. The best system in baseball and a reason to follow America’s pastime for long-suffering Royal fans.

It’s difficult to find one negative comment about this farm system. It contained high impact talent AND depth at almost all positions. Years of futility earned the Royals multiple early round draft picks. From 2005-2010, they had a top five pick on five different occasions. Additionally, Kansas City was active on the international front, signing players out of the Dominican (Kelvin Herrera & Yordano Ventura), Venezuela (Salvador Perez), and even Nicaragua (Cheslor Cuthbert).

In 2011, Doug Gray at assigned a monetary value to every prospect in baseball based on John Sickel’s grading system. He estimated the Kansas City farm system to be worth $243 million while the next best team (Tampa Bay) was worth $184 million. Here is a graph Doug provided, showing all farm systems in 2011:
2011 Farm System dollar values

But having a top farm system has never guaranteed success. Scott McKinney at Royals Review studied prospect success and failure rates and determined that 70% of Baseball America Top 100 prospects are failures. In August of this year, Alex Speier wrote in the Boston Globe:

Remarkably, none of the last 14 organizations to be designated with the top farm system by Baseball America has won a World Series since receiving that accolade. The last team to hoist a championship trophy following a top farm system ranking was the 2005 White Sox, four years after they’d been named the top farm system in 2001.

Of course, this will change if the Royals can win just one of the next three World Series games.

So what does a team with the best farm system need to do to reach the next step?
First, they need to continue to develop these players, as none of them are finished products.
The Royals did just that, as a good number of their prospects reached the big league level. Of course, not all of them have reached their potential, but that is expected.

Next, they need to surround this core with complimentary players.
Whether it is from outside the organization through free agency or via trade, Dayton Moore may have done his best work in this aspect. He parlayed Wil Myers, Jake Odorizzi & Mike Montgomery into James Shields and more importantly, Wade Davis. He also signed Chris Young, Edinson Volquez, Kendrys Morales & Ryan Madson to fill out the roster. Finally, at the trade deadline this year, Moore traded prospects to acquire the final pieces to the puzzle in Johnny Cueto and Ben Zobrist.

Finally, they need luck.
Because sometimes no matter how hard you try, things just don’t go as planned. On the other hand, there’s always a chance to find that diamond in the rough that you weren’t expecting. As Branch Rickey said, “Luck is the residue of design.” It’s hard to tell how much of the Royals success is luck, but by putting the organization in the best situation possible, they have been in position to capitalize on many breaks.

We can now look back retroactively at the 2011 farm system to see where it ranks among the 30 teams in terms of wins above replacement. Granted, it is far too early to make a final judgement on these systems as most of the players are still beginning their careers. Below is a table showing how many wins above replacement each team’s 2011 farm system has produced, along with their winning percentages in subsequent seasons. The top prospect is the player with the most WAR in that farm system.

# Teams WAR 2011 2012 2013 2014 2015 Top Prospect
1 DBacks 100.4 .580 .500 .500 .395 .488 Paul Goldschmidt
2 Royals 91.4 .438 .444 .531 .549 .586 Salvador Perez
3 Angels 88.2 .531 .549 .481 .605 .525 Mike Trout
4 Braves 85.3 .549 .580 .593 .488 .414 Andrelton Simmons
5 Cardinals 74.9 .556 .543 .599 .556 .617 Matt Carpenter
6 Rays 70.6 .562 .556 .564 .475 .494 Desmond Jennings
7 Reds 70.3 .488 .599 .556 .469 .395 Todd Frazier
8 Indians 63.9 .494 .420 .568 .525 .503 Jason Kipnis
9 Pirates 61.6 .444 .488 .580 .543 .605 Starling Marte
10 Nationals 60.6 .497 .605 .531 .593 .512 Bryce Harper
11 Blue Jays 59.2 .500 .451 .457 .512 .574 Brett Lawrie
12 Mets 58.1 .475 .457 .457 .488 .556 Matt Harvey
13 White Sox 56.8 .488 .525 .389 .451 .469 Chris Sale
14 Astros 56.3 .346 .340 .315 .432 .531 Jose Altuve
15 Mariners 50.7 .414 .463 .438 .537 .469 Kyle Seager
16 Athletics 49.8 .457 .580 .593 .543 .420 Josh Donaldson
17 Twins 46.8 .389 .407 .407 .432 .512 Brian Dozier
18 Dodgers 42.7 .509 .531 .568 .580 .568 Kenley Jansen
19 Yankees 42.2 .599 .586 .525 .519 .537 Jose Quintana
20 Padres 42.2 .438 .469 .469 .475 .457 Anthony Rizzo
21 Rockies 41.5 .451 .395 .457 .407 .420 Nolan Arenado
22 Orioles 40.5 .426 .574 .525 .593 .500 Manny Machado
23 Red Sox 40.0 .556 .426 .599 .438 .481 Josh Reddick
24 Marlins 38.1 .444 .426 .383 .475 .438 Christian Yelich
25 Giants 36.5 .531 .580 .469 .543 .519 Brandon Crawford
26 Cubs 33.4 .438 .377 .407 .451 .599 Welington Castillo
27 Brewers 26.7 .593 .512 .457 .506 .420 Mike Fiers
28 Tigers 24.4 .586 .543 .574 .556 .460 Drew Smyly
29 Phillies 23.6 .630 .500 .451 .451 .389 Jarred Cosart
30 Rangers 23.4 .593 .574 .558 .414 .543 Pedro Strop

*Note: I have only included players with positive career WAR in these totals.

Surprisingly, the Diamondbacks have accumulated the most WAR of any team from the 2011 prospect class. However, Kansas City is not far behind in second place. Whether or not this farm system turns out to be among the all-time greats remains to be seen. But if they end up winning the World Series in the next few nights, it will be impossible not to deem it a success.

Thanks to Hawkins DuBois for helping out with prospect lists.

Posted in General, Historical, Statistical Analysis | Leave a comment

Top Plays of the 2015 Postseason (so far)

During the postseason, I have been making series win probability charts. They’re available on the front page of the site and regularly on Twitter. I’ve also been calculating championship win probability (cWPA) added for all players. This is similar to single-game win probability added, but in the context of a postseason series. cWPA takes into account the level of postseason series. Each series is twice as important as the previous level. For example, the average play in the League Championship Series is twice as important as the average play in the Division Series. On the extreme ends, the World Series is eight times as important as the Wild Card game.

What follows are the top eleven plays during the 2015 postseason, according to championship win probability added. Note: .077 cWPA can be interpreted as increasing a team’s probability of winning the World Series by 7.7 percentage points.

1) Jose Bautista homers off Sam Dyson (.077 cWPA)

Game Five ALDS
Toronto’s ALDS win probability went from 64% to 94% on this swing. Whether you agree with the bat flip or not, you can’t deny the impact of the play. According to championship win probability added, it was the second biggest homerun by a Blue Jay in postseason history. Of course, the biggest was Joe Carter’s walk-off in Game Six of the 1993 World Series, which was worth .300 cWPA.

2) Jose Bautista homers off of Ryan Madson (.063 cWPA)

Game Six ALCS
Bautista lacks no flare for the dramatic. This homerun increased the Blue Jays series win probability by 13 percentage points (5% to 18%). While his homerun off Dyson a week earlier was a 30 point increase in series win probability, this one came in the ALCS, a round twice as important as the ALDS, making the two very comparable in value.

3) Luis Valbuena homers off of Johnny Cueto (.058 cWPA)

Game Five ALDS
The two runs from Valbuena’s shot were the first on the board in the deciding Game Five and they increased Houston’s series win probability from 42%-65%. Unfortunately for Valbuena and the Astros, they were unable to hold the lead.

4) Daniel Murphy homers off of Zack Greinke (.051 cWPA)

Game Five NLDS
This wasn’t the most memorable play of the series. That goes to the Chase Utley slide. It’s probably not even the most memorable of the game. That would be Murphy’s stolen base after the Lucas Duda walk. However, it did have the most impact. It increased New York’s series win probability from 41% to 62%, giving the Mets a lead they wouldn’t relinquish.

5) Edwin Encarnacion homers off of Cole Hamels (.051 cWPA)

Game Five ALDS
Encarnacion’s tied the game, albeit briefly, and increased Toronto’s series win probability from 39% to 60%.

6) Wade Davis strikes out Ben Revere (.048 cWPA)

Game Six ALCS
The biggest non-homerun of the postseason. Revere struck out in just 10% of his plate appearances during the regular season, but Davis was able to get him swinging with runners at 2nd and 3rd with one out in the 9th. However, there was an obviously questionable strike two call that should have been ball three. The strikeout increased the Royals series win probability from 83% to 92%.

7) Carlos Correa error, scoring two runs (.047 cWPA)

Game Four ALDS
This is an even bigger play when you consider the alternative, a double play that scores just one run. Instead, this play increased the Royals series win probability from 22% to 41%. Correa had been having a tremendous game, with two homeruns earlier. It should be noted that the ball did skip off of Tony Sipp’s glove, making for a very unusual hop for Correa.

8) Rougned Odor scores on Russell Martin’s error (.047 cWPA)

Game Five ALDS
The most bizarre play of the 2015 postseason, very unlikely to be topped. Choo’s bat, Martin’s misfortune and Odor’s heads-up base running increased the Rangers series win probability from 43% to 61%.

9) Eric Hosmer drives in Lorenzo Cain from first base (.045 cWPA)

Game Six ALCS
Multiple people deserve credit for this play. Hosmer for the single, Cain for the amazing hustle, Bautista for not hitting the cutoff man, and third base coach Mike Jirschele for the awareness to send Cain after Bautista threw to Tulowitzki. All-in-all, the play increased the Royals series win probability from 87% to 96%.

10) Daniel Murphy’s RBI double off of Zack Greinke (.042 cWPA)

Game Five NLDS
New York got an early run in the first inning on Murphy’s double. Kiké Hernandez had trouble picking up the ball off the wall, enabling Murphy to take third base. Murphy should be given credit for his hustle and not just coasting into second base. This play increased the Mets series win probability from 44% to 61%.

11) Javier Baez homers off of John Lackey (.042 cWPA)

Game Four NLDS
Baez, who was starting in place of the injured Addison Russell, gave the Cubs a lead on the first pitch from John Lackey. Baez was batting ninth, behind the pitcher Jason Hammel, who extended the inning on a single to center field. The homerun increased the Cubs series win probability from 70% to 84%.

It should be noted there are sure to be bigger plays in the World Series, given the bigger importance in the postseason’s highest level. In the 2014 postseason, the top 18 plays (according to cWPA) came in the World Series.

Posted in General, Statistical Analysis | Leave a comment

Greatest Postseason Comebacks

The Houston Astros and Kansas City Royals are headed for a decisive Game Five tonight. On Monday in Game Four, the Astros held a 6-2 lead and were just six outs from advancing to the ALCS. With one out in the bottom of the seventh, the Royals had a 0.8% series win expectancy. If Kansas City wins tonight, their comeback will go down as one of the greatest in postseason history. If they lose, it will be all for naught.

The following are the fifteen biggest comebacks in postseason history according to series win expectancy. These are comebacks from the brink of elimination, so you will not see single game comebacks such as Game Four of the 1929 World Series when the Athletics scored ten runs in the seventh inning after being down 8-0. This is because Philadelphia was not close to elimination.

15) 2012 NLDS

Giants defeat Reds (7.0% series win expectancy)

In 2012, MLB added a second wild card. To fit the wild card game in between the end of the regular season and the beginning of the division series, they changed the format of the division series from 2-2-1 to 2-3 in order to eliminate an extra travel day. This made it more difficult for the Giants, who lost both games at home and would have to win the final three in Cincinnati.

Low Point:
During the second inning of Game Three, the Reds held a 1-0 lead. Cincinnati was just seven innings from completing the sweep against the Giants.

The Outcome:
The Giants tied the game and that score remained until extra innings when the Reds scored on an error to force a fourth game. San Francisco never trailed during Game Four and the big blow came in the fifth inning of Game Five on a Buster Posey grand slam off of Mat Latos.

14) 1992 NLCS

Braves defeat Pirates (6.6% series win expectancy)

This was essentially the Pirates last opportunity at a World Championship. The nucleus of the team that lost both the 1990 and 1991 championship series was breaking apart. Barry Bonds and Doug Drabek were all but gone at the end of the season because the Pirates wouldn’t be able to compete with large market clubs in free agency.

While the Braves, with one of the best farm systems in baseball and the backing of billionaire owner Ted Turner, were looking for their second straight World Series appearance.

Low Point:
Going into the bottom of the ninth during Game Seven, Atlanta was down 2-0. They were facing Doug Drabek, who was just three outs away from a complete game shutout.

The Outcome:
The Braves quickly loaded the bases on a double, error, and a base on balls. Pirates manager Jim Leyland brought in his closer, Stan Belinda, in hopes of stopping the rally. Atlanta scored their first run on a sacrifice fly and reloaded the bases on a Damon Berryhill walk. A popout by pinch hitter Brian Hunter gave the Pirates their first out. With the pitcher’s spot due up, Braves manager Bobby Cox went with Francisco Cabrera, who had just eleven plate appearances all season.

In the outfield, Pirates center fielder Andy Van Slyke motioned to left fielder Barry Bonds to move in a few steps. Bonds did not oblige and it proved to be costly. Cabrera singled to left, easily scoring David Justice. Sid Bream rounded third as Bonds fielded the ball and threw home. The throw was a little off-line and Bream was safe by inches. Had Bonds moved in, most baseball fans might not be able to recognize Francisco Cabrera’s name.

13) 2003 ALCS

Yankees defeat Red Sox (6.0% series win expectancy)

It had been 85 years since the Red Sox last won a World Series. During that time, the rival Yankees had won 26 World Championships. It was known as “The Curse of the Bambino” and the Red Sox were going to have to get through their rival to end the suffering. The series was everything that Fox could have dreamed for and more. The two biggest markets, literally fighting their way to the World Series.

Low Point:
In the bottom of the eight in Game Seven, the Yankees were down 5-2 with just five outs remaining. On the mound for Boston was Pedro Martinez, who was going through arguably the most dominant stretch by any pitcher in baseball history.

The Outcome:
After the Yankees quickly scored their third run, Red Sox manager Grady Little went to the mound, but ultimately chose to stick with Martinez, rather than go with late inning relievers Mike Timlin or Alan Embree. The decision would ultimately be the defining moment in Little’s career as the Yankees tied the score just two batters later.

The game went into the 11th when Yankees third basemen Aaron Boone hit just the second pennant-clinching walk-off homerun in LCS history.

12) 1960 World Series

Pirates defeat Yankees (5.9% series win expectancy)

This was the classic David vs Goliath matchup. The Yankees were in their eleventh World Series in the last fourteen years while the Pirates were the champions of the senior circuit for the first time in 33 years. While the two teams split the first six games, the Yankees had outscored the Pirates 46-17.

Low Point:
In the top of the eighth in Game Seven, the Yankees scored two runs to increase their lead to 7-4. The Pirates had just six outs remaining vs Bobby Shantz.

The Outcome:
The bottom of the eighth began with three straight singles, which cut the lead to two. Yankees manager Casey Stengel replaced Shantz with Jim Coates, who then retired the first two batters he faced. Roberto Clemente followed with a single that cut the lead to one. That brought up backup catcher Hal Smith, who entered the game in the sixth inning. At the beginning of Smith’s plate appearance, the Pirates had a series win probability of 29%. Five pitches later, Smith hit the biggest series changing homerun in baseball history. The three-run shot gave the Pirates a 9-7 lead and a series win probability of 94%.

Unfortunately for Hal Smith and the Pirates, the Yankees scored two in the top of the ninth to tie the game. What followed would be the first World Series clinching Walk-Off homerun in history. The highlight of Bill Mazeroski dodging Pirates fans while waving his helmet around would be seen countless times by baseball fans. All while Hal Smith’s contribution would go largely forgotten.

11) 1995 ALDS

Mariners defeat Yankees (5.7% series win expectancy)

The Mariners shouldn’t have been in the playoffs. On August 15th, they were 12.5 games behind the division leading Angels and had a 0.4% chance of winning the division. Over the final month and a half of the regular season, they battle back and won a one-game tiebreaker.

Low Point:
During the third inning of Game Four, the Yankees were up 5-0 and just six innings away from advancing to the ALCS.

The Outcome:
Seattle quickly scored four runs in the fourth inning and eventually took the lead for good in the eighth inning of Game Four, forcing a deciding Game Five.

Through seven innings of Game Five, New York held a 4-2 lead. Their starter David Cone, had thrown 118 pitches and went back to the mound to face the heart of the Mariners order in the eighth. A Ken Griffey Jr solo homerun and a Doug Strange bases loaded walk tied the game.

Randy Johnson, who had thrown seven innings just two days before, came in the ninth inning to stop a Yankees rally and then struck out the side in the tenth. The Yankees would finally get a run across on him in the eleventh, moving just three outs away from a series victory.

The bottom of the eleventh would go on to be arguably the greatest moment in Mariners history. Facing New York’s ace, Jack McDowell, Seattle would get back-to-back singles by Joey Cora and Ken Griffey Jr. Designated Hitter Edgar Martinez would follow with a two-run double down the left field line as Griffey came around to score the winning run. The 1995 Mariners created so much excitement in Seattle, they are often credited for saving baseball in the city, as they would soon get funding for what would become Safeco Field.

10) 2012 NLDS

Cardinals defeat Nationals (3.7% series win expectancy)

After so many years in the basement of the National League East, Washington was finally benefiting from multiple early first round draft picks. During the regular season, they won a franchise record 98 games. The Cardinals were defending World Series champions and in their ninth postseason in the previous thirteen years.

Low Point:
The two clubs split the first four games, going into a deciding Game Five in Washington. The Nationals jumped out to a 6-0 lead, hitting three homeruns in the first three innings on Cardinals ace Adam Wainwright.

The Outcome:
St. Louis slowly chipped away, scattering five runs in the next five innings. They would go into the top of the ninth down by two, facing the Nationals closer Drew Storen.

A double and two walks loaded the bases with two outs. Daniel Descalso singled in two runs to tie the score and Pete Kozma singled in two more to take a 9-7 lead. Cardinals closer Jason Motte provided a perfect bottom of the ninth to advance to the NLCS.

9) 1980 NLCS

Phillies defeat Astros (3.6% series win expectancy)

The Phillies, who have been in existence since 1883, were still looking for their first World Series championship. They had lost three of the last four NLCS.

Low Point:
During Game Four, in the bottom of the sixth inning, the Astros were leading 2-0 and had just loaded the bases with just one out. Houston was just three innings away from appearing in their first World Series.

The Outcome:
With right-handed batter Luis Pujols coming up with the bases loaded, Phillies manager Dallas Green chose to replace his ace Steve Carlton with right-handed reliever Dickie Noles. Pujols then flew out to rightfielder Bake McBride, scoring Gary Woods from third on the sacrifice fly. However, several Phillies noticed that Woods left the base before McBride made the catch. An appeal followed and Woods was declared out and the run was taken off the board, ending the inning.

Philadelphia then scored three runs in the eighth before allowing Houston to tie it in the ninth, forcing extra innings for the third straight game. In the tenth, two runs on a Pete Rose single, and doubles by Greg Luzinski and Manny Trillo put the Phillies up for good.

Game Five would be the fourth straight extra inning game. There were four lead changes with each team having at least a 90% probability of winning before blowing their lead. The 1980 NLCS would go down as one of the most exciting postseason series in baseball history.

8) 2014 AL Wild Card

Royals defeat Athletics (3.2% series win expectancy)

The Kansas City Royals had not appeared in the postseason since 1985

Low Point:
Through seven innings, Oakland was ahead 7-3 with their ace Jon Lester on the mound.

The Outcome:
In the eighth, the Royals cut the deficit to one run by scoring three while stealing four bases on Oakland’s backup catcher Derek Norris, Jon Lester, and Luke Gregerson (who replaced Lester midway through the inning).

In the bottom of the ninth, the Royals manufactured the tying run with a Josh Willingham single, another stolen base by pinch runner Jarrod Dyson, and a sacrifice fly by Nori Aoki. The game would remained tied until the twelfth when Oakland took another lead with a walk, wild pitch, and single. Kansas City would have to mount yet another come back. A Christian Colon single scored Eric Hosmer (who tripled) to tie the game. Colon stole the Royals seventh base of the game to get into scoring position. Finally, catcher Salvador Perez singled in the winning run down the left field line, advancing Kansas City to the division series.

For a much more detailed account, you’ll certainly enjoy Andy McCullough’s The night Kansas City baseball came back to life.

7) 1968 World Series

Tigers defeat Cardinals (2.8% series win expectancy)

1968 was the year of the pitcher. Cardinals ace Bob Gibson posted a ridiculous 1.12 ERA while Tigers hurler Denny McLain became the first 30 game winner since Dizzy Dean in 1934.

Low Point:
St. Louis took three of the first four games of the series. In the fourth inning of Game Five, they held a 3-0 lead and had two runners aboard with just one out.

The Outcome:
On the mound for Detroit was Mickey Lolich, who would throw three complete games during the series and earn MVP honors. Lolich would quickly stop the threat in the fourth inducing a flyout and a strikeout. In the bottom half, the Tigers cut the lead to one with two triples, a sacrifice fly and single. They would take the lead for good in the seventh and force a Game Six (which was a 13-1 Tigers blowout).

Game Seven looked like a classic pitchers duel between Lolich and Gibson, with each club scoreless through six innings. In the top of the seventh, Gibson got two quick outs before Detroit started to rally. Norm Cash and Willie Horton each singled, but the big blow came on Jim Northrup’s two-run triple. Detroit held on to the lead and won their first World Series since 1945.

6) 2003 NLCS

Marlins defeat Cubs (2.2% series win expectancy)

The Cubs had not won a World Series since 1908 and had not won a pennant since 1945, while the Marlins were looking for their second World Series appearance in their eleventh season.

Low Point:
In the top of the eighth in Game Six, the Cubs held a 3-0 lead and were just five outs away from returning to the World Series.

The Outcome:
After getting the first out of the inning, Cubs pitcher Mark Prior allowed a double to Marlins leadoff hitter Juan Pierre. The next batter, Luis Castillo lofted a flyball down the leftfield line in foul territory. In what would forever be known as the “Steve Bartman Incident”, Cubs leftfielder Moises Alou leaped at the wall, only to have the ball deflected by a fan going for the souvenir. Alou was visibly upset because it would extend the at bat, which turned into a base on balls. The next batter, Ivan Rodriguez singled in Pierre, cutting the lead to two runs.

The next play is the most pivotal of the series, and should be more infamous than the Steve Bartman interference. Marlins rookie Miguel Cabrera hit a routine groundball to shortstop Alex Gonzalez. It’s possible the Cubs could have turned an inning ending double play. At the least, they could settle for one out. But Gonzalez misplayed the ball, and the error loaded the bases with just one out. Seven more runs would score in the inning and the Marlins would live to see another day.

Game Seven got off to a poor start for the “North Siders” as Miguel Cabrera hit a three-run homerun off Kerry Wood. But that lead wouldn’t last long as the Cubs scored three in the second to tie it and another two in the third inning to take the lead. The Cubs were now eighteen outs away from the World Series. Whether it was the “Curse of the Billy Goat“, bad luck or poor performance, the Marlins re-took the lead for good in the fifth inning, advancing to the Fall Classic.

5) 2011 World Series

Cardinals defeat Rangers (2.1% series win expectancy)

The Rangers were back in the World Series after losing in 2010 to the Giants. The Cardinals were in their eighth postseason in the previous twelve years.

Low Point:
In the bottom of the ninth inning of Game Six, the Rangers led 7-5 and were just two outs away from a World Series championship.

The Outcome:
Facing Texas closer Neftali Feliz, Albert Pujols doubled into the left-centerfield gap. The next batter, Lance Berkman drew a four-pitch walk, putting the tying run on base. Feliz got Allen Craig to strikeout for the second out of the inning. The Rangers outfield was playing deep in hopes to limit any hit to a single. Unfortunately, rightfielder Nelson Cruz was not playing deep enough as David Freese lined a ball just out of his reach and off the wall, bringing in both runs and to tie the score.

The Cardinals “momentum”, however, was short lived as in the top of the tenth inning, Josh Hamilton hit a two-run homerun off of Cardinals closer Jason Motte. Surely this would be enough to give the Rangers their first World Championship. But in the bottom half, the Cardinals manufactured two runs on three singles, a sacrifice bunt and an RBI groundout. The game was yet again tied.

After Jake Westbrook held the Rangers scoreless in the top of the eleventh, ninth inning hero David Freese came to plate. On a full-count and the seventh pitch of the at bat, Freese hit a walk-off homerun to dead center, forcing a Game Seven. As the ball cleared the fence, play-by-play announcer Joe Buck told the audience “We will see you tomorrow night”. This was a nod to his late father, who twenty years earlier in the 1991 World Series used the same line when Kirby Puckett hit a walk-off homerun.

In Game Seven, the Rangers once again took a two-run lead in the first inning on doubles by Josh Hamilton and Michael Young. The lead would be short-lived as the Cardinals answered back with two of their own in the bottom of the first. In the second, David Freese added to his story with a two-run double. This is all the Cardinals would need as they held the Rangers scoreless the rest of the way.

4) 2004 ALCS

Red Sox defeat Yankees (1.8% series win expectancy)

After the disappointment of losing the 2003 ALCS, the Red Sox were facing the Yankees once again a year later.

Low Point:
This time around, the Yankees took the first three games and looked to be well on their way to another American League pennant. No team in baseball history had ever come back from being down 3-0 in a seven game series.

In the top of the ninth in Game Four, the Yankees held a 4-3 lead as Derek Jeter led off with a walk.

The Outcome:
Closer Keith Foulke, in his second inning of work, quickly retired the side on a groundout, strikeout and popup. Just after midnight in the bottom half of the inning, with three outs remaining, Kevin Millar drew a walk against future Hall of Famer Mariano Rivera. With the tying run on base, Boston manager Terry Francona replaced Millar with pinch runner Dave Roberts. With Bill Mueller at the plate, Roberts drew three straight pickoff throws from Rivera. But on the first pitch of the at bat, Roberts took off and slid into second base, just ahead of the throw from Jorge Posada. It was the biggest stolen base in Red Sox history. The next pitch, Mueller showed bunt but wisely pulled back. On the third pitch, Mueller shot a groundball into center to tie the score. The game would go into the twelfth inning, when David Ortiz hit a two-run walk-off homerun off of Paul Quantrill.

Game Five would last fourteen innings and ended on another walk-off hit by David Ortiz. This time it was an RBI single scoring Johnny Damon.

Game Six was back in New York and was known as the “Bloody Sock Game”. Red Sox starter Curt Schilling would need a tendon in his right ankle temporarily sewn in place, allowing him to pitch. Schilling went seven innings and allowed one run, earning the win and forcing the series to to a seventh game.

Game Seven was the least competitive of all the games as Boston jumped out to an early 6-0 lead, capped off by a Johnny Damon grand slam in the second inning. The Red Sox became the first team in history to win the final four games after losing the first three.

3) 2002 World Series

Angels defeat Giants (1.5% series win expectancy)

The Anaheim Angels were in the World Series for the first time in their 42nd year of existence. The Giants were looking for their first World Series championship since their New York days in 1954.

Low Point:
In the bottom of the seventh inning in Game Six, the Giants held a 5-0 lead, just eight outs away from winning the World Series.

The Outcome:
With Giants starter Russ Ortiz still on the mound in the seventh, the Angels got two runners aboard on singles by Troy Glaus and Brad Fuller. With thunder sticks banging and the Rally Monkey jumping, Scott Speizio took Giants reliever Felix Rodriguez deep to cut the lead to two runs. It would take the Giants two more relievers to stop the rally and keep the two-run lead.

In the eighth, lead-off hitter Darin Erstad cut the lead to one run with a solo shot off of Tim Worrell. That was followed by Tim Salmon and Garret Anderson singles, with still nobody out. Giants manager Dusty Baker then decided to bring in his closer Robb Nen to stop the bleeding. However, Troy Glaus, the first batter to face Nen, drove in the tying and go-ahead runs on a double. The Angels lead would stand as they forced a final game.

San Francisco got out to an early Game Seven lead in the second inning on a sacrifice fly by Reggie Sanders. This would not last long as the Bengie Molina would tie the score in the next half with an RBI double. A bases-clearing double by Garret Anderson in the third innging brought the score to 4-1 and the Angels the first World Series championship.

2) 1986 ALCS

Red Sox defeat Angels (0.8% series win expectancy)

It had been 68 years since the Red Sox had last won a World Series and their last appearance in the fall classic in 1975 would end in heartbreak. The Angels were looking for their first World Series appearance in their 26 year history.

Low Point:
California took three of the first four games and held a 5-2 lead through eight innings of Game Five. They were just three outs away from going to the World Series.

The Outcome:
In the top of the ninth, facing Angels starter Mike Witt, had one aboard on a Bill Buckner single and two outs remaining. Red Sox designated hitter and former Angels MVP Don Baylor cut the lead to one on a two-run homerun. Witt would get the next batter, Dwight Evans, to pop up for the second out. Angels manager Gene Mauch finally replaced Witt with reliever Gary Lucas, who would hit Rich Gedman on the first pitch. With the tying run aboard, Mauch brought in his closer Donnie Moore to face Dave Henderson. Henderson deposited the seventh pitch over the left field wall to take the lead. As a child, I thought that Henderson jumped six feet into the air as he watched the ball clear the fence. While his vertical leap is not 72 inches, he did get some major hang time.

What is often forgotten in this game is the Angels tied the score in the bottom of the ninth on a Rob Wilfong RBI single. The score would remained tied until the eleventh inning when Dave Henderson would once again come up to face Donnie Moore, this time with the bases loaded and no outs. Henderson drove in another run on a sacrifice fly to center fielder Gary Pettis. This lead would hold and Boston would live to see another day.

Boston easily won Game Six 10-4 and Game Seven 8-1 to get another shot at the World Series.

1) 1986 World Series

Mets defeat Red Sox (0.6% series win expectancy)

After Boston came back from having a 0.8% win expectancy in the ALCS, they were set to face the 108-win New York Mets in the World Series.

Low Point:
In the bottom of the tenth inning of Game Six, the Red Sox held a 5-3 lead with just two outs and the bases empty.

The Outcome:
The Mets’ last hope, catcher Gary Carter, singled to left. The pitcher’s spot was up next, so Mets manager Davey Johnson called on pinch hitter Kevin Mitchell, who lined the next pitch into center, moving Carter to second base. Ray Knight, the potential go-ahead run, lined an 0-2 pitch to right-center, scoring Carter and cutting Boston’s lead to one run.

Red Sox manager John McNamara chose to replace Schiraldi with Bob Stanley, who would uncork a wild pitch on the eighth pitch vs Mookie Wilson. This tied the game as Kevin Mitchell crossed the plate, while Ray Knight moved to second base. The next play would be the defining moment in first basemen Bill Buckner’s career. Mookie Wilson grounded the next pitch down the first base line in what looked to be a routine groundout to end the inning. Unfortunately for Buckner and the Red Sox, the ball scooted through his legs as Knight came around to score the winning run.

Boston took a three-run lead in the third inning of Game Seven, on solo homeruns by Dwight Evans and Rich Gedman and an RBI single by Wade Boggs. That lead lasted until the sixth when the Mets scored three of their own to tie the game. New York would score three more an inning later and that would be all they would need for the first World Series championship since 1969.

Posted in General, Historical, Statistical Analysis | Leave a comment

Season Similarity Scores

While Bill James was writing his Abstracts in the 1980’s, he came up with Similarity Scores, which puts a number on how similar the statistics are from two different players. Scores can range from 1000 for the most identical players to 0 for the most dissimilar. Consider that two players that are very much alike in Andre Thornton and John Mayberry Sr, who have a similarity score of 974 while Neifi Perez and Babe Ruth have a similarity score of 0.

I decided to take this concept and apply it to the winning percentages of teams in a single season and compare it to teams in other seasons. The method is very simple:
Find the difference between each team’s winning percentage from the two seasons being compared. For example, the New York Yankees in 2015 had a winning percentage of .537 while the Yankees of 1927 had a winning percentage of .714. The difference is .177 (.714-.537). Do this for all teams and find the average. Then multiply that average by 1000 and subtract it from 1000. Two identical seasons would have a score of 1000.

When comparing years with expansion teams (1961-present) to years before expansion (1901-1960), I only included teams that were active during the two seasons. So when comparing 2015 to 1915, only 16 teams will be included in the comparison since 14 teams were not active in 1915.

I was most curious to see which season is the most similar to 2015. Surprisingly (I guess any season before 2010 would surprise me), the most similar season to 2015 is 1945, which had a similarity score of 966. In fact, this is the second closest match (highest score) between two seasons in the entire history of baseball.

2015 vs 1945 (Similarity Score: 966):


Of course, when comparing 2015 to 1945, we are only considering the winning percentages of 16 of the 30 teams. So if we limit our search to the 30-team era (1998-2015), we’ll find that the most similar season to 2015 is …… 2014. Kind of boring, I know. But it’s natural that the most similar seasons are sequential since that is when team’s will have the most similar personnel.

2015 vs 2014 (Similarity Score: 937):


Now what if we want to see which season is most dissimilar to 2015 during the 30-team era? That would go to the 2002 season.

2015 vs 2002 (Similarity Score: 908):


Which two seasons in history have the most similar winning percentages? The 1922 and 1923 seasons. Again, back-to-back seasons are more likely to have higher scores because their team’s have changed the least.

1923 vs 1922 (Similarity Score: 968):


Next, these two seasons have the most dissimilar winning percentages:

1953 vs 1909 (Similarity Score: 819):


Finally, I’ve included a list of the top and bottom 15 similarity scores of all-time:

Highest Scores   Lowest Scores
Years Score Teams   Years Score Teams
1923 vs 1922 968 16   1953 vs 1909 819 16
2015 vs 1945 966 16   1954 vs 1909 831 16
1973 vs 1971 963 24   1961 vs 1912 836 16
1957 vs 1956 963 16   1915 vs 1906 836 16
1959 vs 1958 962 16   1946 vs 1906 839 16
1973 vs 1972 962 24   1943 vs 1909 839 16
1997 vs 1959 961 16   2002 vs 1908 840 16
1964 vs 1963 961 20   1956 vs 1912 840 16
1911 vs 1910 961 16   1942 vs 1907 840 16
1931 vs 1930 960 16   1953 vs 1912 842 16
1997 vs 1958 959 16   1942 vs 1913 842 16
1942 vs 1941 959 16   1942 vs 1909 842 16
1937 vs 1936 959 16   1957 vs 1912 843 16
1966 vs 1965 959 20   1928 vs 1915 843 16
1999 vs 1958 958 16   1921 vs 1909 843 16
Posted in General, Historical, Statistical Analysis | 2 Comments

Double Plays and Opportunities

One of my pet peeves in baseball is when an announcer mentions a stat without giving any context. It’s lazy and misleading and it’s usually done with counting stats. For example, both A.J. Pierzynski and Prince Fielder have grounded into 19 double plays this season, but Pierzynski has done it in 60 fewer opportunities. Opportunities are defined as plate appearances with a runner on first with less than two outs.

Why do opportunities need to be considered? Not all plate appearances are created equal. Lead-off batters are guaranteed to have one plate appearance per game that will not end up in a double play. Teams with high on base percentages as a whole, will have more opportunities because they have more players reaching first base. Players in the 2/3/4/5 spots in the lineup will have more opportunities because the hitters in front of them will reach base more often (assuming proper lineup construction).

To show this graphically, I decided to make a scatter plot for double plays and double play opportunities. It includes all players with at least 25 opportunities this season.

*Data grabbed from the indispensable Baseball-Reference

The red line shows the league average double play rate, which is around 11% of opportunities. Basically, the farther away from the line to the top left, the better the player has been at avoiding double plays. Conversely, the farther from the line to the bottom right, the worse they are. I added labels for the biggest outliers and most notable offensive players.

Some things the players on the top left have in common:
– Left-handed. Left-handed batters are a few feet closer to first base, so their time to first is, on average, shorter.
– Speed. This is simple. Faster players will beat out more throws to first than slower players.
– Flyball hitters. Fewer ground balls, fewer double plays.
– High strikeouts: You can’t ground into a double play if you don’t put the ball in play.

A couple of notes:
– Lonnie Chisenhall has had 70 opportunities and has yet to ground into a double play.
– The Cubs Rizzo and Bryant have combined for 284 opportunities and have grounded into 15 double palys (8 and 7 respectively). Compare that to Yasmany Tomas who has grounded into 15 double plays in just 72 opportunities.
– It’s no coincidence that the games best hitters (Trout, Harper, Donaldson, Goldschmdit, Votto, etc are all at league average or better.

Posted in General, Quick Posts, Statistical Analysis | Leave a comment

Traveling Championship

Suppose the World Series winner from the previous season held a championship they had to defend every game, sort of a “championship belt”. I’ll call this the “traveling championship”. Once the traveling champion loses, the championship is then transferred to the team that beat them. Then, that team goes on to defend the title. This goes on until the end of the regular season and the last team to hold the title is the “Traveling Champion”.

You might ask:
What does this tell us?
Nothing. Absolutely nothing.

Is the “traveling champion” the best team?
Most likely no.

Why do this?
It’s fun, or at least I think it is.

Without further adieu, the current 2015 Traveling Champion is the Cleveland Indians. Here is how it got there:

Date Champion Defeated
4/10 SDP SFG
4/13 ARI SDP
4/14 SDP ARI
4/18 CHC SDP
4/19 SDP CHC
4/22 COL SDP
4/25 SFG COL
4/27 LAD SFG
4/28 SFG LAD
4/29 LAD SFG
5/13 FLA LAD
5/15 ATL FLA
5/19 TBD ATL
5/20 ATL TBD
5/22 MIL ATL
5/23 ATL MIL
5/25 LAD ATL
5/27 ATL LAD
5/28 SFG ATL
5/30 ATL SFG
6/11 NYM SFG
6/13 ATL NYM
6/14 NYM ATL
6/17 TOR NYM
6/20 BAL TOR
6/24 BOS BAL
6/25 BAL BOS
6/29 TEX BAL
7/10 ANA SEA
7/11 SEA ANA
7/12 ANA SEA
7/23 MIN ANA
7/25 NYY MIN
7/29 TEX NYY
8/10 BAL SEA
8/11 SEA BAL
8/14 BOS SEA
8/16 SEA BOS
8/17 TEX SEA
8/18 SEA TEX
8/19 TEX SEA
8/20 DET TEX
8/21 TEX DET
8/25 TOR TEX
8/27 TEX TOR
8/31 SDP TEX
9/13 HOU ANA
9/14 TEX HOU
9/18 SEA TEX
9/19 TEX SEA
9/20 SEA TEX
9/23 KCR SEA
9/25 CLE KCR

Here is the number of games each team has successfully defended the championship:

Team Successfully Defended Times Won Championship
TEX 15 13
ANA 12 6
LAD 9 6
ATL 6 6
SDP 6 4
BAL 5 4
SFG 4 6
NYM 3 4
NYY 3 1
TOR 3 2
ARI 2 4
COL 2 1
SEA 2 9
BOS 1 2
KCR 1 1
MIN 1 1
CHC 0 1
CLE 0 1
DET 0 1
FLA 0 1
HOU 0 1
MIL 0 3
TBD 0 1
Posted in General, Quick Posts | Leave a comment

Disparity in Division Quality

It’s time to change the playoff seeding, just as the NBA did recently.

The following is each division’s record vs non-divisional opponents during the wild card era (1994-present). As you can see, the NL Central (.555) has the seventh best winning percentage of any division, which equates to a 90-72 record over 162 games.

The NL East, however, has the fourth worst winning percentage (.424), which equates to a 69-93 record.

Year Lg Div W L W-L%
2002 AL West 251 165 0.603
2001 AL West 250 166 0.601
2008 AL East 255 194 0.568
2012 AL West 237 183 0.564
2013 AL East 243 188 0.564
2010 AL East 251 199 0.558
2015 NL Central 228 183 0.555
1998 AL East 316 254 0.554
2003 NL East 237 192 0.552
2011 AL East 248 202 0.551
1994 AL Central 205 168 0.550
2009 AL West 230 190 0.548
2005 NL East 239 199 0.546
2002 NL West 234 195 0.546
1994 NL East 208 175 0.543
1997 AL East 307 263 0.539
1997 NL East 307 263 0.539
2006 AL West 226 194 0.538
2000 NL West 298 256 0.538
2013 NL Central 231 199 0.537
2004 NL Central 232 200 0.537
2006 AL Central 233 201 0.537
2007 NL West 241 209 0.536
2009 AL East 241 209 0.536
2002 NL East 229 199 0.535
2001 NL West 230 200 0.535
2011 NL East 240 209 0.535
2009 NL West 240 210 0.533
2012 AL East 240 210 0.533
1999 AL East 300 264 0.532
2003 AL West 221 195 0.531
2008 NL Central 263 233 0.530
2015 AL East 218 196 0.527
2004 AL West 219 197 0.526
1997 NL West 265 239 0.526
2003 AL East 226 204 0.526
1994 AL East 215 195 0.524
2012 NL East 236 214 0.524
2010 NL East 236 214 0.524
1996 AL Central 287 261 0.524
2004 AL East 225 205 0.523
2000 AL West 260 237 0.523
2007 AL West 219 201 0.521
1995 AL West 220 202 0.521
2005 AL West 219 205 0.517
1996 NL West 254 238 0.516
2014 AL East 221 209 0.514
1999 AL West 258 244 0.514
2005 AL East 226 214 0.514
1998 NL Central 327 310 0.513
2015 AL Central 212 201 0.513
2014 AL West 220 210 0.512
2006 NL East 225 215 0.511
1998 NL West 292 279 0.511
2010 NL West 230 220 0.511
1995 NL West 214 206 0.510
2014 NL Central 219 211 0.509
2000 AL Central 286 276 0.509
1999 NL West 282 274 0.507
2007 AL East 228 222 0.507
1999 NL East 283 276 0.506
2005 NL Central 251 245 0.506
2003 NL West 217 212 0.506
1994 NL Central 183 179 0.506
2014 AL Central 217 213 0.505
1995 NL East 231 228 0.503
1999 NL Central 301 297 0.503
2011 NL West 226 223 0.503
2008 AL Central 226 224 0.502
2007 NL East 225 225 0.500
1996 AL West 245 245 0.500
1998 AL West 251 252 0.499
2006 NL West 217 218 0.499
2007 AL Central 224 226 0.498
1995 AL Central 231 233 0.498
2014 NL East 214 216 0.498
2011 AL West 209 211 0.498
2004 NL East 213 217 0.495
1996 NL East 272 278 0.495
2015 AL West 205 210 0.494
2005 AL Central 215 221 0.493
2000 NL East 274 283 0.492
2012 NL West 221 229 0.491
1996 NL Central 270 280 0.491
2006 AL East 215 223 0.491
2000 AL East 275 287 0.489
2013 AL Central 210 220 0.488
2001 NL East 210 220 0.488
1995 NL Central 227 238 0.488
2015 NL West 202 212 0.488
2010 AL Central 219 231 0.487
2013 NL West 209 221 0.486
1997 AL West 244 260 0.484
1995 AL East 222 238 0.483
2008 NL East 217 233 0.482
2002 AL East 207 223 0.481
2008 AL West 201 218 0.480
2009 NL East 215 235 0.478
1996 AL East 262 288 0.476
2010 AL West 198 222 0.471
1998 NL East 268 302 0.470
2003 NL Central 220 248 0.470
2013 NL East 201 229 0.467
2009 NL Central 232 266 0.466
2001 NL Central 218 250 0.466
2001 AL Central 200 230 0.465
2004 NL West 200 230 0.465
2014 NL West 199 231 0.463
2011 AL Central 208 242 0.462
1997 AL Central 261 309 0.458
2013 AL West 197 234 0.457
1997 NL Central 260 310 0.456
2011 NL Central 226 270 0.456
2000 NL Central 272 326 0.455
2012 NL Central 225 271 0.454
2004 AL Central 195 235 0.454
2010 NL Central 227 275 0.452
2001 AL East 194 236 0.451
1998 AL Central 256 313 0.450
1994 NL West 156 193 0.447
2009 AL Central 201 249 0.447
2007 NL Central 221 275 0.446
2012 AL Central 199 251 0.442
1999 AL Central 248 317 0.439
2008 NL West 195 255 0.433
2006 NL Central 210 275 0.433
2002 NL Central 184 247 0.427
2005 NL West 189 255 0.426
2015 NL East 175 238 0.424
2003 AL Central 180 250 0.419
1994 AL West 144 201 0.417
2002 AL Central 177 253 0.412

The top three teams in the National League Central also have the three best records in the league. If the season ended today, two of them would have to settle for a one game “play-in”, just to enter the tournament. This type of situation wasn’t as big of a problem before the addition of the second wild card in 2012, when all playoff teams were in the tournament. But now, it is unfair to force better teams to have to win a one game crap shoot.

If we are stuck with three divisions and five playoff teams, I would rather see all division winners plus two wild card teams advance to the playoffs with the caveat that the two teams with the worst record face each other in the “play-in” game. Admittedly, it’s not perfect, but it is better than what we have now.

Posted in General, Historical, Quick Posts, Statistical Analysis | Leave a comment