Park Neutralized Stats

One thing that gives people like me headaches is having to deal with the fact that all ballparks are different. Don’t get me wrong, the uniqueness of the sport is part of what makes baseball so special. But it’s an imperfect science when attempting to compare players from different teams.

The other day, my friend and baseball statistic guru Ryan Spaeder was asked which park was the toughest on hitters, and this was his response:

I thought it was perfect. Obviously, Coors Field is easily the most “hitter friendly” ballpark, but it is almost a no-win situation for a hitter. The Rockies have zero Hall of Famers in their 24 years of existence and their two best candidates (Larry Walker and Todd Helton) are unlikely to get in any time soon. The argument is their stats are inflated due to Coors Field, although it is impossible to say by exactly how much. We have park neutralized metrics such as OPS+ and wRC+ which include a park adjustment, and while I use them regularly, I admit they aren’t perfect.

The common practice in adjusting for ballpark is to take the ballpark factor, which estimates how a park influences run scoring compared to league average, and apply to it to each hitter. However, all players receive the same adjustment, no matter if they bat left-handed or right-handed, or if they are fly ball, ground ball, pull or spray hitters, etc. The assumption that all types of hitters should be treated the same is what I’m attempting to correct.

So while I know I’m not completely fixing the problem here, I’m offering an alternative. What I have are park adjusted career totals based off home and road splits. My first attempt was to just multiply road stats by two, but that would completely eliminate the player from playing ANY games in their home park and turn ballpark advantages into disadvantages and vice versa.

Instead, I decided to include home stats, but only at the same rate that a player would visit other parks. For example, Babe Ruth played in an era when his league had eight teams. That means that if he played in all ballparks an equal amount of time, he’d play in his home ballpark 12.5% (1/8) of his games. The other 87.5% would come from his road stats. A player in 2016 would play in their home park 6.67% (1/15) of the time.

Example

The formula is simple. For Babe Ruth’s home runs, we take his home HR/PA (347 / 5150 = .0674) and divide that by the number of teams in his league (.0674 / 8 = .00842). Next we take his road HR/PA (367 / 5473 = .0671) and multiply that by (7 / 8 = .875), which is the number of opponents in the league divided by the number of teams in the league (.0671 * .875 = .0587). Next, we add those two numbers together to get his new HR rate (.00842 + .0587 = .06712). To get his final career HR total, we multiply his rate by his career plate appearances (.06712 * 10623 = 713 HR). Surprisingly, he actually loses a HR, even though he played most of his career with a short right field at Yankee Stadium. We’ll see, however, that many players have a bigger difference in their adjusted career totals.

500 HR
After neutralizing the stats, there are no new members of the 500 HR club, although we lose six players. The biggest drop is by Mel Ott, who loses over 20% of his career total. Ott played his entire career at The Polo Grounds, where the right field foul pole was just 258 feet from home plate. During his career there, lefthanded batters hit about 80% more HR there than they did at the other National League parks.
Capture
What is interesting about Mel Ott and The Polo Grounds is that while it allowed Ott to hit far more home runs, it came at the expense of other hits. So we take away 104 HR, but also credit him 83 singles, 87 doubles, and 21 triples. Overall, his production was increased at home, but not by as much as his home run total would indicate.

The second biggest drop is a bit surprising in Frank Thomas, who lost 90 home runs. Comiskey Park does favor HR hitters, but it’s far from the most drastic ballpark. Still, over his career, Thomas hit a HR in 6.2% of his plate appearances at home and 4.1% on the road.

David Ortiz is not only the biggest gainer on the list, but also among all players. He went from 525 HR to 589, and we’ll see that Fenway Park wreaks havoc on these neutralized stats.
HR Change

The common theme with both of these lists is that those who saw their totals increase all played in parks that favored pitchers with the long ball, while those who saw their totals decrease played in parks that favored home runs.

Now let’s look at what is probably the second most popular career batting list, the “3,000 Hit Club”. (Note: Since we only have home/away splits going back to 1913, any player who began their career before this season is not included. Thus, no Cobb, Wagner or Speaker).
3000 Hits
Just as with the “500 HR Club”, the “3000 Hit Club” only lost players. While some players see an increase their production and some see a decrease from these neutralized stats, as a whole, players will lose some production. This is due to home field advantage and because the majority of these neutralized stats are influenced by road stats. This may partly explain why there are no new members of either club.

Already, we see some of Fenway Park’s impact with David Ortiz’s increase in his home run total and both Yaz and Boggs seeing big decreases in their hit totals. Maybe the most telling is the list of players that saw the biggest decrease in their doubles.
Capture
Nine of the top 10 played most or all of their career at Fenway Park. You just don’t see this type of thing in other sports.

Players with Big Increases in Production

Capture
A fun thought experiment is to imagine how their careers would have turned out had Joe DiMaggio and Ted Williams been traded for each other, with DiMaggio taking advantage of the Green Monster and Williams facing a short RF porch. Instead for DiMaggio, he had to contend with 451 feet to left-center field at Yankee Stadium. We estimate that with a neutral park, he would hit 45 more home runs and increase his overall production, with 31 more points of OPS.

Capture
Rick Wilkins may be the player farthest from your mind when you started reading, yet here he is. What is most amazing about him is just how drastic his home/road splits were when he spent the majority of his career at a hitters park in Wrigley Field. For his career, Wilkins hit .216/.298/.350 at home and .272/.366/.471 on the road. Who knows? Maybe he just got incredible nights sleep in hotel beds.

Capture
It’s no secret that AT&T Park favors pitchers, and what makes Buster Posey so special is that his raw stats are impressive, even before a park adjustment. But if we estimate what they would look like at more favorable parks, it becomes even more obvious that he’s on an early path to the Hall of Fame.

Capture
In the Willie Davis comment in his New Historical Baseball Abstract, Bill James describes a method for converting a players stats from one run environment to another. This has come to be known as the “Willie Davis Method” and it is currently used on this site and is the basis for Baseball-Reference’s neutralized stats (with some additional adjustments). The problem with this method is it treats all batting events the same and are adjusted at the same proportion. As we have seen with Fenway Park, this is not the case. Anyway, Bill James introduced his method in Davis’s player comment because he spent much of his career at a horrible hitters park. As we can see from this neutralization method, Davis’s stats improve, and his +29 triples and +210 total bases are the most of any player in history.

Capture
As if Mike Piazza didn’t already have the most impressive statistics for any catcher in baseball history, they get even better after they are neutralized. In fact, every single home ballpark during Piazza’s career had a park factor below 1.

Players with Big Decreases in Production

Capture
Chuck Klein spent much of his career in the Baker Bowl, which was 280 ft to right field and 300 ft to right-center field. So it’s easy to see why he hit 63% of his home runs at home. When we neutralize his stats, his overall numbers are much less impressive, especially given the hitters era in which he played.

Capture
As we saw earlier, Wade Boggs takes a big hit, with his OPS dropping 63 points. This is similar to other Red Sox players, such as Bobby Doerr (-.084), Rico Petrocelli (-.069), Dom DiMaggio (-.059), Jim Rice (-.053) and Carl Yastrzemski (-.051).

Capture
Barry Larkin is an interesting case. He has a 21 point drop in OPS while his hit total increases by 29. The biggest change was losing 130 walks, decreasing his walk percentage from 10.4% to 8.9%. In fact, Riverfront Stadium regularly increased walks for right handed batters every season of Larkin’s career. This is just a reminder that park factors are not limited to balls in play.

Capture
As mentioned above, playing at Coors Field can be a no-win situation. Obviously, Larry Walkers stats have received a boost. But if we neutralize them, he still compares well to these two Hall of Famers.
Capture
Throw in 9 Gold Gloves and +94 fielding runs, and this should quell any fears you may have about his Coors-inflated stats.

Capture
CarGo loses 102 points in OPS, which is the most of anyone with at least 2000 career plate appearances. This may indicate that his style of play is more affected by Coors Field. It’s also possible that he has a tougher time adjusting to the different approaches opposing pitchers take on the road, as Eno Sarris suggests. This may point to a flaw in the neutralization method, especially for extreme ballparks like Coors Field.

Change in Type of Production

Capture
Hank Aaron played his career at two parks, Milwaukee County Stadium and Atlanta-Fulton County Stadium. Milwaukee favored pitchers in terms of the long ball, but Atlanta was known as “The Launching Pad” and had a big affect on home runs. Naturally, he sees a drop in his home run total, but also an increase in singles, doubles and triples. The overall level of production didn’t change much after neutralization, just how it changed.

Capture
Jay Buhner’s neutralized OPS is nearly identical to his actual OPS, but his peripherals see some changes. His singles and home runs increase, but his singles, doubles and walks decrease. This is just another example that a ballpark can change how a game is played while having very little impact on the run environment.

Flaws in the System

The unbalanced schedule and interleague play make it so not all teams visit the same ballpark an equal amount of times. This means that Rockies players will visit pitchers parks such as Petco, At&T and Dodgers Stadium more often than teams in other divisions. It’s possible this can be corrected by equalizing the amount a team will see a road park. However, this would complicate the process and I’m not completely comfortable with it.

As mentioned in the Carlos Gonzalez comment, it is possible that some players’ away stats are affected due to different approaches taken by pitchers based on the ballpark. I suspect this is only the case in the very extreme and unique parks. It is something to keep an eye on.

This method only uses career statistics, which contain large samples when dealing with home/road splits. A single season may not offer a big enough sample to completely trust, especially with part-time players.

Conclusion

This method is admittedly imperfect, but it does fix the problem with applying the same run adjustment to all players. If anything, it is an alternative to other methods of neutralizing ballparks. I’m open to any suggestions on improving this method and I may publish a pitching neutralization shortly.

For those interested, I have included a spreadsheet with neutralized stats that can be viewed here. It includes all players with at least 1000 PA and began their career after 1912.

This entry was posted in General, Historical, Statistical Analysis. Bookmark the permalink.

2 Responses to Park Neutralized Stats

  1. Nick O'Lodeon says:

    Several years ago, I emailed Bill James to ask about what to do about road/home stats. Part of my question touched on the inconsistency of park factors from year to year even in the same park. His response…

    “They’re (park factors) not consistent because 80 games just isn’t enough to get a true read on anything. In general, I don’t use INDIVIDUAL home/road splits at all, and I would argue that it is normally inappropriate to use them. What we are concerned with is the VALUE of what the player has done. That isn’t dependent on his individual home/road stats.

    Suppose that were a player who, playing in a neutral park, hit .350 with 20 home runs at home, but .250 with 10 home runs on the road, while a teammate hits .300 home and road with 15 homers each
    place. Things like that do happen all the time, but suppose that this was a CONSISTENT effect, rather than an anomaly as it almost always is. Would one player be more valuable than the other? No, he wouldn’t, or at least not appreciably more valuable. The player’s value depends on the ratio between the runs he creates and run context in which he performs. His idiosyncratic tendencies are irrelevant.

    The textbook case here is the two Yankee catchers, Elston Howard and Bill Dickey. Both were Yankee catchers, careers of similar length, and both of them had brilliant four-year stretches relatively late in their careers 1936-39 for Dickey, 1961-64 for Howard. What many people don’t know is that Dickey
    was tremendously helped by playing in Yankee Stadium, while Howard was tremendously hurt by playing in the same park. In road games, Howard had not only more power than Dickey, but vastly more power…as I recall, Howard out homered Dickey in road games about 2 to 1. Taking ALL offense into account, Howard on the road may have been a better hitter than Dickey…not sure if that is true, but if he wasn’t better, he was certainly very close.

    But does this make Howard “actually” better than Bill Dickey? No, it doesn’t, because REAL BASEBALL GAMES WERE WON BY DICKEY’S ABILITY TO TAKE ADVANTAGE OF THE PARK. If you adjust Dickey’s advantage out of existence because it is a park “illusion”–which it is–then you are adjusting real wins out of existence. You can’t do that. So you have to live with the conclusion that Dickey was better than Howard. He was better because he had a superior ability to take advantage of the park–even though it was the same park, and had essentially the same hitting characteristics in Howard’s time that it did in
    Dickey’s.”

    Bill

    • Dan Hirsch says:

      Thanks for commenting. This is similar to all those Red Sox hitters who seemed to be helped by Fenway. It is hard to believe that a player like Wade Boggs wouldn’t be a great hitter if he played his home games in another park, even if he didn’t play nearly as well away from Fenway. Thanks for sharing.

Leave a Reply

Your email address will not be published. Required fields are marked *


7 − three =

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>