{"id":12786,"date":"2011-03-15T13:41:55","date_gmt":"2011-03-15T20:41:55","guid":{"rendered":"http:\/\/www.seamheads.com\/?p=12786"},"modified":"2011-03-16T15:51:43","modified_gmt":"2011-03-16T22:51:43","slug":"ballparks-database-updated","status":"publish","type":"post","link":"https:\/\/seamheads.com\/blog\/2011\/03\/15\/ballparks-database-updated\/","title":{"rendered":"Ballparks Database Updated!"},"content":{"rendered":"<p>Last month we rolled out the online version of the Seamheads Ballparks database, which contained descriptive information about every park ever used as a major league stadium, plus calculations of the impact on batting components for LH and RH batters beginning in 1950.<\/p>\n<p>Today we\u00e2\u20ac\u2122ve released an update to the original data.\u00c2\u00a0\u00c2\u00a0 The latest detailed documentation can always be found <a href=\"http:\/\/www.seamheads.com\/ballparks\/about.php\">here<\/a>, but here is a quick summary of the improvements:<\/p>\n<p>1.\u00c2\u00a0 Added the descriptive park data from Ron Selter\u00e2\u20ac\u2122s book <em>Ballparks of the Deadball Era.<\/em> This new and improved data covers parks used in the 1901 \u00e2\u20ac\u201c 1919 seasons.\u00c2\u00a0\u00c2\u00a0 One side effect of using this newer data is that, for some parks, it made it appear that a change occurred in 1920 to the park, as the dimensions now differ between 1919 and 1920, when in reality it was just that the 1919 data was more accurate.\u00c2\u00a0\u00c2\u00a0 To mitigate this issue, we extrapolated Mr. Selter\u00e2\u20ac\u2122s data past 1919 and into the 1920\u00e2\u20ac\u2122s until we reached a season where we were reasonably certain that physical changes were actually made to the park.<\/p>\n<p>2.\u00c2\u00a0 Added data provided by Clem Comly of Retrosheet.org for the years 1919-1949 from the Retrosheet box score event files that enable us to create estimated LH\/RH splits for these pre-1950 seasons.\u00c2\u00a0\u00c2\u00a0\u00c2\u00a0 They are not yet \u00e2\u20ac\u02dctrue\u00e2\u20ac\u2122 observed splits as, without play by play data, switch hitters must be excluded from our calculations, but they should be some of the best estimated splits you can find anywhere.<\/p>\n<p>We\u00e2\u20ac\u2122ll be diving into the data in some future articles, but for now, just a brief word about the park factor calculations.\u00c2\u00a0 We provide two sets of calculations \u00e2\u20ac\u201c 1-year factors and 3-year factors.<\/p>\n<p>The 1-year factors are \u00e2\u20ac\u02dcobserved\u00e2\u20ac\u2122 factors.\u00c2\u00a0 \u00c2\u00a0While we do use an \u00e2\u20ac\u02dcother parks corrector\u00e2\u20ac\u2122 as described in the detail documentation, these are essentially the factors that were observed for that particular year \u00e2\u20ac\u201c so a 120 doubles factor for LH batters in Fenway Park means that left-handed batters hit 20% more doubles at Fenway than LH batters for those same teams\u00e2\u20ac\u2122 batters hit in games away from Fenway.<\/p>\n<p>The 3-year factors are attempts at calculating the \u00e2\u20ac\u02dctrue\u00e2\u20ac\u2122 factors.\u00c2\u00a0 There are many, many ways we could have constructed our formula, and it\u00e2\u20ac\u2122s difficult to determine what the \u00e2\u20ac\u02dcright\u00e2\u20ac\u2122 way is, but we believe our way is at least a good and defensible way.\u00c2\u00a0\u00c2\u00a0\u00c2\u00a0 Our basic formula is to use the 1-year factors for the season in question, the season immediately preceding, the season immediately following, and then the park\u00e2\u20ac\u2122s long-term historical factor, all weighted equally.\u00c2\u00a0\u00c2\u00a0\u00c2\u00a0 As some parks have rather long histories, while other may have life for only a few seasons, this is not a perfect method, but we believe it retains a basic simplicity while providing for a high degree of accuracy in estimating a park\u00e2\u20ac\u2122s impact on offensive events.<\/p>\n<p>We welcome any feedback on any of the data or suggestions for improvement, so try it out and enjoy!<\/p>\n<p><a href=\"http:\/\/www.seamheads.com\/ballparks\/index.php\">http:\/\/www.seamheads.com\/ballparks\/index.php<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Last month we rolled out the online version of the Seamheads Ballparks database, which contained descriptive information about every park ever used as a major league stadium, plus calculations of the impact on batting components for LH and RH batters beginning in 1950. Today we\u00e2\u20ac\u2122ve released an update to the original data.\u00c2\u00a0\u00c2\u00a0 The latest detailed [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[9,4235,5],"tags":[],"class_list":["post-12786","post","type-post","status-publish","format-standard","hentry","category-general","category-top-stories","category-statistical-analysis"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/seamheads.com\/blog\/wp-json\/wp\/v2\/posts\/12786","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/seamheads.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/seamheads.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/seamheads.com\/blog\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/seamheads.com\/blog\/wp-json\/wp\/v2\/comments?post=12786"}],"version-history":[{"count":0,"href":"https:\/\/seamheads.com\/blog\/wp-json\/wp\/v2\/posts\/12786\/revisions"}],"wp:attachment":[{"href":"https:\/\/seamheads.com\/blog\/wp-json\/wp\/v2\/media?parent=12786"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/seamheads.com\/blog\/wp-json\/wp\/v2\/categories?post=12786"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/seamheads.com\/blog\/wp-json\/wp\/v2\/tags?post=12786"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}