PECOTA
PECOTA, an acronym for Player Empirical Comparison and Optimization Test Algorithm, is a sabermetric system for predicting Major League Baseball player performance.[1] It was invented by Nate Silver of Baseball Prospectus. It relies on fitting a given player's past performance statistics to the performance of "comparable" Major League ballplayers by means of similarity scores. Although drawing on the underlying concept of Bill James' similarity scores, PECOTA calculates these scores in a distinct way that leads to a very different set of "comparables" than James' method.[2] Separate sets of predictions are developed for hitters and pitchers. The comparable players are drawn from a database of all major league player-seasons since 1946. The raw statistics in this database are first adjusted to take into account park effects and the era in which a player played.
PECOTA also draws on Clay Davenport's translations (the so-called Davenport Translations or DT's) of minor league and international baseball statistics to estimate the major league equivalent performance of each player.[3] In this way, PECOTA is able to make projections for more than 1,600 players each year, including many players with little or no prior major league experience.
Unlike performance forecasts that commonly assume a single pattern of change during a player's career, PECOTA employs several models that take into account not just a player's performance in the previous three years but also his age, speed, handedness, and body type (basically, body mass index). Furthermore, instead of focusing on making point estimates of a player's future performance (such as batting average, home runs, and strike-outs), PECOTA relies on the historical performance of the given player's historical "comparables" to produce a probability distribution of the given player's predicted performance during the next five years.
First introduced in 2003, PECOTA projections are produced each year and published both in the Baseball Prospectus annual monographs and on the BaseballProspectus.com website.[13] PECOTA has undergone several improvements since 2003. The 2006 version introduced metrics for the market valuation of players based on the predicted performance levels. The 2007 version introduces adjustments for league effects, to account for differences in the competitive environment of the two major leagues. The logic and methodology underlying PECOTA have been described in several publications (see References), but the detailed formulas are proprietary and have not been shared with the baseball research community. The test of PECOTA is its ability to make accurate forecasts in comparison with alternative forecasting methods. A comparison for the 2006 season shows that PECOTA outperformed several other forecasting systems in predicting hitting (OPS) and performed nearly as well as the best of the other systems in predicting pitching (ERA).[4]
Although designed primarily for predicting individual player performance, PECOTA has been applied also to predicting team performance. For this purpose, projected team rosters are established with projected playing times for each team member based on the expertise of the Baseball Prospectus staff. A team's expected wins is based on applying an improved version of Bill James' Pythagorean Formula to the estimated number of runs scored and allowed by the roster of players under the given playing-time assumptions.[5] PECOTA has been used in preseason forecasts of how many wins teams will attain and in mid-season simulations of the number of wins each team will attain and its odds of reaching the playoffs.[6] In 2006, PECOTA's preseason forecasts compared favorably to other forecasting systems (including Las Vegas betting line odds) in predicting the number of wins teams would earn during the season.[7]
[edit] Notes
- ↑ The acronym was actually based on the name of journeyman major league player Bill Pecota,[1] who with a lifetime batting average of .249 is perhaps representative of the typical PECOTA entry.
- ↑ This difference is explained and illustrated in Nate Silver, "Introducing PECOTA," Baseball Prospectus 2003 (Dulles, VA: Brassey's Publishers, 2003): 507-514. Also see Baseball Prospectus' glossary entry for "Comparable Players"[2].
- ↑ See Clay Davenport, "DT's vs. MLEs — A Validation Study," BaseballProspectus.com, January 30, 1998[3]; Clay Davenport, "Winter and Fall League Translations: Just How Good Are These Leagues, Anyway?," BaseballProspectus.com, January 27, 2004[4]; and Clay Davenport, "Over There! A Second Review of Translating Japanese Statistics, and Translating the Mexican League," Baseball Prospectus 2004 (New York: Workman, 2004): 585-590.
- ↑ Dan Szymborski, "2006 Projections," BaseballThinkFactory.com (December 14, 2006)[5].
- ↑ On the Pythagenport formula, see Clay Davenport and Keith Woolner, "Revisiting the Pythagorean Theorem: Putting Bill James' Pythagorean Theorem To the Test," BaseballProspectus.com, June 30, 1999[6] as well as the Baseball Prospectus glossary entry for "Pythagenport"[7]. On the construction of the depth charts for each team and the application of PECOTA to estimating team wins, see Nate Silver, "PECOTA Projects the American League," BaseballProspectus.com, March 21, 2005[8]; and Nate Silver, "PECOTA Breaks Hearts," BaseballProspectus.com, March 29, 2006[9].
- ↑ See Clay Davenport, "Playoff Odds Report: The Addition of PECOTA," BaseballProspectus.com, May 3, 2006[10] and Baseball Prospectus Statistics[11].
- ↑ Nate Silver, "Projection Reflection," BaseballProspectus.com, October 11, 2006[12].
[edit] References
William Hageman, "Baseball by the Numbers," Chicago Tribune, January 4, 2006.
Alan Schwarz, "Predicting Futures in Baseball, and the Downside of Damon," New York Times, November 13, 2005.
Nate Silver, "The Science of Forecasting," BaseballProspectus.com, March 11, 2004[14].
Nate Silver, "Introducing PECOTA," Baseball Prospectus 2003 (Dulles, VA: Brassey's Publishers, 2003): 507-514.
Nate Silver, "PECOTA Takes on the Field: How'd It Fare Against Six Other Projections Systems?" BaseballProspectus.com, January 16, 2004[15].
Nate Silver, "PECOTA 2004: A Look Back and a Look Ahead," Baseball Prospectus 2004 (New York: Workman Publishers, 2004): 5-10.
Nate Silver, "Rearranging PECOTA," Baseball Prospectus 2006 (New York: Workman Publishers, 2006): 6-11.
Childs Walker, "Baseball Prospectus Makes Predicting Future Thing of Past," Baltimore Sun, February 21, 2006.
