Applications of Improvements to the Pythagorean Won-Lost Expectation in Optimizing Rosters
Alexander F. Almeida (),
Kevin Dayaratna (),
Steven J. Miller () and
Andrew K. Yang ()
Additional contact information
Alexander F. Almeida: Georgetown University
Kevin Dayaratna: Georgetown University
Steven J. Miller: Williams College
Andrew K. Yang: University of Cambridge
A chapter in Artificial Intelligence, Optimization, and Data Sciences in Sports, 2025, pp 333-353 from Springer
Abstract:
Abstract Bill James’ Pythagorean formula has for decades done an excellent job estimating a baseball team’s winning percentage from very little data: if the average runs scored and allowed are denoted by RS $$\mathrm {RS}$$ and RA $$\mathrm {RA}$$ , respectively, there is some γ $$\gamma $$ such that the winning percentage is approximately RS γ ∕ ( RS γ + RA γ ) $$\mathrm {RS}^\gamma / (\mathrm {RS}^\gamma + \mathrm {RA}^\gamma )$$ . One important consequence is to determine the value of different players to the team, as it allows us to estimate how many more wins we would have given a fixed increase in run production. We summarize earlier work on the subject and extend the earlier theoretical model of Miller (who estimated the run distributions as arising from independent Weibull distributions with the same shape parameter; this has been observed to describe the observed run data well). We now model runs scored and allowed as being drawn from independent Weibull distributions where the shape parameter is not necessarily the same and then use the Method of Moments to solve a system of four equations in four unknowns. Doing so yields a predicted winning percentage that is consistently better than earlier models over the last 30 MLB seasons (1994 to 2023). This comes at a small cost as we no longer have a closed-form expression but must evaluate a two-dimensional integral of two Weibull distributions and numerically estimate the solutions to the system of equations; as these are trivial to do with simple computational programs, it is well worth adopting this framework and avoiding the issues of implementing the Method of Least Squares or the Method of Maximum Likelihood.
Keywords: James’ pythagorean Won-Lost formula; Weibull distribution; 62P99 (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:spochp:978-3-031-76047-1_13
Ordering information: This item can be ordered from
http://www.springer.com/9783031760471
DOI: 10.1007/978-3-031-76047-1_13
Access Statistics for this chapter
More chapters in Springer Optimization and Its Applications from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().