Revisiting Replacement Level
Posted: Thu Feb 01, 2024 9:22 pm
Many threads have discussed replacement level on this board... here's a Google search:
https://www.google.com/search?q=replace ... s-wiz-serp
As I have dug into one-number metrics over the years, I'm realizing why there was so much debate in the early years.
To summarize--teams can be as bad as -12 or even -15 as a whole. If replacement level is -2....do we really believe the team as a whole is below replacement level?
But conversely--when looking at ASPM and BPM 1.0 and RAPM, the average rating of minimum salary or 10-day contract players was in the vicinity of -2.0.
I argued then that worse players than replacement level were being played, in general, for development purposes and/or small sample size and luck made them appear worse than they actually were.
I've been working with better prior-informed RAPM data, and I think what was actually happening was this:
1. RAPM shrinks all players toward 0 (particularly low minutes players)
2. Metrics like BPM are built on this "shrunk" basis and also by nature of regression don't capture the full space (box score can't capture full impact).
3. We look at low minutes players and see -2.0.
Well, with some better prior-informed RAPM data, I think I'm ready to change my tune. I have tuned the prior so the model residuals are centered nicely around 0 for low minutes players--there is no minutes-based bias throughout the minutes spectrum.
With that in place, low minutes players average more like -3.5!
Now, again some of that may be from "development players", but certainly not enough to still argue for -2.0.
The new version of BPM/VORP will likely feature a lower replacement level.... Possibly around -3.0. Research continues.
https://www.google.com/search?q=replace ... s-wiz-serp
As I have dug into one-number metrics over the years, I'm realizing why there was so much debate in the early years.
To summarize--teams can be as bad as -12 or even -15 as a whole. If replacement level is -2....do we really believe the team as a whole is below replacement level?
But conversely--when looking at ASPM and BPM 1.0 and RAPM, the average rating of minimum salary or 10-day contract players was in the vicinity of -2.0.
I argued then that worse players than replacement level were being played, in general, for development purposes and/or small sample size and luck made them appear worse than they actually were.
I've been working with better prior-informed RAPM data, and I think what was actually happening was this:
1. RAPM shrinks all players toward 0 (particularly low minutes players)
2. Metrics like BPM are built on this "shrunk" basis and also by nature of regression don't capture the full space (box score can't capture full impact).
3. We look at low minutes players and see -2.0.
Well, with some better prior-informed RAPM data, I think I'm ready to change my tune. I have tuned the prior so the model residuals are centered nicely around 0 for low minutes players--there is no minutes-based bias throughout the minutes spectrum.
With that in place, low minutes players average more like -3.5!
Now, again some of that may be from "development players", but certainly not enough to still argue for -2.0.
The new version of BPM/VORP will likely feature a lower replacement level.... Possibly around -3.0. Research continues.