Hi everyone,
Sorry for the 2 week delay in checking this thread.
I was spending last week slandering Jokic's 30/20/20 since the statline didn't show the defense.
I'll catchup with all the replies and respond accordingly over the weekend.
To rephrase my argument, I don't think RAPM is complete garbage, just that it doesn't meet the standards for ranking players.
- Essentially any stat that tries to predict +/- or RAPM is destined for failure since the inherent noise makes it so that the upper bound on how much of the variance can be predicted is something like 50%.
- *70% not 50% for predicting Multiyear RAPM
I was going off memory,
but I assume the 50% figure will hold if you're looking for single year rankings which are necessary/relevant.
Single game is also useful/important
--
see the tweet for the full context:
https://x.com/sportsandmath1/status/190 ... 82117?s=46
Generally (note I'm making assumptions) Dean Oliver agrees with my sentiment about RAPM and his Net PTS stat follows the general idea of ignoring RAPM for a box score centric approach that uses the +/- for context.
The main difference (I assume) is that his coefficients are handcrafted from his decades of experience whereas mine are based on linear regression to fit the opinion of 10-15 people. I'm not qualified to tune the coefficients (when I tried before I posted anything online c. 2019 the results were mixed)
--
Since there's an upper limit on how well RAPM or +/- can be predicted (let's say ~50% for single season, ~70% for multiseason),
we can say that RAPM or +/- data in general is made up of 70% of data that's useful for player rankings and 30% of "inherent noise" that's related to lineup effects, team context, Off-Court +/-, or randomness. My initial post calling it all Off-Court +/- was reductive.
when EPM, LEBRON, DARKO, BPM, xRAPM, etc. are trying to predict (long term) RAPM or +/-, the RAPM serves as a useful guide to get up to ~70% of the answer (long term, ~50% for a single season) but the other 30-50% of the answer simply doesn't exist in +/- data and trying to predict player rankings better than 70% via RAPM can be counterproductive since they're fitting to data that is unrelated to player rankings.
---------
Sorry if this isn't as polished as I hoped - I had this written out a bit better but the draft got deleted while switching tabs so I just did my best to rewrite what I had earlier.
Anyways, I'm looking forward to reading all the replies!