DSMok1 wrote:permaximum, this is very useful research you are doing!
It appears you are taking season-long stats from Y-1 and using them to predict each of the 82 games in year Y. Is that correct?
Something that you will find with that approach, which may cause issues with the findings, is that unusual games will show high "roster turnover".
- Blowouts typically end up with unusual minutes distributions. Bad players play more when the team plays well! (Similar to the causation issue of the football running=winning idea.)
- Games with many players injured or resting. These would have high turnover, but may not reflect how those players would play together if they played together on a normal basis.
- End of season games, where one or both teams have little to play for. These games often feature unusual minutes patterns and players who haven't played get a chance to play.
Actually I did these tests a long time ago and I wasn't going to share the results. I thought this thread was very related to them and decided to share some of the results. As for my method, I take regular-season per-minute or per-possesion averages (depending on the metric) from Y-1 (some exceptions for rookies, injured players and below 250MP players) and predict all games that happen both in regular season and playoffs in the next season depending on the actual possession or minutes that take place in those games.
For more detail, you can check here where I explained it in a previous post.
1. You're right. But the sample size is too big for it to become a problem and a better metric should actually be good in those situations too. That's what I was looking for. And finally, it's the exact same playing field for all metrics where it's completely fair.Retrodiction Method: Each game was predicted by calculating previous year's per-possesion or per-minute metric score depending on the metrics' formula and regular season average in the previous year for players that take part in the game. Then each team assigned a Total Metric Score by using the actual minutes or possessions and thus winner was predicted. On 2-6 rare occasions where the metric scores were equal to each other home team claimed winner because of the factor of HCA. Players below 250 MP in the previous year and rookies were assigned average values. Then each game's unique roster turnover rate was calculated depending on the new founding of teams, signings, trades, rookies etc and most importantly their in-game minutes for that particular game. E.g: 90% RT minutes from team A and 80% RT minutes from team B makes the game's turnover score 85%. For those that curious there were only 6 games where the roster turnover was 100%. It means both teams had 100% roster turnover and all minutes came from completely new players for those teams. Average roster turnover rate for games was around 33%.
2. That's exactly the situation we're looking for.
3. That's exactly the situation we're looking for.
To summarize, more chaos is better for what I tested which is individual player impact or skill.