NCAA Statistical Plus/Minus 2009-2010 (DSMok1, 2010)

Home for all your discussion of basketball statistical analysis.
Post Reply
Crow
Posts: 10565
Joined: Thu Apr 14, 2011 11:10 pm

NCAA Statistical Plus/Minus 2009-2010 (DSMok1, 2010)

Post by Crow »

page 1 of 2

Author Message
DSMok1



Joined: 05 Aug 2009
Posts: 611
Location: Where the wind comes sweeping down the plains

PostPosted: Mon Jan 11, 2010 12:34 pm Post subject: NCAA Statistical Plus/Minus 2009-2010 Reply with quote
NCAA Statistical Plus/Minus 2009-2010

Statistical plus/minus has been an underused metric for evaluating college players--it is the perfect place to use it! SPM can be readily corrected for strength of schedule and is calculated directly from the stat line.

So I have adapted the latest version of the SPM formula (see "Building an Updated Statistical Plus/Minus Model for 2009" and the actual regression) to the NCAA, similar to what I did for last year (only corrected, improved and updated).

The theory is the same: compile NCAA stats, convert to tempo-free rates, run the SPM regression, and sum the team SPM's to their Pomeroy Efficiency Margin. Several fixes: I realized the SPM regression runs on per 40 minute NBA rates--which would be per 76 possessions. I converted the NCAA stats to that baseline. Also, the output of the SPM is in points/100 possessions--a fact which had me confused previously.

Also, I added in a Bayesian "True Talent" estimation based on the system I outlined on this forum. This was a regression based on team margin, MPG, age, and position, to estimate the SPM, creating a prior from inputs not included in SPM (except MPG). A standard error of the prior was also estimated (along with the standard error of each player's SPM). Basically, I tried to replicate Rosenbaum's method for the SPM standard errors.

The result is multifaceted. We have the pure SPM, which is per 100 player-possessions. We can look at the actual player contribution, which is per 100 TEAM-possessions (probably the best actual rating). We can look at the true talent estimate--which regresses too hard due to the high errors on the individual SPM's, but with more data and perhaps more years would be very useful. I created an overall rank based on a weighted average of the player's contribution to the team (x2) and their true talent estimate.

Access the Spreadsheet HERE. It is editable so you can sort it (let me know if that works)--please don't delete or modify the data. (Don't worry--I have backups!)

The top 30 Players:
Code:
Player Team Pos Yr SPM Contrib True Talent
Jon Scheyer Duke G Sr 14.86 13.20 11.16
Jackson Emery Brigham Young G Jr 17.25 10.83 8.37
Jarvis Varnado Mississippi St. F Sr 15.81 11.04 7.09
Trevon Hughes Wisconsin G Sr 12.21 9.54 8.01
Jimmy Butler Marquette G Jr 12.32 9.82 7.28
Jeremy Lin Harvard G Sr 14.69 11.57 6.36
Ekpe Udoh Baylor C Jr 10.72 9.14 7.60
James Anderson Oklahoma St. G Jr 13.21 10.98 6.40
Dominique Jones South Florida G Jr 12.73 11.26 6.29
Wes Johnson Syracuse F Jr 10.35 8.43 8.55
Robbie Hummel Purdue F Jr 12.19 8.80 7.58
Draymond Green Michigan St. F So 16.41 9.97 6.45
Damion James Texas F Sr 12.09 8.16 7.69
Jacob Pullen Kansas St. G Jr 12.20 8.88 6.74
Gordon Hayward Butler F So 10.59 8.77 6.91
Aubrey Coleman Houston G Sr 10.83 9.91 6.28
Jon Leuer Wisconsin F Jr 11.49 7.95 7.15
Damian Saunders Duquesne F Jr 13.00 11.35 5.89
Jimmer Fredette Brigham Young G Jr 11.03 7.53 7.70
Jerome Dyson Connecticut G Sr 9.59 8.12 6.67
Roman Martinez New Mexico F Sr 10.91 8.39 6.43
Kevin Jones West Virginia F So 9.46 7.51 7.51
Manny Harris Michigan G Jr 11.84 10.47 5.88
Ben Uzoh Tulsa G Sr 9.94 7.92 6.27
Greivis Vasquez Maryland G Sr 9.61 7.65 6.29
Kyle Singler Duke F Jr 8.02 6.80 9.24
David Schneider William & Mary G Sr 12.17 9.44 5.80
Lavoy Allen Temple F Jr 8.47 7.10 6.75
Greg Monroe Georgetown C So 8.39 7.07 7.06
Marqus Blakely Vermont F Sr 15.35 13.05 5.31
Lazar Hayward Marquette F Sr 9.93 7.32 6.37
Cole Aldrich Kansas C Jr 10.30 6.66 8.23
Damian Johnson Minnesota F Sr 11.35 7.29 6.38
Trevor Booker Clemson F Sr 9.76 7.08 6.58
John Wall Kentucky G Fr 8.96 6.99 6.82
Patrick Christopher California G Sr 8.23 6.69 7.30
Jeremy Hazell Seton Hall G Jr 11.72 9.24 5.69
Xavier Henry Kansas G Fr 9.43 6.63 7.69
Landry Fields Stanford G Sr 12.15 10.76 5.28
Back to top
View user's profile Send private message Send e-mail Visit poster's website
Crow



Joined: 20 Jan 2009
Posts: 818


PostPosted: Mon Jan 11, 2010 4:02 pm Post subject: Reply with quote
So only about ten of the guys in the first round of major mock drafts are in the top 30 on overall rating. I will be looking more at who in the mock draft is way down the list on SPM or talent level or overall rating than who is not on the mock draft and yet high on these lists, though a few of such cases may be worth more attention. This is adjusted for strength of schedule yet the gap between top college performance and perceived top NBA potential remains. Probably mostly right but something to consider further.
Back to top
View user's profile Send private message
DSMok1



Joined: 05 Aug 2009
Posts: 611
Location: Where the wind comes sweeping down the plains

PostPosted: Mon Jan 11, 2010 5:06 pm Post subject: Reply with quote
Crow wrote:
So only about ten of the guys in the first round of major mock drafts are in the top 30 on this SPM. I haven't waded thru the file yet but will be looking more at who in the mock draft is way down the list on SPM than who is not on the mock draft and yet high on this SPM list, though a few of such cases may be worth more attention. This metric is adjusted for strength of schedule yet the gap between top college performance and perceived top NBA potential remains. Probably mostly right but something to consider further.


Do look at what year each player is--if they are in the top 30 as a freshman, then they're probably NBA material. Senior, not so much so. I haven't done the aging curves, but there is a significant climb. Also, remember that some players already are highly skilled, and some are rather raw still--while still posting good numbers. Some have a lot of untapped potential still; others are already nearly as good as they can get.
Back to top
View user's profile Send private message Send e-mail Visit poster's website
Crow



Joined: 20 Jan 2009
Posts: 818


PostPosted: Mon Jan 11, 2010 5:28 pm Post subject: Reply with quote
This isn't comprehensive but Scheyer, Udoh, Varnado, James Anderson, Damion James, Vasquez are among those who do quite well here. S. Collins, Alabi and Favors among those not producing here as well as their mock draft rank.
Back to top
View user's profile Send private message
DSMok1



Joined: 05 Aug 2009
Posts: 611
Location: Where the wind comes sweeping down the plains

PostPosted: Tue Jan 12, 2010 10:24 am Post subject: Reply with quote
I decided to change the spreadsheet to READ-ONLY after it got messed up last night--download the spreadsheet if you want to sort/edit it.
Back to top
View user's profile Send private message Send e-mail Visit poster's website
tawtaw



Joined: 25 Jun 2008
Posts: 28
Location: Oregon

PostPosted: Tue Jan 12, 2010 8:05 pm Post subject: Reply with quote
Great stuff DSMok. Thanks a bunch.
Back to top
View user's profile Send private message
Statman



Joined: 20 Feb 2005
Posts: 242
Location: Arlington, Texas

PostPosted: Thu Feb 25, 2010 1:19 pm Post subject: Reply with quote
Are you getting your data from the NCAA boxscores also?

I am still trying to find a RELIABLE play by play data with proper substitutions.....
_________________
Dan

My current national college player rankings (and other stuff):
http://www.pointguardu.com/f136/statman ... post355594
Back to top
View user's profile Send private message Send e-mail
DSMok1



Joined: 05 Aug 2009
Posts: 611
Location: Where the wind comes sweeping down the plains

PostPosted: Fri Feb 26, 2010 9:18 am Post subject: Reply with quote
Statman wrote:
Are you getting your data from the NCAA boxscores also?

I am still trying to find a RELIABLE play by play data with proper substitutions.....


Yes, I get the data from NCAA.org, from pages like this: http://stats.ncaa.org/team/stats?org_id ... l_id=10260. I think NCAA.org has reliable play-by-play data, though I haven't worked with it myself. As usual, all substitution data is present except who started the halves. I haven't figured out the game ID# system... An example play-by-play log: http://stats.ncaa.org/game/play_by_play/114002.
Back to top
View user's profile Send private message Send e-mail Visit poster's website
Statman



Joined: 20 Feb 2005
Posts: 242
Location: Arlington, Texas

PostPosted: Fri Feb 26, 2010 9:30 am Post subject: Reply with quote
DSMok1 wrote:
Statman wrote:
Are you getting your data from the NCAA boxscores also?

I am still trying to find a RELIABLE play by play data with proper substitutions.....


Yes, I get the data from NCAA.org, from pages like this: http://stats.ncaa.org/team/stats?org_id ... l_id=10260. I think NCAA.org has reliable play-by-play data, though I haven't worked with it myself. As usual, all substitution data is present except who started the halves. I haven't figured out the game ID# system... An example play-by-play log: http://stats.ncaa.org/game/play_by_play/114002.


At statsheet.com, he appears to be using the NCAA PbP data also. I looked just at Arizona, and 10 of the 26 games appeared to have errors (the +/- totals didn't sum correctly). The last game Arizona played WSU specifically (18 point Arizona loss) - the sub out info was so bad (especially 12 minutes left in the game) that the results were completely flawed.

I went ahead and dropped my +/- project I was doing with Arizona because I couldn't find accuarate enough PbPs.
_________________
Dan

My current national college player rankings (and other stuff):
http://www.pointguardu.com/f136/statman ... post355594
Back to top
View user's profile Send private message Send e-mail
DSMok1



Joined: 05 Aug 2009
Posts: 611
Location: Where the wind comes sweeping down the plains

PostPosted: Fri Feb 26, 2010 9:45 am Post subject: Reply with quote
Statman wrote:


At statsheet.com, he appears to be using the NCAA PbP data also. I looked just at Arizona, and 10 of the 26 games appeared to have errors (the +/- totals didn't sum correctly). The last game Arizona played WSU specifically (18 point Arizona loss) - the sub out info was so bad (especially 12 minutes left in the game) that the results were completely flawed.

I went ahead and dropped my +/- project I was doing with Arizona because I couldn't find accuarate enough PbPs.


Ken Pomeroy parsed PbP's for last year's games (see the examples here: http://www.kenpom.com/team.php?team=Arizona&y=2009), but I note that he skipped many games. I think those are the games with flawed PbP data.
Back to top
View user's profile Send private message Send e-mail Visit poster's website
DSMok1



Joined: 05 Aug 2009
Posts: 611
Location: Where the wind comes sweeping down the plains

PostPosted: Fri Feb 26, 2010 12:51 pm Post subject: Reply with quote
Spreadsheet updated:
http://www.editgrid.com/user/dsmok1/NCA ... -10_Feb_26
Back to top
View user's profile Send private message Send e-mail Visit poster's website
tawtaw



Joined: 25 Jun 2008
Posts: 28
Location: Oregon

PostPosted: Mon Apr 05, 2010 4:46 pm Post subject: Reply with quote
Great stuff DSMok1.

Looking forward to the season final totals if you post.
Back to top
View user's profile Send private message
DSMok1



Joined: 05 Aug 2009
Posts: 611
Location: Where the wind comes sweeping down the plains

PostPosted: Mon Apr 12, 2010 11:55 am Post subject: Reply with quote
tawtaw wrote:
Great stuff DSMok1.

Looking forward to the season final totals if you post.


Final Season Totals:

http://www.editgrid.com/user/dsmok1/NCA ... 9-10_Final

The issues obvious: steals and blocks dominate to a large extent; also volume shooters skew because of the TSA^2 term.
Back to top
View user's profile Send private message Send e-mail Visit poster's website
tawtaw



Joined: 25 Jun 2008
Posts: 28
Location: Oregon

PostPosted: Tue Apr 13, 2010 10:15 am Post subject: Reply with quote
Great stuff, thanks.
Back to top
View user's profile Send private message
tpryan



Joined: 11 Feb 2005
Posts: 100


PostPosted: Sun Apr 18, 2010 1:12 am Post subject: Reply with quote
Well, I haven't been following this thread but I see that you have some interesting numbers, such as John Wall at #49 and Derrick Favors at #126, with both expected to be lottery picks and Wall perhaps going first in the draft. As a Ga. Tech fan, I was also interested to see Gani Lawal at #561. Personally, I can't see Lawal, a junior who has declared, as an NBA player who will contribute to any degree because he has very limited range on his shot and he is a poor free throw shooter. That might be okay if he were 7-3, but he is 6-9. Favors, also of Ga. Tech, has considerably more potential, in my opinion. It would be interesting to see how these rankings correlate with future NBA achievements.

page 2 of 2

Author Message
DSMok1



Joined: 05 Aug 2009
Posts: 608
Location: Where the wind comes sweeping down the plains

PostPosted: Mon Apr 19, 2010 8:15 am Post subject: Reply with quote
tpryan wrote:
Well, I haven't been following this thread but I see that you have some interesting numbers, such as John Wall at #49 and Derrick Favors at #126, with both expected to be lottery picks and Wall perhaps going first in the draft. As a Ga. Tech fan, I was also interested to see Gani Lawal at #561. Personally, I can't see Lawal, a junior who has declared, as an NBA player who will contribute to any degree because he has very limited range on his shot and he is a poor free throw shooter. That might be okay if he were 7-3, but he is 6-9. Favors, also of Ga. Tech, has considerably more potential, in my opinion. It would be interesting to see how these rankings correlate with future NBA achievements.


They probably don't correlate well... first, they don't measure NCAA achievement too well, because the SPM weights were derived for the NBA. Second, these are retrodictive, not predictive. A 18 year old freshman with the same numbers as a 24 year old senior is a far better prospect! Also, Favors (for instance) had poor point guards, so he was rarely given the ball in position to score. Note how much better Cousins' stats were!
Post Reply