APBR-DraftExpress 2015 NBA Draft Project

Home for all your discussion of basketball statistical analysis.
Post Reply
ampersand5
Posts: 262
Joined: Sun Nov 23, 2014 6:18 pm

Re: APBR-DraftExpress 2015 NBA Draft Project

Post by ampersand5 »

nrestifo wrote:I guess I'll go first. Let me know if this is appropriate as a summary/bio. I tried my best to keep it short, general, and accessible.



My name is Nick Restifo. In my basketball life, I write for Nylon Calculus and am a special assistant for the D2 powerhouse that is the University of New Haven Chargers. If you like, you can follow me on Twitter at @itsastat.
My overall predictions are based on an ensemble of four base models predicting a two year career peak blend of RAPM and Win Shares. The four models include a regression and a bagged neural network (to help with stability) trained on two different subsets of data; all prospects with statistics listed on DraftExpress since 2001-2002 and just those prospects that were actually drafted since 2001-2002. I use standing reach, RSCI high school rank, standing vertical leap, lane agility test time, true shooting percentage, points per 40 minutes pace adjusted, total rebounds per 40 minutes pace adjusted, assists per 40 minutes pace adjusted, steals per 40 minutes pace adjusted, blocks per 40 minutes pace adjusted, turnovers per 40 minutes pace adjusted, personal fouls per 40 minutes pace adjusted, minutes per game, age on February 1st of a player’s draft year, strength of schedule, and percentage of points from three (to account for some spacing benefits). I average an entire player's pre-NBA career, each year weighted by minutes played. Unlike other models, I do not assign any extra weight to the most recent years. The most recent years only get weighed more if the player played more in those years, (and this is usually the case anyway). For the vast amount of missing data for the players who did not participate in the combine, I impute regression based estimates of body dimensions (hand length, body fat, etc) based on listed height and weight. Body dimensions are mostly very easy to estimate, for obvious reasons. For the vertical and agility tests, I impute missing values via decision trees trained on a player's age and body dimensions.
Each model in my ensemble plays an individual role conceptually. By applying two of the base models to some 20,000 plus prospects since 2002, I set the framework for my model to have the ability be applied to any basketball player anywhere, not just those who make top 100 prospects lists. For those prospects that never play in the NBA and don’t have RAPM and Win Shares values, I fill those missing values with -4 and 0 respectively, which are each very close to the absolute minimum career peaks of all NBA players since 2001-2002. Only a handful of NBA players have ever peaked at below -4 RAPM or 0 Win Shares. For those prospects without an RSCI high school rank, I fill those missing values with 600, which is a very rough estimate of what the average high school rank would be for the remaining unranked prospects each year. The problem with developing these models on every potential player is that, in conjunction with imputing all these missing values, the models become not only a reflection of NBA success, but also a reflection of whether or not a player will be drafted, which isn’t always the same thing.
To counter these effects, I trained two additional base models just on players who were drafted. Though some of these players also never played in the NBA, these models are able to get a better handle on whether or not a player will actually succeed in the NBA, more independent of the sometimes clouding effects of what will get a player drafted in the NBA.
In comparison to other models, since I include high school ranking as a variable, my model will favor those highly heralded high school players significantly more than other models. High school rank is an especially important predictor in the regression model trained on all available prospects This results in additional predicted value for the highly ranked high schoolers that might not be as favored in other models, players like Cliff Alexander and Myles Turner, and less predicted value for unranked high school players that do a little better in other models, like Frank Kaminsky. (In Frank Kaminsky’s case in particular, he does not do very well by the models trained on all prospects, but does considerably better by the models trained on just those prospects who were drafted.)
With regards to methodology, my ensemble has its strengths and weaknesses like any other prediction system. I used neural networks as part of my ensemble because they were the most accurate out of sample prediction method on my data, and accuracy is obviously valuable. Neural networks are flexible and often better than other methods like regression at teasing out complex, non-linear relationships amongst the training data, and with regards to draft prospects, neural networks are also good at capturing just how much better the premier players are than the middle class. While neural networks are powerfully accurate, they also tend to overfit training datasets and attach themselves to noise in data in their pursuit of accuracy. To alleviate these concerns, I applied a process known as bagging to my neural networks, which helps to increase the stability of the predictions by taking the consensus of several neural networks over subsets of the training data, rather than a single neural network over the complete training data, as the latter is more likely to interpret noise as signal.
Thanks. Would it be possible to cut this down further (somewhere closer to Layne's length?)? If you prefer, I would be happy to do the trimming...
I think this is a great write up and It should definitely be linked to from the main article so readers can learn more. However, I think for our purposes (due to both length restrictions and attention spans of the average reader), it's too in depth.
ampersand5
Posts: 262
Joined: Sun Nov 23, 2014 6:18 pm

Re: APBR-DraftExpress 2015 NBA Draft Project

Post by ampersand5 »

Statman wrote:
ampersand5 wrote:Just a reminder that tomorrow is May 20th - the day each contributor's write up is due.

If people need to make further edits after tomorrow but before the 25th, that will likely be ok. I just need to start getting the full article/write up organized and making everything look uniform and cohesive.

While there is no template, here is what I wrote to one poster:
a quick bio followed by what your model is conceptually trying to display (and why this is important), how it actually works (any problems it might have), players who do really well in it/players who fair really poorly in it (and why).
I suggest making the write up short, and if possible, provide a link to your website where readers can learn more about you/the model.
I'm sorry I won't be able to make that deadline. I still plan on being involved, but it'd be closer to week 2 of June if it happens.

I really appreciate your helping pull together these different models - hopefully the info you pull together will be a real go to for fans come draft time (twitter?) who are looking for something much more than mocks that are merely trying to mirror the draft as oppose to ranking based on projected relative pro production/performance.
No problem. Hopefully this provides increased exposure to both the APBR community and all the brilliant work done here.
ampersand5
Posts: 262
Joined: Sun Nov 23, 2014 6:18 pm

Re: APBR-DraftExpress 2015 NBA Draft Project

Post by ampersand5 »

Nathan wrote:Sorry guys, I don't think I'll have something by the deadline. Last weekend was graduation weekend for me and I won't be getting home until the 23rd at earliest. I'll post my results here when I have them though, hopefully within the next two weeks or so. Looking forward to seeing the DX article though!
Congratulations Nathan!
I look forward to seeing your rankings regardless.
DSMok1
Posts: 1119
Joined: Thu Apr 14, 2011 11:18 pm
Location: Maine
Contact:

Re: APBR-DraftExpress 2015 NBA Draft Project

Post by DSMok1 »

Here is my writeup:
Daniel Myers:
I am a structural engineer by trade, born and raised in Oklahoma but now living in Maine. I have always been a math nerd (and Excel whiz), and started dabbling in advanced sports statistics around 2007. I started posting on the APBRmetrics forum in 2009, and currently am the acting administrator. My focus is to be open with my work and very aware of the limitations and weaknesses of our statistics.

Box Plus/Minus (BPM) is my contribution to this project, but it is not a projection system at all. Rather, it is perhaps the best public metric for measuring actual production at the college level. The ranking published here is simply a ranking by BPM, which evaluates player production per possession. The full derivation and methodology is available at http://www.basketball-reference.com/about/bpm.html.

BPM was developed by regressing advanced box score stats onto long term Regularized Adjusted Plus/Minus (RAPM). This was done using NBA data (no RAPM is available for the NCAA), but the values of each statistic should be valid at the NCAA level as well. BPM is adjusted for context and strength of schedule.

Full BPM data for the NCAA are available through Sports Reference / College Basketball’s Player Season Finder.

Treat BPM not as a projection of NBA ability, but rather context for the other models: has the player produced in college? Why or why not would we expect that to translate to the NBA? If they produce well in the NCAA as a freshman, that's a great indicator.
And here is my listing:

Code: Select all

╔══════╦═════════════════════════╦══════════════════════════╦══════════╦═══════╦══════╦══════╦══════╗
║ Rank ║  Basketball Reference   ║          School          ║ Position ║ Class ║ OBPM ║ DBPM ║ BPM  ║
╠══════╬═════════════════════════╬══════════════════════════╬══════════╬═══════╬══════╬══════╬══════╣
║    1 ║ Karl-Anthony Towns      ║ Kentucky                 ║ PF/C     ║ FR    ║ 6.5  ║ 10.8 ║ 17.3 ║
║    2 ║ Delon Wright            ║ Utah                     ║ PG       ║ SR    ║ 8.8  ║ 7.4  ║ 16.2 ║
║    3 ║ Frank Kaminsky          ║ Wisconsin                ║ PF       ║ SR    ║ 9.5  ║ 5.8  ║ 15.3 ║
║    4 ║ Willie Cauley-Stein     ║ Kentucky                 ║ C        ║ JR    ║ 4.7  ║ 10   ║ 14.7 ║
║    5 ║ Seth Tuttle             ║ Northern Iowa            ║ F        ║ SR    ║ 7.8  ║ 5.2  ║ 13   ║
║    6 ║ T.J. McConnell          ║ Arizona                  ║ PG       ║ SR    ║ 7    ║ 5.7  ║ 12.7 ║
║    7 ║ Justin Anderson         ║ Virginia                 ║ SF       ║ JR    ║ 7.6  ║ 4.6  ║ 12.2 ║
║    8 ║ Aaron White             ║ Iowa                     ║ PF       ║ SR    ║ 8.5  ║ 3.3  ║ 11.8 ║
║    9 ║ D'Angelo Russell        ║ Ohio State               ║ PG/SG    ║ FR    ║ 8.7  ║ 3    ║ 11.7 ║
║   10 ║ Rondae Hollis-Jefferson ║ Arizona                  ║ SF       ║ SO    ║ 4.4  ║ 6.8  ║ 11.2 ║
║   11 ║ Jahlil Okafor           ║ Duke                     ║ C        ║ FR    ║ 6.8  ║ 4.1  ║ 10.9 ║
║   12 ║ Derrick Marks           ║ Boise State              ║ SG       ║ SR    ║ 7.9  ║ 2.9  ║ 10.9 ║
║   13 ║ Justise Winslow         ║ Duke                     ║ SF       ║ FR    ║ 5.1  ║ 5.3  ║ 10.4 ║
║   14 ║ Joshua Smith            ║ Georgetown               ║ C        ║ SR    ║ 6    ║ 4.4  ║ 10.4 ║
║   15 ║ Stanley Johnson         ║ Arizona                  ║ SF       ║ FR    ║ 4.8  ║ 5.4  ║ 10.2 ║
║   16 ║ Dakari Johnson          ║ Kentucky                 ║ C        ║ SO    ║ 3    ║ 7.2  ║ 10.2 ║
║   17 ║ Darrun Hilliard         ║ Villanova                ║ SF       ║ SR    ║ 6.7  ║ 3.5  ║ 10.2 ║
║   18 ║ Sam Dekker              ║ Wisconsin                ║ SF       ║ JR    ║ 8.2  ║ 1.8  ║ 10   ║
║   19 ║ Quinn Cook              ║ Duke                     ║ PG       ║ SR    ║ 8.2  ║ 1.5  ║ 9.6  ║
║   20 ║ Rakeem Christmas        ║ Syracuse                 ║ PF/C     ║ SR    ║ 3.4  ║ 6.2  ║ 9.6  ║
║   21 ║ Branden Dawson          ║ Michigan State           ║ SF       ║ SR    ║ 2.2  ║ 7.3  ║ 9.5  ║
║   22 ║ Tyus Jones              ║ Duke                     ║ PG       ║ FR    ║ 6.6  ║ 2.8  ║ 9.4  ║
║   23 ║ Wesley Saunders         ║ Harvard                  ║ SG       ║ SR    ║ 6    ║ 3.3  ║ 9.3  ║
║   24 ║ Richaun Holmes          ║ Bowling Green State      ║ PF       ║ SR    ║ 3.6  ║ 5.7  ║ 9.3  ║
║   25 ║ Montrezl Harrell        ║ Louisville               ║ PF/C     ║ JR    ║ 4.6  ║ 4.7  ║ 9.2  ║
║   26 ║ Bobby Portis            ║ Arkansas                 ║ PF       ║ SO    ║ 5.8  ║ 3.1  ║ 9    ║
║   27 ║ Andrew Harrison         ║ Kentucky                 ║ PG/SG    ║ SO    ║ 4.9  ║ 4.1  ║ 9    ║
║   28 ║ Trey Lyles              ║ Kentucky                 ║ PF       ║ FR    ║ 3.9  ║ 5.1  ║ 8.9  ║
║   29 ║ Devin Booker            ║ Kentucky                 ║ C        ║ FR    ║ 6.9  ║ 2    ║ 8.9  ║
║   30 ║ Cameron Payne           ║ Murray State             ║ PG       ║ SO    ║ 8.6  ║ 0.3  ║ 8.9  ║
║   31 ║ Jerian Grant            ║ Notre Dame               ║ PG       ║ SR    ║ 8    ║ 0.9  ║ 8.9  ║
║   32 ║ Aaron Harrison          ║ Kentucky                 ║ SG       ║ SO    ║ 5.2  ║ 3.6  ║ 8.8  ║
║   33 ║ Myles Turner            ║ Texas                    ║ C        ║ FR    ║ 1.5  ║ 7.3  ║ 8.7  ║
║   34 ║ Kelly Oubre             ║ Kansas                   ║ SF       ║ FR    ║ 4.8  ║ 3.7  ║ 8.6  ║
║   35 ║ J.P. Tokoto             ║ North Carolina           ║ SG       ║ JR    ║ 3.3  ║ 5.3  ║ 8.6  ║
║   36 ║ Terry Rozier            ║ Louisville               ║ PG       ║ SO    ║ 4.7  ║ 3.7  ║ 8.4  ║
║   37 ║ Michael Frazier         ║ Florida                  ║ SG       ║ JR    ║ 5.7  ║ 2.8  ║ 8.4  ║
║   38 ║ Corey Hawkins           ║ California-Davis         ║ SG       ║ SR    ║ 8.5  ║ -0.1 ║ 8.4  ║
║   39 ║ Ryan Boatright          ║ Connecticut              ║ PG       ║ SR    ║ 7.5  ║ 0.8  ║ 8.3  ║
║   40 ║ Kevon Looney            ║ UCLA                     ║ PF       ║ FR    ║ 4.2  ║ 4    ║ 8.2  ║
║   41 ║ Pat Connaughton         ║ Notre Dame               ║ SG       ║ SR    ║ 6.1  ║ 2.1  ║ 8.2  ║
║   42 ║ Cliff Alexander         ║ Kansas                   ║ PF/C     ║ FR    ║ 3.2  ║ 5    ║ 8.1  ║
║   43 ║ Robert Upshaw           ║ Washington               ║ C        ║ SO    ║ -0.3 ║ 8.3  ║ 8    ║
║   44 ║ Josh Richardson         ║ Tennessee                ║ G        ║ SR    ║ 5.8  ║ 2    ║ 7.8  ║
║   45 ║ Jonathan Holmes         ║ Texas                    ║ SF       ║ SR    ║ 4.4  ║ 3.4  ║ 7.8  ║
║   46 ║ TaShawn Thomas          ║ Oklahoma                 ║ PF       ║ SR    ║ 1.8  ║ 5.8  ║ 7.7  ║
║   47 ║ Travis Trice            ║ Michigan State           ║ G        ║ SR    ║ 6.9  ║ 0.8  ║ 7.7  ║
║   48 ║ Chasson Randle          ║ Stanford                 ║ SG       ║ SR    ║ 6.9  ║ 0.3  ║ 7.3  ║
║   49 ║ D.J. Newbill            ║ Penn State               ║ SG       ║ SR    ║ 6.2  ║ 1    ║ 7.3  ║
║   50 ║ Anthony Brown           ║ Stanford                 ║ F        ║ SR    ║ 5.1  ║ 1.9  ║ 7    ║
║   51 ║ Brandon Ashley          ║ Arizona                  ║ PF       ║ JR    ║ 3.2  ║ 3.6  ║ 6.9  ║
║   52 ║ Treveon Graham          ║ Virginia Commonwealth    ║ SG       ║ SR    ║ 6.6  ║ 0.3  ║ 6.9  ║
║   53 ║ Chris McCullough        ║ Syracuse                 ║ PF       ║ FR    ║ -1.4 ║ 8.2  ║ 6.8  ║
║   54 ║ R.J. Hunter             ║ Georgia State            ║ SG       ║ JR    ║ 5.4  ║ 1.3  ║ 6.7  ║
║   55 ║ Larry Nance             ║ Wyoming                  ║ PF       ║ SR    ║ 2.4  ║ 4.3  ║ 6.7  ║
║   56 ║ Michael Qualls          ║ Arkansas                 ║ SG       ║ JR    ║ 5.5  ║ 0.6  ║ 6.2  ║
║   57 ║ Keifer Sykes            ║ Green Bay                ║ PG       ║ SR    ║ 5.2  ║ 1    ║ 6.2  ║
║   58 ║ Joseph Young            ║ Oregon                   ║ SG       ║ SR    ║ 6.7  ║ -0.7 ║ 6.1  ║
║   59 ║ Olivier Hanlan          ║ Boston College           ║ PG/SG    ║ JR    ║ 6.4  ║ -0.3 ║ 6    ║
║   60 ║ Norman Powell           ║ UCLA                     ║ SG       ║ SR    ║ 3.5  ║ 2.4  ║ 5.8  ║
║   61 ║ Alan Williams           ║ California-Santa Barbara ║ C        ║ SR    ║ 1.3  ║ 4.5  ║ 5.8  ║
║   62 ║ Shannon Scott           ║ Ohio State               ║ PG       ║ SR    ║ 3    ║ 2.7  ║ 5.7  ║
║   63 ║ Marcus Thornton         ║ Georgia                  ║ F        ║ SR    ║ 1.7  ║ 4    ║ 5.7  ║
║   64 ║ Terran Petteway         ║ Nebraska                 ║ SG       ║ JR    ║ 3.1  ║ 2.5  ║ 5.6  ║
║   65 ║ Dezmine Wells           ║ Maryland                 ║ SG       ║ SR    ║ 2.6  ║ 3    ║ 5.6  ║
║   66 ║ Jordan Mickey           ║ Louisiana State          ║ PF/C     ║ SO    ║ -0.8 ║ 6.3  ║ 5.5  ║
║   67 ║ Christian Wood          ║ Nevada-Las Vegas         ║ PF       ║ SO    ║ 2.2  ║ 3.2  ║ 5.4  ║
║   68 ║ Jarell Martin           ║ Louisiana State          ║ PF       ║ SO    ║ 3.1  ║ 2.2  ║ 5.3  ║
║   69 ║ Vince Hunter            ║ Texas-El Paso            ║ PF       ║ SO    ║ 0.9  ║ 3.8  ║ 4.7  ║
║   70 ║ Juwan Staten            ║ West Virginia            ║ PG       ║ SR    ║ 4.1  ║ 0    ║ 4.1  ║
║   71 ║ Tyler Harvey            ║ Eastern Washington       ║ G        ║ JR    ║ 7.6  ║ -4.7 ║ 2.9  ║
║   72 ║ Le'Bryan Nash           ║ Oklahoma State           ║ SF       ║ JR    ║ 0.7  ║ 2.1  ║ 2.8  ║
║   73 ║ Julian Washburn         ║ Texas-El Paso            ║ F        ║ SR    ║ 1    ║ 1.5  ║ 2.5  ║
║   74 ║ Chris Walker            ║ Florida                  ║ G        ║ SO    ║ -3.2 ║ 5.4  ║ 2.2  ║
║   75 ║ Rashad Vaughn           ║ Nevada-Las Vegas         ║ SG       ║ FR    ║ 3    ║ -1.8 ║ 1.3  ║
╚══════╩═════════════════════════╩══════════════════════════╩══════════╩═══════╩══════╩══════╩══════╝
[/size]
Developer of Box Plus/Minus
APBRmetrics Forum Administrator
Twitter.com/DSMok1
masseffectlenk
Posts: 3
Joined: Sun May 10, 2015 4:46 pm

Re: APBR-DraftExpress 2015 NBA Draft Project

Post by masseffectlenk »

Here is my writeup. I found a way to modify my models to reflect expectation.
Name is Masseffectlenk, and I am a graduate student in bioengineering in the West Coast.

The model uses basic box score stats that are on a pace-adjusted, per 40 minute basis. Rates are specifically used--three point rate, free throw rate, assist rate, usage rate--as well as height, weight, and age (with a more specific weighting based on age). The regressions are informed by nearly a thousand NBA players who have played over 100+ NBA minutes and were drafted post-2005 to get a decade wide span. The stats are mapped to the average offensive and defensive win shares/RPM, and similarity scores based on the overall stats are used to determine the weightings. The most recent model is informed by recent at-rim shots per 40 minutes and dunk rate for each player in the past three seasons, and adds on the athletic component--this only applies to NCAA players this past season. Spreadsheet for data can be found here: https://docs.google.com/spreadsheets/d/ ... 1967060028.

The athletic regression is used specifically to dock players who post deceptively athletic box score stats but lack athleticism otherwise (Jordan Adams) or elevate those who are more athletic than their box score stats indicate (Jordan Clarkson, Norman Powell this year).
And here is my listing:

Code: Select all

75 Name Rank:	Overall Rank:		OFF	DEF	TOT
1	1	Jahlil Okafor	1.510	1.510	3.020
2	2	Karl Towns	1.525	1.468	2.993
3	3	D'Angelo Russell	1.773	1.197	2.970
4	4	Justise Winslow	1.602	1.368	2.969
5	5	Emmanuel Mudiay	1.698	1.266	2.965
6	6	Stanley Johnson	1.612	1.350	2.962
7	8	Myles Turner	1.496	1.418	2.914
8	9	Willie Cauley-Stein	1.369	1.551	2.909
9	10	Tyus Jones	1.745	1.162	2.906
10	11	Christian Wood	1.475	1.426	2.901
11	12	Kevon Looney	1.493	1.386	2.879
12	13	Frank Kaminsky	1.540	1.335	2.875
13	14	Delon Wright	1.638	1.229	2.867
14	15	Bobby Portis	1.497	1.369	2.866
15	16	Robert Upshaw	1.296	1.560	2.856
16	17	Cameron Payne	1.745	1.110	2.855
17	20	Cliff Alexander	1.318	1.509	2.827
18	22	Rondae Hollis-Jefferson	1.457	1.351	2.809
19	23	Wesley Saunders	1.604	1.202	2.806
20	25	Dakari Johnson	1.321	1.482	2.802
21	26	Aaron White	1.499	1.297	2.797
22	27	Richaun Holmes	1.371	1.421	2.792
23	28	Chris McCullough	1.406	1.369	2.774
24	29	RJ Hunter	1.630	1.132	2.762
25	30	Trey Lyles	1.409	1.352	2.761
26	32	Kelly Oubre	1.506	1.250	2.756
27	33	Jerian Grant	1.600	1.155	2.755
28	34	Montrezl Harrell	1.354	1.391	2.745
29	35	Larry Nance Jr	1.444	1.296	2.740
30	37	Kennedy Meeks	1.305	1.432	2.737
31	39	Vince Hunter	1.383	1.345	2.728
32	40	Sam Dekker	1.482	1.231	2.713
33	42	Andrew Harrison	1.601	1.112	2.712
34	44	Joshua Smith	1.297	1.403	2.700
35	50	Derrick Marks	1.571	1.100	2.671
36	51	Josh Richardson	1.560	1.106	2.666
37	52	Darrun Hilliard	1.571	1.095	2.666
38	53	Devin Booker	1.585	1.081	2.666
39	57	Alan Williams	1.297	1.355	2.652
40	64	Branden Dawson	1.299	1.342	2.641
41	67	Jarell Martin	1.327	1.302	2.629
42	69	Terry Rozier	1.569	1.056	2.625
43	71	TJ McConnell	1.522	1.103	2.624
44	74	JP Tokoto	1.420	1.196	2.616
45	77	Chris Walker	1.218	1.380	2.598
46	78	Justin Anderson	1.486	1.109	2.595
47	79	Rashad Vaughn	1.497	1.096	2.592
48	80	Keifer Sykes	1.548	1.041	2.590
49	83	Norman Powell	1.455	1.116	2.571
50	85	Tashawn Thomas	1.229	1.340	2.569
51	87	Quinn Cook	1.537	1.023	2.560
52	88	Pat Connaughton	1.415	1.143	2.558
53	89	Jordan Mickey	1.198	1.358	2.556
54	93	Michael Qualls	1.463	1.071	2.534
55	95	Brandon Ashley	1.275	1.257	2.532
56	98	Tyler Harvey	1.520	1.006	2.526
57	99	Chasson Randle	1.521	1.002	2.523
58	100	Ryan Boatright	1.524	0.997	2.521
59	101	Michael Frazier	1.493	1.027	2.520
60	102	Rakeem Christmas	1.146	1.373	2.518
61	103	Travis Trice	1.502	1.009	2.511
62	104	Aaron Harrison	1.483	1.025	2.508
63	105	Shannon Scott	1.429	1.077	2.506
64	106	Olivier Hanlan	1.502	1.002	2.504
65	108	Juwan Staten	1.408	1.076	2.484
66	110	LeBryan Nash	1.242	1.229	2.472
67	111	Dez Wells	1.354	1.115	2.468
68	112	Marcus Thornton (GA)	1.169	1.299	2.468
69	113	Joseph Young	1.465	0.997	2.462
70	114	Anthony Brown	1.374	1.071	2.445
71	116	Jonathan Holmes	1.242	1.191	2.433
72	118	DJ Newbill	1.394	1.035	2.430
73	119	Corey Hawkins	1.423	0.996	2.419
74	121	Terran Petteway	1.314	1.091	2.405
75	124	Julian Washburn	1.281	1.078	2.359
					
					
	OTHERS:				
	7	George Lucas	1.614	1.305	2.919
	18	Seth Tuttle	1.505	1.327	2.833
	19	Corey Walden	1.659	1.168	2.827
	21	Aleksandar Vezenkov	1.589	1.229	2.822
	24	Alpha Kaba	1.409	1.397	2.805
	31	Nedim Buza	1.513	1.245	2.758
	36	Mario Hezonja	1.572	1.167	2.739
	38	Kristaps Porzingis	1.437	1.295	2.731
	41	Guillermo Hernangomez	1.397	1.316	2.713
	43	Nikola Milutinov	1.285	1.421	2.705
	45	Denzel Livingston	1.520	1.159	2.679
	46	Satnam Singh Bharama	1.291	1.388	2.678
	47	Wael Arakji	1.510	1.168	2.678
	48	Cedi Osman	1.463	1.214	2.677
	49	Rayvonte Rice	1.492	1.179	2.671
	54	Marc Garcia	1.497	1.162	2.660
	55	Andzejs Pasecniks	1.380	1.275	2.656
	56	Guillem Vives	1.522	1.133	2.655
	58	Timothe Luwawu	1.506	1.145	2.651
	59	Lucas Dias	1.417	1.232	2.649
	60	Ailun Guo	1.512	1.137	2.649
	61	 Simone Fontecchio 	1.472	1.175	2.647
	62	Dimitris Agravanis	1.384	1.261	2.646
	63	Luka Mitrovic	1.440	1.205	2.645
	65	Royce O'Neale	1.509	1.125	2.634
	66	Oriol Pauli	1.431	1.201	2.632
	68	 Charles Jackson 	1.197	1.428	2.626
	70	Mouhammadou Jaiteh	1.247	1.378	2.625
	72	JJ Avila	1.347	1.270	2.617
	73	Rade Zagorac	1.481	1.136	2.617
	75	Kevin Harley	1.427	1.188	2.614
	76	Moussa Diagne	1.243	1.363	2.605
	81	Adin Vrabac	1.351	1.235	2.587
	82	Andrey Desyatnikov	1.217	1.363	2.580
	84	Arturas Gudaitis	1.194	1.376	2.570
	86	Nikola Radicevic	1.458	1.103	2.561
	90	Kevin Pangos	1.545	1.007	2.552
	91	Treveon Graham	1.400	1.142	2.542
	92	Briante Weber	1.435	1.105	2.540
	94	Cady Lalanne	1.120	1.413	2.534
	96	Paul Zipser	1.359	1.173	2.532
	97	Daniel Diez	1.352	1.177	2.529
	107	Beka Burjanadze	1.374	1.114	2.488
	109	Walter Pitchford	1.238	1.236	2.475
	115	Andre Hollins	1.419	1.018	2.436
	117	Chad Frazier	1.394	1.036	2.430
	120	Jherrod Stiggers	1.365	1.044	2.409
	122	Todd Mayo	1.292	1.079	2.372
	123	Aaron Thomas	1.272	1.087	2.359
	125	Trevor Lacey	1.323	0.974	2.297
[/size]
ampersand5
Posts: 262
Joined: Sun Nov 23, 2014 6:18 pm

Re: APBR-DraftExpress 2015 NBA Draft Project

Post by ampersand5 »

Amazing stuff everyone.

We are just waiting now for Jesse (who will be posting tonight) and Steve Shea (who I'm emailing now).

Some questions:
We will be showing everyone's ranking and a consensus rank. Do you think we should also be showing how many spots a players gains/loses in comparison to the DE mock? Further, do you think we should create an additional ranking that is a blend between the consensus rank and the DE mock?

I guess this is directly addressed to Dsmok1 - but everyone's thoughts are welcome. Do you want your ranking to be included in the same chart as everyone else?
I thought your piece would be better served at either the beginning or end of the article, and actually independently display all of the rankings and values (likely in image formatting). Thoughts?

Lastly, I have written a brief introduction to the article. Once I have everything submitted, I would like to create a Google Doc with anonymous editing enabled. That way, people can add the things they think are missing/necessary, while taking away the stuff they think doesn't work. I find tremendous value in crowdsourcing these sorts of things, so I would appreciate everyone's contributions.
Barncore
Posts: 25
Joined: Wed Mar 13, 2013 7:16 am

Re: APBR-DraftExpress 2015 NBA Draft Project

Post by Barncore »

Long time lurker chiming in here...
ampersand5 wrote:Do you think we should also be showing how many spots a players gains/loses in comparison to the DE mock? Further, do you think we should create an additional ranking that is a blend between the consensus rank and the DE mock?
For sure, i would like to see that.
Although i think it would make more sense to compare it to the DE top 100 big board, as that is more of a ranking. Whereas the mock is more focused on satisfying team needs.

Also, i don't mean to be disrespectful or anything, only trying to give feedback (from the perspective of your target market - a hardcore draft nut who doesn't make regression models himself). It's gonna be hard to trust the overall average ranking of all the models together, if one of them isn't actually a projection model, but only a ranking of college production (DSMok1). There have been plenty of players who were productive in college but not in NBA (Derrick Williams, Charles Jenkins, Jon Leuer, Marshon Brooks, Jajuan Johnson, etc, there are thousands of examples). Having a college production model will skew the results.
nrestifo
Posts: 52
Joined: Tue Oct 07, 2014 1:23 pm

Re: APBR-DraftExpress 2015 NBA Draft Project

Post by nrestifo »

ampersand5 wrote:Thanks. Would it be possible to cut this down further (somewhere closer to Layne's length?)? If you prefer, I would be happy to do the trimming...
I think this is a great write up and It should definitely be linked to from the main article so readers can learn more. However, I think for our purposes (due to both length restrictions and attention spans of the average reader), it's too in depth.
Does this work? If it doesn't, please feel free to edit as you see fit. Or I can shorten some more. Either way.

My name is Nick Restifo. In my basketball life, I write for Nylon Calculus and am a special assistant for the D2 powerhouse that is the University of New Haven Chargers. If you like, you can follow me on Twitter at @itsastat.
My overall predictions are based on an ensemble of four base models predicting a two year career peak blend of RAPM and Win Shares. The ensemble takes input from a regression based model and a bagged neural network trained on two different subsets of data; all prospects with statistics listed on DraftExpress since 2001-2002 and just those prospects that were actually drafted since 2001-2002 (a total of four base models). I use RSCI high school rank, combine measurements and tests, pace and per minute adjusted box score statistics, minutes per game, age on February 1st of a player’s draft year, strength of schedule, and percentage of points from three (to account for some spacing benefits). I average an entire player's pre-NBA career, each year weighted by minutes played. For the vast amount of missing data for the players who did not participate in the combine, I impute regression based estimates of body dimensions (hand length, body fat, etc) based on listed height and weight. For the vertical and agility tests, I impute missing values via decision trees trained on a player's age and body dimensions.
In comparison to other models, since I include high school ranking as a variable, my model will favor those highly heralded high school players significantly more than other models. High school rank is an especially important predictor in the regression model trained on all available prospects and this results in additional predicted value for the highly ranked high schoolers that might not be as favored in other models, players like Cliff Alexander and Myles Turner, and less predicted value for unranked high school players that do a little better in other models, like Frank Kaminsky.
jessefis
Posts: 15
Joined: Sun Aug 24, 2014 5:18 pm

Re: APBR-DraftExpress 2015 NBA Draft Project

Post by jessefis »

Here is initial summary. I might need to change it a bit after I completely finish my models this weekend (at which point I will post my projections). I initially wrote a ton more but I cut most all of it out and will assume we can link a more thorough explanation of things on my blog.

My name is Jesse Fischer. I work as a Senior Software Engineer at Amazon. My academic background includes a degree in Computer Engineering with a minor in Mathematics from the University of Washington. The last couple of years I have started to explore the sports analytics field a bit and I'm still figuring out where that takes me - when the Sonics come back to town I may have no other choice. I blog on www.tothemean.com, please follow me on twitter @jessefischer33 if you like my work.

My "Longevity" draft model optimizes for "long term value" as defined by a players max 5 year "Value over Replacement Player" (VORP) which is based on BPM: http://www.basketball-reference.com/about/bpm.html. VORP accounts for playing time allowing injuries/durability to be factored in which is important when measuring longevity. To account for players who are still playing (and most importantly the players who haven't hit their 5 year peak yet) I have a separate "Predicted VORP" model (based on age, VORP trajectory, playing time trajectory, max single season VORP, etc) which predicts a players max 5 year value based upon data from his career thus far.

The actual "Longevity" model is based upon typical public data: college stats (multiple years), team stats, combine data, etc. It also includes actual/expected draft position to take into account real life scouting as a factor rather than only blindly using numbers. Data was transformed in intelligent ways to try to account for pace, competition, playing time, teammate quality, among other things I won't mention. The model uses more than just the players who were drafted but also any which have had a remote chance of playing in the NBA (assigning a replacement player value if they didn't make it). To automate this pre-filtering there is an additional "Predicted NBA Player" model which projects the probability of a NCAA player playing in the NBA.

Using a filtered dataset, the "Longevity" model runs as a blend of many different models. The individual models consist of different machine learning algorithms all optimized and tuned in different ways. The other major thing to mention is that my overall model is not at all limited strictly to linear relationships like most public models. There are a lot of details and discussion left out of this summary which can be read about in future posts on my blog: www.tothemean.com.
DSMok1
Posts: 1119
Joined: Thu Apr 14, 2011 11:18 pm
Location: Maine
Contact:

Re: APBR-DraftExpress 2015 NBA Draft Project

Post by DSMok1 »

Cool, a model based on BPM!

As someone noted, my BPM ranks should not be used in the average, but rather used purely as a point of reference.

On another note, I think all of the different write ups should include the following information:
1. The dependent variable, and whether it is peak or first few years, etc.
2. What source of data is used, of which I think there are three classes:
A. Combine and measurement data
B. Boxscore data
C. Recruiting rankings and other scouting data
3. What algorithms were used for the analysis; for instance, BPM used linear regression.
Developer of Box Plus/Minus
APBRmetrics Forum Administrator
Twitter.com/DSMok1
vjl110
Posts: 37
Joined: Sun Sep 23, 2012 4:05 pm

Re: APBR-DraftExpress 2015 NBA Draft Project

Post by vjl110 »

Good idea DSmok

I made some minor changes to fit those criteria:
My name is Layne Vashro. I am presenting my simple “Estimated Wins Peak” (EWP) here, but I've put together a number of different projection models and tools to help evaluate incoming talent. These include several NCAA/International models, a player-season comparison finder, a tool that shows how each statistic has historically translated to the NBA for players under different coaches, and a tool that allows you follow each prospect's progression/regression throughout the season. You can find these over at NylonCalculus under the “Our Stats” tab [http://nyloncalculus.com/stats/].

The goal of the EWP model is to project how good each prospect will be at the peak of his NBA career. In order to do that, I must quantify “peak NBA performance” in some acceptable way. I do this by calculating the number of wins a player is responsible for in each season of his career using a blend of Win Shares (box-score metric) and RAPM (+/- metric). I then use a two-year rolling average and select the highest value as that player's “wins peak”. Here is a link to the list of previously drafted players included in the sample [https://goo.gl/yYmeEQ]. If this list largely agrees with the order in which you would select these players in a redraft, you can at least be comfortable with my model's validity.

To build the model itself, I take the above list of players along with collegiate box-score statistics pulled from draftexpress.com and basketball-reference.com, play-by-play statistics pulled from hoop-math.com, anthropometric information from draftexpress, and a selection of team statistics pulled from sports-reference.com. I then use linear regression to identify what each bit of pre-NBA information says about a player's future peak production in the NBA. This knowledge is then applied to current prospects whose future is still unknown.
Crow
Posts: 10536
Joined: Thu Apr 14, 2011 11:10 pm

Re: APBR-DraftExpress 2015 NBA Draft Project

Post by Crow »

Did any the models use NBA playoff data in any way? If no, that is something I think deserves more consideration next time. Teams are built in win in the playoffs, or should be imo.

I understand the logic of looking at players from last 10-13 years to calibrate the model but has what helps teams win in the playoffs shifted a bit since the beginning of the period? I think it has, towards 3pt shooting and away from offensive rebounding at minimum. And with more importance for three point defense and defending without much fouling. Maybe it doesn't matter that much but some adjustment for recent trends might help.
ampersand5
Posts: 262
Joined: Sun Nov 23, 2014 6:18 pm

Re: APBR-DraftExpress 2015 NBA Draft Project

Post by ampersand5 »

Hi everyone,

Also, if anyone has ideas on where the charts/numbers should be displayed, please share - also, do you think beside each user's summary, there should be a separate image just ranking their top 14?

We are only waiting on Steve's info and Jesse's rankings. I have been in communication with both of them and they should be here shortly.

I will likely be off the internet until Sunday, so I look forward everyone's changes.
Last edited by ampersand5 on Tue May 26, 2015 1:36 pm, edited 1 time in total.
DSMok1
Posts: 1119
Joined: Thu Apr 14, 2011 11:18 pm
Location: Maine
Contact:

Re: APBR-DraftExpress 2015 NBA Draft Project

Post by DSMok1 »

That's a great first draft. Someone needs to go through and do a full copy edit.

I think there should be several articles involved, and this be just the introduction. I would include each model's top 10 as a sidebar beside each writeup.

Further analysis and the overall ratings should be in another article.
Developer of Box Plus/Minus
APBRmetrics Forum Administrator
Twitter.com/DSMok1
steveshea
Posts: 91
Joined: Wed Oct 23, 2013 8:17 pm

Re: APBR-DraftExpress 2015 NBA Draft Project

Post by steveshea »

Sorry for the delay. Here is my writeup. Rankings to follow soon.


College Prospect Rating

College Prospect Rating (CPR) uses a college player’s box score statistics and his class (freshman, etc.) to approximate his NBA potential. It differs significantly from other objective draft models in at least the following ways:

1. CPR does not use regressions. Thus, CPR does not have to make a choice of a dependent variable. This is nice, but not the primary motivation for not using regressions. A typical regression uses information of what has worked in the past to predict what will work in the future. Implicit in the prediction is the assumption that the context of the past will be similar enough to the context of the future. This may not be true in the NBA. The NBA is changing in very measurable ways (such as the percentage of a team’s offense that comes from 3-point shots). CPR hopes to project the players that will succeed in 2016 and beyond, not pick the players that will thrive in the ‘90s.
2. College players are inconsistent. This is most problematic in the freshman season, which is the last college season for some of the top prospects. Some freshmen improve dramatically over the course of the season. Others simply “don’t show up” on occasion. These inconsistencies blur season average numbers. CPR gets around this by focusing only on each individual’s top 10 performances in each statistic.
3. There are no weights on statistics or adjustments for scarcity of position. CPR weights all statistics the same. It simply looks for excellence. Anthony Davis was excellent. Kevin Durant was excellent. The two were superior prospects in very different ways. CPR leaves it to the team to decide what positions they need or perceive to be scarce at the time, and what type of player they want.

In spite of its nonstandard construction (or maybe because of it), CPR has been effective projecting both high picks that busted and late picks that surprised.
Post Reply