2014-15 MAX Ratings

mystic · Post by **mystic** » Mon Dec 08, 2014 5:13 pm

permaximum wrote:
1. If that is the case, why do you still have seasons listed as "out-of-sample", which wouldn't be eligible for Neil's proposed test, but would be considered in-sample?
I'm sorry but I couldn't read further. If you can't get that I used 1980-11...2012-13 data in the regression, I don't know what to say. Besides it looks you don't know how did DSMok1 come with BPM since you think 2013-14 should be in-sample too.

Q.E.D.

In Neil's test he sets up y0 data in order to predict y1, y2, y3 and y4. How would you want to test 1979 data in that test at all out-of-sample, if 1980 is in-sample already? You used 1980 as earliest data in order to come up with "your model", thus the latest season fully eligible in an out-of-sample test is 1975 (for 1976 the y4 would be 1980, thus in-sample!). And no, I NEVER said that 2014 data would be in-sample when used to predict 2015 results, just that the test of 2013 on the 2014 results would be in-sample. Testing the 2014 metric data on the 2015 season results is out-of-sample. (It really would be nice, if you stop with your constant logical fallacies, creating a straw man is considered as such!)

Crow wrote:It was a side issue above but I'll just say that my blend with adjustments was soley aimed at winning a contest where adjustment was within the rules.

Don't worry, I did not want to imply anything else. I said so before in the prediction thread that I don't see anything wrong with your approach there.

permaximum · Post by **permaximum** » Mon Dec 08, 2014 8:03 pm

@mystic

One thing first, I pretty much get the same RMSE for all in-sample seasons... Out-of-sample game streak will scream itself with it's RMSE. That's how you can get the idea about what's in-sample or not pretty easily when it comes to box-score metrics. Well that's my findings anyway. Even at 15 games into to 2014-15 you can differentiate better metric from worse pretty easily via RMSE check.

Back to the real thing, well, we finally found out where the problem lies with this in-sample, out-of-sample thing. Miscommunication. I always assumed Neil would use Season-1 to predict the Season-0. If he'll go back to the point of S-4, that's an another story. Besides, I don't think going that back is a good thing. He will have to use too many average, replacement or actual values for missing players which is going to skew the test imo.

And for your claim about "logical fallacies"... Prove it or.. you know. Come on pal, do something besides talking. All I see is failure from you... Failure in prediction contests, failure in calculating a simple ridge regression via cross validation, failure in actually showing a real work here.

Statman · Post by **Statman** » Mon Dec 08, 2014 8:46 pm

permaximum wrote: Edit: BTW, I have an opinion. The best box-score metric and the best RAPM variant should be the only all-in-one metrics to stay updated and used in the future. I know that RPM is the best RAPM variant. And I know that BPM blend (which I call MAX) is the best box-score metric (I thank DSmok1 for the 10th time). So, if you got something to say about this, shoot!

I disagree.

permaximum · Post by **permaximum** » Mon Dec 08, 2014 10:27 pm

Statman wrote:
permaximum wrote: Edit: BTW, I have an opinion. The best box-score metric and the best RAPM variant should be the only all-in-one metrics to stay updated and used in the future. I know that RPM is the best RAPM variant. And I know that BPM blend (which I call MAX) is the best box-score metric (I thank DSmok1 for the 10th time). So, if you got something to say about this, shoot!
I disagree.

Nice. This made me happier. Really, believe it or not. I'm excited to use "that metric" for a blend with xRAPM if it will become public in the foreseeable future.

mystic · Post by **mystic** » Tue Dec 09, 2014 2:29 pm

permaximum wrote: One thing first, I pretty much get the same RMSE for all in-sample seasons.

Well, I'm wondering whether you can post such numbers say from 2001 to 2014, just to get a grasp on what you mean with "pretty much the same". Last time we talked about something like that, you proclaimed a SD of 5.7 to be not significant different from a SD of 2.4.

I just checked the SD of the RMSE for a metric with (A) and without (B) in-sample information (for that 2001 to 2014 intervall, where I use 2000 information to predict 2001 performances until 2013 data for the 2014 results), and A has 0.4 while B has 0.4, in fact they have a R² of 0.79 on the team level while being at 0.94 at the overall league-level RMSE (season-to-season). When I compare the predictive power via RMSE for metric A in y1 for 2001 to 2014 vs. 1987 to 2000, where the latter intervall would be out-of-sample, the RMSE would just differ by 0.06 (worse for the out-of-sample intervall) while the SD becomes 0.41. Looking at such numbers, I can't say with great certainty whether A has in-sample for a specific intervall or not. Though, when I compare the 2nd year performances, B becomes slightly better than A, which gets substantially in the 3rd year. If you don't have a good metric without in-sample data and therefore need that in-sample data to become relevant, your test can be enough, but if that metric isn't that bad in predicting y1 (so, better than PER or WP48 for that matter) without in-sample, the boost by the in-sample data wouldn't be noticed "easily". To put a number on it: The best 7 years in terms of RMSE for the metric A in the 1987 to 2000 sample, present in average a RMSE better by 0.45 than the worst 7 years for the 2001 to 2014 sample (mind you, 2001 to 2014 would be in-sample). Overall, the boost from the in-sample information might just be enough to "win" such a contest for y1, even doing very well in y2, but would get exposed by y3 and y4.

The test was based on adjusted per 100 possession scoring differences.

From my perspective, going by your comments so far, you either haven't checked that on a bigger sample at all, and thus pulling some stuff out of thin air here, or you actually not just used in-sample information to derive one set of coefficients, but have different coefficients for each season depending on the fit, or you have a weird understanding of what "pretty much the same" means.

permaximum wrote: Even at 15 games into to 2014-15 you can differentiate better metric from worse pretty easily via RMSE check.

Pretty bold statement, given that I know for sure that a better metric in average may even show a worse RMSE in a particular season than a metric being in average worse. Though, talking about metrics here which are a bit closer in terms of overall predictive power (say 2.4 vs. 2.7 RMSE).

permaximum wrote: Back to the real thing, well, we finally found out where the problem lies with this in-sample, out-of-sample thing. Miscommunication. I always assumed Neil would use Season-1 to predict the Season-0. If he'll go back to the point of S-4, that's an another story. Besides, I don't think going that back is a good thing. He will have to use too many average, replacement or actual values for missing players which is going to skew the test imo.

Ok, but how would 2014 be out-of-sample in such case, if you "believed" it would be used to "predict" 2013 results? Given your strong comment before about that specific season, you may understand that something seems to be very off with your "explanation". But let us work with the assumption here that you indeed just misunderstood the design of the test, can you agree that from an outside perspective it looked like that you don't have a clue what in-sample and out-of-sample actually means?

permaximum wrote: And for your claim about "logical fallacies"... Prove it or.. you know. Come on pal, do something besides talking. All I see is failure from you... Failure in prediction contests, failure in calculating a simple ridge regression via cross validation, failure in actually showing a real work here.

What I sometimes wonder is what people like you are getting out of such crap. What do you want to achieve with such comments? Should I feel bad, because I made mistakes or should I feel deeply offended? Life is a myriad of mistakes, handling such in a honest way and with the will to improve upon yourself by learning from mistakes seems to me the better approach than feeling bad about yourself or getting overly emotional by comments made by random people on the internet. Or do you have the feeling that writing things like that gives you bonus points by others, who may have shown dislike towards me in the past?

Btw, I suggest using a blend build by this equation:

- build z-score-like values for WS/48 and BPM, using this:

WS-z = (WS/48-0.1)/0.075
BPM-z = BPM/3

then plug that into this:

player_rating = 0.7*WS-z + 1.2*BPM-z + 0.1*mpg - 2.5

Pretty simply solution for a blend I found. I also tested it against "your metric" and found that this would cover about 96.5% of the variance (unweighted, while about 99% minute-weighted) when using your presented numbers in this thread for 2015 at two different stages. That blend can also be adjusted by a team-level adjustment as well as normalized to a desired SD afterwards.

Also, one other thing: The likely better predictive results of "your metric" in comparison to BPM are clearly controlled by a regression to the mean, where you have some sort of individual "shrinkage" based on the other stuff included. If you bring the BPM SD controlled by using a WS-like approach down to the SD of your metric, the difference in "predictive power" will get closer.

permaximum · Post by **permaximum** » Tue Dec 09, 2014 2:50 pm

Alright, from now on this is just an another thread I won't respond to you.

Statman · Post by **Statman** » Tue Dec 09, 2014 7:59 pm

permaximum wrote:
Statman wrote:
permaximum wrote: Edit: BTW, I have an opinion. The best box-score metric and the best RAPM variant should be the only all-in-one metrics to stay updated and used in the future. I know that RPM is the best RAPM variant. And I know that BPM blend (which I call MAX) is the best box-score metric (I thank DSmok1 for the 10th time). So, if you got something to say about this, shoot!
I disagree.
Nice. This made me happier. Really, believe it or not. I'm excited to use "that metric" for a blend with xRAPM if it will become public in the foreseeable future.

Didn't understand your response. That's ok, in case you didn't get my joke - since my work is box-score based, I believe it's as good or better than any out there. So, I disagreed with your stating MAX was the best box score metric from my own little personal level.

Now, if you are saying your are going to "blend" my results into your work (I don't think you are, I think you are talking about something else) - go right ahead. I have every regular season posted back to 1980 at my site. More power to ya.

permaximum · Post by **permaximum** » Tue Dec 09, 2014 10:42 pm

Statman wrote: Didn't understand your response. That's ok, in case you didn't get my joke - since my work is box-score based, I believe it's as good or better than any out there. So, I disagreed with your stating MAX was the best box score metric from my own little personal level.

Now, if you are saying your are going to "blend" my results into your work (I don't think you are, I think you are talking about something else) - go right ahead. I have every regular season posted back to 1980 at my site. More power to ya.

I didn't mean to be aggressive buddy. Actually I was the one joking "referring to DSMok1's claim". It was half irony. Anyways, if you have the data with b-r player codes I would like to test it's prediction accuracy. I also need to learn on which seasons you trained it.

permaximum · Post by **permaximum** » Sun Dec 14, 2014 12:04 pm

Rankings are updated. Kevin Durant is on the rise.

Crow · Post by **Crow** » Sun Dec 14, 2014 4:36 pm

I appreciate the blend and should check it more. I'd be interested in seeing a next step blend of MAX with xRAPM and perhaps more.

Avery Bradley rated pretty low. Have to compare to RPM and past season.

Is there a link to the ratings? The one in your profile goes to an old thread.

permaximum · Post by **permaximum** » Sun Dec 14, 2014 9:35 pm

Crow wrote:I appreciate the blend and should check it more. I'd be interested in seeing a next step blend of MAX with xRAPM and perhaps more.

Avery Bradley rated pretty low. Have to compare to RPM and past season.

Is there a link to the ratings? The one in your profile goes to an old thread.

An xRAPM blend is an interesting thing. I made some tests for it and pure RAPM. In the end a blend with pure RAPM proved to be just a bit better than xRAPM blend in my tests but the sample size is obviously small. I suspect the result was surprising because it's hard to improve the prediction power of xRAPM since it already captures much of the available data. BTW, in that blend I used 5 years of data (Y-1,Y-2,Y-3,Y-4,Y-5), PER and WP too.

When I have free time again, hopefully I'll come with a blend that uses 5 years of RAPM, xRAPM, known box-score metrics, height, position value and age adjustment in the end. Whatever I can do to improve prediction.

However, ideally, I would genarate values for all metrics before each game and use all unique lineups in the regression. But I don't have that kind of data.

It should be like this:

Payer A-B-C-D-E : F-G-H-I-J-K : Metric 1 (e.g: OWS, DWS, OBPM, DBPM, xRAPM, RAPM, PER, WP, WAR, eWins etc. every metric that I can get) total before the game, Metric 2 total before the game,.... and a few box-score additions such as MPG, height, position value: Possessions in the game : Margin
Player A-B-C-D-E : F-G-H-I-J-L : ...
...

If I get that kind of data, I believe I can beat Vegas pretty easily with the right blend and proper HCA, rest, age, coach adjustments.

Edit: In that ideal scenerio I need 5 years of metric totals before each game including the season that's ongoing.

permaximum · Post by **permaximum** » Sat Dec 20, 2014 10:54 am

Rankings are updated. Russel Westbrook is having one of the best seasons of all time.

Mike G · Post by **Mike G** » Sat Dec 20, 2014 12:53 pm

The only (part or full season) PER higher than Westbrook's 32.9 is Anthony Davis this year at 33.8, in >400 minutes.
For full seasons, Wilt, Jordan, and LeBron hit 31, 3-4 times each.

Davis also has the better WS/48 rate, .308 to .296. These would rank 9 and 13 all-time; Kareem dominates this ranking, with 3 of the top 5, maxing at .340. Others higher than Westy are Wilt, MJ, and LeBron.
Durant was at .295 last year.

Then in BPM, RW sits at +12.8 . LeBron hit 12.2 in 2009, and Jordan's the only other player to top 11.3
http://bkref.com/tiny/87UpC

Of the top 100 PER seasons (>26.5) Westbrook is 5th in Pts/36, after 2 each by Wilt and Jordan; 9th in Ast/36, after 3 Magics, 3 Oscars, and 2 Pauls; 12th in Stl/36; 43rd in OReb/36

He's played just 30% of team minutes so far. If he averages 36 mpg the last 2/3 of the season, he would reach 60% for the year.

APBRmetrics

2014-15 MAX Ratings

Re: 2014-15 MAX Ratings

Re: 2014-15 MAX Ratings

Re: 2014-15 MAX Ratings

Re: 2014-15 MAX Ratings

Re: 2014-15 MAX Ratings

Re: 2014-15 MAX Ratings

Re: 2014-15 MAX Ratings

Re: 2014-15 MAX Ratings

Re: 2014-15 MAX Ratings

Re: 2014-15 MAX Ratings

Re: 2014-15 MAX Ratings

Re: 2014-15 MAX Ratings

Re: 2014-15 MAX Ratings