Skip to main content

2015 NCAA Basketball Over and Under Achiever Analysis

(This was originally posted to several internet sources in March of 2015)
Hi everyone!
I am new here, but I was encourage by a friend of mine to post some data that I have compiled related to whether certain college coaches are over-rated or under-rated based on NCAA tournament performance. The original question that I tried to answer was "Is Bo Ryan Over-rated." This analysis was primarily meant to answer that question, but I uncovered a lot more.
I have compiled a rather large NCAA tournament database over the years where I essentially have the results of every game since seeding began in 1979 in a spreadsheet such that I can performance various analyses on the data if I wish. It occurred to me over the weekend that it would be possible to measure if a coach is doing better than expected or not based on one of two different methods.
The first method is to compare a coach's record as a certain seed against the historical performance of all teams of that seed. For example, since 1979, 1-seeds have a win-loss record of 447-124 (0.783). Tom Izzo's lifetime record as a 1-seed is 16-3 (0.842). So, by this metric,Izzo does better than average as a 1-seed. More quantitatively, if an average coach playing 19 games as a 1-seed would be expected to win 78.3% of the time, you would expect that coach to win 14.87 of those games. Since Izzo has won 16, he is 1.13 games "above average" as a 1-seed. If you have the right set of data (which I do), you can then perform the same calculation for Izzo as a 2-seed, 3-seed, etc. and obtain on overall number of games above or below average for his career in the NCAA tournament. You can also perform the same calculation for all 566 coaches who have coached at least 1 game in the NCAA tournament.
The histogram summarizing this calculation is shown here:
All_20seeds_zpshr4y3u56.0.jpg
As you can see, the distribution is roughly a bell curve centered at zero with an upper limit close to 6 games and a lower limit of around -4 games
However, this calculation troubles me a little bit because it does not take into account that some 1-seeds have to play a 4-seed and then a 2-seed to make the Final Four (like MSU did in 2000) while some other 1-seeds may draw a 12-seed and then a 11-seed (like MSU did in 2001). In other words, not all tournament draws are created equally. In order to correct for this, I instead compared each coach's performance to the average based on seed differential and not just raw seed. As it turns out, the data for this comparison is very linear. If the difference in seed between two teams is only 1 (a 1-seed vs. a 2-seed), the higher seed team only wins about 56% of the time, all the way up to a maximum differential of 15 (a 1-seed vs. a 16-seed) where the probability of winning is 100%. In case you are curious, this correlation looks like this (where the size of the data label roughly corresponds to the number of data points for each seed differential. As you can guess, some differentials are much less likely than others).
Correlation_zpstay3omtt.0.jpg
So, using the same methodology as described above, I was able to calculate each coach's performance against the average in terms of seed differential. When I do this, the histogram looks like this:
All_20differential_zpsfk5u5nzs.0.jpg
Again, it looks like a bell curve centered at zero, this time with a maximum value of 7 games (above average) and a minimum value of -5 (below average)
So, where go some of the notable coaches fall on these scales? Well, I removed all the data for coaches with less than 10 games in the Big Dance to simplify things, which cuts the number down to 121 coaches. In terms of performance just based on seed, the simplified histogram now looks like this, including the names of notable coaches:
10_20seeds_zpstgyknezi.0.jpg
Using this format, it is pretty easy to separate the men from the boys. Based on this metric, Tom Izzo has out-performed all other coaches in the history of the tournament since 1979 with a very impressive "score" of 5.65 games (i.e. wins over his career) above the expected/average performance . You can also see that there is a pack of 10 coaches or so, including Billy Donovan, Jim Calhoun, Rick Pitino, Callipari, Brad Stevens, and Coach K that are head and shoulders above everyone else. To get back to the original question, Bo Ryan is slightly above average at 1.15, just behind Bill Self at 1.34. Also notable are that coaches like Thad Matta and Mark Few have scores below 0.5, while legends such as Lute Olson and Bob Huggins score below average. Worse yet are Gene Keady at -3 and Rick Barnes at almost -3.5.
But, as I mentioned, I think a more fair comparison is the one using seed differentials. The simplified and labeled histogram based on this calculation is shown here:
10_20differential_zpsuhnnhptv.0.jpg
By this metric, Izzo finishes 4th all time at 6.27, just behind Roy Williams, Villanova legend Rollie Massimino, and Rick Pitino. I suppose that this means that historically, Izzo has gotten a slightly easier draw than Pitino and Roy Williams. I must admit that I am surprised to see Roy Williams so high in this calculation. It seems like he needs a full NBA roster to be successful in the tournament. But, I guess if nothing else, this shows that Ol' Roy's teams are properly placed in the tournament, Furthermore, even though he has been upset 11 times, 7 of those have been in situations where the seed differential is 3 or less. That is better than most coaches do in those situations.
Other notable observations are that Billy Donovan does not so quite as well in this analysis, which suggests he has also benefited from some good higher seeds teams being upset before they reach his team. Also, Bielien checks in at a very respectable 3.23, while other notable coaches such as Jim Boehiem (0.40), Bill Self (0.23), and Thad Matta (0.63) are quite average.
That brings us back to the original question: is Bo Ryan underachiever? Well, based on this metric, his score is -0.47, significantly lower than his score using just the raw seed. This confirms our concern that he has benefited for a couple soft tournament draws. So, yes, I think that we can definitively say that he is an under-achiever, Now, that is not to say that he is a bad coach any more so than Lute Olson (-0.23) is a bad coach. But, Bo only ranks 82nd among the 121 coaches with over 10 tournament games, by this metric.
As a final note, I have always had a gut feeling that Coach K has been a bit of an under-achiever over the past 10-15 years. Based on the data above, that does not seem to be the case. However, I decided to see what would happen if I re-did the calculation using only games played in the "Tom Izzo era" of the tournament (since 1998). When I do this, Coach K numbers do, in fact, take a pretty big hit. His score based on raw seed is actually negative (-0.98), which his score based on seed differential is only 0.27, which is in the Bill Self range. Color me not so impressed.
Anyway, I hope that you all find this analysis as enjoyable as I did in putting it together.

Comments

Popular posts from this blog

Dr. Green and White Helps You Fill Out Your Bracket (2024 Edition)

For as long as I can remember, I have loved the NCAA Basketball Tournament. I love the bracket. I love the underdogs. I love One Shining Moment. I even love the CBS theme music. As a kid I filled out hand-drawn brackets and scoured the morning newspaper for results of late night games. As I got older, I started tracking scores using a increasing complex set of spreadsheets. Over time, as my analysis became more sophisticated, I began to notice certain patterns to the Madness I have found that I can use modern analytics and computational tools to gain a better understanding of the tournament itself and perhaps even extract some hints as to how the tournament might play out. Last year, I used this analysis to correctly predict that No. 4 seed UConn win the National Title in addition to other notable upsets. There is no foolproof way to dominate your office pool, but it is possible to spot upsets that are more likely than others and teams that are likely to go on a run or flame out early....

2024 Week Eight Preview: OK Computer

Playing the first game after a bye week is like waking up from a nap. It is a little tough to predict how the body will respond. If a nap comes at just the right time and lasts for just the right length of time, it can be very refreshing and rejuvenating. But sometimes waking up for a nap can be rough. It can cause a disorienting, groggy feeling like suddenly two plus two equals five and that down is the new up. Based on the way the three weeks prior to the bye week went, last week's break at the midpoint of the season came at exactly the right time for the Spartans. Facing one top five team is challenging enough. Facing two top five teams on consecutive weekends including almost 5,000 miles of travel is something else entirely. But how will the rested Spartans look on the field come Saturday night? It is hard to predict what we are going to get. It is the classic "rest versus rust," million dollar question.  I prefer to be optimistic and to believe that the Spartans will...

2024 Week Seven Preview: Intermission

It is hard to believe that we are already halfway through the Michigan State Spartans' 2024 season. The Green and White currently sit at 3-3, having just lost two games straight to teams both ranked in the top three nationally.  Despite the current losing streak, Michigan State is actually slightly ahead of schedule. While the Spartans' schedule currently grades out to be harder than expected when I conducted the analysis this summer (by 0.7 games), Michigan State's current odds to go to a bowl game (46%) are 10 percentage points higher than what I projected.  In Week Seven, Michigan State has drawn a much needed bye. Think about it as an intermission of sorts. The Spartans' mission this weekend is to rest, heal, reflect on the first half of the season, and prepare for back half of the schedule with the goal of qualifying for the bowl game. Michigan State's team and staff may be taking it easy, but data and Vegas never sleep. Today's piece will focus more on the...