Thursday, January 29, 2026

Belichick for the Hall

 

“You know he got a chance he can win. White dudes like to do shit like that...vote for the wrong dude as a goof. They get drunk and shit and go like:

‘Let's vote for Jesse Jackson!’

‘I just voted for Jesse Jackson!’
And next day would be like this: ‘He f---in' won?’" – Eddie Murphy in Delirious, talking about Jesse Jackson running for President.

In POTUS elections, many people “vote their conscience”. Instead of supporting the crazy Republican, they vote for the Libertarian candidate. Or instead of supporting the crazy Democrat, they vote for the Green Party candidate. They know that their own vote won’t affect the outcome since the election is “safe” (or unwinnable) in their red or blue state, so might as well make a statement.

Or sometimes, as Mr. Murphy suggests, they’ll vote non-optimally for a goof.

So Bill Belichick didn’t get into the Pro Football Hall of Fame. We still don’t know who did get elected, but we do know that he did not.

DISCLAIMER: I am not a Pats fan.

Background: A 50 person committee (mix of NFL people and media) has a slate of five finalists (each selected from separate semifinalist groups of 8-10): one Coach, one Contributor, and three Senior Players (last played no later than 1999; two of the three had their primes in the 1970s and the third in the 1980s, retiring after 1993 season). Each committee member votes for 3 of the 5. Anyone getting 80% is in (if no one gets 80%, the top vote getter is in). (The Hall specifies that it’s “top three getting 80%”, but it’s impossible for four candidates to each get 80%.) Last year, only one of the five was picked. This year's class hasn't been announced yet, but it was leaked that Belichick was not selected.

Belichick had "spirit" issues, as he was directly implicated in one cheating scandal (Spygate) and was there for another (Deflategate), plus was innovative with finding loopholes in the rules. He was known to be gruff with the media. Other coaches or owners may have resented him. So that could explain some of the non-votes (at least 11 of the 50 did not vote for him).

So there are many possible reasons for not getting to 80%:

1.       The Contributor was Bob Kraft, owner of the Patriots. One could imagine a voter saying, "i'm not going to use two of my three votes on candidates from one team." And some might actually think Kraft was more worthy, or was friends with Kraft but disliked Belichick and chose Kraft.

1a. Same logic, but “I’m not going to use two of my three votes on non-Players.”

2. It's not a given that the Senior Players will have another shot so voters may have prioritized voting for them.  They might still be eligible but there are many other worthy candidates in the category. In fact, one voter has come forward and cited this factor One reason Bill Belichick isn’t first-ballot Hall of Famer. | Kansas City Star

3. Voters may have thought that Belichick was getting in regardless so they used their precious votes on other people they felt worthy. What’s it matter if he gets 94% or 96%? But bumping someone else from 78% to 80% is huge. The problem is when other voters act this way, too. (He f---in’ lost?!) I think this was true of some of the voters but no one is going to cop to that this year.

4. The method matters and the quality of the other candidates matters. This exact process was only put in place last year and resulted in only one of the five being elected. If all candidates are exactly equal, each one gets 60%, far from the 80% threshold. If there is one guarantee on the slate, then the other four only have a 50% starting point. (Conversely, if one of the five is clearly not qualified and no one votes for them, the baseline is 75%.) So it wouldn’t be surprising if the votes are split and one or even zero candidates are elected. If you had a Yes/No on each candidate, that would reduce this problem. (Voters might feel reluctant to flood the Hall with five new inductees so might vote No on the least qualified even if that candidate is above the bar.)

5. It's hard to compare across groups and eras. Was Bob Kraft a more impactful owner than Roger Craig was as a running back? How can you even start to quantify this? Even comparing players, was LC Greenwood the defensive lineman better than Ken Anderson the quarterback? Greenwood’s teams were more successful (Go Steelers!) but there are nine Steelers from that era (plus the coach and owner) in the Hall. There are two Bengals, one of whom came on after Anderson’s prime.

5. Then there’s jealousy and general dislike. Bill Polian of the long-time rival Colts was one of the voters and claims not to remember whether he voted for Belichick or not but was rumored to have spoken out against Belichick in the committee meeting (it’s a violation of the rules for anyone to reveal what happened in that meeting other than revealing their own votes if they wish). The Colts suffered often, occasionally unfairly, at the hands of the Patriots. To paraphrase Mencken, he couldn’t get his hand upon the collar of his superior, so he got his thumb into his eyes.

6. And, of course, the scandal(s). Belichick was implicated in Spygate (filming other team’s practices), which cost them a big fine and a top draft pick. He was coach during Deflategate which led to Tom Brady (who will certainly get in first ballot, though through a different voting process) being suspended for four games but was never himself implicated. And he never hesitated to exploit a loophole in the rules no matter how unfair it seemed. Plus he was a bit gruff and didn’t suffer fools in the media. But football has had an interesting relationship with rule-breakers, certainly when compared to baseball or ultimate. Players have been suspended for steroid use without getting much flack (including possible GOAT Lawrence Taylor, who was first-ballot Hall inductee). (Take a look here List of suspensions in the NFL - Wikipedia and see how many of your favorite players are named.) Two Hall of Famers were even suspended for a year for gambling on football! I acknowledge that this should cost him some points, but his resume is just so far above the line that he’d had to go full OJ to fall below the line (and even then, I’m not sure; one of the Senior Player finalists last year killed his wife and himself but CTE had driven him mad). So football seems to be a little inconsistent on this one.

Then there are the ripple effects, at least if the current process holds. Mike Holmgren (coach for the Packers and Seahawks) was a finalist last year but the Coach committee chose Belichick as its sole finalist this year. Presumably Belichick will be chosen again next year and get elected after serving his penance, so Holmgren and other top coaches Coach Blue-Ribbon Committee names 9 Semifinalists for Hall of Fame Class of 2026 | Pro Football Hall of Fame will get bumped. Additionally, next year’s Contributor and Senior Players will be on the ballot with a near-unanimous choice, leaving fewer available votes to get to 80%, leading to more of a backlog.

The scandals alone are not enough to explain the failure to elect, so most analyses which focus on just that are misguided. If they don’t fix the process, it’s just going to get worse. You might not agree with the rationale of #1-5, but that's how some people see it.

 

Thursday, October 31, 2024

Some observations on club Nationals 2024

 This was the first time in a while that I watched a lot of club Nationals (mostly the Men’s Division). I attended in 2018 but was more just taking in the atmosphere than watching with a critical eye. I played in two local leagues, an open beach tournament, and Men's Sectionals this year after being almost exclusively in age-limited play since at least 2010 and noticed some differences, both strategically and culturally.

Wow, what a performance by Rhino in the finals. Everything worked, and everyone made plays. And other than maybe one or two of the long shots being somewhat contested, I don’t think they were even close to turning it over. I remember when Sockeye had a turnover free first half c. 2006 in the finals, that included a greatest and a block that ended up on the prone defender’s back only to be picked up by the Sockeye receiver. There were a few passes caught very close to the lines, but those throws went exactly where they aimed. But you could just see and feel their sense of invincibility.

There were so many laser backhand hucks. Seems like back in the day backhands used to be more towering, which required the long cutter to be closer to the thrower at release so that the disc could still be in front of the receiver when it came down to catching height. I wonder how much of this is due to the marker having to be further away from the disc.  With the mark tight, the thrower has to step around more and keep the arm low to avoid the mark, and the disc naturally has more float. Forehands also come out a little hotter than before but there were always some laser long forehands.

Rhino claimed small ball is dead but I still saw plenty of two foot passes and immediate give and gos. I don’t find that style of play aesthetically pleasing to watch though it is fun to play and there is no doubt that it works and is hard to stop without doing something illegal (a little extra bodying on the mark or the cut) or possibly unethical (calling travel for minuscule changes in direction) or with a rule change (how’s this: once you get into the red zone, the stall count drops by one after every complete pass, to a minimum of 3).

Interesting parallel between this game and the 1995 finals, which was the last final with a larger margin of victory (though in a game to 21).  DoG 21, Sockeye 10. DoG only had 126 passes that game (six of them long, including all three turnovers). Stats haven’t come out yet but I would imaginer there were significantly more passes than that (especially on a per point basis). D line forced 10 turnovers and scored every time. Sockeye’s O line had a clean hold rate of 50% and their scoring percentage for the game (goals/possessions) was higher than their tournament average. We would have been the favorites going into the final but not heavy favorites. We had won in 1994, our first, but had a poor year, including a loss in the finals of Worlds to Double Happiness (SF) and a 17-6 beating by Cojones (NY) at Regionals. This was Sockeye’s first time at Nationals. They won their pool as a 3 seed and beat Double in semis.

1995 format: 2 pools of 6, games to 19. Two pool play games Thursday and Friday, final pool play game Saturday morning, then all semis at the same time (possibly staggered 30 or 60 minutes), finals on Sunday. Semis and finals to 21. DoG played about 225 points over seven games, or 83 points per player (19 players, which was smaller than most rosters at the time). All games now are to 15, except when they’re shorter, so Rhino played 151 points over six games, or 41 points per player (26 of them), just under half of what DoG averaged.

One of the announcers remarked how different the end zone offenses are now compared to even five years ago. Several things stood out to me. Offenses seemed to have three players standing spread out across the back of the end zone while their defenders were 10 yards in front of them facing the throwers. While occasionally they would get hit for easy goals, these seemed to come high in the stall count and there were also some blocks made. What struck me more was how many goals were scored in the middle of the field, a yard into the end zone, with the receiver cutting toward the thrower. I never actually took stats on this, but I used to say that 90% of non-long goals were scored near the cone. The prototypical end zone offense was a regenerative strike from the back, sometimes followed by a cut from near the back to the same side but a few yards inside the cone, if not open swing to the other side and repeat. Even when offenses updated and had cuts coming from the front and going straight across the field, they were still leading passes toward the sideline. But I don’t know how many times I saw a little backhand flip on a soft inside break pass. Not criticizing it, but I was almost shocked. Even when we ran a ho or a side stack, we’d still revert to a vert in the end zone. I guess this is somewhat driven by the defenses, which are forcing flat or middle more often.

But I do remember c.2006 we had an end zone offense with the setup being one cutter at each cone and two cutters at the back of the end zone spread apart, with the three handlers outside the end zone. It was devastating in practice when we were able to line up that way but in games we found it too hard to get into the set and abandoned it. I wonder now if we couldn’t have tried tweaking it a bit to keep the center of the field open and attack there.

Inside out forehands: These throws have been a pet peeve of mine for some time. Though sometimes it comes across as “all i/o are bad”, there are certain rules I have for them that just didn’t seem to be met. To reduce it to a single heuristic, they’re ok if the receiver is able to square up and catch it in the chest, but if they have to keep running and catch it in front, there are so many things that can go wrong with it. For many years, I’ve seen players try to jam that pass into a tight window and seen blocks, drops, and passes zing by. I really didn’t see that this year. The throws were a little softer, the cutters were to the middle of the field a little more, and there was more distance from the cutter defender. I saw Molly Brown hit three consecutive passes that were i/o forehands, and they all looked like good choices. I don’t know if there has been a collective wising up or if the defenses have shifted such that there is more margin on the throws or if it’s just selection bias, but it sure seemed different this year.

Travels: the casual extra steps taken by so many players just drove me crazy. Handlers catching it flat-footed, and then just stepping upfield. Cutters catching it while running (but not really sprinting) and then taking five or seven chop steps, often using their momentum to deliver their next pass without truly stopping until after the disc is released. I didn’t really notice excessive foot movement once the pivot was set, though.

Spikes: we get it, you scored. Maybe you’re excited, but more often it felt “look at me”. I did once write, “You want to spike the disc?  Spike it!! “, so maybe I shouldn’t be the one pointing this out, but it got a bit tiresome to watch. So many were disc-damaging as well.

Manu: hellvua thrower, great competitor. I watched two of the Molly Brown games, plus some of the Worlds. I often heard the commentators talk about the great players and ask, “what can they do to get even better?”  To me, the #1 thing isn’t for Manu to learn how to do anything. It’s to learn how to not do something. There were two admittedly sweet-looking breakmark backhands in the semis that soared out of bounds. Some long backhands looked forced (there were several holsters, but whether that’s because they were just fakes or if the marker prevented the throw or if Manu actually decided it wasn’t a good look, I couldn’t tell). It’s a fine line between creating and forcing, and I think Manu needs to explore where that line is and how it changes based on the environment.

Technology: I remember getting a phone call about two weeks after one of the 1980s Nationals I didn’t make and hearing, “you won’t believe how badly the Tunas did at Nationals”. And this was much quicker than usual, since I usually had to wait for the UPA Newsletter to arrive at my house sometime around Christmas. Now, there are complaints (some of them from me!) that not all the games are streamed, there isn’t a single page that automatically shows all the scores, all games should be free, etc. You don’t know how lucky you have it. (Still, the USAU website is not ideal.)

Format: I’m still not a fan of stadiums, turf fields, fields with lots of non-ultimate lines, and six separate rounds of semis (with only a tiny amount of overlap between two of the games). In particular, I abhor the night game. Though many players now play night games through the semi-pro leagues, it’s still a change from every other game of the club season. The Brute Squad/Scandal semis started at 10:15 PM ET. That’s an unfair disadvantage which extends into the finals. Scandal wouldn’t have been able to get to bed until  probably 3 am ET, and their bodies were probably still on East Coast time. And hearing that teams were probably fatigued because they had to play two games with their 26 players the previous day. Sure, the game is faster now, but that’s due in part to only having to play half as much. Pat King (NYNY) once told me he played 72 points in the semis and finals on one day.

Now get off my lawn.

(Cross-posted to rec.sport.disc, I mean, the ultimate subreddit. Thinking also about putting it on X.)

Tuesday, February 11, 2020

The Lake Wobegon Effect, or Why Most People Are Above Average


Welcome to Lake Wobegon, where all the women are strong, all the men are good-looking, and all the children are above average."-- Garrison Keillor 

More generally, illusory superiority is the tendency to overestimate one’s ability compared to others. For instance, it’s widely quoted that 80% of drivers think they are above average in driving skill, and we all pooh-pooh those other dumb people who erroneously think they can somehow defy simple math. However, there is a perfectly rational reason for this to be the case.

For something like height that is easy to measure and easy to compare, you would probably find that close to 50% of people think they are above average. But what about more complex skills like driving or intelligence or soccer-playing? There are multiple dimensions in each of those. Person A hasn’t been in an accident in 10 years. Person B easily maneuvers through city traffic and can merge onto a highway seamlessly. Person C would win a race around a closed course. In soccer, some players score goals, some pass the ball well, and some play defense. Who’s the best? Well, the mother of the soccer player who scores goals thinks that goal-scoring is the most important skill, while the father of the defender thinks that defense wins championships, and the grandfather of the passer loves the beautiful game and wants to see the ball passed around. Who is best? “My kid!”

Let’s say that each player can accurately assess how good everyone is in each attribute. However, each of them thinks that the attribute they are best at is twice as important as the attribute they are second-best at, and four times as important as the one they are worst at (so, a 4x-2x-x weighting). What happens now?

Here’s a simple example with three players. Each is really good at one skill, average in another, and terrible in the third, but is average if each skill is equally important. Fairly evaluating each skill (but not their importance to “soccer playing ability”), each player thinks that they are the best. Similarly, each player is considered by one of the others to be the worst.

Player
Scoring
Passing
Defending
Equal assessment
Scorer’s assessment
Passer’s assessment
Defender’s assessment
Scorer
9
5
1
5
6.7
4.4
3.9
Passer
1
9
5
5
3.9
6.7
4.4
Defender
5
1
9
5
4.4
3.9
6.7





What about the broader case? I did a simple simulation for this. I have 10 000 individuals whose skills in three attributes are independent and are uniformly distributed. As you would expect, 50% of them are above average on each attribute and on the average of the three attributes. However, each weights the importance of the skill in accordance with their ability in that skill.            What happens now? As it turns out, 74% of these people are now above average! The median person is now in the 63rd percentile.

Now let’s add in the illusory superiority bias and see where we get. Let’s assume each person overestimates their skill by a mere 5%, so if they are truly average, they think they are in the 55th percentile. Now, 81% of the people are above average.

How about a real-life example? Who was the best hitter in the American League in 2017? I looked at three stats (batting average, home run rate, and strikeout avoidance) for all 78 qualifiers that year, and compared each player to the average qualifier in that stat. Using an equal weight for the three metrics, there were about an equal number of above-average and below-average hitters, as we would expect. But what if each qualifier got to choose what stat was most important and which was least important, like in the hypothetical example above? Then, the average player was suddenly 12% better than average, and 57 of 79 (73%) are above average. Welcome to Lake Wobegon.

Friday, April 10, 2015

How my team has done in every tournament I've played in since 1992

At Nathan Wicks' request, I made a chart of all 239 tournaments I've played in the last 24 years and how my team did (1st place is at the top). It's split out by division, and Nationals and Worlds are highlighted.

Friday, January 17, 2014

Live at skydmagazine.com

I am now writing a monthly feature at skydmagazine.com. The first feature is "Why I'm Still Around". In it, I trace my interest in playing over the years, starting out as an 18 year old who didn't have anything else better to do before going away to college, as a 20-something developing my game, as a 30-something at the top of my game, and as a 40-something waiting for death. One of the things I touch on but didn't get to develop enough is that I get a chance or take the opportunity to chat more. For instance, I played with Zip for several years, but I've probably had more serious conversations with him the past two years (bar run, party at his house I crashed, goaltimate) than I did during our time as teammates. DoG Masters and Beach Worlds 2011 also led to getting to know more people, and those have led to additional opportunities like playing with Los Zodiac at Paganello, where I got to team with modern luminaries Beau Kittredge and Bart Watson. Two quotes from Bart: "Beau on sand is like the rest of you on grass" and "So, what was it like to play before there was strategy?" I (and Alex) got on that team through our connection with Greg Husak of the Condors, who (along with Steve Dugan and Mike Namkung) played with DoG Masters at Worlds in 2008. Good quote from Husak after playing with us for a few days: "So, let me get this straight. Your offense is 'this guy cuts, then that guy cuts, and if he doesn't get it, this other guy cuts'." I guess the Condors offense was a bit more formally structured. If you have any suggestions for future articles, leave them in the comments.

Monday, August 26, 2013

Stat post for The Huddle

With all the stat posts at ultiworld.com, I thought I'd post an article I wrote for a stats issue of The Huddle which never got published.

Stats

In the early 1990s, my teams (Earth Atomizer and Big Brother) recorded every pass of the season, entering them in a notebook using a shorthand notation during games, and some friends and I would compile them afterwards. Among other things, we found that forehands were thrown away about 50% more frequently than backhands, about 60% of hucks were complete (except for a certain anti-stat hothead who went 4 for 16), and 1.5% of passes were dropped. We did use that first piece of knowledge (coupled with “scouting” observations) about forehands to decide to force forehand most of the time. But what else did we gain for all that time spent?

I have come around to believe that for the time being, the concrete value of tracking individual statistics to predict or to evaluate is doomed by two things, context and sample size. We tried to make one adjustment for context, namely, separating out “tough games” from “chump games”. But then that fed into the second issue, sample size, since we now had fewer games to draw from. And was the line between “tough” and “chump” in the right place? Some of the games were tough because of bad playing conditions, others because we just played badly or exceptionally well in a game that would normally be a blowout, and still others because we had a skeleton crew. Oh, and some teams played zone and forced the handlers to pile up twice as many throws as usual (without, I hope you realize, playing twice as well). But we counted them all equally.

(I should also add that “opportunities” is something that must be accounted for when trying to analyze. But it’s not always as simple as dividing by the number of touches. In the seminal basketball analytics book “Basketball on Paper”, Dean Oliver (now in charge of stats at ESPN) highlighted that player efficiency decreases with increased usage as the players who bear the brunt of the offensive load have to make plays that are closer to the margin. Furthermore, these players will also draw the toughest defenders.)

But we could still tell who was good at completing passes, right? Well, as I like to say at my job where I analyze my company’s engineering performance, it depends. They recorded individual stats on last year’s NexGen tour. I was very excited to get this dataset, because every game was against a quality opponent, almost everyone played almost every game, and each game was a showcase and not just one of many in a long weekend. As it turns out, this dataset too suffers from some confounders such as the first half of the tour beings spent figuring out how to play together and what roles to settle into, but it was still the purest dataset I know of. For the complete tour, turnover percentage* of the players ranged from 3.4% to 12.4%. But for the most part, the guys at the higher end of the turnover range also threw a higher percentage of their passes for goals, while the low-turnover guys didn’t throw as many goals. Here’s the graph for all of them, split out by how often they touched the disc per point: *They didn’t separate out drops from throwaways so we’ll have to use this instead of incompletion rate.

Note also that the high touch players were in the lower left corner. I can think of two explanations for this besides them being conservative handlers. One, an in-bounds pull almost always results in an uncontested completed pass. Two, passes in general in that half of the field are typically easier to complete because the defense has to respect the threat of the long pass, and I think that handlers have a higher percentage of the touches there than they do closer to the endzone, where pass frequency is more evenly distributed. At the other end of the graph, deeps are going to be catching more of their passes near the endzone, resulting in relatively more opportunities for goal throws but also with each completion a little more difficult because of the reduced space. The risk/benefit of a few extra yards changes near the goal line as well. I wrote in “Ultimate Techniques and Tactics”, a book co-authored with Eric Zaslow and published by Human Kinetics and still available through your favorite Internet reseller, that being in the endzone instead of just on the goal line increases your chance of scoring as much as being 10 yards closer elsewhere on the field.

I once set up a simulation of an offense where the players were equally talented (i.e., had the same incompletion rate per yard of throw) but had different roles in the offense and different throw choices. The first thing I noticed is that a particular player sometimes had MVP-level tournaments and sometimes had tournaments where he would have been benched. The more important point, though, was that the players’ stat lines resembled those of real teams such as NexGen, with some players racking up the goals and turnovers while others had lots of touches but few fantasy league stats. This leads me to conclude that much of the difference between the stat lines of any two players is not a difference in effectiveness but simply a matter of taste. (Note that there are still some players who stand out, either good or bad, but you generally don’t need a calculator to know that.) Two equally-efficient players can have drastically different stat lines due not to any difference in skill or on-field decision-making but to the difference in their roles.

So what do these detailed individual stats (at this stage in our history, where we have only the stats of our own team against a wide range of opponents in vastly different environments) bring to the table? Accountability and self-awareness. Lord Kelvin wrote, “If you cannot measure it, you cannot improve it.” Simply being aware of your actual completion percentage on hucks should force you to contemplate whether you are making good choices. I remember going over each of my turnovers in a weekend (with the aid of the stat pad) and being shocked to learn how many of them were simply poor risk/reward decisions, and I was able to eliminate some of those.

Lest you think I’ve given up on stats, I haven’t. But I think the payoff for now would come on analyzing team decisions. The first priority would be to get realistic baselines for performance. I routinely see people write that five turnovers in a game is typical or that drops never happen or that hucks are completed 75% of the time. While there are certainly examples of these happening, I would guess that they aren’t the typical performance. The other area I would like to quantify the value of particular scenarios. For instance, how much harder is it for a team to score off a deep, high pull vs a low pull vs a brick, and how consistent are good pullers at achieving good pulls? Could someone who is an otherwise bad defender still be a good D player simply by virtue of his pulls? On the offensive side, exactly how costly is it to rest one of your top players? How deadly is it to turn it over in your own half of the field? Might the Huck-‘n’-Hope offensive style actually be a reasonable strategy due to the long field left after a turnover? We might have opinions about those now, but until we measure these, we don’t know.

Friday, March 01, 2013

Future of Ultimate

I'll be attending the Ultimate Players and Coaches Conference tomorrow as a panelist for "the future of ultimate". Back in 2005, I blogged about trends in ultimate over the previous 10 years, over the upcoming 10 years, and that didn't happen. I have a feeling that the focus on tomorrow's discussion will be around the further professionalization of ultimate. This doesn't mean only whether the pro leagues will take off and how widespread they'll be, but what elite-level ultimate will look like. Will top players in 10 years still be attending Potlatch and Paganello and Poultry Days or will it just be unacceptable for them to risk getting hurt while wearing overalls? How much will decisions be about the players on the field versus the rest of the ultimate world or the spectating world? There has already been a movement toward less freedom, but how far will it go? Will Men's split off from Women's (and both say goodbye to Mixed) if a sponsor comes calling? Those were the kinds of questions that immediately popped into my head, and I brought up the topic at skydmagazine But then after I got into a conversation with one of the other commenters, I realized that once again I neglected 95% of the ultimate out there by thinking only about the competitive season in the US (or in those places where they could reasonably expect to challenge the top US teams). So maybe I need to think some more about how summer leagues, youth camps, fun coed tournaments, and semi-competitive (i.e., playing tournaments without devoting all your spare time to the game) ultimate.