A Firsthand Account of Microsoft’s Employee-Ranking System

Commentary about business and finance.
Aug. 26 2013 11:56 AM

Tales of an Ex–Microsoft Manager

Outgoing CEO Steve Ballmer’s beloved employee-ranking system made me secretive, cynical, and paranoid.

low angle view of a team of business executives sitting at a table
Time to stack rank. Don't tell nonmanagers.

Photo by George Doyle/Stockbyte/Thinkstock

There were seven or eight of us managers in the conference room, all peers, as well as our own manager. The conference rooms all had large tables, whether heavy varnished oak or cheap plywood. The chairs were the sort that let you lean back with increasing tension, and they had a few levers underneath the seat that raise and lower the seat and adjust the back. I’m a fidgeter, so I played with them a lot during meetings.

David Auerbach David Auerbach

David Auerbach is a writer and software engineer based in New York. His website is http://davidauerba.ch.

We were mostly white, and all men. Each of us had between three and six “direct reports”: nonmanager programmers who we oversaw. We were the direct reports of our manager. There were lots and lots of managers at Microsoft—it was the only path to advancement, so the company structure became more and more steeply vertical. Once or twice a year, we would all get together and decide how good each of our reports was, by ordering them from best to worst.

The system was called the stack rank.

Following Friday’s news of Microsoft CEO Steve Ballmer’s imminent retirement, postmortems of his lackluster 13-year reign have pointed to stack ranking—which, to be entirely fair, predated him—as both a cause and a symptom of the corporation’s decline. As a software developer and later development lead at Microsoft between 1998–2003, I had to evaluate others and be evaluated myself under this system. And I can say that yes, stack ranking is as toxic for innovation and integrity and morale as media reports made it out to be, and then some.

Each report’s name was written on an index card and put on the table. It was a two-step process. First, reports were broadly sorted into four buckets: excellent, good, mediocre, and awful. Then, within each bucket, people were paired for comparison and bubbled up or down. Managers would argue whether a particular report was better or worse than some other manager’s report in the same bucket. Our manager would adjudicate the debates. Some managers were better fighters than others.

A manager’s goals here are twofold. First, you want to place your best people as high as possible on the ladder. This will help them get big bonuses, promotions, and raises, and thus keep them happy and less likely to leave your group or the company. Second, if you’re unfortunate enough to have weak reports, you want either to help them by placing them sufficiently high that they don’t get dinged too badly come the annual review, or to throw them to the wolves and let them get ranked low. If you give up on them, they’ll be put on a dead-end track that marks them as more or less useless. The object is to get them out from under you and make them someone else’s problem.

I was lucky not to have any weak reports. I had been encouraged to take one at an earlier point, but frenetically talked my way out of it, using all the managerese I could summon to argue that he didn’t belong on my team.

Advertisement

My reports seemingly did fine. I got them all into the top two buckets. After the stack rank was over, our boss, who’d overseen the whole meeting, took our stack rank and then went to another, similar meeting one level up, where now we would be stacked and ranked behind our backs, then mixed in with the stack we’d just created, cruelly shuffled like a deck of human cards.

Eventually, the vice presidents would have to bargain among themselves for how many bonuses and raises they would get for their entire organizations, then ration them out according to the stack rank. People would then be assigned one of three grades: 4.0 (Above Average), 3.5 (Average), and 3.0 (Below Average). The very rare 4.5 got you a set of steak knives; a 2.5 meant you were fired (more or less). I’m not sure if Alan Turing himself could have gotten a 5.0.

The stack rank was harmful. It served as an incentive not to join high-quality groups, because you’d be that much more likely to fall low in the stack rank. Better to join a weak group where you’d be the star, and then coast. Maybe the executives thought this would help strong people lift up weak teams. It never worked that way. More often, it just encouraged people to backstab their co-workers, since their loss entailed your profit.

The stack rank was a zero-sum game—one person could only excel by the amount that others were penalized. And it was applied at every level of the organization. Even if you were in a group of three high performers, it was very likely that one of you would be graded Above Average, one Average, and one Below Average. Unless your manager was a prick or an idiot or both, the ordering would reflect your relative skills, but that never came as too much comfort to the hard-working schlub who just wasn’t as good as the other two.

TODAY IN SLATE

Culturebox

The Ebola Story

How our minds build narratives out of disaster.

The Budget Disaster That Completely Sabotaged the WHO’s Response to Ebola

PowerPoint Is the Worst, and Now It’s the Latest Way to Hack Into Your Computer

The Shooting Tragedies That Forged Canada’s Gun Politics

A Highly Unscientific Ranking of Crazy-Old German Beers

Education

Welcome to 13th Grade!

Some high schools are offering a fifth year. That’s a great idea.

Culturebox

The Actual World

“Mount Thoreau” and the naming of things in the wilderness.

Want Kids to Delay Sex? Let Planned Parenthood Teach Them Sex Ed.

Would You Trust Walmart to Provide Your Health Care? (You Should.)

  News & Politics
Politics
Oct. 22 2014 9:42 PM Landslide Landrieu Can the Louisiana Democrat use the powers of incumbency to save herself one more time?
  Business
Continuously Operating
Oct. 22 2014 2:38 PM Crack Open an Old One A highly unscientific evaluation of Germany’s oldest breweries.
  Life
Gentleman Scholar
Oct. 22 2014 5:54 PM May I Offer to Sharpen My Friends’ Knives? Or would that be rude?
  Double X
The XX Factor
Oct. 22 2014 4:27 PM Three Ways Your Text Messages Change After You Get Married
  Slate Plus
Tv Club
Oct. 22 2014 5:27 PM The Slate Walking Dead Podcast A spoiler-filled discussion of Episodes 1 and 2.
  Arts
Culturebox
Oct. 22 2014 11:54 PM The Actual World “Mount Thoreau” and the naming of things in the wilderness.
  Technology
Future Tense
Oct. 22 2014 5:33 PM One More Reason Not to Use PowerPoint: It’s The Gateway for a Serious Windows Vulnerability
  Health & Science
Wild Things
Oct. 22 2014 2:42 PM Orcas, Via Drone, for the First Time Ever
  Sports
Sports Nut
Oct. 20 2014 5:09 PM Keepaway, on Three. Ready—Break! On his record-breaking touchdown pass, Peyton Manning couldn’t even leave the celebration to chance.