A better way to pick the best team in baseball.

A mathematician's guide to the news.
Oct. 29 2004 7:21 AM

Building a Better World Series

Why the Fall Classic isn't the best way to find the best team in baseball.

The Red Sox, as the whole world knows, won the World Series in commanding fashion by dispatching the St. Louis Cardinals in four straight games. Few people would dispute that the Red Sox are the best team in baseball. But is the best-four-out-of-seven World Series really the best way to figure that out?

A mathematician views the World Series as an algorithm—that is, a formal procedure for solving a problem. In this case, the problem is to determine which of two baseball teams is better. The World Series algorithm goes like this: "Play until either the Red Sox or Cardinals win four games. Once either the Red Sox or the Cardinals win four games, conclude that they are the better team."

If the World Series were "best 51 out of 101" rather than "best four out of seven," the better team would win almost all the time; that's the way probabilities shake out in the long run. But a hundred-game World Series would test the patience of even the most die-hard baseball junkies. Ideally, we'd like an algorithm that

  • has a very high chance of picking the better team, and
  • picks a winner in as short a time as possible.


This is the same fundamental trade-off that designers of clinical trials face. A study involving a huge number of patients is much more reliable than a smaller trial but also takes longer and costs more.

It's hard to meet both goals at once. Some sports, like football, prioritize the second criterion by holding a one-game-for-all-the-marbles championship. Others, like tennis, emphasize the first: The requirement that you have to win a game by two points makes it harder for the weaker player to luck out. The problem is that, in theory, a game can go on forever.

The World Series falls somewhere in between the football model and the tennis model. But is it possible that we can do even better than the current World Series algorithm? That is, can we construct a model that both picks the better team more often and, on average, picks a winner in a shorter number of games?

Yes, we can. Suppose we play an "Alternate World Series" that ends when a team is up 3-0, 4-1, 4-2, 5-3, or 5-4. The AWS turns out to be shorter than the standard World Series; the chance of the series going eight or nine games is balanced out by the possibility of a 3-0 skunking. What's more, the AWS has a slightly better chance of selecting the stronger team than does the real World Series. That makes sense—with apologies to the 2004 Red Sox, a series where one team goes up three games to none is pretty much a blowout, and we won't lose much by pulling the plug. For every 2004 ALCS, there are a lot more copies of the 2004 World Series. On the other hand, the AWS lets us see at least two more games if the series goes to 3-3, a circumstance where we might really benefit from seeing a few more results.

Now, let's see a bit of the computation hiding behind the statements we just made. Suppose the chance that the Red Sox will beat the Cardinals in any given game is p. (For the moment, we ignore the difference between Pedro Martinez and Tim Wakefield and take p to be the same for each game.) If the Red Sox are better than the Cardinals, p will be bigger than 1/2; otherwise, p is smaller than 1/2. Let's suppose p = .55, which is to say the Red Sox are quite a bit better than the Cardinals. After one real World Series game, there's a 55 percent chance the Red Sox are up 1-0 and a 45 percent chance the Cardinals are up 1-0. The chance the Red Sox will sweep the first two games is .55 x .55 = .3025, while the chance the Cardinals go up 2-0 is .45 x .45 = .2025. The Series will be 1-1 if the Red Sox go up 1-0 and the Cardinals win Game 2 (probability .55 x .45), or the Cardinals go up 1-0 and the Red Sox respond with a win (probability .45 x .55). So, we get .55 x .45 + .45 x .55 = .495 as the probability that the series stands at 1-1 after two games.

Continuing this type of analysis, we find that the probabilities break down as follows:



The Ebola Story

How our minds build narratives out of disaster.

The Budget Disaster That Completely Sabotaged the WHO’s Response to Ebola

PowerPoint Is the Worst, and Now It’s the Latest Way to Hack Into Your Computer

The Shooting Tragedies That Forged Canada’s Gun Politics

A Highly Unscientific Ranking of Crazy-Old German Beers


Welcome to 13th Grade!

Some high schools are offering a fifth year. That’s a great idea.


The Actual World

“Mount Thoreau” and the naming of things in the wilderness.

Want Kids to Delay Sex? Let Planned Parenthood Teach Them Sex Ed.

Would You Trust Walmart to Provide Your Health Care? (You Should.)

  News & Politics
Oct. 22 2014 9:42 PM Landslide Landrieu Can the Louisiana Democrat use the powers of incumbency to save herself one more time?
Continuously Operating
Oct. 22 2014 2:38 PM Crack Open an Old One A highly unscientific evaluation of Germany’s oldest breweries.
Gentleman Scholar
Oct. 22 2014 5:54 PM May I Offer to Sharpen My Friends’ Knives? Or would that be rude?
  Double X
The XX Factor
Oct. 22 2014 4:27 PM Three Ways Your Text Messages Change After You Get Married
  Slate Plus
Tv Club
Oct. 22 2014 5:27 PM The Slate Walking Dead Podcast A spoiler-filled discussion of Episodes 1 and 2.
Oct. 22 2014 11:54 PM The Actual World “Mount Thoreau” and the naming of things in the wilderness.
Future Tense
Oct. 22 2014 5:33 PM One More Reason Not to Use PowerPoint: It’s The Gateway for a Serious Windows Vulnerability
  Health & Science
Wild Things
Oct. 22 2014 2:42 PM Orcas, Via Drone, for the First Time Ever
Sports Nut
Oct. 20 2014 5:09 PM Keepaway, on Three. Ready—Break! On his record-breaking touchdown pass, Peyton Manning couldn’t even leave the celebration to chance.