Why IBM's next target should be a machine that plays poker.
"For two-player games, there still is fundamentally a right answer" as to what move to make, says Michael Bowling, the leader of the Computer Poker Research Group at the University of Alberta. "It's a fair game—I'm guaranteeing, in the long run, that I don't lose money. Every time my opponent makes a mistake, it's only to my advantage." That is to say, it's possible to develop an optimal strategy such that there's always a best move that can be made. One needn't worry about the fact that a computer can't read its opponent's body language or demeanor. A two-player game of poker is essentially a math problem.
Adding a third player is the equivalent of going from a 2-D world to a 3-D world: The one-on-one matchup unfolds into a trio of one-on-two relationships. The concept of a "right move" for any given situation goes out the window, and the delicate strategic equilibrium is ruined. The strategies of the three players become hopelessly intertwined—and now hinge on both statistical assessments and psychological ones. Bowling described one experiment in which three computers—or "bots"—had reached a virtual stalemate in a simplified version of three-player Hold 'Em. To test this delicate balance, they switched one of the machine players over to a simple (and generally unadvisable) "always raise" strategy. To their surprise, the always-raise bot didn't lose much ground. The bot to its left, however, cleaned up—and the one to its right had to mortgage the farm.
That's why poker is such a useful problem: To develop an excellent multiplayer bot, programmers would have to model people as well as probabilities. "To be able to handle these ring games, we're going to have to work in behavior," Bowling says. This is outside the realm of traditional game theory, and outside the sort of brute-force calculations and strategizing that made up Deep Blue's DNA. Even if a computer couldn't read body language, it would glean a lot of information from more explicit patterns of human behavior. UCLA computer scientist Leonard Kleinrock, whose most famous student is poker pro Chris Ferguson, says the timing and speed with which someone bets, for example, could be exploited by a machine as a different sort of tell. (Perhaps someone bets instantaneously when bluffing but takes forever with a borderline hand.) So could things like how often an opponent is caught bluffing, which hands he likes to pursue most often, and how his betting patterns change over time.
So far, most of the cross-disciplinary work is being done with economists, who have long incorporated game theory into their own models. There's interest in poker from both directions. One economist I spoke with, Roger McCain, immediately suggested AI researchers focus on three-card draw, a more strategic game than Hold 'Em, and begin from the perspective of human frailty. "We know people don't always choose the best strategy," he said. Other models approach the game as though the opponent were an extremely savvy player. Indeed, the very process of sussing out the competition—in Jeopardy! terms, is he a Ken Jennings or a Cliff Clavin?—could yield some important computing innovations down the line. As with so many of these emerging technologies, the most obvious applications would be in finance: Since both poker and investing are about managing risk with money on the line, it's easy to imagine how a model for playing poker would serve as a powerful tool for playing the markets. In a broader sense, understanding the nature of gamblers may help economists to understand rapid, and even irrational, shifts in behavior—say, when people move their money, sell their houses, buy government bonds, or do anything else to try to stay a step ahead.
Building such a program will be a major undertaking, and we won't get anywhere by locking a bunch of economists and computer scientists in a room for a month. For one thing, many in the AI crowd consider existing economic models of behavior to be unrealistic, as they expect people to act strategically after considering thousands of options. "Basically, [computer science] criticizes economic theories because they require a lot of computational power from the players," says Constantinos Daskalakis, a game theorist at MIT. There is a more promising line of research in economics known as "bounded rationality," which takes into account that humans have only so much processing power, but that field will need to develop further before it can be of much use to the programmers.
The irony is that, unlike checkers or chess, a poker program that studies behavior begins to resemble the very thing that Deep Blue did not: a machine that plays games like a human. Without a doubt, the first computer to win the National Heads-Up Poker Championship will rely on its share of brute force computations. It will also understand a thing or two about personality—who its opponents are, the sort of mistakes they're likely to make, and the moment when they're at their weakest. I can see it now: Watson on Poker After Dark, announcing to his human components in that treadmill voice of his: "All in."
Chris Wilson is a Slate contributor.
Photo by Ben Hider/Getty Images.