News Archive (1999-2012) | 2013-current at LinuxGizmos | Current Tech News Portal |    About   

Linux supercomputer beats humans in Jeopardy match

Jan 14, 2011 — by LinuxDevices Staff — from the LinuxDevices Archive — views

IBM showed off its Linux-based Watson supercomputer in an exhibition “Human vs. Machine” game of Jeopardy, while discussing potential practical uses of its natural-language AI in the IT industry, especially in health care and tech support. Watson beat two human challengers in this practice round, but the real winner will be proven in a televised competition in February.

Watson, IBM's latest DeepQA supercomputer, defeated its two human challengers during a demonstration round of Jeopardy on Jan. 13. The supercomputer will face former Jeopardy champions Ken Jennings and Brad Rutter in a two-game, men-versus-machine tournament to be aired in February. According to Novell, Watson runs on its SUSE Linux Enterprise Server distribution.

At the media event, Jennings and Rutter played a practice round with Watson to answer 15 questions across categories such as "Chicks Dig Me" and "MC." While Watson came out strong, Jennings and Rutter (pictured) held their own, with the final score of Watson with $4,400, Jennings (right) at $3,400 and Rutter (left) at $1,200. 

As the game's host Alex Trebek (pictured in the middle) often says on real games, "It's still anyone's game."

The Jeopardy match-up was not the "culmination" of four years of work by IBM Research scientists that worked on the Watson project, but rather, "just the beginning of a journey," Katharine Frase, vice president of industry solutions and emerging business at IBM Research, told eWEEK.

Supercomputers such as Watson that can understand natural human language — complete with puns, plays on words and slang — to help them answer complex questions will have applications in areas such as health care, tech support, and business analytics, said Frase.

Watson analyzes "real language," or spoken language, as opposed to simple or keyword-based questions, to understand the question, according to IBM. It then looks at the millions of pieces of information it has stored to find a specific answer.

"The hard part for Watson is finding and justifying the correct answer, computing confidence that it's right, and doing it fast enough," said David Ferrucci, the lead researcher and principal investigator on the Watson project, at the media event showcasing Watson at IBM's Yorktown Heights Research Lab.

This is where Jeopardy comes in. The quiz show covers a broad range of topics, and the questions can be asked in a variety of ways, whether it's quirky, straightforward, or downright strange. Creating a machine that can take on human challengers on Jeopardy became a "rally cry" for researchers to think about question and answer processing in a "more open and different way," said Frase.


IBM's Watson

"Grand challenges are a big deal to IBM," said John Kelly III, IBM's senior vice president and director of IBM Research. IBM's last major challenge was Deep Blue, the supercomputer that took on Chess Grandmaster Garry Kasparov in 1987. Many of the supercomputers used by the Department of Defense are the "sons and grandsons" of Deep Blue, Frase said.

Jeopardy is significantly more complicated than chess, said Ferrucci. Chess can be broken down mathematically and there are finite combinations, he said, while Jeopardy has "infinite ways" to extract data. Watson needs to understand the clues, pick which categories to choose, decide how confident it is that the answer is correct, and decide how to wager during "Daily Doubles" questions or for the final round.

Healthcare, tech support, and legal search apps beckon

The technology has to process natural language to understand "what did they mean" versus "what did they say," which has a lot of implications in the health care sector, said Frase. Patients are not using the terms doctors learned in medical school to describe their ailments, but more likely the terms they picked up from their parents growing up, she said.

A Watson-like system can take that information and co-relate it against all the medical journals and relevant information, and say, "Here's what I think and why," while showing its evidence for how it came up with the conclusion, according to Frase.

The machine won't be making a diagnosis or treatment decisions — a doctor would — but can present information to help the doctor. A similar situation exists for tech support, where the system would be able to figure out what the problem is, she added.

Watson could also look up legal cases. While that's possible with a search engine, just matching on keywords is "not always clear" what the case "was really about," Frase said. The system would be able to absorb the nuances and subtleties of a case and present a better set of results when researching case precedents.

Researchers trained Watson with 200 million pages of text, or about one million books, ranging from sources such as encyclopedias, movie scripts, newspapers, and even children's book abstracts. Watson uses the data to analyze contextual clues and figure out how the information relates to each other.

"Watson is not just storing all that information," said Bernie Spang, director of strategy for the software group at IBM Research. Instead of just storing data — which would make it a glorified search engine — Watson's algorithms evaluate the context to understand and correlate the information with other things it knows.

For example, Ferrucci noted that Watson knew BusinessWeek's quote about former General Electric CEO Jack Welch, "If leadership is an art, then surely Jack Welch has proved himself a master painter." Faced with a Jeopardy-style question, "Welch ran this," Watson would have figured out that Welch led GE (rather than being employed there as a painter).

Jeopardy was a "great application" to disprove, or prove, whether the DeepQA algorithms used to build Watson work for this kind of information learning, understanding, and retrieval, said Spang.

Watson's Jeopardy practice round on YouTube
Source: ZDNet
(Click to play) 

Open source Linux underpinnings

Watson is powered by 10 racks of Power 750 servers running Linux, containing 15 terabytes of RAM and 2,880 processor cores operating at 80 teraflops, IBM said. Each Power7 system can run thousands of simultaneous analytics algorithms to sift through more than 15 terabytes of information stored in Watson's "brain." The data is stored in a DB2 database.

Under the "hood," Watson is all open-source, using Eclipse as the tools platform along with Apache's Hadoop and Unstructured Information Management Architecture (an "IBM Research creation") to analyze unstructured data, said Spang. 

The advances in processing and computing technology built into the Power7 chips were a result of the work that IBM did while building Deep Blue, Spang said. Earlier record-breaking supercomputers that ran Linux have included IBM's Blue Gene (pictured).

The Top500 report issued by UC Berkeley last May showed that some 470 of the world's 500 fastest supercomputers — or 91 percent — ran Linux, compared to 75 percent in 2006. Oak Ridge National Laboratory's Cray XT5 Jaguar was the fastest computer in the world last year, with a sustained performance of 1.76 PFlop/s, said Top500. Yet, a new Chinese entrant called Nebulae was said to have achieved second place, helped out by its higher peak performance of 2.98 PFlop/s, compared to Jaguar's peak of 2.33 PFlop/s.

Stay tuned

Taping for the two-game tournament begins Jan. 14, and will be broadcast Feb. 14, 15 and 16. Instead of shipping Watson out to Los Angeles, where the show is usually taped, IBM spent $1 million to build a replica set at its Yorktown Heights Research Center. The game board is much smaller than the original, but the host and players' podiums are exactly the same, said Harry Friedman, executive producer of Jeopardy.

The first place winner will win $1 million, second place gets $300,000, and third place nets $200,000. IBM will donate 100 percent of Watson's winnings to a charity, World Vision, while Rutter and Jennings said they will each donate half of their prizes to their respective charities.

Fahmida Y. Rashid is a writer for our sister publication eWEEK.


This article was originally published on LinuxDevices.com and has been donated to the open source community by QuinStreet Inc. Please visit LinuxToday.com for up-to-date news and articles about Linux and open source.



Comments are closed.