News Archive (1999-2012) | 2013-current at LinuxGizmos | Current Tech News Portal |    About   

IBM embedded speech technologies evolve

Jan 26, 2006 — by LinuxDevices Staff — from the LinuxDevices Archive — views

[Updated 4:00 PM PST] — IBM last week updated its speech recognition technology, adding features aimed at allowing commands to have a more flexible syntax. Embedded ViaVoice 4.4 is available for Linux, and is commonly used in a variety of embedded systems, such as in-car “infotainment” devices.

According to an article at PC Magazine, ViaVoice 4.4 supports “free-form commands,” including command synonyms and flexible ordering of command parameters. Previous voice technology required users to stick to fairly limited scripts when issuing voice commands.

The PC Magazine article touts the ViaVoice 4.4 release as a major advance, and a landmark on IBM's five-year goal of achieving “super-human” speech recognition; that is, computer speech recognition that equals and even surpasses human capabilities.

Also touted in the new release are improved acoustic models, which help sift the wheat of voice from the chaff of background noise, as well as better filtering of transient noises, such as honking horns blown at drivers distracted by conversations with their gadgets.

Still, IBM claims that the more natural conversational patterns permitted by ViaVoice 4.4 are less distracting to drivers than trying to recall more rigid command syntaxes. Jim Holland, product line manager of embedded speech at IBM, stated, “Speech recognition has gotten a bad rap in recent years, but it has quietly been maturing into a truly useful technology. The ability to use natural language commands allows the device to become a more instinctive part of a user's daily routine.”

Typical applications for the new ViaVoice release include car radio tuning, phone dialing, and on-the-fly translations of multimedia content, for example to create subtitles for foreign films, PC Magazine says.

In a related story, eWEEK has written about a Wake Forest University project that aims to combine IBM's voice recognition technology with web services, allowing students with cell phones to perform such feats as browsing class schedules or the library card catalog, or checking washing machine availability.

The PC Magazine article can be found here, and the eWEEK article is available here.


 
This article was originally published on LinuxDevices.com and has been donated to the open source community by QuinStreet Inc. Please visit LinuxToday.com for up-to-date news and articles about Linux and open source.



Comments are closed.