AT&T developing voice-controlled iPhone apps (video)
AT&T has developed a software trick that will let modern mobile handsets, including Apple Inc's iPhone, recognize voice commands without the need for specialized voice recognition software.
The research project is based on a new version of AT&T's WATSON speech recognition engine, dubbed Speech Mashups, that puts the entire feature on the web as a service that can be called upon from anywhere a high-speed Internet connection is possible.
As long as the software used to access Speech Mashups obeys certain web standards, particularly an AJAX framework and JavaScript, the technology can capture voice commands, interpret them at a remote server, and send them back to the device in a language a website or program can understand -- all without installing a dedicated app or plugin.
The telecoms company says the technology can be used for IP-based TV boxes as well as BlackBerries and smartphones, but draws most of its focus to the iPhone -- a device which (unlike the BlackBerry) has no native voice recognition of its own and, until the release of iPhone 2.0 firmware, had no support for the feature even through isolated native apps.
In a prototype mobile version of the YellowPages website, AT&T in a research video shows an iPhone user entering the business name and location into text fields on the page just by speaking them at the appropriate times. *While typing would work in such a case, the company claims that voicing the information is faster and more convenient -- especially when driving.
This solution is limited and excludes iPhones without a sufficiently fast connection to AT&T's servers or for native applications that don't include web code; many of Apple's own applications, for example, wouldn't function with the feature. *As-is, the technology doesn't satisfy frequent requests for voice dialing or other direct speech recognition features.
Still, while the development is limited in scope and remains in AT&T's labs, the development potentially opens up both web apps and some native iPhone apps to a feature that even Apple itself has yet to program into its own devices.
The research project is based on a new version of AT&T's WATSON speech recognition engine, dubbed Speech Mashups, that puts the entire feature on the web as a service that can be called upon from anywhere a high-speed Internet connection is possible.
As long as the software used to access Speech Mashups obeys certain web standards, particularly an AJAX framework and JavaScript, the technology can capture voice commands, interpret them at a remote server, and send them back to the device in a language a website or program can understand -- all without installing a dedicated app or plugin.
The telecoms company says the technology can be used for IP-based TV boxes as well as BlackBerries and smartphones, but draws most of its focus to the iPhone -- a device which (unlike the BlackBerry) has no native voice recognition of its own and, until the release of iPhone 2.0 firmware, had no support for the feature even through isolated native apps.
In a prototype mobile version of the YellowPages website, AT&T in a research video shows an iPhone user entering the business name and location into text fields on the page just by speaking them at the appropriate times. *While typing would work in such a case, the company claims that voicing the information is faster and more convenient -- especially when driving.
This solution is limited and excludes iPhones without a sufficiently fast connection to AT&T's servers or for native applications that don't include web code; many of Apple's own applications, for example, wouldn't function with the feature. *As-is, the technology doesn't satisfy frequent requests for voice dialing or other direct speech recognition features.
Still, while the development is limited in scope and remains in AT&T's labs, the development potentially opens up both web apps and some native iPhone apps to a feature that even Apple itself has yet to program into its own devices.
Comments
Steve
Wait....Sounds like a new feature for future update.=) lol
Is it one of those web terminals I've heard about?
It might be really cool, if they make it work well and across the whole iPhone. Still, the better solution would be for Apple to just write in Voice Commands into the next software release. Add MMS and Video Recording, then (almost) all the critics will be silenced!
Steve
Except the critics who want a good camera in their phone.
This isn't really relevant, but it looks like the guy has a terminal on his iPhone. It seems doubtful though that an official video (From AT&T nonetheless) would feature a jailbroken iPhone. Can anyone shed some light on this?
Is it one of those web terminals I've heard about?
I can. The video is from AT&T. That's all you need to know.
I want the voice activated feature where I can say, "Unlock my F*KING iPHONE 3G you BASTARDS"!!!
iPwnageTool, but you have to click while you talk.
I want the voice activated feature where I can say, "Unlock my F*KING iPHONE 3G you BASTARDS"!!!
its about time someone said just that
voice dialing, how about fingerprint reading so you don't have to swipe then put in you pin then talk, come on, the ui needs some buff.
yea
voice dialing, commands, finger print reader app--now only $200
It might be really cool, if they make it work well and across the whole iPhone. Still, the better solution would be for Apple to just write in Voice Commands into the next software release. Add MMS and Video Recording, then (almost) all the critics will be silenced!
This has the potential of a LOT more than just voice commands. Imagine the ability to dictate entire e-mails by voice rather than typing. See my previous post about the company Vlingo that's already doing this. Which would you rather do, type in a 200 character e-mail or simply speak it to your phone and have it translated directly into text? I've seen the Vlingo stuff in action on a BlackBerry and even in a rather noisy public environment it can be remarkably accurate.
This has the potential of a LOT more than just voice commands. Imagine the ability to dictate entire e-mails by voice rather than typing. See my previous post about the company Vlingo that's already doing this. Which would you rather do, type in a 200 character e-mail or simply speak it to your phone and have it translated directly into text? I've seen the Vlingo stuff in action on a BlackBerry and even in a rather noisy public environment it can be remarkably accurate.
So we've got instant messaging and texting that gets used many times where email could have worked and now people want to use email with voice recognition where voicemail could work. Most people struggle with what apps to use for what function...nothing like confusing them more.
And, yeah, I understand the reasoning some want this....just being a bit sarcastic about it. It all sounds as crazy as using a web browser to write a paper.
This is easy to do from applications created with the SDK but I didn't think it was possible from within Safari. Thanks.
It might be really cool, if they make it work well and across the whole iPhone. Still, the better solution would be for Apple to just write in Voice Commands into the next software release. Add MMS and Video Recording, then (almost) all the critics will be silenced!
Steve
And the ability to forward contacts as address cards/business cards via texts/mms/bluetooth. A feature that is greatly missed.
How about proper editing features for emails so that I don't have to forward an entire conversation, just the attachment?
They should implement something similar to that.
Can I have a Sadim wisa woenuts and diet coke to go?!
Sadim wisa woenuts = Shrimp with walnuts...
Flied Lice anyone? Cheesburgah, Cheesburgah!
Oops, back to work