|
|||||||
| Register | Members List | New Posts | Mark Forums Read |
![]() |
|
|
Thread Tools | Display Modes |
|
|
#1 |
|
Kasper's Automated Slave
Join Date: Nov 1997
Posts: 6,151
|
AT&T developing voice-controlled iPhone apps (video)
AT&T has developed a software trick that will let modern mobile handsets, including Apple Inc's iPhone, recognize voice commands without the need for specialized voice recognition software.
The research project is based on a new version of AT&T's WATSON speech recognition engine, dubbed Speech Mashups, that puts the entire feature on the web as a service that can be called upon from anywhere a high-speed Internet connection is possible. As long as the software used to access Speech Mashups obeys certain web standards, particularly an AJAX framework and JavaScript, the technology can capture voice commands, interpret them at a remote server, and send them back to the device in a language a website or program can understand -- all without installing a dedicated app or plugin. The telecoms company says the technology can be used for IP-based TV boxes as well as BlackBerries and smartphones, but draws most of its focus to the iPhone -- a device which (unlike the BlackBerry) has no native voice recognition of its own and, until the release of iPhone 2.0 firmware, had no support for the feature even through isolated native apps. In a prototype mobile version of the YellowPages website, AT&T in a research video shows an iPhone user entering the business name and location into text fields on the page just by speaking them at the appropriate times. *While typing would work in such a case, the company claims that voicing the information is faster and more convenient -- especially when driving. This solution is limited and excludes iPhones without a sufficiently fast connection to AT&T's servers or for native applications that don't include web code; many of Apple's own applications, for example, wouldn't function with the feature. *As-is, the technology doesn't satisfy frequent requests for voice dialing or other direct speech recognition features. Still, while the development is limited in scope and remains in AT&T's labs, the development potentially opens up both web apps and some native iPhone apps to a feature that even Apple itself has yet to program into its own devices. |
|
|
|
|
|
#2 |
|
Registered User
Join Date: Sep 2007
Location: California
Posts: 87
|
Might Be Cool...
It might be really cool, if they make it work well and across the whole iPhone. Still, the better solution would be for Apple to just write in Voice Commands into the next software release. Add MMS and Video Recording, then (almost) all the critics will be silenced!
Steve
20" Aluminum iMac (August 2007) - Leopard 10.6.1
13" MacBook Pro (2.53 Ghz) - Leopard 10.6.1 32 GB iPhone 3GS 8 GB iPhone (Original) 2 iPod Minis (Blue, 4GB) |
|
|
|
|
|
#3 |
|
Registered User
Join Date: Jun 2008
Location: Central Florida
Posts: 74
|
Completely irrelevant, but interesting...I was very confused as to how this showed up at "12:00am EST," when my computer displays 11:27PM (Eastern). Not only that, but we're now on Eastern DAYLIGHT Time which is EST+1...so actual EST is 10:27 PM. What's wrong AI, these Panasonic ToughBook, Kodak printer, and T-mobile ads not garnering enough clicks to afford decent software?
|
|
|
|
|
|
#4 |
|
Registered User
Join Date: Nov 2006
Posts: 2,077
|
Null.
Ş & ş are called "Thorn" & şey represent şe sound you've associated "th" wiş since şe 13ş or 14ş century. I'm bringing it back.
<(=_=)> (>=_=)> <(=_=<) ^(=_=^) (^=_=)^ ^(=_=)^ +(=_=)+ Last edited by Slewis; 11-09-2008 at 07:56 AM.. |
|
|
|
|
|
#5 |
|
Registered User
Join Date: Jul 2007
Location: Reston, VA
Posts: 367
|
Aggg...why ATT. Apple can do it twice as better. Its already done in OS X. If only Apple had some more time and engineers the whole iPhone would listen to you.
Wait....Sounds like a new feature for future update.=) lol ![]() |
|
|
|
|
|
#6 |
|
Registered User
Join Date: May 2005
Posts: 8,453
|
Might be nice if at&t opened this up as an online Mac app also, perhaps a widget?
"The natural progress of things is for liberty to yield, and government to gain ground."
—Thomas Jefferson Proud AAPL stock owner. |
|
|
|
|
|
#7 |
|
Registered User
Join Date: Jun 2007
Location: Boise, ID among others
Posts: 529
|
Voice Dialing really isn't difficult, I'm actually very surprised if there is not a popular program in the app store for that? Is this forbidden in the app store or something??? If not, I guarantee one is in development..
|
|
|
|
|
|
#8 |
|
Registered User
Join Date: Jun 2008
Posts: 30
|
This isn't really relevant, but it looks like the guy has a terminal on his iPhone. It seems doubtful though that an official video (From AT&T nonetheless) would feature a jailbroken iPhone. Can anyone shed some light on this?
Is it one of those web terminals I've heard about? |
|
|
|
|
|
#9 | |
|
Registered User
Join Date: Feb 2006
Location: Ireland
Posts: 8,559
|
Quote:
Collecting my SSD iMac Fry-die. :D
|
|
|
|
|
|
|
#10 | |
|
Registered User
Join Date: Feb 2006
Location: Ireland
Posts: 8,559
|
Quote:
Collecting my SSD iMac Fry-die. :D
|
|
|
|
|
|
|
#11 |
|
Registered User
Join Date: Feb 2007
Posts: 3,700
|
I want the voice activated feature where I can say, "Unlock my F*KING iPHONE 3G you BASTARDS"!!!
![]() |
|
|
|
|
|
#12 |
|
Registered User
Join Date: Feb 2006
Location: Ireland
Posts: 8,559
|
iPwnageTool, but you have to click while you talk.
Collecting my SSD iMac Fry-die. :D
|
|
|
|
|
|
#13 | |
|
Registered User
Join Date: Nov 2002
Location: ASHLAND, KY
Posts: 1,818
|
Quote:
![]() ![]() voice dialing, how about fingerprint reading so you don't have to swipe then put in you pin then talk, come on, the ui needs some buff. yea voice dialing, commands, finger print reader app--now only $200 ![]()
I APPLE THEREFORE I AM
|
|
|
|
|
|
|
#14 |
|
Registered User
Join Date: May 2008
Posts: 12
|
AT&T is playing catchup... A company called Vlingo has been doing this sort of thing for a while. The company has a bunch of speech scientists from MIT whose previous speech rec company was acquired by Nuance. They've already got a full speech app available for BlackBerry's (you can download it from their home page) and I've heard rumors that they have an early iPhone version being tested. In a nutshell, if you want to send an e-mail, reply to an e-mail, etc. just press & hold the "send" button and start talking. It streams your audio to a server that converts the speech to text and returns the text back to the e-mail application (or whichever app you happen to be running at the time). It's pretty slick stuff. I've seen it demoed a few times. If you've got a BlackBerry I'd suggest downloading the Vlingo app and trying it out.
Last edited by Iphtashu Fitz; 07-23-2008 at 09:12 AM.. |
|
|
|
|
|
#15 | |
|
Registered User
Join Date: May 2008
Posts: 12
|
Quote:
|
|
|
|
|
|
|
#16 | |
|
Registered User
Join Date: Sep 2005
Posts: 42
|
Quote:
And, yeah, I understand the reasoning some want this....just being a bit sarcastic about it. It all sounds as crazy as using a web browser to write a paper. ![]() |
|
|
|
|
|
|
#17 |
|
Registered User
Join Date: Dec 2007
Posts: 182
|
Anyone notice that that iPhone is running an extremely old firmware (1.0-1.02, check the calculator icon) and it's been jailbroken (where'd they get Terminal from)?
|
|
|
|
|
|
#18 |
|
Registered User
Join Date: Jul 2008
Posts: 13
|
Silly question
So how did AT&T manage to record audio from within HTML?
This is easy to do from applications created with the SDK but I didn't think it was possible from within Safari. Thanks. |
|
|
|
|
|
#19 | |
|
Registered User
Join Date: Jan 2007
Posts: 49
|
And...
Quote:
How about proper editing features for emails so that I don't have to forward an entire conversation, just the attachment? |
|
|
|
|
|
|
#20 |
|
Registered User
Join Date: May 2008
Posts: 570
|
The whole purpose of voice-dialing, for me at-least, is so I don't have to click-around on my iPhone while I'm driving. Maybe if they did a double-click-and-hold on on the lanyard then speak the name or the actual number, and after releasing the button it will search the contact. It will then verify the contact by speaking it back, and with a single click allows you to accept or a repeated double-click-and-hold allows you to repeat.
They should implement something similar to that. |
|
|
|
|
|
#21 |
|
Registered User
Join Date: Jul 2008
Posts: 64
|
How about people with accents?>
And if one calls with a foreign accent, they'll have a nervous breakdown
! Never mind someone with a really thick accent! Can I have a Sadim wisa woenuts and diet coke to go?! Sadim wisa woenuts = Shrimp with walnuts... Flied Lice anyone? Cheesburgah, Cheesburgah! Oops, back to work ! |
|
|
|
|
|
#22 |
|
Global Moderator
Join Date: Jun 2004
Location: .US
Posts: 9,127
|
|
|
|
|
|
|
#23 | |
|
Global Moderator
Join Date: Jun 2004
Location: .US
Posts: 9,127
|
Quote:
Accents will probably always be a problem, which is probably part of why speech recognition needs to be trained to the user. |
|
|
|
|
|
|
#24 |
|
Registered User
Join Date: Jan 2008
Posts: 457
|
It'll be ready in 5 years and they'll charge us for it.
![]() |
|
|
|
|
|
#25 | |
|
Registered User
Join Date: Nov 2002
Location: ASHLAND, KY
Posts: 1,818
|
Quote:
I APPLE THEREFORE I AM
|
|
|
|
|
![]() |
| Currently Active Users Viewing This Thread: 1 (0 members and 1 guests) | |
| Thread Tools | |
| Display Modes | |
|
|