Apple reportedly putting together team of speech recognition experts for neural network-powered Siri

Posted:
in General Discussion edited July 2014
A report on Monday claims Apple is putting together a team of A-list speech recognition researchers, including high-ranking employees from Nuance, to create an in-house Siri engine based on neural networking.

Siri


According to Wired, Apple has created a group of software engineers and researchers from Nuance, the firm responsible for Siri's voice-recognition functionality, as well as other companies to work toward a next-generation backbone for the virtual assistant.

The publication points to a number of Apple hires over the past few years, including Nuance's former vice president of research Larry Gillick and Gunnar Evermann, who is currently working as Siri's speech project manager.

Speaking to the publication, Microsoft research division head Peter Lee said Apple hired Alex Acero away from the Redmond, Wash. software giant in 2013. Acero is now a senior director on the Siri team.

As for neural networks, a term for machine learning algorithms that work in a manner similar to the brain's neurons, Wired said IBM, Microsoft and Google have deployed the deep learning technology in various speech-related applications.

Microsoft, for example, is using a neural network to power the real-time translation feature set to debut in Skype later this year. Google is dabbling in neural nets with Android's "Google Now" speech recognition functionality.

With the reported hires, Lee guesses Apple is likely planning a neural net-powered Siri backbone built entirely in-house.

"All of the major players have switched over except for Apple Siri," Lee said. "I think it's just a matter of time."

Last year, a report claimed that Apple had assembled a small team of experts, including former employees of a firm called VoiceSignal Technologies, to develop speech recognition technology specifically for the Siri personal assistant. The group supposedly works out of the company's Boston office.

Earlier in June, The Wall Street Journal reported that Nuance was exploring a sale to either partner Apple or Samsung. It appears that Apple may be step ahead of Nuance after poaching talent from the company for its own in-house team.

"Apple is not hiring only in the managerial level, but hiring also people on the team-leading level and the researcher level," said Abdel-rahman Mohamed, a University of Toronto researcher who was supposedly asked to join Apple's team. "They're building a very strong team for speech recognition research."

Current builds of Apple's next-generation iOS 8 still include a Nuance-powered Siri, though the virtual assistant has a few tricks up its sleeve with Google Now-style real-time speech-to-text and smart home product control with HomeKit integration, among other enhancements.

Comments

  • Reply 1 of 10
    solipsismxsolipsismx Posts: 19,566member
    I sure hope they do something by next iOS release because I feel like Siri really hasn't become better and speech recognition since its debut, only more services (that are great) have been added to its usage list.
  • Reply 2 of 10
    Earlier in June, <em>The Wall Street Journal</em> reported that Nuance was <a href="http://appleinsider.com/articles/14/06/16/apple-partner-nuance-exploring-sale-reportedly-in-talks-with-samsung">exploring a sale</a> to either partner Apple or Samsung. It appears that Apple may be step ahead of Nuance after poaching talent from the company for its own in-house team.

    "Apple is not hiring only in the managerial level, but hiring also people on the team-leading level and the researcher level," said Abdel-rahman Mohamed, a University of Toronto researcher who was supposedly asked to join Apple's team. "They're building a very strong team for speech recognition research."

    Current builds of Apple's next-generation iOS 8 still include a Nuance-powered Siri, though the virtual assistant has a few tricks up its sleeve with Google Now-style real-time <a href="http://appleinsider.com/articles/14/06/02/apple-unveils-ios-8-with-interactive-notifications-quicktype-keyboard-group-text-enhancements">speech-to-text</a> and smart home product control with <a href="http://appleinsider.com/articles/14/06/20/first-look-siri-gains-smart-home-controls-with-homekit-in-ios-8">HomeKit integration</a>, among other enhancements.

    Now that the zombie has been sucked dry of talent, it's time for Google to swoop in and overpay for what's left.

    It's hard to believe Siri's out of Beta and all grown up now... I wonder how long before she's given a chip of her own?
  • Reply 3 of 10
    shogunshogun Posts: 362member
    Siri needs a grammar loop in that neural net. She makes dictation choices seemingly based on what words are used most often, not what makes sense. For example, in a dictation to a musician I said something like, "I really enjoyed hymn 345." Of course Siri made it "him 345." These mistakes happen all the time. It really hobbles the usefulness of dictation when you have to comb back over it and fix a problem every 10-15 words or so.
  • Reply 4 of 10
    mjtomlinmjtomlin Posts: 2,673member
    Quote:

    Originally Posted by Shogun View Post



    Siri needs a grammar loop in that neural net. She makes dictation choices seemingly based on what words are used most often, not what makes sense. For example, in a dictation to a musician I said something like, "I really enjoyed hymn 345." Of course Siri made it "him 345." These mistakes happen all the time. It really hobbles the usefulness of dictation when you have to comb back over it and fix a problem every 10-15 words or so.

     

    Siri plays no part in dictation. That's Nuance's speech recognition engine.

  • Reply 5 of 10
    solipsismxsolipsismx Posts: 19,566member
    mjtomlin wrote: »
    Siri plays no part in dictation. That's Nuance's speech recognition engine.

    I'm sure he knows that, but Siri ncludes the Nuance/Dragon Dictation-based engine so I think it's fine to simply use the term Siri even when only referring to the dictation aspect of the service.
  • Reply 6 of 10
    As those of us who support Speech Magic know, there's a whole lot more to Nuance than just supporting Siri. They pretty much own speech rec these days, and I'd love it if Apple could own them!
  • Reply 7 of 10
    SpamSandwichSpamSandwich Posts: 33,407member
    Keep hiring, Apple. Keep hiring. Bring over some of the folks behind Google's speech recognition and throw in some of the brilliant minds behind IBMs Watson while you're at it. ????
  • Reply 8 of 10
    19831983 Posts: 1,225member
    So maybe that's why Google Now seems to understand what I say to it far better than Siri does? Maybe when Siri becomes fully 'neuralized' I'll start using it again.
  • Reply 9 of 10
    So when do we get Siri for Mac? The speech recognition and dictation on the Mac is woefully behind the iPhone/iPad. The main thing that's missing is the "give and take" of Siri's interaction. The Mac's current system only allows for a single command without the ability that Siri has to query for more information. The only advantage of the Mac's system is that it is extensible via AppleScript. Any word on this? Rumors even?
  • Reply 10 of 10
    mjtomlinmjtomlin Posts: 2,673member
    Quote:
    Originally Posted by SolipsismX View Post





    I'm sure he knows that, but Siri ncludes the Nuance/Dragon Dictation-based engine so I think it's fine to simply use the term Siri even when only referring to the dictation aspect of the service.

     

    Siri doesn't "include" the speech engine ... it just happens to be used to feed text into Siri, so they are in fact two separate things. Would you say OS X Mavericks has Siri? No, you wouldn't, because it doesn't. But it does have the same dictation abilities as iOS.

Sign In or Register to comment.