AppleInsider AppleInsider Forums


Go Back   AppleInsider > Applications
Register Members List New Posts Mark Forums Read

Reply
 
Thread Tools Display Modes
Old 01-17-2008, 08:50 PM   #1
AppleInsider
Kasper's Automated Slave
 
Join Date: Nov 1997
Posts: 6,151
MacSpeech's Dictate: high quality voice recognition for the Mac

MacSpeech at this week's Macworld Expo unveiled Dictate, its new speech recognition and voice command software currently in beta and slated for release mid February. The new product replaces and improves upon the existing iListen.

Dictate is now based upon the highly accurate speech recognition engine developed by Naturally Speaking; iListen was based upon technology licensed from Philips. MacSpeech supplies the user interface and rich integration with AppleScript and other Mac technologies.

A $29 crossgrade is available for any registered iListen customers who have purchased or obtain a copy of iListen in 2008. Any registered iLife customer from 2007 and earlier can pre-order a crossgrade for $79.

Speech Recognition Accuracy

Representatives demonstrated the accuracy and intelligence of the new system by dictating live into the system. After being switched on, the system allows the user to both dictate and issue voice commands. It determines which you are doing by analyzing the context of words. Dictate only requires a 5 minute profile creation session, which profiles the mic used and then analyzes the speaker's speech patterns and diction. In addition, the user can supply text that the software will analyze for unfamiliar words, and then speak those words to expand the system's dictionary.

The software's advanced recognition engine allows the software to accurately present natural speech dictation, correctly interpreting text such as "the patient was in a coma, comma" or "the end of the medieval period period." It also correctly formatted phone numbers and currency amounts, complete with a dollar sign, a thousands comma, and a decimal point, even when spoken in different ways, such as "five thousand dollars and twenty cents."

Dictate can enter text into any application that supports text entry from the keyboard, even including Windows apps running in a virtual environment such as Parallels or Fusion. To take a quick dictation without opening another application, Dictate also provides a simple text entry window of its own.

The software will support a variety of English language families, including American English, UK English, and Australian, Indian, and SE Asian variants. MacSpeech also has immediate plans to release German, Italian, Spanish, and French versions, and can match developments in new speech engine models released by Naturally Speaking.



Voice Control

In addition to entering text, Dictate can also be used to control the desktop interface. Reps demonstrated the software being used to launch applications, edit entered text, even open Safari bookmarks.

When a new application is installed, Dictate rapidly scans it to set up a table of commands, allowing the user to launch it by name and then activate any of its menu commands by voice. The voice command features can also be extended using AppleScript. Among other features, Dictate can also be used to launch Spotlight and rapidly search the system.

Dictation Hardware

Dictate ships with a microphone, but can be used with any standard mic. Company reps recommended against using a Bluetooth mic because that protocol limits the bandwidth of sound input to 8 KHz, reducing the overall accuracy of dictation. Other wireless microphones, such as professional quality RF equipment, can be used at full quality.
AppleInsider is offline   Reply With Quote
Old 01-17-2008, 09:11 PM   #2
coolfactor
Registered User
 
Join Date: Jul 2004
Location: Van Isle, BC, Canada
Posts: 208
Glad to see developments in this area. I expect the next version of OS X to better support speech recognition, which falls in line with the need for voice dialing on the iPhone.
coolfactor is offline   Reply With Quote
Old 01-17-2008, 09:23 PM   #3
Ireland
Registered User
 
Join Date: Feb 2006
Location: Ireland
Posts: 8,559
Apple should buy these guys up right away.


Collecting my SSD iMac Fry-die. :D
Ireland is online now   Reply With Quote
Old 01-17-2008, 09:50 PM   #4
solipsism
Registered User
 
Join Date: Apr 2006
Location: The Ansible
Posts: 11,779
Quote:
Originally Posted by Ireland View Post
Apple should buy these guys up right away.
Agreed.


Can it also read text back to you using Apple's software of their own voice synthesizers?
solipsism is offline   Reply With Quote
Old 01-17-2008, 10:07 PM   #5
Bageljoey
Registered User
 
Join Date: Jun 2006
Location: Jersey (new)
Posts: 1,001
I too am glad to see progress, but I, personally, am waiting for subvocal input...


Progress is a comfortable disease
--e.e.c.
Bageljoey is offline   Reply With Quote
Old 01-17-2008, 10:54 PM   #6
Delfoniq
Registered User
 
Join Date: Jun 2007
Location: Dallas, TX
Posts: 92
Buy, who?

Quote:
Originally Posted by Ireland View Post
Apple should buy these guys up right away.
MacSpeech is using technology from Nuance and essentially the same technology currently present in the Dragon Naturally Speaking engine. Nothing too novel there I think

I wish Apple would realize the importance of speech recognition too and start investing money in it like it did back in the 90s. The potential of speech recognition for enabling voice commands and accurate dictation in devices like the iMac, iPhone and iPod is huge .


Last edited by Delfoniq; 01-17-2008 at 11:17 PM..
Delfoniq is offline   Reply With Quote
Old 01-17-2008, 11:07 PM   #7
Delfoniq
Registered User
 
Join Date: Jun 2007
Location: Dallas, TX
Posts: 92
Here's an easy to do idea that Jobs will like!

Quote:
Originally Posted by Bageljoey View Post
I too am glad to see progress, but I, personally, am waiting for subvocal input...
I can't believe there is already a patent out for that!

Delfoniq is offline   Reply With Quote
Old 01-17-2008, 11:27 PM   #8
penchanted
Registered User
 
Join Date: May 2007
Posts: 122
There were rumors that Microsoft's 1997 $150,000 investment in Apple came with some conditions including that Apple not compete in the area of voice recognition. I could imagine Apple agreeing to these terms given the state of the technology at that time. However, it's hard to believe, even if there was such an agreement, that there is not some sunset on the period of time until Apple can enter this arena. Hopefully, we will see voice recognition addressed by Apple soon.
penchanted is offline   Reply With Quote
Old 01-17-2008, 11:30 PM   #9
solipsism
Registered User
 
Join Date: Apr 2006
Location: The Ansible
Posts: 11,779
Quote:
Originally Posted by penchanted View Post
There were rumors that Microsoft's 1997 $150,000 investment in Apple came with some conditions including that Apple not compete in the area of voice recognition. I could imagine Apple agreeing to these terms given the state of the technology at that time. However, it's hard to believe, even if there was such an agreement, that there is not some sunset on the period of time until Apple can enter this arena. Hopefully, we will see voice recognition addressed by Apple soon.
Since they no longer install IE as the default browser I'd say whatever deal was made is now complete.

edit: It was a 150,000 shared then valued at $150 million. MS sold those shares pretty much as soon as they could.
solipsism is offline   Reply With Quote
Old 01-17-2008, 11:42 PM   #10
Delfoniq
Registered User
 
Join Date: Jun 2007
Location: Dallas, TX
Posts: 92
Quote:
Originally Posted by penchanted View Post
There were rumors that Microsoft's 1997 $150,000 investment in Apple came with some conditions including that Apple not compete in the area of voice recognition. I could imagine Apple agreeing to these terms given the state of the technology at that time. However, it's hard to believe, even if there was such an agreement, that there is not some sunset on the period of time until Apple can enter this arena. Hopefully, we will see voice recognition addressed by Apple soon.
That would explain why Microsoft has gotten really good on speech recognition lately

http://www.youtube.com/watch?v=2Y_Jp6PxsSQ

"I think it's picking up a little bit of echo here."

Delfoniq is offline   Reply With Quote
Old 01-17-2008, 11:53 PM   #11
hmurchison
Global Moderator
 
Join Date: Nov 2001
Location: Seattle, WA
Posts: 10,457
It's nice to see iListen drop that turd of a engine and move to Nuance technology. If Apple isn't interested in Spech Rec at a serious level they're on crack. I wince everytime I see a mini chiclet qwerty keyboard on a phone. Stone Age comes to mind.

I like the price of Dictate. It leads me to believe that they are basically delivering Dragon Preferred on Mac. However I'd love to see features that come in Professional. There needs to be robust support for scripting and creating Macros. That's where the fun...and efficiency really kick in.


Mac mini - 2 , iPod Nano- 1
G4 Cube - 5 , iPod Shuffle -1
Bloggity Blog
hmurchison is offline   Reply With Quote
Old 01-18-2008, 12:52 AM   #12
Carson O'Genic
Registered User
 
Join Date: Nov 2002
Location: San Francisco
Posts: 1,183
I'm also glad to see things improving in this area, although all my Macs still use IBM chips and won't work with the new software.

I use to use IBM's software for speech recognition. It was very good at learning new words, in fact the accuracy was often better for some of the long words I had to teach the program than for small words. Having to dictate into a separate app and copy over into a word processor etc was a pain.

I own iListen, but have never found it useful. It worked very well for simple language but it completely failed to learn some of the jargon I use my writing. That killed it for me.

Look forward to trying the new version one day, which combined with the ever increasig speeds of modern Macs should one day make this software actually useful.
Carson O'Genic is offline   Reply With Quote
Old 01-18-2008, 01:30 AM   #13
jbrowdy
Registered User
 
Join Date: Aug 2006
Posts: 22
Macros

Quote:
Originally Posted by hmurchison View Post
There needs to be robust support for scripting and creating Macros. That's where the fun...and efficiency really kick in.
Doesn't the old iListen (and presumably Dictate) support the use of Macros?
jbrowdy is offline   Reply With Quote
Old 01-18-2008, 01:48 AM   #14
davidf01
Registered User
 
Join Date: Nov 2005
Posts: 16
CBT: yes, ESL reco for itouch/iphone! ... but foreign language support is needed too!

1) cbt:

apple leaves soooo much money on the table by ignoring the no-brainers like cbt -- especially for second language learning!

apple needs to created a common reference point for language learning by licensing the Dragon voxreco engine (as well as pay the NRE costs for cepstral to port their engine to asian languages!) ... along with oem-ing the various pen input tools (cf: http://www.yale.edu/chinesemac/pages/palm.html) -- and yes, a BT stylus is inevitable (hopefully like an Annato "digital pen"!)

then hopefully the cbt big players - like rosetta stone and plecto - will deliver their desktop experience in the pre-eminent mobile platform!

however, the biggest opportunity is for ASIAN LANGUAGES to be added to reco engines on the mac!

-- however dont hold your breath: apple has a LONG history of pissing away golden opportunities.

2) mic:

it is unclear if the current limitations of BT (in terms of audio quality) will be solved in the new version 2.1 (which the Airbook supports!).

the info on wikipedia does not stress whether the 8khz encoding is hard-wired into the spec or not ...

certainly the higher bitrates of EDR (2.1 Mbps) are entirely ample to support high fidelity stereo (i'm looking at you iMuffs!), so one would presume that high quality audio-in (22Khz @ 16 bit = 2.2 Mbps) would also not be a problem for EDR!? (if it were custom hardware programmed to using the whole available channel, so it did mot necessarily not be limited to a pre-defined max link rate for audio).

so, hey RF bitheads! -- some clarification would be useful!

cf: http://en.wikipedia.org/wiki/Bluetooth
cf: http://www.bluetooth.com/Bluetooth/T...__Baseband.htm

note: the current spec does specifically state that BT hardware is expected to provide for hard-wired support for audio "at least" at 64kbps (or equivalent quality) ...

"On the air-interface, either a 64 kb/s log PCM (Pulse Code Modulation) format (A-law or μ-law) may be used, or a 64 kb/s CVSD (Continuous Variable Slope Delta Modulation) may be used. The latter format applies an adaptive delta modulation algorithm with syllabic companding. The voice coding on the line interface is designed to have a quality equal to or better than the quality of 64 kb/s log PCM. The table below summarizes the voice coding schemes supported on the air interface."

However, the Av profiles seem to support a wide variety of codecs that could make good use of that constrained audio bandwidth (64k?) ... especially mpeg4 audio (AAC ... which is actuallly an enhancement of mpeg2 audio, but let's not quibble ;-)

"This (stereo) profile relies on GAVDP. It includes mandatory support for low complexity subband codec (SBC) and supports optionally MPEG-1,2 Audio, MPEG-2,4 AAC and ATRAC.

The audio data is compressed in a proper format for efficient use of the limited bandwidth. Surround sound distribution is not included in the scope of this profile."

soooo, the upshot of a quick perusal of the BT spec seems hopeful but not definitive: 8khz sampling is used only for legacy encoding (ie for PCM equivalents such as alpha/mu-law) ... the physical link rate reserved for audio -- (64k?) -- would be enough to handle a reasonable sample rate (16Khz) and a reasonable quantization (16 bits) => 196Kbps raw with at least 3X compression for AAC produces a bitrate ≤ 64Kbps! (ie within the link rate reserved for audio).

if this is correct - then the optional codecs in the BT spec could deliver sufficent quality for vox reco!?

again, the RF bitheads can help us all understand the options (and the commercially available chipsets) ;-)
davidf01 is offline   Reply With Quote
Old 01-18-2008, 05:17 AM   #15
8CoreWhore
Registered User
 
Join Date: Jan 2008
Posts: 457
Dictate for the iPhone?

Considering that Nuance uses this engine for SMS on mobiles, I hope MacSpeech adapts this to the iPhone. It would require the use of the mic for this purpose... an Apple update for the mic to function while phone inactive?
8CoreWhore is offline   Reply With Quote
Old 01-18-2008, 05:47 AM   #16
Sedicivalvole
Registered User
 
Join Date: Feb 2007
Location: London
Posts: 41
My dreams are coming true.

Computer, hello computer!

http://uk.youtube.com/watch?v=v9kTVZiJ3Uc

Apple seem far more alligned with the physical manuipulation of a machine not verbal at the moment
Sedicivalvole is offline   Reply With Quote
Old 01-18-2008, 10:26 AM   #17
willrob
Registered User
 
Join Date: Nov 2006
Posts: 164
MacSpeech claims (on their web site) that bluetooth is not currently accurate for dictation. The second generation iPod Nano however can be fitted with a mic and used as a portable dictation unit which iListen can "type." [the Nano actually comes with voice recording capabilities built it] Progressing to the iPhone would be an obvious step, IF Apple allows it. The Touch unfortunately has no mic and currently MacSpeech doesn't support it, so it's not clear if it has the same internal ability to record. There is a third party app (jailbreak required) that records the voice if one has a specially made mic— but no way to translate that into type automatically (so far).

If you order iListen now (Buy.com has lowest price of $122.99, with mic), you can crossgrade to the new engine for $29 once it ships. I'm not sure where this article is getting the $140 price; there's no sign of that offer on the MacSpeech website.
willrob is offline   Reply With Quote
Old 01-18-2008, 11:53 AM   #18
paulgreen
Registered User
 
Join Date: Jun 2003
Posts: 13
Quote:
Originally Posted by willrob View Post

If you order iListen now (Buy.com has lowest price of $122.99, with mic), you can crossgrade to the new engine for $29 once it ships. I'm not sure where this article is getting the $140 price; there's no sign of that offer on the MacSpeech website.
I bought/pre-ordered "Dictate" for the $149 price at the MacWorld show earlier this week. I don't know whether this was a show-only price, or whether you can order it at this price directly from MacSpeech.com (you could try calling them?). I would guess that if you can order it, the price may only be good during MWSF (i.e. not after today).
paulgreen is offline   Reply With Quote
Old 01-18-2008, 11:58 AM   #19
doemel
Registered User
 
Join Date: Jan 2006
Posts: 75
Quote:
Originally Posted by Delfoniq View Post
That would explain why Microsoft has gotten really good on speech recognition lately

http://www.youtube.com/watch?v=2Y_Jp6PxsSQ

"I think it's picking up a little bit of echo here."

Hahaha, I almost crapped my pants when I saw that! You made my day! Maybe I should give Vista a try, seems to be really funny...
doemel is offline   Reply With Quote
Old 01-18-2008, 12:15 PM   #20
ecking
Registered User
 
Join Date: Feb 2005
Location: Toronto
Posts: 1,564
This is pretty cool, I'd love something like this to use with final draft.


Apple Gear: Mini G4, Pro 2.66, MacBook(Alu)
iPhone 3G, Nano 4th Gen, Classic 120GB

Quote:
Originally Posted by appleinsider vBulletin Message
You have been banned for the following reason:
Three personal attacks in one post. Congratulations.
Date the ban will be lifted: 08-15-2006, 03:00 PM
ecking is offline   Reply With Quote
Old 01-19-2008, 11:44 AM   #21
kkerst
Registered User
 
Join Date: Mar 2005
Location: California
Posts: 15
Hah

Roger Roger, what's your vector Victor, do we have clearance Clarance?

Let's hope it would know how to intrepret that.
kkerst is offline   Reply With Quote
Old 01-21-2008, 03:44 PM   #22
rebbi
Registered User
 
Join Date: Jan 2008
Posts: 2
Great news

I've been using iListen for a couple of years at home, and Dragon NS at the office, and even being something of an Apple fanboy, I have to admit that Dragon has it all over iListen in accuracy, ease of correction, interface, and ease of training. I've been hoping for years that MacSpeech would license the Nuance core technology for a. Mac-only product. This is great news for the Mac market, as the Mac platform desperately needs a first-class speech recognition and transcription solution. Shame it's only for Intel, but it's another reason to sell my Powerbook on eBay...
rebbi is offline   Reply With Quote
Old 01-25-2008, 03:22 AM   #23
webmail
Registered User
 
Join Date: Jun 2003
Posts: 585
I actually use Dragon on my mac (sort of) inside of VMware Fusion to dictate text to word, then I just copy/paste or drag it off windows onto the mac desktop.

I love this software. It's not practical in an office environment all the time. But I've written a lot more freely when I'm alone. It's nice to be able to have your thoughts written out.
webmail is offline   Reply With Quote
Old 03-17-2008, 05:47 AM   #24
JonasLondon
Registered User
 
Join Date: Feb 2008
Location: Londonistan
Posts: 6
Perhaps great product, not so great company?

I bought iListen in Version 1.6 and it just did not work as advertised. When I heard they were integrating Dragon's engine, I was excited. Really excited. Apparently it now works very well.

However, I am very disappointed with the company MacSpeech. After offering the $79 Upgrade for loyal customers, it turns out they wanted to charge another 70 or 79 USD for shipping. I have ordered a lot of things from the US, but for a box of a DVD and a microphone, that is silly money.

-> Email to their support 1

Then I got the information that they are working on the UK store. Great.
They send out another marketing mail, with no link to the UK store. I still "find it", try and order my software. Warning that upgrade code can only be used once. Error on their cart page.

-> Email to their support 2

I then receive an email stating they've fixed it and re-issued my upgrade code.
Again, I log in to upgrade. Seems to work now, but instead they have upped the price to GBP 69.95. Let me translate that for you: That currently equates to $140 USD!!

I don't need any BS from them regarding "transfer pricing" etc., the UK Sales Tax stands at 17.5%, Import Duty (it is low for software and computers, if applicable at all!), and shipping can't bring it to this silly price point.

-> Email to their support 3


At the end of the day, as much as I'd like to use it, I have now wasted time again and again for them to get it right, only to be hit with a ridiculous price for an upgrade. At this price, I might as well have ordered it two weeks ago from the States with the ridiculous shipping cost. Anyway, if I ever get tempted again, I'll just buy it used from Amazon.com and have it shipped here for 29 USD, not a problem at all.

"Higher costs of selling in the UK" would be a BS answer as well, as I am already, cash in hand so to speak, willing to do the final round of financing to get iListen into a working state. BNo need to spend marketing GBP on me, I am already won over.

However, I am not stupid to pay double the price for a silly update. I'd rather type.

If they don't care to attend to their loyal client base, then that is their prerogative. And it is mine to discuss this with other potential clients.

Best thing perhaps would be for Apple to buy the company and replaced the operations and sales teams, and sell it at a sensible price point. Worked great in Shake, Color and other apps.

A former, disappointed MacSpeech customer.


"People are either charming or tedious.” - Oscar Wilde


Last edited by JonasLondon; 03-17-2008 at 05:49 AM.. Reason: typed too fast for iListen. Just kiddin', iListen couldn't do this. Dictate might.
JonasLondon is offline   Reply With Quote
Old 03-19-2008, 10:45 PM   #25
polar315
Registered User
 
Join Date: Jun 2007
Posts: 47
macspeech

Bought it today and installed the microphone and the software. Went to create a profile and poof the Dictate software abends.

Go to the macspeech website...the support part of the site is down...arghhhh
polar315 is offline   Reply With Quote
Old 03-22-2008, 07:22 PM   #26
polar315
Registered User
 
Join Date: Jun 2007
Posts: 47
Macspeech Dictate a victim of it's own success ?

After getting some emails back from Macspeech to try some changes (none of which worked). I am back at square one with software that does not work and out $200 dollars.

No word out of Macspeech but, with the website taking down it's help section can only imagine this is a very wide spread issue. Lots of references to this app crashing exactly as mine does.

Maybe AppleInsider might get a better answer than I am getting from Macspeech.

Not too impressed with taking down the help section of your site when you appear to have a significant issue. Would like to see them say something like "We know there is an issue and we are working on it". Problem for them is that they finally get out the product for shipping and it appears they will be needing to recall product on the shelf or at the very least need an update. Worse yet might require redistribution of discs to people that already have the product. Get picked best of show might go down as the Sports illustrated cover jinx of the IT industry.
polar315 is offline   Reply With Quote
Old 03-24-2008, 11:49 AM   #27
polar315
Registered User
 
Join Date: Jun 2007
Posts: 47
Macspeech has identified that the disk replicator has had some issues. Appears that the data disks are having a higher than normal failure rate http://www.macspeech.com/article_inf...rticles_id=293

Have requested my replacement disks and hope to get a response soon.
polar315 is offline   Reply With Quote
Reply


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT -5. The time now is 11:21 AM.


Powered by vBulletin® Version 3.8.4
Copyright ©2000 - 2009, Jelsoft Enterprises Ltd.