PDA

View Full Version : Apple proposes acoustic separation for iPhone conference calls


AppleInsider
04-10-2008, 02:10 PM
Apple in the second of two interesting patent filings revealed this week discusses techniques for improving the iPhone's ability to serve as a multi-party communication environment, in which participants on conference calls can be assigned to virtual position in order to improve clarity.

The technique is particularly suited for communication devices with at least two speakers available for audio output, such an iPhone with a connected pair of earphones or headset.

When a conference call is initiated, participants would be presented with a graphical user interface on the iPhone for use in managing the virtual locations for the plurality of participants.

"The visual indication for at least one of the participants can be assigned to a different one of the visually distinct regions, thereby causing an audio sound associated with the participant to be spatially adapted to originate from a virtual location corresponding to the visually distinct region," Apple said in the filing.

"To assist the user of the device in determining and distinguishing the different participants in the multi-party call, directional audio processing can be utilized so that the different sources of audio for the call can be directionally placed in a particular location with respect to the headset. As a result, the user of the device hears the other participants in the multi-party call as sound sources originating from different locations. "

In one implementation, Apple said the assignment to the default positions is automatic, either based on the participants' position geographically or in the order at which the participants joined the multi-party call.

"Next, a participant position screen is displayed," Apple continued with is explanation. "The participant position screen can enable a user to alter the position of one or more of the participants to the multi-party call. Here, the participant position screen is displayed such that a user of the portable communication device can manipulate or otherwise cause one or more of the positions associated with the participants to be changed. In doing so, the user, in one embodiment, can cause the physical movement of a representation of a participant on the participant position screen. Here, a decision determines whether a reposition request has been made. When the decision determines that a reposition request has been made, the associated participant is moved to the specified position."

All the participants on an iPhone conference call could also share media items such as "songs, albums, audiobooks, playlists, movies, music videos, photos, computer games, podcasts, audio and/or video presentations, news reports, and sports updates."



In particular, the patent filing contains considerable discussion of multi-party voice calls with concurrent audio playback. "One aspect of the invention pertains to a wireless system that supports both wireless communications and media playback," Apple said. "The wireless communications and the media playback can be concurrently supported. Consequently, a user is able to not only participate in a voice call but also hear audio playback at the same time."

In such instances, another graphical user interface would be presented on the iPhone's screen to allow each user to "blend" the two audio sources to their individual liking, independent of one another.

"The display screen includes a blend control. The blend control allows a user of the portable electronic device to alter the blend (or mixture) of audio from audio playback and audio from a voice call. [...] The blend control includes a slider that can be manipulated by a user towards either an audio end or a call end. As the slider is moved towards the audio end, the audio playback output gets proportionately greater than the voice call output. On the other hand, when the slider is moved towards the call end, the voice call output gets proportionally greater than the audio playback output. For example, the position of the slider can represent a mixture of the audio playback output and the voice call output with each amplified similarly so that the mixture is approximately 50% audio."



"The audio for each can be altered such that the audio from the incoming call and the audio from the media playback are perceived by a listener (when output to a pair of speakers, either internal or external) as originating from different virtual locations. The different virtual locations can be default positions or user-specified (during playback or in advance). [...] The sender or recipient of the audio sounds pertaining to a media item can be permitted to separately control the volume or amplitude of the audio sounds pertaining to the media item. As a result, the mixture or blend of the audio sounds pertaining to media items as compared to audio sounds pertaining to the voice call can be individually or relatively controlled."

The September 2006 filing, titled "Audio processing for improved user experience," is credited to Apple employees Michael Lee and Derek Barrentine.[ View this article at AppleInsider.com ] (http://www.appleinsider.com/article.php?id=3946)

SpamSandwich
04-10-2008, 02:17 PM
Must've come out of their work with 3D stereo separation with Soundtrack/Final Cut Pro or Steve's movie involvement. This would be interesting applied to stereo audio over a phone call. Very interesting. I wonder what kind of bandwith/data rate is needed for effective spatial audio over a phone?

solipsism
04-10-2008, 02:32 PM
Looks like Apple is utilizing their strengths to take another chunk out of corporate. As much as I hate the cost and, IMO, the pointlessness of conference calls they are very popular. This method could be done on the cheap using 3G and WiFi with ease and perhaps even allow for simple keynote presentations and images to be sent to the device like with iChat A/V.

quinney
04-10-2008, 02:40 PM
iDucker, iDe-esser, iLimiter, iSurroundMixer

Cubert
04-10-2008, 02:45 PM
Jesus, did some kid scrawl on a few napkins when they came up with this patent?

That being said, the best just keeps on (potentially) getting better.

Cubert
04-10-2008, 02:49 PM
Looks like Apple is utilizing their strengths to take another chick out of corporate. As much as I hate the cost and, IMO, the pointlessness of conference calls they are very popular. This method could be done on the cheap using 3G and WiFi with ease and perhaps even allow for simple keynote presentations and images to be sent to the device like with iChat A/V.

What chick would that be? Eliot Spitzer's Kristen?

;)

Booga
04-10-2008, 02:49 PM
Yes! Yes! OMG YES FTW!

As someone who spends a couple hours a day in audio conferences, lack of positional audio is a huge, huge frustration. It makes a lot of conversations turn into an unintelligible jumble. Giving each member a position is a great first step, but I'd love to see stereo/surround microphones specially built for audioconferencing and a protocol to match.

mydo
04-10-2008, 02:54 PM
Must've come out of their work with 3D stereo separation with Soundtrack/Final Cut Pro or Steve's movie involvement. This would be interesting applied to stereo audio over a phone call. Very interesting. I wonder what kind of bandwith/data rate is needed for effective spatial audio over a phone?

Can't be all that 3D with only two speakers. At best it's 2D.

aplnub
04-10-2008, 03:03 PM
OmniGraffle?

McHuman
04-10-2008, 03:04 PM
Theres nothing sexy about conference calls.

echosonic
04-10-2008, 03:06 PM
iDucker, iDe-esser, iLimiter, iSurroundMixer


iCompressor, iDelay, iGate, iVerb

echosonic
04-10-2008, 03:09 PM
iCompressor, iDelay, iGate, iVerb

iPan, iGain, iChorus, iPitch

gbrandt
04-10-2008, 03:22 PM
Not patentable. R & D from all major telco and cell phone companies has been ongoing for years.

The ARM is fully capable of positional audio, and many companies already provide 3d audio optimized for ARM (and other) processors including QSound and Beatnik.

Someone asked about bandwidth requirements...there are no extra bandwidth requirements for 3d audio All you need is two speakers and a position and the filters do the rest.

Gregor

physguy
04-10-2008, 03:26 PM
Must've come out of their work with 3D stereo separation with Soundtrack/Final Cut Pro or Steve's movie involvement. This would be interesting applied to stereo audio over a phone call. Very interesting. I wonder what kind of bandwith/data rate is needed for effective spatial audio over a phone?

As I understand this no bandwith/data rate changes are required. This is basically like assigning a different balance level to each source on the conference call. I assume this can only work if the iPhone is the aggregator of the caller - i.e. you call one person, put on hold, call another, etc. Then it can assign each source to a different virtual location. If you a simply part of another conference call then I don't see how the iphone could do anything about this as it would have to 'recognize' voices or have a tag sent by the conference center each time a speaker changed.

mydo
04-10-2008, 03:30 PM
Not patentable. R & D from all major telco and cell phone companies has been ongoing for years.

The ARM is fully capable of positional audio, and many companies already provide 3d audio optimized for ARM (and other) processors including QSound and Beatnik.

Someone asked about bandwidth requirements...there are no extra bandwidth requirements for 3d audio All you need is two speakers and a position and the filters do the rest.

Gregor

From what I understand from the picture they are patenting the concept and user interface that allows someone to manage conference calls this way.

gbrandt
04-10-2008, 03:30 PM
Well, its a bit more complex than simple balance control per person. Sound can actually be place behind a person (although not perfectly) but directly above and below works quite well.

gbrandt
04-10-2008, 03:33 PM
From what I understand from the picture they are patenting the concept and user interface that allows someone to manage conference calls this way.
You are probably right. But they may still have issues. I've seen demo programs that allow you to move a speaker (person) around at will.

Gregor

Gustav
04-10-2008, 03:35 PM
Can't be all that 3D with only two speakers. At best it's 2D.

Really? How many ears do you have?

It's quite possible to do 3D sound with two audio sources. Clever frequency and harmonic processing will give very good 3D spatialization. Speakers will work ok, but headphones should much better as they know the sources are directly at your ears.

This is a pretty cool idea.

mydo
04-10-2008, 03:35 PM
Yea. I think most of these software patents are "obvious" but patents haven't been about protecting ideas for a long time;)

physguy
04-10-2008, 03:43 PM
Well, its a bit more complex than simple balance control per person. Sound can actually be place behind a person (although not perfectly) but directly above and below works quite well.

Hence the word "basically". The main point is that the iPhone has to be the aggregator of the conference call for this to be useful otherwise it has no idea which sound to assign to which position.

SpamSandwich
04-10-2008, 03:51 PM
SRS simulates 3D audio with only 2 speakers, so be advised.

CREB
04-10-2008, 03:51 PM
I have been on far too many conference calls to count. I see no redeeming value with acoustic separation. I am listening for what is important in a conference call—not whether Betty or Bob are pleasantly acoustically separated. Business has fundamentals; this is just bordering on the ridiculous. Now, in an entertainment situation...that's an entirely different matter.

Feynman
04-10-2008, 03:53 PM
Bye bye RIM :D

physguy
04-10-2008, 04:00 PM
I have been on far too many conference calls to count. I see no redeeming value with acoustic separation. I am listening for what is important in a conference call—not whether Betty or Bob are pleasantly acoustically separated. Business has fundamentals; this is just bordering on the ridiculous. Now, in an entertainment situation...that's an entirely different matter.

If this could actually be offered by real conferencing system I would disagree strongly. On conference calls you not only need to hear 'what' but also 'by whom'. Without that information a tremendous about of context of the meaning is often lost leading to miscommunication. If everyone has significantly difference vocal characteristics that all is well but if two, more or several pairs of people of similar vocal characteristics you find yourself asking 'who was that' or, if you don't want to interrupt the flow simply letting it go. This, in principle, would be extremely valuable but, with standard telephony you don't have even the possibility of two-channel transmission to make this possible.

CREB
04-10-2008, 04:09 PM
If this could actually be offered by real conferencing system I would disagree strongly. On conference calls you not only need to hear 'what' but also 'by whom'. Without that information a tremendous about of context of the meaning is often lost leading to miscommunication. If everyone has significantly difference vocal characteristics that all is well but if two, more or several pairs of people of similar vocal characteristics you find yourself asking 'who was that' or, if you don't want to interrupt the flow simply letting it go. This, in principle, would be extremely valuable but, with standard telephony you don't have even the possibility of two-channel transmission to make this possible.

You're using a bloody mobile phone! C'mon get real. It's a bloody mobile phone used in various environments—because it's mobile—not some cozy conference room where I'd use the office phone versus a mobile phone for obvious reasons. How many people here have actually worked in major corporate environments?

physguy
04-10-2008, 04:20 PM
You're using a bloody mobile phone! C'mon get real. It's a bloody mobile phone used in various environments—because it's mobile—not some cozy conference room where I'd use the office phone versus a mobile phone for obvious reasons. How many people have actually worked in major corporate environments?

Do you read before you write?:)

I'm talking about in a 'corporate environment', and yes I'm familiar. That said, having this in 'stereo' on a mobile phone with headsets would still be VERY valuable. I am often on a conf. call in a lounge or similar where it is quiet enough to utilize what this type of approach would offer. But, again, current standard telephony would not allow this as it is a single channel of audio. (Along with other limitations such a frequency range, phase alignment, etc. I am aware of our '3D audio' works).

dsmudger
04-10-2008, 04:29 PM
Don't know about anyone else but before I saw the image showing the UI, I was thinking of a more touchflo style interface for positioning the sources than the top-down diagram they have there?

Sort of like iChat, which I think has a sort of 3d layout when you share a presentation or document - imagine that, but with you and two other participants in a conference - one could be video, the other maybe a contact picture if it was voice only (or any combination video/contact picture obviously..) - tap, hold and drag the video/picture to swap positions -audio doesn't jump across or cut out as you drag, but is 3d positioned from start to finish, a-la Creative EAX, but maybe simpler and in software - and the overall appearance is like everyone is facing inwards in a triangle as you look into the screen? (with all the usual touchflo Appley black reflect-i-ness going on around it :))

Bit of a tangent, but I'm wondering whether current 3G infrastructure can even support a handset aggregating a video call with two others either voice or video..? Would the video participants have to receive half-resolution video on the downlink to fit two video channels into one call (assuming we're not talking about building two radios into the handset)? and how would you get two separate channels of audio coming down as others have mentioned? I always thought GSM (and I guess 3G) conference calling was handled at the operator end and you always got the pre-mixed audio down a mono audio channel? (assuming they can't go in and rewrite the audio codecs and GSM protocols at this point)

Who knows/anyway, if not, they could always just do all this over wifi instead - wonder if the SDK allows the necessary kind of access to do this eh? :)

CREB
04-10-2008, 04:32 PM
Do you read before you write?:)

I'm talking about in a 'corporate environment', and yes I'm familiar. That said, having this in 'stereo' on a mobile phone with headsets would still be VERY valuable. I am often on a conf. call in a lounge or similar where it is quiet enough to utilize what this type of approach would offer. But, again, current standard telephony would not allow this as it is a single channel of audio. (Along with other limitations such a frequency range, phase alignment, etc. I am aware of our '3D audio' works).

With all due respect...I simply do not buy it. Given the myriad of corporate environments, all the way from the plush office to the being in the most adverse of field conditions, I prefer something more purpose-built. In the field I carry a military spec mobile phone (because is has to work for all the right reasons); at the office I carry a different phone. It is what being said versus whom the hell said it that is important to most serious business people. I wonder what Warren Buffet would have to say about all this nonsense? For that matter I dare you to ask Steve Jobs if he gives a true rat-arse about this as he runs Apple (I seriously doubt it as have read about Jobs, and in speaking with the friends I have that have worked with him).

shamino
04-10-2008, 04:51 PM
I have been on far too many conference calls to count. I see no redeeming value with acoustic separation. I am listening for what is important in a conference call—not whether Betty or Bob are pleasantly acoustically separated. Business has fundamentals; this is just bordering on the ridiculous. Now, in an entertainment situation...that's an entirely different matter.
Think of what happens when you're on a many-way conference call and a few people all try to speak at once. The voices all get muddled and you can't make out anything that was said.

With decent stereo separation, it will be much easier to separate the voices - just like you can do in a face-to-face meeting.

The real interesting thing here is going to be getting carriers involved. When you make a conference call over land lines, the sound from the various parties is multiplexed in the central office (or at a PBX or a conference bridging-center). Under that circumstance, then the phone won't be able to separate the streams and reposition them.

If, however, you receive each party's sound as a separate data stream, then this system shouldn't be that hard to implement. I've already seen this feature in standalone video conferencing systems. (Doesn't iChat also do this to some extent when you have a multi-way video chat?)

Does anyone know where the audio is mixed for GSM-based conference calls? If they're mixed at a centralized location, then I think this feature will require changes to the carrier's infrastructure in order to make it all work.

CREB
04-10-2008, 04:59 PM
Think of what happens when you're on a many-way conference call and a few people all try to speak at once. The voices all get muddled and you can't make out anything that was said.

With decent stereo separation, it will be much easier to separate the voices - just like you can do in a face-to-face meeting.
Simply a matter of basic etiquette versus trying to assimilate garbled information. A good read and use of Robert's Rules provides for better meetings, and conference calls than this iPhone feature will ever provide.

Rot'nApple
04-10-2008, 05:30 PM
Yes! Yes! OMG YES FTW!

As someone who spends a couple hours a day in audio conferences, lack of positional audio is a huge, huge frustration. It makes a lot of conversations turn into an unintelligible jumble. Giving each member a position is a great first step, but I'd love to see stereo/surround microphones specially built for audioconferencing and a protocol to match.

You beat me to my post.

Insteand of "mano a mano" confence calls they will be "mono a mono":lol:

So much for that "stereo" effect one enjoys out of their earplugs/headsets.

Alright "mano a mano" is Spanish for hand to hand. What?! you wanted the Spanish version of "ear to ear"? "oido a oido" for you men or "oreja a oreja" for you women. It just wouldn't sound right "pun" wise.

solipsism
04-10-2008, 05:41 PM
Alright "mano a mano" is Spanish for hand to hand. What?! you wanted the Spanish version of "ear to ear"? "oido a oido" for you men or "oreja a oreja" for you women. It just wouldn't sound right "pun" wise.

Technically it's oral to aural.

GregAlexander
04-10-2008, 05:47 PM
As I understand this no bandwith/data rate changes are required. This is basically like assigning a different balance level to each source on the conference call. I assume this can only work if the iPhone is the aggregator of the caller

Sounds right to me. Only the aggregator gets to hear the separated voices... the others on the call get it all combined. Of course, if the iPhone is the only phone in the conference with stereo headset then who cares... but the long term plan would have to be to separate them out for all participants - much like an iChat video conference.

addicted44
04-10-2008, 10:49 PM
Simply a matter of basic etiquette versus trying to assimilate garbled information. A good read and use of Robert's Rules provides for better meetings, and conference calls than this iPhone feature will ever provide.

Directional sound is not new technology, and has been implemented by many companies. How would having directional sound "hurt" in any ways? First of all, it can easily be made completely optional. Secondly, you dont necessarily need to place someone behind you, and someone else in front of you, but instead if you are in a conference call with two people, and instead of both persons sounding like they are speaking from the same place (e.g. front), if one sounds like he/she is speaking from slightly left of front, and the other slightly right of front, how would this be any worse than what you have now? On the other hand, it will make it very easy to identify who is speaking what even if the voices sound similar.

Btw, if this issue prevents a person from reading a 400+ book just to have a conference call, then more power to them! Additionally, a lot of conference calls are not even done with people from your own company. Are you gonna hang up on your client who is giving you half your business because he is not courteous? Also, basic etiquette does not help identify who is speaking when you are speaking to 3 or 4 complete strangers, whose voices possibly sound similar (very common especially in international calls).

JeffDM
04-11-2008, 12:02 AM
Simply a matter of basic etiquette versus trying to assimilate garbled information. A good read and use of Robert's Rules provides for better meetings, and conference calls than this iPhone feature will ever provide.

That's a good suggestion for advanced civilizations that aren't based on naked apes in suits, but this is Earth here, that's the best we have. I doubt that you'll get a whole lot of the suited naked apes to go along with it.

solipsism
04-11-2008, 12:08 AM
That's a good suggestion for advanced civilizations that aren't based on naked apes in suits, but this is Earth here, that's the best we have. I doubt that you'll get a whole lot of the suited naked apes to go along with it.

It was also Earth when the apes were suited.
http://www.avclub.com/content/files/images/planet-of-the-apes.article.jpg

BTW, how can something be in a suit and be naked at the same time?

JeffDM
04-11-2008, 12:29 AM
It was also Earth when the apes were suited.

BTW, how can something be in a suit and be naked at the same time?

Make that hairless apes.

mydo
04-11-2008, 10:47 AM
Really? How many ears do you have?

It's quite possible to do 3D sound with two audio sources. Clever frequency and harmonic processing will give very good 3D spatialization. Speakers will work ok, but headphones should much better as they know the sources are directly at your ears.

This is a pretty cool idea.

Not when they are stuck in your ear. In that situation the sound is only coming from a single point that is fixed with respect to the head. So the only parameter that can change is the blend between right and left and hence 2D only.

JeffDM
04-11-2008, 11:05 AM
Not when they are stuck in your ear. In that situation the sound is only coming from a single point that is fixed with respect to the head. So the only parameter that can change is the blend between right and left and hence 2D only.

That's not really true because of the psychoacoustics used. The audio spectrum can be adjusted because sounds that come from above and below sound different because of how it hits our ears, the lobes reflect and scatter sounds in different ways depending on position. The software mimics that effect to make it sound like it's "out there".

mydo
04-11-2008, 12:47 PM
That's not really true because of the psychoacoustics used. The audio spectrum can be adjusted because sounds that come from above and below sound different because of how it hits our ears, the lobes reflect and scatter sounds in different ways depending on position. The software mimics that effect to make it sound like it's "out there".

Yes but when you have an ear bud stuck in your ear there is no "above" or "below". Your lobes are out of the equation because the buds are stuck in the ear past the lobe.

JeffDM
04-11-2008, 01:03 PM
Yes but when you have an ear bud stuck in your ear there is no "above" or "below". Your lobes are out of the equation because the buds are stuck in the ear past the lobe.

You clearly aren't getting the point. The incoming audio signal coming to the earbuds is readjusted to simulate that effect.

mydo
04-11-2008, 01:09 PM
You clearly aren't getting the point. The incoming audio signal coming to the earbuds is readjusted to simulate that effect.

I would say the same about you.


Given that the speak is placed in the ear canal a few centimeters from the ear drum and never moves (relative to the ear drum) how does it create a spacial effect to simulate a sound coming from a distant place?

What "adjustment" is being done other than volume? The best you'll get is right left stereo which is not "3D". It's actually only 2Dis.

JeffDM
04-11-2008, 01:12 PM
Given that the speak is placed in the ear canal a few centimeters from the ear drum and never moves (relative to the ear drum) how does it create a spacial effect to simulate a sound coming from a distant place?

What "adjustment" is being done other than volume? The best you'll get is right left stereo which is not "3D". It's actually only 1D.

It's not just a simple volume adjustment. They change various parts of the audio spectrum, like a very high-tech equalizer, though there's other trickery in there too to, digitally simulate the same effect. In higher math terms, I think they apply some sort of convolution to the signal.

You can probably go to the Dolby labs site and listen to examples of Dolby Headphone to try for yourself.

solipsism
04-11-2008, 01:16 PM
I would say the same about you.


Given that the speak is placed in the ear canal a few centimeters from the ear drum and never moves (relative to the ear drum) how does it create a spacial effect to simulate a sound coming from a distant place?

What "adjustment" is being done other than volume? The best you'll get is right left stereo which is not "3D". It's actually only 2Dis.

• http://en.wikipedia.org/wiki/3D_audio_effect

mydo
04-11-2008, 02:09 PM
• http://en.wikipedia.org/wiki/3D_audio_effect

Yea that's exactly how I thought it would work. When do you linear combinations of only two point source adjacent to the receiver (ear drum) you get something 2Dish. Fun tricks can be played by adding opposite ear echo to voice but in the end it will be 2Dis and not 3D.


Listen to this example.

http://gprime.net/flash.php/soundimmersion

To me all the sounds are "behind" me. Even when I swap my ear buds L/R. The shaking match box is either in my ear or just behind it. The foot falls are dislocated with the shaking match box and don't seem to have any real location other then off to the left or right. So I wouldn't call that 3D.

It would be great for a conference call but beyond 3 speakers I think it would get muddled. If my brain hears it like everyone is behind me then I'd get annoyed. I see no way to get in front of or behind location out of this technology.

solipsism
04-11-2008, 02:16 PM
Yea that's exactly how I thought it would work. When do you linear combinations of only two point source adjacent to the receiver (ear drum) you get something 2Dish. Fun tricks can be played by adding opposite ear echo to voice but in the end it will be 2Dis and not 3D.


Listen to this example.

http://gprime.net/flash.php/soundimmersion

To me all the sounds are "behind" me. Even when I swap my ear buds L/R. The shaking match box is either in my ear or just behind it. The foot falls are dislocated with the shaking match box and don't seem to have any real location other then off to the left or right. So I wouldn't call that 3D.

It would be great for a conference call but beyond 3 speakers I think it would get muddled. If my brain hears it like everyone is behind me then I'd get annoyed. I see no way to get in front of or behind location out of this technology.

I don't know why you don't consider that 3D. The link you posted is completely 3 dimension with my in-ear headphones on. I can easily point out when the matches are being shook in any area around me. My eyes even half expect to see something in front of me I know isn't there. It's quite freaky.

mydo
04-11-2008, 02:26 PM
I don't know why you don't consider that 3D. The link you posted is completely 3 dimension with my in-ear headphones on. I can easily point out when the matches are being shook in any area around me. My eyes even half expect to see something in front of me I know isn't there. It's quite freaky.

I agree that it's very good but it's not 3D. To me the sound comes from a 2D plane that's around me. I didn't hear any above or below. The brain can fill in a lot of gaps but the fact that I heard it from behind and you heard it from the front highlight the problem.

solipsism
04-11-2008, 02:30 PM
but the fact that I heard it from behind and you heard it from the front highlight the problem.

I heard it from behind too. It went in several circles around my head from left to right (or clockwise if you were looking down on me) and then did the left side up and down and then the right side up and down. I have good in-ear speakers so perhaps that is the difference.

I only referenced "in front of me" when referring to my vision as my eyes currently only face forward... but I'm working on that.

shamino
04-11-2008, 05:14 PM
Simply a matter of basic etiquette versus trying to assimilate garbled information. A good read and use of Robert's Rules provides for better meetings, and conference calls than this iPhone feature will ever provide.
In other words, you are saying "the feature is useless because nobody I talk to ever talks out of turn and nobody I talk to ever gets into an argument".

Congratulations. You are the only one who lives in this dream world. The rest of us have to deal with normal human beings and a piece of technology that makes it easier to understand them when they're not trying to bend their natures to arbitrary rules is useful.

It's amazing. Any time someone announces a new product or concept, there's always someone like you who comes along and says "it's garbage because I, personally don't need it", and will get violently angry with anyone who disagrees. Perhaps you should consider the possibility that the world has people that are not clones of yourself.

randomroyalty
04-12-2008, 10:16 AM
http://goldtent.com/wp_gold/wp-content/uploads/2007/10/cone_title.gif

datamodel
04-14-2008, 05:55 AM
OmniGraffle?

But why, when scribbling on the back of an envelope is so much quicker?

Or perhaps they have the little known "Back of Fag Packet" stencil set we use to design our networks with...

dfiler
04-14-2008, 09:09 AM
Can't be all that 3D with only two speakers. At best it's 2D.Humans also only have two ears... but hear in 3D. There are tonal shifts corresponding to angle at which sounds are heard. These tonal shifts are now possible via software. Granted, you're right that Apple will probably only attempt to apply the effect in 2 dimensions, or is that one dimension? ;) :lol:

gloss
04-14-2008, 10:02 AM
Yes but when you have an ear bud stuck in your ear there is no "above" or "below". Your lobes are out of the equation because the buds are stuck in the ear past the lobe.

I can only assume you've never listened to binaural audio, or you would know better than this.

And by the way, that link you posted is a good example. If you can't figure out exactly where the noise is in 3D space at all times, then you need to have your ears checked. I'm not even kidding. The dimensionality was spectacular, and if you can't perceive it then there's something wrong with your spacial sense.

mydo
04-15-2008, 08:55 AM
Humans also only have two ears... but hear in 3D. There are tonal shifts corresponding to angle at which sounds are heard. These tonal shifts are now possible via software. Granted, you're right that Apple will probably only attempt to apply the effect in 2 dimensions, or is that one dimension? ;) :lol:

Humans can tilt their head and use parts of the skull to hear too. How often do you hear something and can't tell where it's coming from. ONLY 2 EARS!

mydo
04-15-2008, 08:57 AM
I can only assume you've never listened to binaural audio, or you would know better than this.

And by the way, that link you posted is a good example. If you can't figure out exactly where the noise is in 3D space at all times, then you need to have your ears checked. I'm not even kidding. The dimensionality was spectacular, and if you can't perceive it then there's something wrong with your spacial sense.

I assume you haven't had classes in advanced math and linear systems. Otherwise you'd know that two point sources next to two receivers can't produce real 3D sound.

dfiler
04-15-2008, 10:08 AM
Mydo, just what is it that you're trying to disprove? :???:

You seem to be arguing against something but I can't quite figure out what. Is it that you believe binaural sound is a complete sham?

JeffDM
04-15-2008, 10:28 AM
But why, when scribbling on the back of an envelope is so much quicker?

Or perhaps they have the little known "Back of Fag Packet" stencil set we use to design our networks with...

It's like saying a napkin drawing of a part design is faster than drawing it in CAD. It's true, but that's not the point.

Scribbling on a piece of scratch paper is great for brainstorming, but I wouldn't a accept it as a substitute for proper documentation.

gloss
04-15-2008, 10:30 PM
I assume you haven't had classes in advanced math and linear systems. Otherwise you'd know that two point sources next to two receivers can't produce real 3D sound.

You are so misguided I don't even really want to waste my time. By mimicking the aural shadow of a human head and ear, a binaural recording deposited directly into your ear canal (having been RECORDED directly within a simulated ear canal) produces something that is about as close to 'real 3D sound' as you're ever going to get. The recording itself contains all of the modified and distorted frequencies that your head/ears would normally generate, and in using earbuds you are bypassing the physical and replacing it with the virtual.

And as previously stated, if you can't hear it in that recording you linked to, you have a hearing problem. The accuracy of the dimensionality is so eerie that I can pick out exactly where behind, in front of, above, or off to the side the noise is coming from, and several others here have attested to the same thing. Your inability to hear it is not a disproof of the concept.

mydo
04-16-2008, 10:53 PM
You are so misguided I don't even really want to waste my time. By mimicking the aural shadow of a human head and ear, a binaural recording deposited directly into your ear canal (having been RECORDED directly within a simulated ear canal) produces something that is about as close to 'real 3D sound' as you're ever going to get. The recording itself contains all of the modified and distorted frequencies that your head/ears would normally generate, and in using earbuds you are bypassing the physical and replacing it with the virtual.

And as previously stated, if you can't hear it in that recording you linked to, you have a hearing problem. The accuracy of the dimensionality is so eerie that I can pick out exactly where behind, in front of, above, or off to the side the noise is coming from, and several others here have attested to the same thing. Your inability to hear it is not a disproof of the concept.

OMG. How exactly does one do that with a teleconference? Three people are talking into a cheap phone mic' and ... some magic occurs ... to create something "having been RECORDED directly within a simulated ear canal" and ....

We're talking about phones here people. Not a recording studio.

JeffDM
04-16-2008, 10:57 PM
OMG. How exactly does one do that with a teleconference? Three people are talking into a cheap phone mic' and ... some magic occurs ... to create something "having been RECORDED directly within a simulated ear canal" and ....

We're talking about phones here people. Not a recording studio.

The effect is simulated with special transforms applied to the incoming signals to simulate how sound bounces off the ear from various places, it's like an audio version of a 3D rendering. It's not really 3D, but we see it that way, even if it's synthetic. This audio treatment has been available for use in products for at least a decade now. And yes, the quality does vary from person to person, and from headphone to headphone. But generally, it does work.

gloss
04-16-2008, 11:36 PM
OMG. How exactly does one do that with a teleconference? Three people are talking into a cheap phone mic' and ... some magic occurs ... to create something "having been RECORDED directly within a simulated ear canal" and ....

We're talking about phones here people. Not a recording studio.

I was mostly disputing the fact that you seem to imply that binaural audio is somehow an impossibility, when clearly it's not.

On this conference-calling thing, though I concur with you. It may be a nifty feature, but I don't see it being anything particularly game-changing, and I certainly don't buy that it will produce anything more than a slight delineation between parties.

ByronVanArsdale
04-17-2008, 11:02 AM
Yes! Yes! OMG YES FTW!

As someone who spends a couple hours a day in audio conferences, lack of positional audio is a huge, huge frustration. It makes a lot of conversations turn into an unintelligible jumble. Giving each member a position is a great first step, but I'd love to see stereo/surround microphones specially built for audioconferencing and a protocol to match.

Have you tried having people say their name first before speaking? Etiquette is essential on a conference call of any size. Can't wait for stereo microphones!

ByronVanArsdale
04-17-2008, 11:06 AM
Theres nothing sexy about conference calls.

After six hours on the tarmac at DFW waiting to de-ice, conference calls were looking VERY sexy to me!:lol:

ByronVanArsdale
04-17-2008, 11:28 AM
If this could actually be offered by real conferencing system I would disagree strongly. On conference calls you not only need to hear 'what' but also 'by whom'. Without that information a tremendous about of context of the meaning is often lost leading to miscommunication. If everyone has significantly difference vocal characteristics that all is well but if two, more or several pairs of people of similar vocal characteristics you find yourself asking 'who was that' or, if you don't want to interrupt the flow simply letting it go. This, in principle, would be extremely valuable but, with standard telephony you don't have even the possibility of two-channel transmission to make this possible.

I have been on far too many conference calls to count. I see no redeeming value with acoustic separation. I am listening for what is important in a conference call—not whether Betty or Bob are pleasantly acoustically separated. Business has fundamentals; this is just bordering on the ridiculous. Now, in an entertainment situation...that's an entirely different matter.

Agree strongly with physguy. CREB - you may be right about no net gain from acoustic separation. I've spent thousands of hours on conference calls use my binaural headset to cut out noise and distraction. The idea of being able to position say Betty on my left, Bob on my right is useful IF we are talking small number of call participants. I said it previously, call etiquette works well ONCE you get everyone trained to follow it. Psyguy is right - information without context = miscommunication.

Personally, I always wanted one of those gizmos that would make me sound like Darth Vader when leading my calls - something to mix it up a bit!:D

ByronVanArsdale
04-17-2008, 11:46 AM
With all due respect...I simply do not buy it. Given the myriad of corporate environments, all the way from the plush office to the being in the most adverse of field conditions, I prefer something more purpose-built. In the field I carry a military spec mobile phone (because is has to work for all the right reasons); at the office I carry a different phone. It is what being said versus whom the hell said it that is important to most serious business people. I wonder what Warren Buffet would have to say about all this nonsense? For that matter I dare you to ask Steve Jobs if he gives a true rat-arse about this as he runs Apple (I seriously doubt it as have read about Jobs, and in speaking with the friends I have that have worked with him).

Do you read before you write?:)

I'm talking about in a 'corporate environment', and yes I'm familiar. That said, having this in 'stereo' on a mobile phone with headsets would still be VERY valuable. I am often on a conf. call in a lounge or similar where it is quiet enough to utilize what this type of approach would offer. But, again, current standard telephony would not allow this as it is a single channel of audio. (Along with other limitations such a frequency range, phase alignment, etc. I am aware of our '3D audio' works).

CREB - depends on the type of and frequency of the conference call. Say you are meeting the same project team for the past year - the group knows each other and has a flow to it. Add a new member to the team and they'd be lost trying to track who is saying what. A huge time waster on calls starts with something like this: "I can't remember who said this but...." which is quickly followed up by "oh, that was Betty" and as phsyguy said, the flow of the call is broken.

How many times has a project team had to "redo" something based on a simple miscommunication during a conference call. Very expensive to everyone. Now, if I ever AM in a position to query Steve Jobs or Warren Buffet, I fully promise I'll query them and let you know what I learned.

ByronVanArsdale
04-17-2008, 11:49 AM
Simply a matter of basic etiquette versus trying to assimilate garbled information. A good read and use of Robert's Rules provides for better meetings, and conference calls than this iPhone feature will ever provide.

Thanks CREB - great resource that I didn't know about.

ByronVanArsdale
04-17-2008, 12:18 PM
Think of what happens when you're on a many-way conference call and a few people all try to speak at once. The voices all get muddled and you can't make out anything that was said.

With decent stereo separation, it will be much easier to separate the voices - just like you can do in a face-to-face meeting.

The real interesting thing here is going to be getting carriers involved. When you make a conference call over land lines, the sound from the various parties is multiplexed in the central office (or at a PBX or a conference bridging-center). Under that circumstance, then the phone won't be able to separate the streams and reposition them.

If, however, you receive each party's sound as a separate data stream, then this system shouldn't be that hard to implement. I've already seen this feature in standalone video conferencing systems. (Doesn't iChat also do this to some extent when you have a multi-way video chat?)

Does anyone know where the audio is mixed for GSM-based conference calls? If they're mixed at a centralized location, then I think this feature will require changes to the carrier's infrastructure in order to make it all work.

Directional sound is not new technology, and has been implemented by many companies. How would having directional sound "hurt" in any ways? First of all, it can easily be made completely optional. Secondly, you dont necessarily need to place someone behind you, and someone else in front of you, but instead if you are in a conference call with two people, and instead of both persons sounding like they are speaking from the same place (e.g. front), if one sounds like he/she is speaking from slightly left of front, and the other slightly right of front, how would this be any worse than what you have now? On the other hand, it will make it very easy to identify who is speaking what even if the voices sound similar.

Btw, if this issue prevents a person from reading a 400+ book just to have a conference call, then more power to them! Additionally, a lot of conference calls are not even done with people from your own company. Are you gonna hang up on your client who is giving you half your business because he is not courteous? Also, basic etiquette does not help identify who is speaking when you are speaking to 3 or 4 complete strangers, whose voices possibly sound similar (very common especially in international calls).

Simply put, etiquette is everything when leading a cc.

Establishing etiquette and maintaining it throughout the call is essential for leading an effective call, especially when you have cc's with internal and external participants. It is simple - have everyone say their name first. If you are interested, here's a post I've written about handling Skype based cc's where there is time delay in the responses.

http://www.conferencecalltraining.com/power/?p=89

Hanging up on clients does not sound like a sound business strategy! The basic problem with cc's is that no one has ever set a consistent standard for how they are conducted. We are in the middle of the evolution of how to run effective cc's (webinars, web-based meetings, etc). Meeting management went through a similar evolution in the 80's and 90's.

With travel costs and delays (sorry to anyone caught in the recent AA mess), there will be more and more people utilizing cc's. Thanks Apple for pushing the envelope. Now if Jobs can just get AT&T to keep up....

ByronVanArsdale
04-17-2008, 12:44 PM
In other words, you are saying "the feature is useless because nobody I talk to ever talks out of turn and nobody I talk to ever gets into an argument".

Congratulations. You are the only one who lives in this dream world. The rest of us have to deal with normal human beings and a piece of technology that makes it easier to understand them when they're not trying to bend their natures to arbitrary rules is useful.

Shamino - I don't think CREB said no one talks out of turn or gets into an argument.... it was about listening only for the important bullets. Sounds smart to me if you are the participant and not the leader. The leader has a bigger challenge.

It is not the role of technology (aside from say a taser :D) to derail politically motivated people who use cc's to further their personal agenda. It IS the role of the leader to develop the skills to make the cc as productive for everyone on the call as possible.

Any tool, tech or otherwise, that make it easier to lead the call is a step in the right direction. The better the technology, the more the leader & participants can focus on being productive.

ByronVanArsdale
04-17-2008, 01:12 PM
OMG. How exactly does one do that with a teleconference? Three people are talking into a cheap phone mic' and ... some magic occurs ... to create something "having been RECORDED directly within a simulated ear canal" and ....

We're talking about phones here people. Not a recording studio.

I was mostly disputing the fact that you seem to imply that binaural audio is somehow an impossibility, when clearly it's not.

On this conference-calling thing, though I concur with you. It may be a nifty feature, but I don't see it being anything particularly game-changing, and I certainly don't buy that it will produce anything more than a slight delineation between parties.

Not sure I agree gloss - no one has mentioned the bane of background noise. (can' t wait to get a jawbone so I can call from any environment without distracting the call). If you could control/eliminate things like background noise (or music on hold!) without a full global/individual mute (currently available on most teleconference lines), this WOULD be game-changing.

Perhaps Apple will provide an equalizer with settings like: Cut out background noise; Boost volume of a specific caller; Slow rate of speech; and my two personal favorites: Deliver small electric shock to a specific caller :lol: and the much needed TIVO function that automatically puts the call on hold and allows you to back up and re-listen to what someone just said!