Marvin

About

Username: Marvin
Joined: February 2006
Visits: 131
Last Active: July 24
Roles: moderator
Points: 7,013
Badges: 2
Posts: 15,588

Reactions

2.7KLike19Dislike859Informative

Apple to hold first in-person summit in years, focused on AI

Marvin

February 2023

kestral said:

What is Apple doing about ChatGPT / Bard and the AI wars? This is huge. Apple hired an AI guy, time for him to earn his pay.

The top AI guy that Apple hired left due to Apple's remote work policy and went to work for Google DeepMind last year:

https://appleinsider.com/articles/22/05/07/apples-director-of-machine-learning-exits-over-return-to-office-policy
https://www.linkedin.com/in/ian-goodfellow-b7187213

He was at Apple for over 3 years. Still, a lot of this stuff is open source now.

This could be Apple's entrance into online search that was rumored a while ago. It could behave much like Siri with conversational search but obviously more advanced and a bit more accurate due to typing instead of speaking.
New Mac Pro may not support PCI-E GPUs

Marvin

February 2023

tenthousandthings said:

programmer said:

I haven't seen it mentioned here that there is a fundamental architectural difference between the Apple Silicon GPU and pretty much every other GPU architecture (except Imagination/PowerVR): it is a tile-based architecture. If you look at the Metal programming manual, there are numerous differences in how the two types of architectures need to be dealt with by the application. Those differences will be even more severe at the OS/driver level. Apple is quite aggressive at dropping old hardware in order to reduce their software burden (given, as mentioned above, software lifecycles are actually longer than hardware delivery cycles), and I would imagine they want to get to a place where (in Metal 4 or 5) they can completely focus on their hardware's tile-based architecture, and not have to accommodate AMD/nVidia designs. Same applies to the rumoured "ray tracing" support [...]

Great post, really helpful, thank you for taking the time to write it. Pardon me if I'm wrong about this, but isn't everyone moving toward tile-based/chiplet-based architectures, not just Apple? I know AMD's RDNA 3 architecture uses "chiplets" (AMD's term for tiles). So even while Apple is moving toward dropping support in Metal 4 or 5 for older architectures, the rest of the industry is moving along with them. So accommodating AMD's current, chiplet-based 7000-series designs (and future designs) in the Apple Silicon lattice Mac Pro wouldn't necessarily be too much of a burden?

Apple's tile-based GPU refers to the way it renders images, not the way the physical chips are structured:

https://developer.apple.com/documentation/metal/tailor_your_apps_for_apple_gpus_and_tile-based_deferred_rendering
https://www.rastergrid.com/blog/gpu-tech/2021/07/gpu-architecture-types-explained/

AMD GPUs use physical chiplets but they use immediate mode rendering rather than tile-based rendering.

This means that GPU drivers have to accommodate each rendering architecture. If Apple writes the drivers, they have to maintain both.

They've already done this as Metal software runs ok on AMD GPUs. In the Blender Metal features, Apple added support for AMD and Intel:

https://developer.blender.org/T92212

They have an active install base over 100m who still have Intel/AMD and this will be the case for a few years.
Apple's Studio Display doesn't shine in the light of competition

Marvin

January 2023

mikethemartian said:

It doesn’t really matter since there are many Mac users who won’t use a display unless it is officially anointed by Apple.

Not just for the brand name and build quality, it works better with the Mac and iPad.

What would be nice to have is an option without the computing parts:

https://www.ifixit.com/News/58242/studio-display-teardown-is-this-secretly-an-imac
https://www.apple.com/studio-display/

It has an A13 Bionic chip with 64GB storage for Spatial Audio, Siri and Center Stage. A basic model (Studio Display SE) without compute chip and camera at $999 with the tilt stand included would be a nice option. The basic model doesn't even need speakers, all those parts are usually on the connected device anyway.
New Mac Pro may not support PCI-E GPUs

Marvin

January 2023

tht said:

cgWerks said:

tht said:
… The big issue, how do they get a class competitive GPU? Will be interesting to see what they do. The current GPU performance in the M2 Max is intriguing and if they can get 170k GB5 Metal scores with an M2 Ultra, that's probably enough. But, it's probably going to be something like 130k GB5 Metal. Perhaps they will crank the clocks up to ensure it is more performant than the Radeon Pro 6900X …

The devil is in the details. I’ve seen M1 Pro/Max do some fairly incredible things in certain 3D apps that match high-end AMD/Nvidia, while at the same time, there are things the top M1 Ultra fails at so miserably, it isn’t usable, and is bested by a low-mid-end AMD/Nvidia PC.

I suppose if they keep scaling everything up, they’ll kind of get there for the most part. But, remember the previous Mac Pro could have 4x or more of those fast GPUs. Most people don’t need that, so maybe they have no intention of going back there again. But, I hope they have some plan to be realistically competitive with more common mid-to-high end PCs with single GPUs. If they can’t even pull that off, they may as well just throw in the towel and abandon GPU-dependant professional markets.

If only they can keep scaling up. Scaling GPU compute performance with more GPU cores has been the Achilles heel of Apple Silicon. I bet not being able to scale GPU performance is the primary reason why the M1 Mac Pro was not shipped or got to validation stage. On a per core basis, 64 GPU cores in the M1 Ultra is performing at little over half half (GB5 Metal 1.5k points per core) of what a GPU core does in an 8 GPU core M1 (2.6k per core). It's basically half if you compare the Ultra to the A14 GPU core performance. And you can see the scaling efficiency get worse and worse when comparing 4, 8, 14, 16, 24, 32, 48 and 64 cores.

The GPU team inside Apple is not doing a good job with their predictions of performance. They have done a great job at the smartphone, tablet and even laptop level, but getting the GPU architecture to scale to desktops and workstations has been a failure. Apple was convinced that the Ultra and Extreme models would provide competitive GPU performance. This type of decision isn't based on some GPU lead blustering that this architecture would work. It should have been based on modeled chip simulations showing that it would work and what potential it would have. After that, a multi-billion decision would be made. So, something is up in the GPU architecture team inside Apple imo. Hopefully they will recover and fix the scaling by the time the M3 architecture ships. The M2 versions has improved GPU core scaling efficiency, but not quite enough to make a 144 GPU core model worthwhile, if the rumors of the Extreme model being canceled are true (I really hope not).

If the GPU scaling for the M1 Ultra was say 75% efficient, it would have scored about 125k in GB5 Metal. About the performance of a Radeon Pro 6800. An Extreme version with 128 GPU cores at 60% efficiency would be 200k in GB5 Metal. That's Nvidia 3080 territory, maybe even 3090. Both would have been suitable for a Mac Pro, but alas no. The devil is in the details. The Apple Silicon GPU team fucked up imo.

There are some tests where it scales better. Here they test 3DMark at 7:25:

Ultra gets 34k, Max gets 20k so 70% increase.

Later in the video at 20:00, they test Tensorflow Metal and Ultra GPU scales to as high as 93% faster than Max.

https://github.com/tlkh/tf-metal-experiments

Some GPU software won't scale well due to CPU holding it back, especially if it's Rosetta translated.

It's hard to tell if it's a hardware issue or software/OS/drivers, likely a combination. If the performance gain is more consistent with M2 Ultra and doesn't get fixed with the same software on M1 Ultra, it will be clearer that it's been a hardware issue. The good thing is they have software like Blender now that they can test against a 4090 and figure out what they need to do.

The strange part is that the CPU is scaling very well, almost double in most tests. The AMD W6800X Duo scales well for compute too and those separate GPU chips are connected together in a slower way than Apple's chip. There are the same kind of drops in some tests but OctaneX is near double:

https://barefeats.com/pro-w6800x-duo-other-gpus.html

As you say, the GPU scaling issue could have been the reason for not doing an M1 Extreme. M2 Ultra and the next Mac Pro will answer a lot of questions.

An Extreme model wouldn't have to be 4x Ultra chips, the edges on the Max chips could both be connected to a special GPU chip that only has GPU cores. Given that the Max is around 70W, they can do 4x Max GPU cores just on the extra chip (152) plus 76 on the others for 228 cores. This would be 90TFLOPs and would be needed to rival a 4090 and this extra chip can have hardware RT cores. This should scale better as 2/3 of the GPU cores would be on the same chip.
New Mac Pro may not support PCI-E GPUs

Marvin

January 2023

Hreb said:
The claim of "up to 76 cores" is interesting. Currently the macos kernel has a hard limit to 64 cores. Apple could obviously raise that, but I doubt they'd do that outside a major release of macos. But macos 14 could certainly have a higher core thread limit.

This is for GPU cores. CPU cores in M2 Ultra would be 24.

commentzilla said:

If this is true what is the point of a bigger Mac Studio Pro? I don’t see a justification.

Some people like to have lots of storage, Ultra only goes to 8TB. The Mac Pro currently has 32TB PCIe modules (4x 8TB drives):

https://www.apple.com/shop/product/HMUE2ZM/A/promise-pegasus-r4i-32tb-raid-mpx-module-for-mac-pro

There are 16TB drives now so probably 64TB per module and it can take two for 128TB storage.

There's also network cards for optical IO to be able to work with the Mac Pro remotely.

If people need the storage or IO capability and it's similarly priced to the Ultra, it's an easy purchase to make. Most people only buy the mid-range Mac Pros so it doesn't need to reach the highest performance level.

Mid-range is 16-core Xeon plus 15TFLOPs W6800X ($10k). The M2 Ultra will be faster than the 28-core Xeon and GPU will be close to 30TFLOPs for $5k.