Advertisement
Advertisement
⚡ Community Insights
Discussion Sentiment
84% Positive
Analyzed from 4237 words in the discussion.
Trending Topics
#apple#pro#local#more#ram#ultra#max#don#chip#mac
Discussion Sentiment
Analyzed from 4237 words in the discussion.
Trending Topics
Discussion (141 Comments)Read Original on HackerNews
That means they're actually incentivized at least short term, to benefit PCs becoming strong enough to do local LLMs. Which makes this play make even more sense. Though, I've been saying for a while that the local AI inflectiom point is the death knell for these frontier labs.
"Death knell" is a touch hyperbolic. Hardware that can only run quantized models that take up GBs in VRAM falls short of even an A100 (by almost an order of magnitude[0]), which in turn falls short of what an 8xH100 cluster can do (also by another order of magnitude[0]).
I'm an avid believer in local LLMs, but I cannot deceive myself - data center accelerators will win on power dissipation numbers alone[1], even when giving generous allowances for higher efficiency on Apple chips - and assuming the Apple-efficiency advantage persists on the same TSMC process node.
0. Based on my unscientific fine-tuning training experiments across local and rented GPUs. YMMV for inference.
1. Unless Apple surprises everyone and brings back the XServe with M7, if not, then laptop and desktop for factors simply can't dump heat fast enough to compete head-to-head, and will be designed for lower input wattage.
The frontier models are faster, and better at coding, but not so much that i’ll pay $200/month for them.
At some point there will be diminishing returns towards the "just throw more RAM at it" approach the current frontier models are taking. Commoditization is just as inevitable as it ever was... and in doing so will enable actual leaps of what AI/ML is capable of. That's not to say there won't be a place for 99.999999% accurate vs 99.99999% but those cases will be limited and likely prime to disruption based on real innovation vs access to capital.
Enjoy paying $1000 or more for a little 4 GiB cloud terminal that connects you to all your online accounts where all your actual work gets done. This is the future.
We need one of those specialized inference chip startups to succeed and a PC manufacturer willing to bet on them against Nvidia for the local AI to find mass market appeal.
Apple has pretty good competition in every segment with the exception of maybe the iPad, but I'm not a tablet user.
M1 had 70 GB/s, M1 Pro: 200, M1 Max 400, M1 Ultra 800.
Modern RTX 6000: ~1,600 or so.
If we get a 1,200-1,500 GB/s bandwidth M7 variant in late 2027 with 512GB of RAM, that will be a very interesting chip. Tracking LLM size and performance improvements, I can imagine that being a sort of inflection point for local inference. I wonder what the power budget would be in desktop format.
You're look at about 100 tokens/s for a 1T MoE 37B active 4bit model.
It'd probably cost $30k or more I'm guessing if memory prices do not come down. Even at $30k, it could still be a relative bargain since an RTX Pro 6000 Blackwell 96GB card costs $12k today. The M3 Ultra with 512GB was around $8k before Apple discontinued it. I expect an M7 Ultra to have 768GB or 1024GB.
Apple Silicon Macs were on their way to becoming cheap local LLM machines relative to professional GPUs before this memory crisis. It may still emerge as such in a few years.
Here's some interesting math: At 512GB, an Ultra chip could make 42 pro iPhones. Assume a 55% profit margins, and $1200 ASP, you're looking at $28,160 in profit from making iPhones instead. No wonder Apple discontinued the M3 Ultra 512GB. If they only have a limited supply of RAM for all their products, it makes no sense to produce an $8000 M3 Ultra 512GB when you can produce 42 pro iPhones. You can only configure an M3 Ultra up to 96GB today as of June 2026.
Apple would have to raise the price of a 512GB Ultra Mac to around $50k to match iPhone profits.
How would that work? They purchase 512GB from Samsung and then it doesn't matter if that's like 128x 4GB or 4x 128GB?
Edit: for those of you downvoting I don’t celebrate this prospect. I’m merely realistic about where things are going given the rapid vibe shift from the administration on AI since the start of June.
(This is assuming Apple will deliver, but this area is one of the biggest ones they have in AI, and they need the developer ecosystem to exist and survive)
Maybe this strategy works, even in that world.
Remember when we all thought (were told we thought) the world was heading to 3D views of our 2D lived experience like a solid Cube of GUI we could rotate around and live inside? Well Apple took the simple 2D square pane of virtual desktops and .. made it a SONY strip. One variable: sideways.
So here we are being told AI is the future. Apple seems to be saying "yes but it will run local" which might be a safe bet if AI comes true but I wonder how many of us want the AI outcome, which is morally speaking the 3D immersive GUI cube here: what if we don't want that?
So I think Apple has the right instinct. In fact, I've had the thought multiple times that I really want a lot of workflows just running on my device. Workflows like fast vector search (already fast on the m4, but I want it more common place), or realtime transcription and summarization to be even faster, on device, etc.
there isn't a future where we all just decide that nah, we don't want AI anymore. usefuly things don't disappear.
I think reducing the die area dedicated to ai stuff is not going to be a problem.
And in fairness apple already has essentially ai-less hardware in the form of the MacBook neo and it’s been an astonishing success.
I have one and it’s a very good laptop, particularly for the price i paid it.
Do we have a choice? It's being forced upon us by folks who have the power to distort any market they want. Energy prices are rising, and the PC industry is about to be destroyed by component prices. It will be dumb clients that run the software our feudal overlords of the data centers will have the grace to grant us. And the government lets it happen because it furthers their interests.
If they are pulling out all the stops to make the M7 more competitive.. guess I can wait for that?
It's not simply marketing since the Pro/Max chips of a generation use the same cores as the regular version, just more of them or different combinations of performance and efficiency cores.
The claim is that M6 will be released, but the only variants will be lower end.
When they get to the M7 generation, they will make high end variants.
It's a real distinction because each generation of parts shares an architecture.
The article has an entire section speculating what the M6 parts will be, but says they'll top out around 200GB/s memory bandwidth and 12 graphics cores.
Why would it? Each generation of the M series has an architectural improvement on their chipsets. The difference between an M1 and an M1 Pro is the allocation and arrangement not the architecture. M6 to M7 presumably will have architectural changes.
Or did this announcement also add an M6 chip, and they're just skipping pro?
It can still be a very real, not made-up distinction, if the actual facts on the ground are that Apple designed an M6 line, but then scrapped that design and asked the team to create a new design with emphasis on AI-focused specs.
It's not the name that's important (the M7 could still come out as M6), is them skipping a design, or cpu "Tick-Tock model" step.
It's the same thing as how the Mac Studio got an M4 Max refresh, but they didn't make an M4 Ultra so if you want the 28+ core CPU or 60+ core GPU, that's still using an M3 Ultra.
This time it'll be across all the Pro, Max, and Ultra versions, if you want those they'll stay at the previous generation for the M6 cycle.
Not that weird - Apple has a huge set of chips and hardware and software products. Putting every single thing on a fixed identical update cycle together won't always make sense.
Are you thinking Apple is leaking that there will be a long wait for much more expensive chips in order to… what?
https://bontechlabs.com/news/apple-is-reportedly-using-intel...
Given the risks involved in establishing Apple Silicon designs with a new fab, I would expect early M7 parts to be in test production right now.
The fundamental M7 design is already set in stone.
Mark Gurman's Bloomberg article does not mention fabrication partners or processes.
I was holding out for one until I decided to switch from an M1 Pro 16" MBP to an M5 Air 15" due to the expected price increase. I think many M1 Pro/Max generation people were waiting to upgrade this year.
They can release a redesigned MBP with the base M6 chip.
They don't want to tell the world how the new redesigned MBP is the best laptop in the world but it's slower than the older MBPs.
> What sets the A20 apart isn’t just the node shrink—it’s the revolution in packaging. Apple is transitioning to Wafer-Level Multi-Chip Module (WLCM) integration, meaning that RAM will no longer be situated beside the chip, but rather on the chip wafer itself, integrated alongside the CPU, GPU, and Neural Engine.
This shift eliminates the need for silicon interposers and substrates, thereby enhancing signal integrity, improving thermal dissipation, and facilitating faster memory access with lower latency. The benefits? Better multitasking, smoother AI processing (hello, Apple Intelligence), improved battery life, and potentially a smaller chip footprint—freeing up space for other components.
https://hwbusters.com/news/apples-a20-chip-ushers-in-a-new-e...
It's entirely possible that TSMC is ramping up more slowly than expected.
And their explanation isn't really passing the smell test for me for other reasons, for instance the fact that DRAM processes are pretty radically different than bulk logic processes, which wouldn't really let you put it all on the same wafer, much less the same die. Even back in the day when you had eDRAM blocks (like the Xbox 360's eDRAM die), that was really a DRAM process with a bit of logic cells that wouldn't be competitive if they weren't sitting right next to the DRAM blocks.
I could be wrong here though, my examples are more than a bit long in the tooth.
> CoWoS (Chip-on-Wafer-on-Substrate)
https://semiwiki.com/wikis/industry-wikis/cowos-chip-on-wafe...
It's a more advanced update from their older InFO tech.
As someone who wants to run effective llms locally for many things their other big benefit has been the unified memory studios for a small bit.
hyperscalers better all IPO in the next 8 quarters
some kind of private-public partnership
sorry if thats already happening in some capacity, like i said - "stupid question"
I wonder how much the rumored 768GB RAM version will cost.
But in terms of “noticing it” you are correct. You won’t pay attention after a day or two.
EDIT: this menu managing app will need permissios to make screen captures. So much for the privacy. Forgot to mention.
They need to pull out of this half assed bandwagon approach.
They don't need to pull out of this approach.
Do you really think the average Apple user will use it when there’s already better AI provided by OpenAI and Anthropic which don’t require advanced local hardware?
I guess it should be https://www.bloomberg.com/news/articles/2026-06-25/apple-to-...
EDIT: gift link if paywalled (archive.is capture is truncated): https://www.bloomberg.com/news/articles/2026-06-25/apple-to-...