Apple to Position On-Device AI as Key WWDC 2026 Focus
Apple plans to highlight on-device AI capabilities as a competitive advantage at WWDC 2026 by leveraging 15 years of custom silicon for local processing over cloud alternatives. The strategy involves a Google Gemini partnership, potential acquisitions, and a shift from prior Private Cloud Compute commitments as the company seeks to reframe its Apple Intelligence efforts after rollout challenges.

People familiar with Apple's plans speaking to The Information say the company is expected to showcase how the chips designed for iPhones, Apple Watches, and Macs give it an edge in processing AI queries directly on devices. While cloud-based processing will remain necessary for complex queries, Apple will position local inference as a privacy-preserving, cost-saving alternative to the massive data center buildouts its rivals have pursued.
While cloud-based processing will remain necessary for complex queries, Apple will position local inference as a privacy-preserving, cost-saving alternative to the massive data center buildouts its rivals have pursued.
As part of its agreement with Google, Apple is set to use a large version of Google's Gemini model to train a smaller, distilled version capable of running locally on Apple hardware. Apple is also scouting acquisitions to help advance its model-shrinking work, with one company it has considered being Liquid AI, a Massachusetts startup focused on running AI locally on devices.
Some queries will still require cloud processing. Apple has approved the use of Nvidia's confidential compute technology within Google Cloud to handle processing of the larger Gemini-based model. The security feature encrypts data and AI models during processing, adding a modest performance cost but offering stronger privacy protections.
The arrangement represents a noticeable departure from Apple's original Apple Intelligence announcement, in which the company said all cloud-bound queries would be handled exclusively by its own Private Cloud Compute infrastructure running on Apple silicon. Apple is likely to retain the Private Cloud Compute branding despite the change.
There are material limits to how far Apple can push on-device processing. Google's full Gemini model runs into the trillions of parameters, and Apple has struggled to run it on its own Private Cloud Compute infrastructure, which uses the same Apple silicon chips found in Mac computers.
Apple Intelligence was first announced at WWDC 2024, but the rollout has been hampered by a tepid response to initial features and a protracted delay to the more personal version of Siri. The company is now expected to use WWDC 2026, which runs from June 8, to reframe the narrative, reintroduce the delayed features, and debut new ones.
Tap a lens to see what this story means for you.
Reader-supported · Daily Brief
Daily brief at 7 AM ET. Top tech stories, every morning. Sourced and fact-checked.
Reader-supported
The Circuitry is a passion project I've always wanted to build, and I love the work behind it.
Running it costs real money. APIs, hosting, time. To keep improving the site and growing this into something useful for everyone, those costs have to be covered.
Any contribution is appreciated. If not, no pressure. Thanks for reading.