Apple’s AI roadmap is no longer just about software. According to new reporting from Bloomberg’s Mark Gurman, the company is also preparing a hardware shift that would let its assistants “see” in a way they can’t today—starting with a rumored pair of AirPods equipped with cameras, and potentially followed by a second folding iPhone aimed at the same 2027-era window.
The timing matters. Gurman’s latest details come after Apple has moved past WWDC and the initial wave of AI announcements and platform updates. In other words, this isn’t speculation floating in a vacuum; it’s tied to internal testing schedules and the cadence of iOS releases that Apple uses to roll out major new capabilities. And if the rumors are right, Apple’s next leap won’t be limited to phones and tablets. It will extend into the most intimate computing device many people already wear every day: earbuds.
Camera-equipped AirPods: the “visual context” bet
The core of the rumor is straightforward but potentially transformative: Apple is working on AirPods with cameras mounted in their stems. The idea isn’t simply to add a novelty camera feature to a headset. Gurman describes a system designed to provide Siri with upgraded “visual context” about your surroundings—essentially giving the assistant more information than audio alone can provide.
That distinction is important. Today’s Siri improvements, even when powered by advanced AI models, still begin with what you say. If you ask for help finding something, identifying an object, or understanding what’s happening in front of you, the assistant can only infer so much from your description. A camera-equipped wearable changes the input channel. It turns Siri from a conversational assistant into something closer to an ambient interpreter of the world around you.
Gurman also adds a detail that signals Apple is thinking about user awareness and privacy boundaries: lights on the earbuds that indicate when data is being uploaded to the cloud. That kind of visible status indicator is exactly the sort of design element Apple tends to favor when it introduces sensors that could feel intrusive. It’s also a clue about how Apple may architect the experience. If the cameras are used for AI tasks that require heavy processing, the system likely relies on cloud connectivity at least part of the time. The lights would then serve as a real-time “you’re being analyzed” cue—something users can understand instantly without needing to dig into settings.
What “visual context” could actually look like in practice
It’s easy to say “Siri can see,” but the more interesting question is what Apple would do with that capability. Gurman’s framing suggests the goal is not constant surveillance or continuous recording. Instead, it points toward targeted assistance: moments where visual understanding unlocks a better answer.
Imagine asking Siri, “What is this?” while looking at a product label, a plant, a piece of equipment, or a sign you don’t recognize. With camera input, Siri could interpret the scene and respond with a direct identification rather than asking you to describe it. Or consider navigation and everyday problem-solving: “How do I set up this charger?” If the assistant can view the device and its ports, it can guide you through steps with far less back-and-forth.
There’s also a more subtle use case: reducing the cognitive load of explaining what you’re seeing. People often struggle to describe visual details accurately. Even when you know what you want, you might not have the vocabulary to communicate it. A camera-based assistant can translate your intent into a visual reference, then respond with instructions that match what’s actually in front of you.
This is where the “visual context” concept becomes more than a gimmick. It’s a shift in interaction design. Instead of treating the user’s voice as the primary source of truth, Apple would treat the environment as a shared context layer. Your speech becomes the request; the camera becomes the grounding.
And because the cameras are on earbuds, the context is tightly coupled to the user’s attention. You’re not holding a phone up to the world; you’re wearing a device that can capture what you’re looking at while you move through daily life. That could make the experience feel more natural than “open an app, point the camera, wait for results.”
The cloud upload signal: privacy theater or meaningful control?
The mention of lights indicating when data is being uploaded is one of the most telling parts of the rumor. Apple has long positioned itself as privacy-forward, but privacy isn’t just about encryption and policy—it’s also about transparency. Users need to know when sensors are active and when their data leaves the device.
If these AirPods include cameras, the biggest trust hurdle won’t be whether Apple can technically protect data. It will be whether users feel confident that the system is behaving as expected. A visible upload indicator is a practical solution to that problem. It gives immediate feedback: the system is doing something beyond local processing.
Still, there’s a nuance worth watching. A light that indicates cloud upload doesn’t necessarily tell you what’s being captured, how long it’s stored, or whether the camera is always on versus triggered by specific actions. Apple could implement the feature in a way that’s respectful and minimal—only capturing when needed, only uploading when required, and only retaining data under strict controls. Or it could implement it in a way that feels more ambiguous to users.
The rumor doesn’t settle those questions. But the fact that Apple is reportedly planning a visible indicator suggests it understands the optics and the user experience implications. In a world where wearables increasingly blur the line between helpful and creepy, transparency cues are becoming as important as the underlying AI.
Testing timelines: iOS 27 betas now, iOS 28 internally
Gurman’s update includes a schedule detail that makes the rumor feel more grounded. While Apple is currently working through beta releases for iOS 27, the camera AirPods are reportedly being tested internally with iOS 28.
That implies Apple is treating this as a platform-level feature, not just a hardware add-on. For camera-based AI assistance to work reliably, it needs tight integration with system frameworks: permissions, sensor management, audio routing, background behavior, and the assistant’s orchestration logic. It also needs a consistent user interface for consent and status indicators.
In other words, the earbuds aren’t just “a new accessory.” They’re likely part of a broader ecosystem update where iOS 28 provides the necessary plumbing for camera input, AI request handling, and user-facing controls.
This also hints at why the launch window is late 2027. Apple typically doesn’t rush hardware that requires deep software integration and careful privacy design. If the feature depends on new OS capabilities, Apple has to align hardware readiness with the software release cycle. That alignment takes time—especially when the experience must feel seamless and safe.
Why AirPods are the right (and risky) place to start
From a product strategy perspective, AirPods are a clever starting point. They’re already a mainstream Apple category, they’re worn constantly, and they’re socially acceptable in a way that camera glasses might not be. Earbuds also have a built-in “always with you” advantage. If Apple wants to make AI assistance feel ambient, it needs a device that’s present during everyday moments—not something you pull out only when you remember to.
But there’s also risk. Cameras on earbuds raise different concerns than cameras on phones. With a phone, users control the camera angle and can clearly see what’s being recorded. With earbuds, the camera’s perspective is less obvious to bystanders and sometimes less obvious to the wearer too. That’s why the upload indicator matters, and why Apple would likely need robust permission controls and clear user feedback.
There’s also the question of social acceptance. Even if Apple designs the feature responsibly, people may still react strongly to the idea of being near someone wearing camera-equipped earbuds. Apple will likely need to consider how the device communicates its state to others, not just to the user. The lights could serve that purpose, but the effectiveness will depend on visibility, brightness, and placement.
A second folding iPhone: the 2027-era companion move
Alongside the camera AirPods rumor, Gurman also mentions a “second folding iPhone” as part of Apple’s next wave of products, with 2027 as the target window.
This is notable because it suggests Apple may be pursuing two parallel strategies at once. On one track, it’s expanding AI capabilities into new interaction modes—potentially using camera-equipped wearables to provide richer context. On another track, it’s continuing to explore form factors that can change how people consume content and interact with apps.
A second folding iPhone could be interpreted as Apple refining its approach after learning from the first generation. Folding devices are notoriously difficult to get right: durability, crease visibility, battery life, software optimization, and app behavior all matter. If Apple is planning a second foldable, it likely believes it can improve the experience enough to justify the category’s complexity.
But there’s also a deeper connection to the AI story. Foldables can support more flexible layouts, which can make AI features feel more integrated. If Apple’s AI assistant is going to rely on visual context, a larger screen and adaptable UI could help present results in a way that feels natural—think side-by-side guidance, contextual overlays, or interactive explanations that respond to what the user is looking at.
Even if the folding iPhone rumor is separate from the AirPods rumor, the timing suggests Apple may be building a coordinated ecosystem where AI features are distributed across devices: earbuds for capture and grounding, phones for display and control, and iOS for orchestration.
A unique take: Apple’s next interface might be “context-first,” not “command-first”
Most AI assistants today are still command-first. You ask a question, the assistant responds. Even when the assistant is “smart,” the interaction model often remains: user prompts, AI answers.
If camera-equipped AirPods arrive as rumored, Apple’s next interface could become context-first. The assistant would start from what it can observe—your environment, your objects, your situation
