Perplexity has announced the next step for its Personal Computer platform: the first hybrid local-server inference orchestrator, designed to automatically route AI workloads between a user’s local device and cloud-based agents based on the nature of each task. The feature, developed in partnership with Intel and compatible with NVIDIA’s RTX Spark and other local silicon, is set to arrive in July and represents what Perplexity describes as the first product to make hybrid AI a practical reality rather than an industry ambition.
The core challenge the orchestrator addresses is a three-way tension at the heart of modern AI systems: accuracy demands the most capable frontier models, which are expensive to run in the cloud; privacy demands that some data never leave a user’s device; and cost and energy efficiency demand that routine tasks not consume frontier-model compute.
Perplexity’s orchestrator solves this by reasoning automatically about which work should run locally and which should go to cloud-based agents, task by task, without requiring users to make that decision themselves. A compact model runs locally to detect when sensitive data — such as financial records, health information, or personal files — should remain on-device, while work requiring a frontier model’s full capability routes to the server.
The announcement carries broader implications for the AI infrastructure landscape. As local chips advance in their inference capability, Perplexity said sensitive and routine workloads moving onto devices people already own could meaningfully reduce demand for centralized data center infrastructure. It also reframes data sovereignty: important data can remain within its jurisdiction without requiring a government or organization to build dedicated data center capacity to achieve it. Perplexity framed the architecture as a natural extension of its core business model — one centered on accurate AI rather than maximizing tokens sold — arguing that optimizing value per watt rather than compute consumption puts its incentives in alignment with users.

