微軟與 AI 新創公司 Perplexity 簽署 7.5 億美元的 Azure 雲端合約

Microsoft Cements AI Dominance with $750 Million Perplexity Deal and Custom Silicon Push

In a decisive move that reshapes the competitive landscape of artificial intelligence infrastructure, Microsoft has secured a three-year, $750 million agreement with AI search startup Perplexity. The deal, finalized late last week, sees the "answer engine" unicorn committing to Microsoft Azure for a significant portion of its cloud computing needs.

This partnership arrives simultaneously with Microsoft’s unveiling of the Maia 200, a next-generation custom AI chip designed specifically for inference workloads. The confluence of these two events signals a strategic pivot in Redmond: Microsoft is not merely funding the AI revolution; it is vertically integrating the hardware and software required to sustain it, effectively challenging the dominance of Nvidia and Amazon Web Services (AWS) in one stroke.

A Strategic Alignment: The $750 Million Commitment

The agreement marks a critical diversification for Perplexity, a company that had previously described itself as "all-in" on AWS. While Perplexity maintains that AWS remains a preferred partner, the magnitude of the Microsoft deal—three-quarters of a billion dollars over 36 months—suggests a substantial shift in operational gravity.

Under the terms of the deal, Perplexity will leverage Microsoft Foundry, a service that provides unified access to a suite of frontier models. This includes not only OpenAI’s GPT-series but also models from competitors like Anthropic and xAI, all hosted on Azure’s infrastructure. For Perplexity, whose product relies on synthesizing answers from multiple models to ensure accuracy and reduced hallucinations, this flexibility is paramount.

The timing is particularly notable given the cooling relationship between Perplexity and Amazon. Following a lawsuit filed by Amazon regarding Perplexity’s "Buy with Pro" feature, the startup’s move to secure capacity with Microsoft serves as both a strategic hedge and a declaration of independence. By locking in compute capacity with Azure, Perplexity ensures it can scale its query volume—which has exploded to over 100 million per week—without being beholden to a single infrastructure provider.

The Rise of Custom Silicon: Enter Maia 200

While the headline figure is the $750 million contract, the underlying technological enabler is Microsoft’s new silicon. The Maia 200 accelerator is an Application-Specific Integrated Circuit (ASIC) engineered specifically for AI inference—the process of running a model to generate answers, rather than training it.

Inference is the primary cost center for user-facing AI products like Perplexity. Every time a user asks a question, the system must process vast amounts of data in real-time. Standard GPUs are often overkill or power-inefficient for these specific tasks. The Maia 200, built on a 3-nanometer process by TSMC, claims to offer 30% better performance per dollar compared to existing commercial solutions.

Key capabilities of the Maia 200 include:

Optimized Precision: tailored for 4-bit (FP4) and 8-bit (FP8) data formats, allowing for faster response times without significant accuracy loss.
Memory Bandwidth: High-bandwidth memory architecture designed to handle large context windows, essential for search engines that need to "read" multiple web pages before answering.
Vertical Integration: Designed to slot directly into Azure’s existing server racks, minimizing the need for data center retrofitting.

For a client like Perplexity, access to Maia 200 instances on Azure could theoretically lower the "cost per query," a metric that currently challenges the unit economics of AI search engines compared to traditional keyword search.

Competitive Landscape: The Cloud Wars Intensify

The deal underscores the intensifying battle between the "Big Three" cloud providers to capture high-growth AI startups. By securing Perplexity, Microsoft denies its rivals exclusive claim to one of the few consumer AI apps with true mass-market traction.

The following table illustrates the current alignment of major cloud providers with key AI players and their hardware strategies:

**Cloud Provider	Primary AI Partners	Strategic Hardware Focus**
Microsoft Azure	OpenAI, Perplexity, Mistral, xAI	Maia Series: Focused on inference cost-reduction and reducing Nvidia dependency.
Amazon Web Services (AWS)	Anthropic, Hugging Face, Cohere	Trainium & Inferentia: Mature custom silicon aimed at training and inference.
Google Cloud	DeepMind (Internal), Anthropic	TPU (Tensor Processing Unit): The longest-running custom AI silicon project.

This table highlights a critical trend: partnerships are no longer just about credits; they are about hardware alignment. AWS has long touted its Trainium chips to partners like Anthropic. Now, Microsoft is using the Maia 200 as a lure to show cost-conscious startups that Azure is the most economical place to run their businesses at scale.

Perplexity's Growth Trajectory and Infrastructure Needs

Perplexity’s valuation has skyrocketed to nearly $9 billion, driven by its promise to replace the "ten blue links" of traditional search with direct, cited answers. However, this model is computationally expensive. Generating a paragraph of text requires orders of magnitude more energy and processing power than retrieving a database row.

To sustain its growth, Perplexity must solve the latency and cost equation. The Microsoft deal provides a pathway to do both. By utilizing Azure’s global data center footprint and potentially the Maia 200’s inference capabilities, Perplexity can push its "Pro" features—such as deep research and file analysis—to more users without eroding its margins.

Furthermore, access to Microsoft Foundry simplifies the model orchestration layer. Instead of managing separate API integrations for GPT-4, Claude 3, and Grok, Perplexity can route these requests through Azure’s managed service, improving reliability and reducing engineering overhead.

Implications for the AI Ecosystem

The $750 million deal sends a ripple effect through the broader AI ecosystem.

The "Multi-Cloud" Reality: Startups are increasingly refusing to be locked into a single vendor. While Microsoft and OpenAI have a tightly coupled, almost exclusive relationship, the Perplexity deal proves Azure is open for business with other major players, even those who use competitor models.
Validation of Microsoft's Hardware: If Perplexity successfully migrates a significant portion of its inference traffic to Maia-powered instances, it serves as a powerful case study for other enterprises. It would prove that Microsoft’s chips are a viable alternative to Nvidia’s H100s for production workloads.
Pressure on Google: Perplexity is arguably the most direct competitor to Google Search. By strengthening Perplexity, Microsoft is fighting a proxy war against Google’s core business model, forcing Google to continue accelerating its own AI Overviews rollouts, often at the expense of its traditional ad revenue.

Financial Outlook and Investor Sentiment

For Microsoft investors, the deal is a welcome signal that the company’s massive capital expenditure (CapEx) on AI infrastructure is yielding revenue-generating contracts. Concerns had been mounting regarding the return on investment for the billions spent on data centers. A $750 million commitment from a single startup helps validate the demand side of the equation.

Moreover, the introduction of Maia 200 addresses the "margin compression" fear. If Microsoft can serve AI workloads on its own chips rather than expensive third-party GPUs, its gross margins on AI services will improve over time.

Conclusion

Microsoft’s $750 million agreement with Perplexity is more than a standard vendor contract; it is a strategic maneuver that reinforces Azure’s position as the operating system for the AI age. By combining massive capital allocation with the deployment of the Maia 200 chip, Microsoft is building a vertically integrated fortress that appeals to the next generation of tech giants. For Perplexity, the deal offers the computational firepower needed to continue its assault on traditional search, securing its future in an increasingly resource-constrained digital economy.