A Cloud Collaboration for the Future of AI
CoreWeave Partners with Perplexity to Supercharge AI Inference Workloads
CoreWeave and Perplexity are teaming up to revolutionize AI inference workloads. With a multi‑year agreement, CoreWeave will power Perplexity's advanced inference demands using NVIDIA's cutting‑edge GB200 NVL72 clusters. This collaboration aims to enhance Perplexity's API ecosystem, driving innovation and scalability with CoreWeave’s AI‑first cloud infrastructure.
Introduction to CoreWeave and Perplexity Partnership
Significance of AI Inference in Modern Cloud Computing
AI inference plays a pivotal role in modern cloud computing, transforming how data‑driven decision‑making is integrated into real‑time applications. It involves using pre‑trained AI models to make predictions or generate outputs based on new data, a process critical in scenarios requiring immediate responses, such as virtual assistants or recommendation systems. The partnership between CoreWeave and Perplexity underscores the importance of high‑performance inference capabilities in the AI ecosystem. By leveraging dedicated NVIDIA GB200 NVL72‑powered clusters, they aim to achieve unparalleled speed and scalability, essential for high‑demand applications like Perplexity's Sonar and Search APIs, which require low‑latency and consistent performance.1
Overview of NVIDIA GB200 NVL72‑powered Clusters
Role of Perplexity Enterprise Max in CoreWeave's Strategy
Initial Deployment Technologies: Kubernetes and W&B Models
Comparing CoreWeave with AWS and Azure
Perplexity's Multi‑Cloud Strategy and Its Benefits
Key Personalities and Their Views
Market and Financial Implications of the Partnership
Public Reactions to the Strategic Partnership
Future Trends and Predictions in AI Cloud Infrastructure
Sources
Related News
May 12, 2026
Telus’s BC AI data centre cluster is a sovereign-compute bet, not a finished build
Ottawa and Telus announced a three-site AI data centre cluster in British Columbia: Kamloops, Mount Pleasant, and downtown Vancouver. But the project is still at MOU stage, with no funding committed yet and no public pricing, GPU counts, or power capacity disclosed. For Canadian builders, the real question is whether this becomes usable domestic AI infrastructure — or just a polished policy signal that arrives after the market has already moved on.
May 11, 2026
Telus’s BC sovereign AI build could add real Canadian compute — or just better branding
Canada and Telus say they’re advancing a sovereign AI infrastructure build in British Columbia, with three planned data centres and more than 60,000 GPUs by 2032. The big question for builders is not the ribbon-cutting; it’s whether this becomes usable Canadian compute with clear access, pricing, and procurement paths — or stays a policy label with nice hardware attached.
May 8, 2026
Coinbase Restructures: Cuts 14% Workforce, Embraces AI-Driven Leadership
Coinbase is axing 14% of its workforce as it ditches 'pure managers' for AI-driven roles. Expect leaner, AI-backed 'player-coaches' managing larger teams. This shift could be risky, but also transformative for those adapting quickly.