June 03, 2026
Data Platforms Are AI Platforms: Beyond Vendor Messaging
Native AI on Snowflake, Databricks, Fabric and BigQuery has matured quickly. The 2026 architecture question: which workloads belong on the platform and which need inference closer to the data?
The data platform is the AI platform. That label held up through 2025 and continues to strengthen in 2026. The harder questions are: Where does the platform actually run? And is your architecture designed to benefit the vendor?
Snowflake Cortex, Databricks Mosaic AI, Microsoft Fabric and BigQuery have moved beyond basic LLM functionality into agent primitives, document AI, model serving and vector search at production scale. For most enterprise workloads, the platform is where AI happens.
But for the vast amount of enterprise data that lives outside structured tables, the platform isn’t always where AI runs.
That distinction is central to architecture and FinOps decisions in 2026.
The Platform AI Story Keeps Getting Better
A year ago, native AI on the data platforms felt like a hedge against the broader AI tooling ecosystem. Today it’s the default.
Cortex now spans embedded LLM functions, Document AI for unstructured extraction, Cortex Search for hybrid retrieval and agent frameworks operating over governed data. Databricks has pushed Mosaic AI into foundation model fine-tuning, custom model serving with GPU-backed compute and Vector Search at scale. Unity Catalog and Snowflake Horizon extend governance to AI assets directly.
For analytical workloads, copilots over governed data and standard retrieval-augmented generation (RAG) patterns, the platform-native approach wins on time-to-value. Governance, observability and lineage come built in. The skills your team already has translate. Inference costs stay predictable at expected volumes.
We see this work consistently across our practice. When the use case fits the platform, the platform earns the workload.
Unstructured at Scale Changes the Math
Three patterns push past what platform-native economics handle well.
- Scale of the unstructured corpus: Generating embeddings for hundreds of millions of documents, processing video archives or running multimodal extraction across decades of content tips the cost curve. Platform inference is priced for steady-state retrieval, not bulk processing of dormant data.
- Sovereignty and sensitivity: Regulated content in healthcare, defense and financial services often can’t move into a multi-tenant managed service, even one with strong logical isolation. Logical isolation might satisfy the engineering team. Whether it satisfies the auditor is a different conversation.
- Latency at the edge: Manufacturing floors, retail stores and field operations need inference where the data is generated. Cloud round trips don’t meet response time requirements and bandwidth back to the platform is often the constraint.
In these scenarios, inference moves closer to the data. The platform still plays a role. The footprint just gets bigger.
The Platforms Are Part of the Hybrid Answer
Both Snowflake and Databricks have been investing in exactly this extension.
Snowpark Container Services lets workloads run inside Snowflake’s compute fabric, which means custom models and inference services can operate against governed data without exporting it. Snowflake’s external table and Iceberg support extend governance to data that lives outside the platform. The footprint is no longer bounded by Snowflake-managed storage.
Databricks has gone further on serving flexibility. Custom model deployment on GPU-backed compute, foundation model fine-tuning and Unity Catalog governance over models, vector indexes and feature stores. Lakehouse Federation pulls remote data into the same governance model. Mosaic AI is built to run wherever the workload needs to live.
The pattern across both is consistent. The data platform stays the system of record and the governance plane. Inference happens where the workload demands. The platforms are defining what hybrid architecture looks like, not retreating from it.
The On-premises Side of the Picture
When workloads run on dedicated infrastructure, the requirements are well understood: accelerated computing platforms purpose-built for inference, enterprise inference stacks that expose consistent endpoints across cloud and on premises, reference architectures from major OEMs that combine compute, networking and storage for AI workloads at scale and model customization frameworks that allow tuning without rebuilding the stack.
What ties this back to the data platform is the connection model. Models are served as microservices behind consistent endpoints, callable from Cortex or Mosaic AI. Embeddings are stored and governed centrally. The semantic layer doesn’t care where inference physically runs. Observability and FinOps span both the cloud platform and on-premises capacity as a single operational picture.
This is where CDW’s role gets practical. Hybrid architecture is a delivery discipline across data platforms, accelerated infrastructure, OEM hardware, networking and security. The challenge most enterprises face isn’t choosing the components. It’s making them function as one system the data platform can govern end to end.
Practical Guidance
For CIOs and CTOs evaluating the architecture, the move is to draw the line consciously.
Map AI workloads to three categories: structured analytics, copilots over governed data and standard RAG up to a few million documents stay on the platform. Bulk unstructured processing, regulated content and edge inference get evaluated for placement closer to the data. Custom model serving with specific performance or compliance requirements falls into the same review.
Then look at the boundary cases honestly. Where is your unstructured corpus going? If you’re activating 80% of your data, inference economics matter. If you’re piloting RAG on a quarterly report, they don’t.
Platform AI is right for most things. On-premises inference is right for specific things. The architecture that works spans both without forcing false trade-offs in either direction.
The Foundation Is Still the Foundation
AI capabilities are only as good as the data feeding them. Governance, semantic context and observability determine whether the inference produces something useful or something confidently wrong.
What’s changing in 2026 is where the inference happens. Right fit architecture now extends to right fit inference. The foundation question doesn’t move. The work of making both sides function as one architecture is what separates AI that scales from AI that stalls.
Learn how to align AI workloads with the right mix of data platform and infrastructure.
Rex Washburn
Chief Architect and Head of Engineering – Data