Databricks Co-Founder Explains What Kills Enterprise AI Deals at TechCrunch Disrupt 2026

At TechCrunch Disrupt 2026, Databricks co-founder brought the conversation back to a point that many enterprise buyers have been circling for months, but rarely say out loud in plain terms: the enterprise AI deal is no longer primarily about whether the technology is impressive. It’s about whether it can be trusted—reliably, safely, and repeatedly—once it leaves the controlled environment of demos, pilots, and proof-of-concepts.

That shift changes everything about how deals get evaluated, negotiated, and ultimately approved or killed. In the early wave of enterprise AI, the question was often framed as excitement: Can this model generate better answers? Can it summarize faster? Can it automate a workflow that used to take hours? Those questions still matter, but they’re increasingly table stakes. The real gate now is deployment readiness—whether an organization can roll AI out broadly without creating new operational risk, compliance exposure, or reputational damage.

The co-founder’s core message at Disrupt was simple: enterprise AI is entering a different phase. Enterprises are moving from “Is this exciting?” to “Is it safe to deploy broadly?” And when that second question becomes the dominant one, the reasons deals fail become more specific, more technical, and more governance-heavy than many vendors expect.

What kills deals in this phase isn’t usually a lack of capability. It’s uncertainty.

When buyers can’t clearly predict how an AI system will behave across the messy reality of production—different users, edge cases, changing data, ambiguous inputs, shifting business rules—they hesitate. That hesitation doesn’t show up as a single objection. It shows up as a pattern: procurement slows down, legal asks for additional assurances, security requests more documentation, and stakeholders demand evidence that goes beyond benchmarks.

In other words, the deal dies not because AI can’t do something impressive, but because the enterprise can’t confidently answer the question: what happens when it’s wrong?

And “wrong” in enterprise AI isn’t just a quality issue. It can be a safety issue, a compliance issue, a financial issue, or a customer trust issue. A model that hallucinates a policy citation might be annoying in a consumer app. In an enterprise context, it can trigger incorrect decisions, improper disclosures, or downstream failures that are expensive to unwind.

That’s why the evaluation criteria are changing. Buyers are increasingly treating AI systems like critical infrastructure rather than like experimental software.

A useful way to think about the new buying process is that enterprises are now evaluating AI through three lenses at once: reliability, risk, and operational control. Each lens has its own set of questions, and each question can become a deal blocker if the vendor can’t provide credible answers.

Reliability: not just accuracy, but consistency under pressure

Reliability used to mean “Does it work on our test set?” Now it means “Does it behave consistently when conditions change?”

Enterprises are learning that model performance isn’t static. It shifts with data drift, with user behavior, with prompt patterns, with retrieval quality, and with the evolving context of the business itself. Even if a model looks strong during a pilot, the pilot may not expose the full range of real-world inputs. Production introduces noise: incomplete information, conflicting documents, ambiguous requests, and adversarial or accidental misuse.

So buyers start asking for evidence that the system can handle variability. They want to know how the system behaves when it doesn’t know. They want guardrails that prevent confident nonsense. They want measurable thresholds for when the system should refuse, escalate, or route to a human.

This is where many deals stall. Vendors can often demonstrate impressive outputs. But enterprises want to see how the system fails—and whether those failure modes are acceptable.

Risk management: the enterprise version of “trust, but verify”

Risk management is where the conversation becomes less about model quality and more about governance.

Enterprises are increasingly concerned with the full lifecycle of AI: training and fine-tuning practices, data provenance, access controls, logging, monitoring, incident response, and auditability. They also care about how the system interacts with sensitive data and whether it can leak information, infer private attributes, or produce outputs that violate internal policies.

In the early days, some organizations treated these concerns as compliance checkboxes. Now they’re treated as operational requirements. If a system can’t be monitored effectively, it can’t be governed effectively. If it can’t be audited, it can’t be defended. If it can’t be rolled back or constrained, it can’t be safely deployed broadly.

The co-founder’s framing points to a key reality: enterprises don’t just want AI that works. They want AI that can be managed.

Confidence in behavior: understanding the boundaries

Confidence is the missing ingredient in many enterprise AI discussions. Buyers want to know not only what the model can do, but what it won’t do reliably.

This is where “safety” becomes practical rather than philosophical. Safety isn’t only about preventing harmful content. It’s also about ensuring the system’s outputs are grounded in the right context, that it respects enterprise constraints, and that it can be tuned to the organization’s tolerance for error.

Confidence also includes transparency. Enterprises want to understand why the system produced a particular output. They want traceability—especially when AI is used to support decisions that affect customers, employees, or regulated processes.

When vendors can’t provide that level of visibility, enterprises struggle to justify broad rollout. They may still run pilots, but they hesitate to scale.

Operational control: the ability to steer the system over time

One of the most underappreciated aspects of enterprise AI deals is that deployment isn’t a one-time event. It’s a continuous process. Data changes. Policies change. Business priorities change. User behavior changes. Models and retrieval systems need updates. Systems need monitoring and recalibration.

So buyers ask: who owns the system after launch? How quickly can issues be detected? How quickly can fixes be deployed? What happens when the system starts drifting? How do you ensure that improvements don’t break existing workflows?

This is where platform thinking matters. Enterprises increasingly want AI capabilities embedded into a broader data and governance stack, rather than bolted on as a standalone tool. The reason is straightforward: if AI is tightly integrated with the data layer, access controls, and observability tooling, then the enterprise can manage it like any other critical system.

If AI is isolated, it becomes harder to govern. And when governance becomes the deciding factor, isolation is a liability.

Why “safe to deploy broadly” is the new ROI question

The phrase “safe to deploy broadly” sounds like a risk statement, but it’s also an ROI statement.

Broad deployment is where value compounds. Pilots can be useful for learning, but they don’t transform operations at scale. Enterprises want to move from experimentation to transformation. That requires confidence that the system won’t create unacceptable costs—financially, legally, or operationally.

In practice, “safe to deploy broadly” means the enterprise believes it can:

1) Prevent catastrophic failures
2) Detect problems quickly
3) Contain impact when something goes wrong
4) Prove compliance and accountability
5) Maintain performance as conditions evolve

If those five items aren’t credible, the enterprise can’t justify scaling spend. The deal doesn’t die because the AI is weak. It dies because the enterprise can’t quantify the downside.

And quantifying downside is hard when vendors rely on vague assurances.

The unique twist in this phase: governance is becoming product

A notable implication of the Disrupt discussion is that governance is no longer a separate layer added after the fact. It’s becoming part of the product experience.

In earlier cycles, vendors could win deals by focusing on model performance and developer experience. Now, buyers expect the vendor to help them operationalize AI: to provide the tooling and frameworks that make safety and reliability measurable.

That includes things like:

– Controls for data access and permissions
– Mechanisms to constrain outputs to enterprise-approved knowledge
– Monitoring and alerting for quality and safety signals
– Logging and audit trails for compliance
– Processes for evaluation, testing, and regression checks
– Clear pathways for human review and escalation

When these capabilities are missing or require heavy custom engineering, enterprises often decide the total cost of ownership is too high. Not just in dollars, but in time and organizational effort.

The result is a new kind of buyer skepticism: “Can we actually run this at scale without building a whole governance team around it?”

This is also why procurement and security teams are increasingly central to AI deals. They’re not just sign-off gates anymore. They’re co-authors of the deployment plan.

The hidden reason deals stall: misalignment between pilot success and production reality

Many enterprise AI projects fail to scale because pilot success doesn’t translate cleanly to production.

Pilots often use curated datasets, limited user groups, and simplified workflows. Production introduces unpredictable inputs and higher stakes. Even small differences—like how documents are updated, how retrieval is configured, or how users phrase requests—can change outcomes.

So buyers start demanding evaluation that mirrors production conditions. They want stress tests, scenario coverage, and clear metrics for acceptable performance.

This is where the “exciting” phase ends. Excitement can be generated in a demo. But safety and reliability require systematic measurement.

The co-founder’s comments reflect a broader market maturation: enterprises are learning to treat AI evaluation like engineering, not like marketing.

They want to see repeatable processes, not one-off results.

What vendors can learn from this shift

For vendors, the lesson is not simply “be safer.” It’s to align the sales narrative with the enterprise’s decision framework.

If the enterprise is asking, “Is it safe to deploy broadly?” then the vendor needs to answer with evidence and operational detail. That means being able to discuss:

– How the system is evaluated before rollout
– What monitoring exists after rollout
– How incidents are handled
– How the system is constrained to reduce risk
– How governance is implemented in practice
– How updates are managed without breaking trust

It also means