A Briefing for Business Owners

Before you invest in AI, read the numbers nobody's showing you.

The AI market is full of impressive demos, big promises, and consultants who can't explain why 80% of deployments deliver no bottom-line impact. Here's what the landscape really looks like from the buyer's side — and the one gap nobody's managing.

5 minute read · Walter Adamson · Updated March 2026

The Reality

The numbers that should be on every CEO's desk

80%

of companies that deployed generative AI report no material bottom-line impact

McKinsey, 2025 — the "Gen AI Paradox"

42%

of companies abandoned most of their AI initiatives in 2024–2025

IBM Institute for Business Value

24%

is the best autonomous task completion rate by any AI agent in real-world testing

Carnegie Mellon University benchmark

~130

genuinely operational AI agents exist — out of thousands claimed by vendors

Gartner, 2025

31%

of employees admit to actively sabotaging their company's AI strategy

Forbes, 2025

91%

of mid-market firms have adopted generative AI — yet 70% say they need external help to make it work

RSM Middle Market Report, 2025

Read those numbers together and the picture is clear: almost everyone has bought into AI, almost nobody is seeing the returns they were promised, and a third of their own staff are working against them.

That's not a technology problem. That's a gap between what AI delivers and what the business needs. The market is flooded with consultants showing impressive demos that don't survive contact with a real Tuesday afternoon when three people are on leave and the compliance deadline is tomorrow.

The statistic nobody's explaining

IBM's institute tracked two distinct failure modes. 42% of companies abandoned most of their AI initiatives. Of the deployments that continued, 74% delivered zero tangible business value. Not low value — zero. The technology worked. The demos were impressive. But in most of these deployments, nobody had checked whether the AI output was complete — whether it covered the full requirement surface, or just the 40–60% that language models handle most easily. The bottleneck was never capability. It was that nobody built the judgment to catch what AI leaves out.

The Real Problem

Three gaps every business owner senses — but nobody's naming them

“The AI output looks right — but something feels off.”

Everyone's worried about AI hallucinating — making things up. That's a real risk, and it's getting smaller with every model update. The risk that isn't shrinking is the one nobody's watching: AI that gives you a perfectly accurate answer to 40% of the problem and presents it as the whole picture. It doesn't simplify on purpose. Language models pattern-match toward the most probable answer, not the most complete one. They drop requirements, ignore edge cases, and skip the parts that are hard to reason about. Then they hand you something polished and confident. Your team looks at it, thinks "this is good," and executes. Nobody checks what's missing because nothing looks missing. That's the completeness gap.

“What if someone gets access to our data?”

Security researchers have already demonstrated real exploits against the protocols connecting AI to business systems. This isn't theoretical — a GitHub exploit via prompt injection exfiltrated private repository data. The consultants showing you AI demos rarely mention this. The ones who should be helping you are the ones who lead with it, because they're the ones designing for it.

“How do I know this won't be obsolete in six months?”

AI models are being replaced faster than most businesses can deploy them. If your system is built around a specific model, you're buying a dependency that has a shelf life. The architecture matters more than the model — systems built on open standards survive the model churn. Systems built on vendor demos don't.

The pattern behind all three gaps

Every one of these concerns is legitimate. And every one of them points to the same structural gap: AI is producing output that is correct and incomplete, and nobody has built the judgment layer to catch what's missing. The businesses that get AI right aren't the ones with the best models — they're the ones with organised oversight designed for how AI actually performs, not how the marketing says it performs.

The Mistake

The question everyone asks — and the one nobody does

Every AI consultant walks in and asks the same question: “What do you want to automate?” It sounds reasonable. It's the wrong question.

The right question is: “What is AI leaving out — and what should you stop doing entirely?”

Most businesses that try to automate a workflow should first eliminate 30–40% of the steps in that workflow. They've accumulated processes that made sense five years ago and nobody's questioned since. Reports nobody reads. Approvals that don't add value. Data entered in three places when one would do. And now, on top of that, AI is producing output that covers part of the work with total confidence — and nobody's checking whether it covered the whole thing.

Automating a bad process doesn't fix it — it just makes it a faster bad process. And layering AI onto work without a judgment layer doesn't make the output reliable — it makes the gaps invisible.

	× What most consultants do	✓ What actually works
First question asked	“What can we automate?”	“What is AI leaving out — and what should stop?”
How they demonstrate value	Impressive demos of AI capabilities	Honest assessment of AI completeness gaps
What they check	Whether the AI output is accurate	Whether the AI output is complete
Architecture approach	Built around one AI vendor	Works across models — the judgment layer survives vendor change
Documentation	Instructions separate from the docs your team uses	The document your team approves IS what drives the AI
Timeline to results	Months of pilots and proofs of concept	Working systems in 2–4 weeks
When things change	Rebuild required	Update one file. System adapts.

The Window

This confusion won't last — and that's what makes the timing matter

The AI market right now is in the messy middle. The hype has peaked, the disappointments are stacking up, and the unreliable vendors are starting to exit. That sounds like a bad time to invest. It's actually the best time.

Every major technology shift follows the same pattern. The companies that built during the confusion — when cloud computing was unreliable, when digital was overpromised — those are the ones who dominated the next decade. The infrastructure is solidifying right now: open standards, falling costs, proven patterns for what works and what doesn't. The businesses that build the judgment layer during this window get a structural advantage that latecomers can't replicate.

In 12–18 months, platforms will bundle AI capabilities that make simple implementations trivial. The advantage won't be in having AI — everyone will have it. The advantage will be in having your operations already running with organised judgment — the instinct to catch what AI leaves out, built into how your team works. That's the gap that doesn't close by waiting.

The Approach

Organised judgment. One source of truth. Working systems in weeks.

I build operational systems that close the gap between AI output and business reality. Every AI output gets audited for completeness before your team acts on it. The system forces the AI to plan without acting, then runs a second evaluator layer against your actual business logic — the rules, constraints, and exceptions your team knows but the AI doesn't. Gaps get flagged and filled before anyone sees the output.

The architecture is simple:

The document your team approves is the same artifact that drives the AI. One file to update. When the team changes the document, the AI's behaviour updates with it. When the next model ships, the system keeps running because it was never dependent on one vendor.

What that looks like in practice:

3 hours → 15 minutes · Crew briefing packs, with completeness audit

Skipped → Every job · Compliance checks, automatic and consistent

Full day → 1 hour · Proposals, with gaps caught before submission

I've built 100+ of these specialised AI workflows across deep client engagements. Before that, I spent decades at BHP — 20 years across CIO and VP roles covering mining, rail, ports, and energy. I know what operational pressure feels like from the inside.

Three commitments

I won't sell you technology you don't need. I won't lock you into one AI vendor. And I won't pretend AI output is trustworthy by default — it's correct and incomplete, and the systems I build are designed for that reality.

Next Step

Find out where the completeness gaps are

A structured diagnostic that maps where AI is creating value in your operations and where it's creating invisible risk. You'll see the completeness gaps in your own workflows. Concrete findings, not a slide deck. If the honest answer is “not yet,” I'll tell you.

Walter Adamson · BHP 20 years · Head IT Audit · IT Strategy · Corporate Planning · VP International · 100+ specialised AI workflows

Book the AI Instinct Diagnostic →