Insights

Why AI Training Doesn't Change How People Work

By Dan Gildoni · 20 June 2026 · 13 min read

In short

Enterprises have given everyone AI and trained them on it, and behaviour has barely moved. The reason is not the tool, the budget, or the appetite of the workforce. Most AI training teaches the features of the tool, not the habit underneath the keystroke, and it is delivered as a one-off event that fades within weeks. Access is not behaviour. This essay names why that gap persists, and what installs a reasoning habit that actually holds.

On this page

The adoption paradox
Why training is the reflex, and why it fails
The habit nobody trains
The two failures
What actually installs a habit
One question worth asking
Frequently asked questions
References

01 · The paradox

Everyone has it. Almost nobody works differently

There is a number from the Microsoft Copilot rollouts that should stop any transformation lead in their tracks. When Gartner studied enterprise deployments, 90% of the people given the assistant said they would fight to keep it, and at the same time 72% said they struggled to fit it into their daily routine (Gartner, via Computerworld, 2024). Read those two facts together. People love the tool and cannot make it part of how they work. Only six percent of the organisations studied had finished a pilot and were planning a real scale-up. The licences went out. The behaviour stayed put.

This is not a Microsoft story; it is the shape of enterprise AI everywhere. MIT's NANDA initiative looked across hundreds of generative-AI deployments and found that around 95% of pilots delivered no measurable impact on the profit-and-loss statement, with only about five percent producing rapid acceleration (MIT NANDA, 2025). Their diagnosis is worth holding onto: the failure is not the quality of the models, it is the approach. Generic tools, they note, do not learn from or adapt to how people actually work. The technology arrived. The way of working did not move to meet it.

Faced with that gap, most organisations reach for the same instrument: training. More courses, more literacy modules, more prompt libraries. It is the reasonable response, and it mostly does not work. Understanding why is the difference between a programme that produces a completion certificate and one that produces a workforce that decides differently.

02 · The reflex

Why training is the reflex, and why it fails

Training is the reflex because it is legible. It can be scheduled, assigned, and counted, and a completion rate is a clean thing to show a board. The skills gap is real and the pressure to close it is real: the World Economic Forum estimates that 39% of workers' core skills will be transformed or outdated by 2030, and 63% of employers name skills gaps as the single biggest barrier to transforming their business (World Economic Forum, 2025). So the organisation does the legible thing and buys training at scale.

Then two forces quietly drain it. The first is what the training teaches. A feature tour shows people the buttons: how to summarise a document, draft an email, build a deck. It leaves them more fluent with the tool and no different in how they reason, so they point a more capable machine at exactly the work they were already doing. The second is memory. Human retention of new information collapses fast without reinforcement, the forgetting curve that learning science has described for over a century, and a one-off course is the textbook case. Even the part that lands drains away within a month. Put the two together and you get a workforce that has been trained and a way of working that has not changed. Deloitte's survey of manufacturers caught this precisely: even as firms deploy AI, they rated human capital the lowest-maturity area of all the smart-manufacturing categories they measured (Deloitte, 2025). The tools are maturing faster than the people using them, because the people are being trained in a way that does not stick.

This is not a failure of effort, and it is not unique to AI. RAND found that more than 80% of AI projects fail, roughly twice the rate of other IT projects, and that the leading cause is people and process, not the technology (RAND, 2024). The instinct to fix a people problem with a content delivery is where the money quietly goes to waste.

03 · The habit

The habit nobody trains

There is a layer underneath the keystroke that no feature tour reaches, and it is the layer that decides whether AI is worth anything. It is the habit of how a person engages the machine in the first place. Most people use AI like a vending machine. They have a task, they type a request, they take what comes back, and they repeat. No expert framing, no challenge to the request, no examination of whether the task was the right one. The machine does exactly what it is told, and what it is told is almost always the symptom rather than the cause, because the thinking behind the request was never examined.

The small minority who get real value operate a different default. They treat the machine as a thinking partner. They give it a specific expert role. They state the real goal rather than the surface task. And, the move that does most of the work, they make the machine question them before it answers, surfacing what they have not specified and challenging whether the task is even the right one. We call that move the Flip, and it is the hinge of the whole thing. Most people instruct AI to produce. The Flip instructs it to interrogate the request first.

Training shows people the buttons. It almost never changes the question they ask before they press one.

This is why training fails to change behaviour even when it is well delivered. The vending-machine habit is invisible to a feature curriculum, because the curriculum is about the tool and the habit is about the person. You can teach someone every capability of an assistant and leave the way they reason with it completely untouched. The gap is not technical. It is cognitive, and a course aimed at the tool will keep missing it.

04 · The diagnosis

The two failures, named

If AI training is going to change behaviour, it has to beat the two forces that drain the standard version. Naming them is the start of fixing them.

1. It trains the tool, not the thinking. A feature course raises fluency with the interface and leaves the reasoning habit underneath it untouched. People come out able to operate the machine and still aiming it at the same low-value work, faster. The result is what we call High-Speed Waste: polished output that accelerates activity without changing a single outcome that matters. To change behaviour, the unit of training has to be the habit, the persona, the goal, and the Flip, not the button.

2. It is an event, not a practice. A one-off workshop runs straight into the forgetting curve, and most of what was taught is gone within weeks. Behaviour change needs reinforcement: the new habit has to be practised in real work, repeatedly, until it is the default rather than something recalled from a course. A single event, however good, cannot do that. What holds is a rhythm.

Notice that neither failure is about the technology, the budget, or whether people want to use AI. They want to; the Copilot numbers prove it. Both failures are about the design of the learning. That is good news, because design is the most fixable thing in the building.

05 · The fix

What actually installs a habit

The evidence on what works points the same way the diagnosis does. Harvard Business Impact studied how AI-fluent people actually became fluent and found they were twice as likely to have learned through experimentation rather than instruction. Their conclusion is blunt: without hands-on practice, learning "stays in the realm of theory, important, but inert" (Harvard Business Impact, 2025). Capability is built by doing the real work differently, not by being told about it. The same study found that the binding constraint on AI fluency was a lack of organisational support and reinforcement, not a lack of employee motivation.

That is what the Havruta Methodology is built to install, and it answers the two failures directly with two pillars. The Cognitive Pillar replaces the feature list with one portable habit, the 4-Lines: give the machine a specific expert persona, state the real goal, invoke the Flip so it questions you before it answers, and run the exchange one step at a time. It is simple enough that a finance analyst, a plant supervisor and a marketer can all run it on their own real work the same afternoon, which is the hands-on practice the evidence calls for.

The Brain Pillar answers the second failure. It replaces the one-off event with persistent, shared context, the verified ground a team reasons from again and again, so the habit is reinforced by the work itself rather than left to memory. This is also where Ground Truth lives, the discipline of anchoring the machine in your verified internal reality instead of its generic average. The two pillars together give people a test they can apply to their own output, what we call the Mirror Principle: if the output is generic, the reasoning was generic. A workforce that has internalised that has stopped needing the course, because it has learned to notice the gap itself.

None of this requires new tools or new budget. It works with whatever AI the organisation already has on its desks. The variable it changes is not what people can access. It is how they reason before they press a key.

06 · The question

One question worth asking

Do not ask what proportion of your workforce has been trained on AI. That question has a flattering answer and it tells you nothing. Ask instead: has the reasoning changed? Not seats activated, not modules completed, not prompts sent. Is the work that comes out of your teams less generic, and are the decisions sharper, than they were before the programme ran? If the honest answer is no, the training was not too small. It was aimed at the wrong layer.

The reason this matters for whoever owns AI transformation is that the two halves of the mandate stand or fall on the same point. Broad upskilling that changes behaviour and a deep AI Builders team only compound if they share one reasoning discipline; otherwise you get a workforce that was trained and did not change, and an expert core that pulls away from it. Both are versions of the same failure: capability that was delivered and never installed.

The most fixable gap in the enterprise is not a gap in tools or in budget. Those are largely closed; the spend has been made and the machines are on every desk. The gap is in how people think before they use them, and unlike infrastructure it does not take years or millions to close. It takes a habit, practised on real work, reinforced until it holds. The course that fades is the expensive option. The habit that compounds is the cheap one. The only question is which one your programme is actually building.

07 · Frequently asked

Frequently asked questions

Why doesn't AI training change how people work?

For two reasons. Most training teaches the features of the tool rather than the reasoning habit underneath the work, so people become fluent with the interface and keep the way of working they already had. And it is delivered as a one-off event, which runs straight into the forgetting curve, so even what lands fades within weeks. The fix is to install a portable reasoning habit on real work and reinforce it through an ongoing rhythm, not a single course.

If people love tools like Copilot, why isn't behaviour changing?

Because wanting a tool is not the same as reasoning with it. Gartner found 90% of people given Microsoft Copilot would fight to keep it, while 72% could not fit it into their daily routine (Gartner, via Computerworld, 2024). The enthusiasm is real and the access is real. What is missing is the habit of making the machine interrogate the work before it produces, which no licence and no feature tour supplies on its own.

What is the actual enterprise AI failure rate?

MIT's NANDA initiative found that around 95% of generative-AI pilots delivered no measurable impact on the profit-and-loss statement, with about 5% producing rapid acceleration (MIT NANDA, 2025). RAND separately found more than 80% of AI projects fail, roughly twice the rate of other IT projects, with people and process the leading cause, not the technology (RAND, 2024). The exact figure varies by study; the direction does not.

What is the difference between AI literacy and changing behaviour?

AI literacy is knowledge of what AI is and what it can do, which is necessary and not sufficient. Changing behaviour is a habit: the expert persona a person assigns, the real goal they hold the machine to, and the discipline of letting it question the work before it produces. Literacy can be taught in a module. A habit has to be practised on real work and reinforced, which is why literacy programmes raise awareness and rarely change output.

How do you make AI upskilling stick?

Teach a single portable habit on each person's own live work, then reinforce it. The Harvard Business Impact study found AI-fluent people were twice as likely to have learned through experimentation, and that the binding constraint was organisational support, not motivation (Harvard, 2025). In practice that means hands-on practice over instruction, the 4-Lines as the habit, and persistent shared context so the practice is repeated rather than remembered. We set out the build in enterprise AI upskilling that changes behaviour.

How do you measure whether AI training worked?

By the work, not the dashboard. Seats activated, completion rates and prompt counts measure access and attendance, not capability. The signal that matters is whether the output is less generic and the decisions sharper than before, the test the Mirror Principle names. If the reasoning in the room is no different from a month ago, the training has not changed behaviour, whatever the completion rate shows.

References

Gartner, on Microsoft 365 Copilot enterprise rollouts (90% would fight to keep it; 72% struggle to integrate it; 6% planning scale-up), reported by Computerworld, 2024. computerworld.com
MIT NANDA, The GenAI Divide: State of AI in Business 2025 (around 95% of pilots show no measurable P&L impact). nanda.media.mit.edu
World Economic Forum, Future of Jobs Report 2025 (39% of core skills transformed or outdated by 2030; 63% of employers name skills gaps the biggest barrier). reports.weforum.org
Deloitte, 2025 Smart Manufacturing Survey (human capital rated the lowest-maturity smart-manufacturing category). deloitte.com
RAND Corporation, The Root Causes of Failure for Artificial Intelligence Projects, 2024 (more than 80% of AI projects fail; people and process the leading cause). rand.org
Harvard Business Impact, Learning Through Experimentation, 2025 (AI-fluent people twice as likely to have learned hands-on; organisational support the binding constraint). harvardbusiness.org

The course that fades is the expensive option. Install the habit that compounds instead.

Request a Strategic Briefing →

AI for the Head of AI Transformation → Read the methodology →