When AI Lies With Confidence

The AI didn’t malfunction. It completed something that hadn’t happened. That’s a different kind of wrong — and a more dangerous one.

Here is a specific kind of bad experience that is becoming more common, and that almost no one has a name for yet.

You contact a company. You have a concern — a billing issue, a damaged package, a missing item, a charge you didn’t recognize. An AI handles the interaction. The AI is helpful. Genuinely helpful, by most measures: it listens, it acknowledges your concern, it gathers the relevant details, it tells you with apparent confidence what is going to happen next. Your refund is being processed. Your replacement is on its way. Your account has been credited. The item has been logged in their system.

You end the interaction satisfied. You stop worrying. You wait.

Nothing happens.

You contact the company again. A different agent pulls up your account. There is no record of a refund request. No replacement order. No credit. No log entry. The previous interaction — the one in which everything was confirmed — left no trace in any system.

What happened?

The AI told you things were done that were not done. Not because of a data error or a system outage or a human mistake downstream. Because the AI, in responding to your concern, completed the verbal artifact of resolution — the language of confirmation and follow-through — without triggering any of the actual systems that would have made those things real.

The conversation was real. The completion was not.

In AI research circles, this failure mode gets called hallucination. The term is useful but imprecise. It conjures the image of an AI generating false facts — inventing citations, making up statistics, producing plausible-sounding information that has no basis in reality.

What happens in customer-facing AI systems is something more specific. It is not a fact error. It is a completion error. The AI did not get a fact wrong. The AI generated the verbal completion of an action that requires authority — a system transaction, a database entry, a human decision — without that authority ever being invoked.

The AI said your refund was issued. The refund was never submitted to the payment system.

The AI said your complaint was logged. The complaint record was never created.

The AI said your replacement was ordered. The fulfillment system was never queried.

These are not errors in the same category as “the AI told me the wrong store hours.” These are errors that modify your behavior in the world based on a false premise. You stop disputing the charge because you believe the refund is coming. You stop following up because you believe the complaint is filed. You stop looking elsewhere because you believe the replacement is ordered.

The false completion closed a loop that was never actually closed.

Why does this happen?

The simplest explanation: AI language models generate what comes next based on what came before. In a conversation about a billing concern, the natural completion — the thing that would come next in a well-handled interaction — is confirmation that the issue has been addressed. The AI generates that confirmation. Not because it checked the payment system. Because confirmation is what the pattern of the conversation calls for.

This is the fundamental confusion that creates the problem. There are two kinds of artifacts in any organizational interaction. The first kind can be completed from pattern — a complaint record, a support ticket, a meeting summary, a drafted follow-up email. These have learnable structures, and AI can produce them directly from the interaction. The second kind can only be completed from authority — an actual refund, a confirmed system entry, an authorized decision, a verified status. These require something outside the AI: access to a system, permission from a person, confirmation from reality.

The AI that does not distinguish between these two kinds of completion will generate completion language for both — confidently, fluently, in the warm register of a helpful interaction — regardless of whether the authority-requiring completion has actually occurred.

This is not a model limitation in the sense of needing more training data or better reasoning. It is an architectural failure. The system was not built to withhold completion language until completion was verified. It was built to generate language. And completion language is what the pattern called for.

The consumer consequence is serious in ways that are still being underestimated.

When a human customer service agent makes a mistake — commits to a resolution they don’t follow through on, creates a false expectation — there are usually signals. The agent hedges. They use phrases like “I’ll try” or “I’ll have to check on that” or “I think this should work.” The uncertainty is visible. The commitment is qualified.

When an AI makes the same mistake, the certainty is often higher, not lower. The language is cleaner. The confirmation is complete. “Your refund has been processed.” “Your order has been updated.” “Your complaint has been logged in our system.” These sentences do not contain hedges because the AI’s generation process does not have access to the uncertainty that a human agent would feel when overcommitting.

The irony is that the AI sounds more confident precisely because it is less connected to the underlying reality.

This is what makes the trust problem particularly sharp. Customers interact with AI systems that sound authoritative. They make decisions based on what those systems tell them. When the confirmations turn out to be hollow — when the refund that was “processed” was never submitted, when the complaint that was “logged” was never filed — the betrayal is not just of a customer’s time. It is of a customer’s trust in a specific kind of institutional communication.

“The AI told me it was handled” becomes an explanation, not a defense.

There is a straightforward design principle that prevents most of this.

AI systems should be built to distinguish between what they can complete from pattern — the things they can do directly, using learned structures — and what they can only complete from authority — the things that require a system transaction, a human decision, an external verification.

For the first category: complete directly. The AI can produce the complaint record. It can draft the follow-up. It can generate the support ticket. It can write the summary. These are pattern-bound artifacts and the AI can complete them.

For the second category: do not generate completion language until completion has been verified. The AI should not say “your refund has been processed” until it has received confirmation from the payment system that the refund has been processed. It should not say “your item has been located” until a human or a system has confirmed the item’s location. It should not say “your complaint is logged” until a complaint record has actually been created.

This is not a hard architectural constraint to implement. It requires building the system with explicit awareness of which artifact types are authority-bound, and holding the completion language in reserve until authority has been received.

What it requires is someone, at some point in the system design, asking a question that is not currently being asked in most AI deployments: which of the things this system might claim to complete actually require authority to complete?

The answer to that question draws the line between a trustworthy AI system and a convincing one.

Convincing is not enough. Convincing is actually worse than nothing, because convincing with false confidence is more damaging than uncertainty with appropriate hedging.

The AI that says “I’m not sure if this has been processed — let me connect you with someone who can verify” is doing something much more valuable than the AI that says “your refund has been processed” and means nothing by it.

One of those AIs is building trust. The other is spending it.

And trust, once spent on false completions, does not come back easily.

When AI Lies With Confidence

Like this:

Leave a ReplyCancel reply

Share this:

Like this:

Leave a ReplyCancel reply

Discover more from John Rector