Strategy

Agentic AI vs AI Agents vs Chatbots: Why the Distinction Matters at Procurement

|10 min read

At procurement and supplier-evaluation conversations through 2026, three terms keep getting used interchangeably: chatbot, AI agent, and agentic AI. Vendors use them inconsistently, RFP responses blur them deliberately, and the result is that mid- market UK firms regularly buy one thing thinking they are buying another. The distinction matters a great deal, for governance, for cost, for what the system is realistically capable of, and for whether the work product the firm needs will actually emerge.

This piece is the plain-English version of the distinction, written for the procurement, operations, and second-line audiences who have to evaluate what they are being sold.

Chatbots

A chatbot is a conversational interface to a model. The user asks; the model answers; the conversation lives in a UI. Modern chatbots can be impressively capable, drawing on retrieval-augmented generation, citing sources, and handling multi-turn dialogue, but they share a fundamental property: the user has to be there. Nothing happens without a prompt. The work does not happen on its own.

Chatbots are right for: internal knowledge retrieval, drafting support, research help, customer self-service. They are wrong for: workflows that need to run without a human at the keyboard. If the value proposition you have been pitched is “our AI does the work for you” and the demo involves someone typing prompts into a chat window, you are looking at a chatbot, not an agent.

AI agents

“AI agent” is the term that has done most of the damage in vendor copy, because it gets used to mean almost anything. In its narrowest, most useful sense, an AI agent is a single AI step inside a defined workflow, read this document, classify this case, draft this response, route this email. This is what we call AI automation. The model does one well-bounded thing as part of a larger process, with clear inputs, clear outputs, and clear quality criteria. It runs without a user prompt because it is triggered by an event, not by a question.

AI agents in this narrow sense are the highest-ROI starting point for most UK mid-market firms. The technical bar is lower than for full agentic systems, the governance work is more contained, and the time-to-value is faster. The mistake firms make is buying something pitched as an “AI agent” that turns out to be a chatbot in a wrapper, or, at the other end, buying something pitched as an “AI agent” that is actually a multi-step agentic system the firm is not yet ready to govern.

Agentic AI

Agentic AI is what people mean when they say “the agent decides what to do next.” The model plans across steps. It chooses which tool to call. It uses the result of one step to decide the next. It checks its own work and asks for help when it is uncertain. The unit of work is not the prompt or the function call; it is the goal.

Agentic AI is the right pattern when the work is genuinely multi-step, multi-system, and judgement-heavy, client onboarding orchestration, conduct-risk investigation, multi-stage research synthesis. It is the wrong pattern for single-step work, where it adds governance cost without adding capability. The most expensive mistake in this space is buying agentic AI for work that should have been single-step automation.

Why the distinction matters at procurement

Three reasons it matters in real engagements.

Governance scope. A chatbot is governed broadly like any other software with AI in it, usage policy, output review for client-affecting communications, log retention. Single-step AI automation needs the controls plus an eval harness, structured logging of model and prompt version, and an escalation path for low-confidence cases. Agentic AI needs all of that plus step-level audit trails, defined tool boundaries, designed-in human-in-the- loop checkpoints, and a rehearsed rollback path. The governance bar is genuinely higher with each step up the ladder. Buying an agentic system thinking you are buying a chatbot leaves you under-governed in a way the FCA, ICO, or your own second line will eventually find.

Cost shape. Chatbots are typically per-seat or per-conversation. AI automation is typically per-workflow, with inference and infrastructure as ongoing operating costs and a meaningful build-and-eval investment up front. Agentic AI carries those costs plus a substantially larger investment in eval, observability, and governance scaffolding - and the running costs are typically higher because each agentic task involves more model calls. Buying one thinking you have priced for another leaves the second-year budget conversation in difficult shape.

Capability fit. Buying a chatbot when you needed AI automation means buying a tool your team has to drive rather than a tool that does the work. Buying agentic AI when you needed AI automation means paying for governance and orchestration capability you are not going to use. Buying AI automation when you needed agentic AI means underdelivering on the coordination problem that was the actual goal. Each mismatch is recoverable, but each one loses you a quarter or two.

Three procurement-stage questions to ask

These are the questions that separate the three categories quickly:

1. “Walk me through how this runs without a user typing.” If the answer is “a user types,” you are looking at a chatbot. If the answer is “an event triggers it and it does one well-defined thing,” AI automation. If the answer is “the agent receives a goal and plans the steps,” agentic AI.

2. “Show me the audit trail of what happened on a real case.” If the audit trail is conversation logs, chatbot. If it is model and prompt version per case, plus inputs and outputs, single-step automation. If it is step-level, every tool call, every input, every output, every decision the agent made, agentic AI. Vendors who cannot show you any of these are selling unfinished product.

3. “What happens when the system is uncertain?” Chatbots typically answer anyway. Well-built AI automation flags low-confidence cases for human review with the model's reasoning attached. Production-grade agentic AI escalates to a human-in-the-loop checkpoint with the agent's reasoning, the data it gathered, and the decision it would have taken. The answer to this question reveals more about the maturity of the underlying engineering than any other.

What to do at procurement

Match the category to the workflow rather than the other way round. Run a structured discovery, the Evolve Workflow Audit is one approach, there are others, to understand which category your candidate workflows actually need. Then evaluate vendors in that category. The mistake is letting a vendor's pitch define the category before the workflow has been understood; that is how firms end up with a chatbot pitched as an agent and a procurement decision they later regret.

For a deeper look at agentic AI specifically, see Agentic AI explained: a UK operator's guide. For the cost model on a single AI automation, The true cost of AI automation for a 200-person financial services firm is the practical companion piece. And for the broader question of where AI automation lands next to RPA in a typical mid-market estate, AI automation vs RPA is worth the read.

Ready to transform your business with AI?

Book a free strategy session to discuss how Evolve AI can help your organisation harness AI safely and compliantly.

Book Strategy Session