This article is solely my personal opinion, and does not represent my employer’s position.
What are AI Agents? It seems like every software company in the world is now describing their software as an agent. There’s a long tradition of attaching each new technology buzzword to a company’s software, with or without making any changes to it whatsoever. “Agent” seems to primarily be the replacement for “AI”, “Big Data,” “Analytics,” or “Responsive.”
But AI Agents are a real technology. Even if most are not, some products are being genuine when they use the term. This article lays out the functionalities and capabilities that make up AI Agents – and introduces a complexity hierarchy so that you can evaluate yourself software that claims to be agentic.
The most accurate (and least precise) definition of an AI Agent is software that has agency to act on your behalf. That is, it can “decide” what to do. This comes from the real world: when you hire a real estate or talent agent, they do the work to execute your will. I don’t think this definition is helpful, because my furnace can “decide” to shut off the heater because it is warm enough, and the computer from WarGames can “decide” to start thermonuclear war. Most definitions, including Wikipedia’s, cover this entire gamut. Let’s be more precise.
When honest marketers describe their AI software as an Agent, they generally mean a Large Language Model (LLM) application that is more capable than early days of ChatGPT. The early ChatGPT interface already had several important features that would not be called argentic:
This already rules out plenty of software that’s billed as an AI Agent. If the AI is applied via button-click, it has less agency than ChatGPT did in 2021. This doesn’t make it bad though! Teams Meeting Recap is one of my favorite AI features, despite being the antithesis of an agent.
Moving on from what is not an agent, let’s discuss those capabilities that can be added to an LLM application that do increase agency. I’ll give a definition of each functionality, and then explain a complexity hierarchy.
This is certain to be an incomplete list, but this is what I see on the horizon. And there are other useful AI capabilities that are not related to agents, such as a natural voice interface or RAG. “AI Agent” isn’t a synonym for “advanced”!
A program being agentic isn’t the same thing as being very good. Anthropic Claude Sonnet 3.5 is considered by many today to be the LLM product that produces the best responses, but it has few features of AI Agents. But if we arrange these capabilities across reasoning, tool-use, interaction, and learning and sort by difficulty to implement, we can get a fair picture of what AI Agents exist today.
Here’s a workbook you can use for the chart. Let’s compare a few AI products’ agency (click to embiggen):
You can see that each company has a very different view of what an AI Agent is. I like the products I just described, but many other software companies are misusing the term “agent.” If an “agent” is a single call with a single system prompt to reprocess user data, the AI has no agency at all. It is too late to fix the term in our industry. However, now you have the tools to determine how agentic a product is, and in what dimensions.
You can read my older post, How to Improve Your LLM Application, for tips on adding agency to your intelligent application. Get out there and start building!
It's an open secret which people will thrive during the AI wave. Here's how to…
A history of diffusers' faults and how transformers address them. Diffusers are still used by…
See how testing your agents with LLM-judged questions (evaluation) will improve their quality, prevent regressions,…
Tired of boring answer? See how to create AI agents with your own tastes and…
Agent instructions are critical to get right. Learn why to write in Markdown, go right…
Effective AI agents handle complex or menial workflows with instructions, knowledge, and skills. Learn to…