AI Agents: FAQ for Decision-Makers
An AI Agent is a software-based system built on a (Large) Language Model that autonomously pursues a defined goal: it perceives its environment, plans across multiple steps, independently selects and uses external tools (tools, APIs, data sources), observes the results, and iteratively adjusts its plan until the goal is reached or aborted. The key point for decision-makers: an agent only pays off when the solution path cannot be planned in advance.
Key Takeaways
- ✓An AI Agent differs from chatbots, RPA, and workflow automation through four mandatory properties: LLM-based control, multi-step planning (Perceive→Reason→Act→Observe), dynamic tool use, and goal-oriented autonomy within guardrails.
- ✓Agents only pay off when the solution path cannot be planned in advance. For fixed paths, workflow automation or RPA are cheaper, more robust, and lower in compliance burden.
- ✓Market maturity is muted: according to McKinsey State of AI 2025, only 23 percent scale at least one agentic use case; Gartner (June 2025) expects more than 40 percent of agentic AI projects to be cancelled by the end of 2027.
- ✓Compliance belongs in the picture from day 1: EU AI Act (Art. 50 transparency from 02 Aug 2026), GDPR DPIA (Art. 35), as well as the works council under BetrVG §87 (DE) or ArbVG §96 (AT). Note: informational, not legal advice.
- ✓Pragmatic start: 1 to 3 use cases with clear ROI, a low-risk pilot with human-in-the-loop, outcome KPIs instead of activity metrics, vendor-agnostic framework choice based on use case.
- ✓Costs are often underestimated: token and reasoning costs drive the effort; according to Bitkom 2026, 33 percent of companies report higher AI costs than expected. Routing and caching dampen this.
An AI Agent is a software-based system built on a (Large) Language Model that autonomously pursues a defined goal: it perceives its environment, plans across multiple steps, independently selects and uses external tools (tools, APIs, data sources), observes the results, and iteratively adjusts its plan until the goal is reached or aborted.
For decision-makers, the most important questions can be condensed into three core answers:
- What is different? An agent plans dynamically and selects tools itself, instead of following a predefined path. This is exactly what distinguishes it from a chatbot, RPA, and workflow automation.
- When does it pay off? Only when the solution path cannot be planned in advance. If the process is known, classic automation or RPA are cheaper and more stable.
- What matters when introducing it? A clear use case with ROI, human-in-the-loop for irreversible actions, DPIA and works council from the start, thinking vendor-agnostically.
A concrete example
A typical L4 agent (autonomous agent) is a research assistant tasked with answering an open question. It breaks the goal down into sub-steps, independently invokes a web search, reads results, formulates follow-up questions, checks for contradictions, and finally summarizes an evidence-backed answer. No one prescribed the exact order of the search queries. It is precisely this dynamic control within the reasoning loop (Perceive→Reason→Act→Observe) that makes the difference from a hard-wired pipeline.
Where does the market currently stand?
Expectations are high, scaling is muted. According to McKinsey State of AI 2025, only 23 percent of companies scale at least one agentic use case, with a further 39 percent experimenting; in no function does the share of scaled use cases exceed 10 percent. Gartner (June 2025) expects more than 40 percent of agentic AI projects to be cancelled by the end of 2027, often due to an unclear business case and underestimated effort. For decision-makers, this means: prioritize soberly instead of rolling out broadly.
Agent, chatbot, RPA, workflow, or assistant?
The following matrix classifies the terms that are frequently conflated in the market:
Criterion | AI Agent | RPA | Workflow automation | Assistant | |
|---|---|---|---|---|---|
Trigger | Goal event | Message | Time rule | Event | |
Reasoning | LLM, multi-step | Intent matching | none (rule script) | conditional logic | LLM, single-step |
Tool use | dynamic, many | usually none | UI bots, screen scraping | prebuilt connectors | limited |
Memory | Short- and long-term | Session | none | Workflow state | Session |
Autonomy | high | very low | medium, scripted | low | low to medium |
Maintenance | high | low to medium | medium to high (UI breaks) | low | low |
Key principle: workflow automation and RPA follow a predefined path, whereas an agent decides the path dynamically at runtime. This also entails the agent's higher maintenance and compliance effort.
How mature is "my" agent? The levels L1–L5
- L1 Reflex: rule-based, no LLM (FAQ bot, thermostat).
- L2 Augmented LLM: LLM plus one tool call, reactive (e.g. ChatGPT with web search).
- L3 Workflow agent: LLM in a deterministic pipeline (prompt chaining, routing).
- L4 Autonomous agent: LLM controls sequence and tool selection dynamically, full loop (e.g. Claude Code, Deep Research).
- L5 Multi-agent system: several autonomous agents coordinate via A2A (orchestrator plus specialists).
This scale helps in vendor conversations: anyone who says "autonomous agent" but only delivers L2/L3 is engaging in "agent washing". According to Gartner, only about 130 of thousands of providers have genuine agent capabilities.
Standards: MCP and A2A
Two open standards structure the ecosystem. MCP (Model Context Protocol) standardizes the agent↔tool connection; the 2025-11-25 specification was handed over to the Linux Foundation in December 2025, and there are more than 10,000 MCP servers. A2A (Agent-to-Agent) standardizes agent↔agent communication and has been with the Linux Foundation since June 2025, with more than 150 participating organizations. For decision-makers, both are an argument for vendor-agnostic architectures that reduce lock-in.
Compliance: what DACH decision-makers need to clarify early
Note: informational, not legal advice; legal deadlines partly provisional.
- EU AI Act: Transparency obligations under Art. 50 apply from 02 Aug 2026. The AI literacy obligation (Art. 4) has already applied since 02 Feb 2025. High-risk classifications under Annex III may, according to the Digital Omnibus (07 May 2026), be postponed to 02 Dec 2027; until formal adoption, 02 Aug 2026 remains legally decisive.
- GDPR: For agentic systems, a Data Protection Impact Assessment (Art. 35) is usually required; Art. 22 (automated decisions) and Art. 28 (processing on behalf) are also relevant.
- Co-determination: Before introduction, the works council must be involved — in Germany under BetrVG §87, in Austria under ArbVG §96.
Common mistakes when introducing agents
The most expensive pitfalls are rarely technical. Frequently the framework is chosen before the use case, data preparation is underestimated, the DPIA is skipped, or human-in-the-loop is omitted for irreversible actions. Equally risky: a token cost explosion without routing and caching, treating agents as deterministic, and a lack of observability. A pragmatic start addresses exactly these points: 1 to 3 use cases with measurable ROI, a low-risk pilot with human approval, outcome KPIs instead of activity metrics, as well as DPIA and works council from day 1.
FAQ
What is the difference between an AI Agent and a chatbot?
What is the difference between Agentic AI and an AI Agent?
Do we need our own LLM to deploy agents?
When does an agent pay off compared to workflow automation or RPA?
How do we choose the right agent framework?
What are MCP and A2A and why are they relevant for us?
Which legal obligations apply in the DACH region? (informational, not legal advice)
Is a Data Protection Impact Assessment (DPIA) required for agentic systems?
Does the works council need to be involved?
Why are agents often more expensive than expected and how do you dampen the costs?
Why do so many agent projects fail?
How do we measure the success of an agent?
Want to go deeper?
Get new analyses straight to your inbox – or see how we put this knowledge to work for companies.