AI Techniques

Agentic AI

Agentic AI refers to AI systems that autonomously pursue goals by planning multi-step actions, calling external tools, and adapting based on outcomes — with minimal human intervention at each step.

Agentic AI is the shift from AI as a passive responder to AI as an active executor. Rather than answering a single prompt and stopping, an agentic system receives a high-level goal, breaks it into steps, takes action, observes the result, and iterates — all with minimal human hand-holding between steps.

The underlying engine is still an LLM, but wrapped in a loop: plan → act → observe → re-plan. What makes it “agentic” is that the model decides what to do next at each turn, rather than waiting for a human to direct every step.

The Core Loop

Every agentic system, regardless of framework, runs a variant of the same cycle:

  1. Goal intake: A human provides a high-level objective (“research competitor pricing and draft a report”) rather than a step-by-step instruction.
  2. Planning: The LLM decomposes the goal into a sequence of sub-tasks and selects which tools or skills to invoke first.
  3. Action: The agent calls a tool — a web search, a code interpreter, a database query, an API, a file read — and receives a result.
  4. Observation: The model reads the tool output and evaluates whether it moved closer to the goal.
  5. Re-planning: Based on what it learned, the model decides the next action, revises its plan if needed, or declares the task complete.

This loop repeats until the goal is achieved or a stopping condition is met.

Key Components

LLM Core

The language model acts as the “brain” — reasoning about the goal, deciding which tool to use, interpreting results, and generating final outputs. More capable models (stronger reasoning, longer context, better instruction-following) produce more reliable agents.

Tools

Tools are functions the agent can call to interact with the world. Common examples:

  • Web search — retrieve up-to-date information
  • Code interpreter — write and execute code, process data
  • File I/O — read, write, and transform documents
  • API calls — interact with external services (CRMs, databases, communication platforms)
  • Browser control — navigate and extract data from websites

Memory

Agents need to track state across many steps. Memory typically comes in two forms:

  • In-context memory: The full conversation and tool history kept in the active context window
  • External memory: Vector stores or databases that persist facts between sessions, enabling long-running agents to pick up where they left off

Planning Module

Some architectures add an explicit planning step — generating a structured task graph before execution begins — to improve reliability on complex, multi-step objectives. Techniques like ReAct, Tree of Thoughts, and chain-of-thought prompting all influence how agents plan.

Orchestrator / Scheduler

In multi-agent systems, an orchestrator coordinates multiple specialised sub-agents, routes tasks to the right agent, and synthesises results. This is analogous to a project manager delegating to specialists.

Single-Agent vs. Multi-Agent

Single-agent systems use one LLM in a loop with access to a set of tools. Simpler to build and debug, appropriate for focused, well-defined tasks.

Multi-agent systems compose multiple specialised agents — each with its own tools, memory, and scope — orchestrated by a coordinator. Better suited to complex workflows where parallelism, specialisation, or cross-checking between agents adds value. The tradeoff is higher coordination complexity and more surfaces for failure.

What Agentic AI Can Do in 2026

Agentic systems in production today:

  • Ship code — write, test, and open pull requests autonomously
  • Run research — query databases, read papers, synthesise findings into reports
  • Manage outbound sales — enrich leads, draft personalised outreach, log CRM updates
  • Control browsers and desktops — fill forms, extract data, navigate UIs without APIs
  • Automate customer support — resolve tickets end-to-end, escalate only edge cases
  • Orchestrate business processes — coordinate approvals, data transforms, and notifications across systems

Reliability Challenges

Agentic systems amplify both the capabilities and the failure modes of the underlying LLM:

  • Compounding errors: A mistake in step 3 propagates through steps 4–10 before anyone notices
  • Tool misuse: The model may call the wrong tool, pass malformed parameters, or misinterpret results
  • Hallucinated actions: The model may “invent” tool results rather than actually calling the tool
  • Scope creep: Without clear stopping conditions, agents can over-execute or take unintended side effects

Mitigations include explicit human checkpoints for irreversible actions, sandboxed tool environments, step-level logging, and confidence-based escalation.

Agentic AI vs. Automation vs. Chatbots

ChatbotAutomationAgentic AI
TriggerUser messagePredefined eventGoal statement
Decision-makingNoneRule-basedLLM-driven
Tool useLimitedFixed pipelineDynamic, multi-tool
Handles novel situationsPoorlyNoYes
Requires pre-scriptingPartiallyFullyMinimally