What Is an Agent Framework? (and Do You Even Need One?)

Understand what agent frameworks actually do for you — and the honest cases where a plain API call and a loop are all you need.

BEGINNER11 MIN READUPDATED 2026-06-11

In plain English

An agent framework is a library that handles the repetitive plumbing of building an AI agent, so you can focus on what the agent should actually do. An agent is an LLM that runs in a loop: it thinks, picks a tool, runs it, reads the result, and repeats until the task is done. The framework gives you that loop, the tool wiring, and the message bookkeeping pre-built.

Think of building a custom car. You could forge your own engine, wheels, and brakes from scratch. Or you could start from a chassis kit that already has the engine mounted and the wiring run, and bolt on the parts that make your car yours. An agent framework is that chassis kit. The chassis isn't the interesting part of your car — but rebuilding it for every project is a waste of your time.

Popular examples include LangGraph, CrewAI, LlamaIndex, OpenAI Agents SDK, smolagents, and provider-native kits like the Claude Agent SDK. They vary wildly in size and philosophy, but they all answer the same question: what's the boilerplate I shouldn't have to write again?

Why it matters

Calling an LLM API once is trivial: one HTTP request, one answer. But an agent is not one call — it's many, in a loop, with state carried between them. The first time you build one by hand, you discover how much surrounding code you need: maintain a growing message history, describe your tools in the model's schema, parse the tool call it returns, run the function, feed the result back, and decide when to stop. Then add retries, timeouts, logging, and streaming.

Every team was writing that same loop from scratch. Agent frameworks packaged it up. Instead of hand-rolling tool use plumbing for every project, you register a function and the framework handles the round-trip. What used to be a few hundred lines of careful glue becomes a dozen lines of intent.

Who should care

Developers building anything multi-step — a research agent, a coding agent, a customer-support bot that calls real APIs — where the model loops and uses tools.
Teams that want portability — frameworks let you swap the underlying model without rewriting your whole app.
Anyone coordinating multiple agents — a multi-agent system needs orchestration, hand-offs, and shared state that a framework provides out of the box.

How it works

Under the hood, every agent framework wraps the same core loop. The model never "does" anything itself — it just emits text or a request to call a tool. The framework is the runtime that acts on those requests and loops back. Here's the cycle a framework runs for you:

// The agent loop a framework runs for you

Send messageshistory + tool definitions → modelModel decidesanswer, or call a toolRun the toolframework executes your functionAppend resulttool output back into historyCheck: done?stop on final answer, else loop↺ repeat

On top of that loop, frameworks add a fairly consistent set of building blocks. Different libraries name them differently, but the concepts repeat:

Building block	What it does	Why you'd reach for it
Tools / actions	Register a function the model can call	The whole point of an agent — give it hands
Memory / state	Carry context across loop turns	Planning and memory for long tasks
Orchestration	Route between steps, branch, retry	Control flow beyond a simple loop
Multi-agent	Spawn and coordinate sub-agents	Split a big job across specialists
Model adapters	One interface, many providers	Swap models without a rewrite
Tracing / eval hooks	Log every step, score outcomes	Observability and debugging

Frameworks split into two broad styles. Graph / workflow frameworks (like LangGraph) make you define explicit steps and transitions — more setup, but you can see and control exactly what runs next. Autonomous-loop frameworks (like smolagents or CrewAI) hand you a higher-level agent and run the loop with less ceremony. The trade-off is the eternal one: control versus convenience.

// Two framework philosophies

Graph / workflow

You define nodes and edges
Explicit, inspectable control flow
Easy to add human-in-the-loop pauses
More boilerplate up front
Example: LangGraph

Autonomous loop

You hand the agent a goal + tools
Framework runs the loop for you
Less code, faster to a demo
Harder to predict and constrain
Example: CrewAI, smolagents

The 'no framework' version (so you know what you're buying)

The best way to understand what a framework gives you is to write the loop by hand once. It's genuinely short. Here's a complete tool-using agent using only a provider SDK — no agent framework at all. Read it, and you'll see exactly which lines a framework would delete for you.

bare_agent.pypython

# pip install anthropic  — no agent framework, just the provider SDK
import anthropic

client = anthropic.Anthropic()  # reads ANTHROPIC_API_KEY from your env

# 1. Describe the tool to the model (name, purpose, input schema).
tools = [{
    "name": "get_weather",
    "description": "Get the current temperature for a city.",
    "input_schema": {
        "type": "object",
        "properties": {"city": {"type": "string"}},
        "required": ["city"],
    },
}]

# 2. Your actual tool — a plain Python function.
def get_weather(city: str) -> str:
    fake_db = {"Paris": "19C", "Tokyo": "24C"}
    return fake_db.get(city, "unknown")

messages = [{"role": "user", "content": "What's the weather in Paris?"}]

# 3. The loop a framework would run for you.
while True:
    resp = client.messages.create(
        model="claude-sonnet-4-5",
        max_tokens=1024,
        tools=tools,
        messages=messages,
    )
    messages.append({"role": "assistant", "content": resp.content})

    if resp.stop_reason != "tool_use":
        print(resp.content[0].text)  # final answer — stop looping
        break

    # The model asked to call a tool: run it and feed the result back.
    for block in resp.content:
        if block.type == "tool_use":
            result = get_weather(**block.input)
            messages.append({"role": "user", "content": [{
                "type": "tool_result",
                "tool_use_id": block.id,
                "content": result,
            }]})

That while True block — sending messages, checking stop_reason, running the tool, appending tool_result, looping — is the agent. A framework hands you that loop pre-written, plus retries, parallel tool calls, streaming, and a tidier way to register tools. For one tool and one model, the raw version is honestly fine. For ten tools, three agents, and a need to swap providers, you'll want the framework.

Do you even need one?

Here's the honest answer the marketing won't give you: often, no. A surprising number of "agents" are really a single well-crafted prompt, or a fixed sequence of two or three LLM calls with code in between. Those don't need an agent framework — they need good prompt engineering and a few lines of plain code. Anthropic's own guidance is to start with the simplest thing that works and add agentic complexity only when simpler approaches fall short.

// A decision flow

One prompt enough?→ just call the APIFixed steps?→ chain calls in plain codeModel loops + picks tools?→ a framework starts to pay offMulti-agent / production?→ framework strongly worth it

Use this rough rule. The more uncertainty (the model decides what to do next) and the more moving parts (many tools, many steps, many agents), the more a framework earns its keep. The more deterministic and simple your task, the more a framework is overhead you'll fight.

Your situation	Reach for a framework?
Summarize text, classify, extract fields	No — one API call
RAG: retrieve, then answer	Usually no — see RAG fundamentals
Agent loops over many tools to finish a task	Yes — it saves real plumbing
Several agents coordinating, human approvals	Yes — orchestration is the hard part
You must audit every prompt and token	Maybe not — abstraction can hide them

The landscape: which framework does what

"Agent framework" is a loose label covering tools with different centers of gravity. Knowing what each is best at saves you from forcing the wrong one onto a problem. This category at agent SDKs and frameworks covers each in depth; here's the map.

Framework	Best at	Trade-off
LangGraph	Explicit graphs: full control of loops, branches, state	More setup; a bigger surface to learn
LlamaIndex	Data ingestion and RAG-centric agents	Less of a general orchestration tool
DSPy	Optimizing prompts as compiled programs	Different mental model; smaller ecosystem
Claude Agent SDK	Production agents the Claude Code way	Tied to one provider's patterns
CrewAI	Role-based multi-agent teams, fast to a demo	Opinionated; less low-level control
smolagents	Tiny code-writing agents, minimal deps	Deliberately small; fewer batteries included
No framework	Total control, zero abstraction	You write and maintain the glue yourself

These aren't mutually exclusive. A common real-world stack uses LlamaIndex for ingestion, LangGraph for orchestration, and calls a provider SDK directly inside a node. And if portability across models matters to you, a major reason to use any framework is the model adapter layer — write your agent once, then point it at a different model by changing a string.

One more axis worth naming: provider-native vs. provider-agnostic. The Claude Agent SDK and OpenAI Agents SDK are tuned to one provider's models and patterns — often the smoothest path if you're committed to that provider. LangGraph, CrewAI, and smolagents aim to work across providers, trading a little polish for portability.

Going deeper

Once the basics click, here's what separates a toy agent from one you'd trust in production — and where frameworks help, hurt, or simply can't save you.

Frameworks don't fix reliability

The hardest problem with agents isn't the loop — it's that errors compound. If each step is 95% reliable, ten steps in a row land near 60%. No framework fixes that for you. You still need solid evaluation, guardrails, and tight tool design. A framework can give you tracing hooks to see where a run derailed, but the reliability work is yours.

The abstraction tax, made concrete

Every framework sits between you and the raw model. When it works, that's a gift. When it doesn't, you debug two systems, and deep abstractions can obscure the actual prompt being sent — which matters enormously, because the prompt is where most agent behavior is won or lost. A good framework lets you drop down and inspect or override the raw messages. If a framework hides the prompt from you entirely, treat that as a red flag, not a feature.

Security is not optional

An agent that runs tools is an agent that can be hijacked. Prompt injection — malicious instructions hidden in a web page or document the agent reads — can trick it into calling tools you never intended. Frameworks add some guardrail hooks, but the discipline is on you: validate tool inputs, sandbox anything dangerous, and never let a tool do something irreversible without a check.

Standards are reducing lock-in

A newer reason the framework choice matters less than it used to: the Model Context Protocol (MCP) is becoming a shared standard for connecting agents to tools and data. As more frameworks speak MCP, the tools you build become portable across them — so you can change frameworks later without rewriting every integration. The honest mature take: learn the bare agent loop, pick the lightest framework that removes real pain, and never let any framework stop you from understanding what the model is actually doing.

FAQ

What is an agent framework in simple terms?

It's a library that handles the repetitive plumbing of building an AI agent — the loop that calls the model, runs the tools it picks, and feeds results back — so you write your agent's logic instead of re-implementing that loop every time. The intelligence stays in the model; the framework is just scaffolding.

Do I need an agent framework to build an AI agent?

No. You can build a working agent with just a provider's SDK and a short loop, and for simple tasks that's often the cleaner choice. Frameworks pay off when you have many tools, multiple agents, or need to swap models — basically, when the plumbing they remove outweighs the abstraction they add.

What is the difference between an agent framework and calling the API directly?

Calling the API directly means you write the agent loop yourself: maintain message history, describe tools, parse tool calls, run them, and decide when to stop. A framework gives you that loop pre-built plus retries, streaming, multi-agent orchestration, and model adapters — at the cost of a dependency and a layer of abstraction between you and the raw prompts.

Which agent framework should a beginner start with?

Start by building one agent with no framework at all so you understand the loop. Then, if you want graphs and explicit control, try LangGraph; if you want a small code-first agent, try smolagents; if you're committed to one provider, use its native SDK like the Claude Agent SDK. Match the tool to the problem, not the hype.

Are agent frameworks free and open source?

Most of the popular ones — LangGraph, CrewAI, LlamaIndex, smolagents, and DSPy — are open source and free to use. Some have companion paid products for tracing, hosting, or evaluation, but the core libraries you build with are free.

When is an agent framework overkill?

When your task is a single prompt, a fixed sequence of a couple of calls, or basic retrieval-then-answer RAG. Those are deterministic enough that a framework's loop and abstractions add overhead without saving much. Reach for a framework once the model genuinely decides what to do next across many steps or tools.

// In plain English

// Why it matters

Who should care

// How it works

// The 'no framework' version (so you know what you're buying)

// Do you even need one?

// The landscape: which framework does what

// Going deeper

Frameworks don't fix reliability

The abstraction tax, made concrete

Security is not optional

Standards are reducing lock-in

// FAQ

// Further reading

// Related

In plain English

Why it matters

How it works

The 'no framework' version (so you know what you're buying)

Do you even need one?

The landscape: which framework does what

Going deeper

FAQ

Further reading

Related