AgentOps

Observability and DevTool platform for AI agents — from prototype to production

github.com/AgentOps-AI/agentops★ 5.6k agentops.ai

Overview

AgentOps is an open-source SDK and platform that helps developers build, evaluate, and monitor AI agents. You add it to a Python program to capture analytics on your LLM calls, replay agent sessions step by step, and track how much each run costs.

It is aimed at developers building agentic apps who need visibility into what their agents actually do at runtime. Instead of guessing why an agent failed or how much a workflow spent, you get execution graphs, input/output recording, and cost tracking across foundation-model providers.

As an LLM observability tool, AgentOps sits alongside your agent framework rather than replacing it. It offers native integrations with CrewAI, LangChain, LangGraph, AG2 (AutoGen), the OpenAI Agents SDK, and others, and can run as a hosted dashboard or be self-hosted on your own cloud.

What it does

Replay analytics and debugging with step-by-step agent execution graphs
LLM cost management that tracks spend across foundation-model providers
Session replays initialized in two lines of code
Native integrations with CrewAI, AG2 (AutoGen), Agno, LangGraph, LangChain, and the OpenAI Agents SDK
Decorators (@session, @agent, @operation, @task, @workflow) for tracing agents, tools, and functions
Self-hosting option to run the full dashboard and API backend on your own cloud

Getting started

Install the SDK, then initialize the client at the start of your program to capture analytics on all your LLM calls.

Install AgentOps

Install the package from PyPI.

bashbash

pip install agentops

Get an API key

Create an API key from the AgentOps dashboard at app.agentops.ai/settings/projects.

Initialize and record a session

Call agentops.init() near the start of your program (for example in main.py) and end the session when your program finishes.

pythonpython

import agentops

# Beginning of your program (i.e. main.py, __init__.py)
agentops.init(<INSERT YOUR API KEY HERE>)

...

# End of program
agentops.end_session('Success')

Trace functions with decorators

Use AgentOps decorators to add observability to specific agents and operations. They support input/output recording, exception handling, and async functions.

pythonpython

from agentops.sdk.decorators import session, agent, operation

@agent
class MyAgent:
    @operation
    def main_operation(self):
        return "done"

@session
def my_session():
    agent = MyAgent()
    return agent.main_operation()

Commands and code are distilled from the project's own documentation — always check the official repo for the latest.

When to use it

Debugging why a multi-step agent failed by replaying its execution graph step by step
Tracking and controlling LLM spend across foundation-model providers in production
Monitoring agents built with CrewAI, LangChain, LangGraph, or the OpenAI Agents SDK through native integrations
Self-hosting the AgentOps dashboard and API backend to keep agent telemetry on your own cloud

How AgentOps compares

AgentOps alongside other open-source observability & llmops tools AI/TLDR tracks, ranked by GitHub stars.

Tool	Stars	What it does
Langfuse	★ 29.4k	A self-hostable platform for tracing LLM and agent calls, managing prompts, and running evaluations to debug and improve AI applications.
Opik	★ 19.7k	An open-source platform from Comet for tracing, evaluating, and monitoring LLM applications, RAG systems, and agent workflows with dashboards and LLM-as-judge metrics.
TensorZero	★ 11.7k	An open-source LLMOps platform that puts a single gateway in front of every major LLM provider and adds observability, evaluation, optimization, and A/B testing.
Evidently	★ 7.6k	A monitoring and evaluation framework for ML and LLM systems that tracks output quality, drift, and test results over time with reports and dashboards.
OpenLLMetry	★ 7.2k	An OpenTelemetry-based SDK that auto-instruments LLM providers, vector databases, and frameworks so traces flow into any existing observability backend.
Helicone	★ 5.8k	A proxy-based observability platform that logs, monitors, and evaluates LLM API calls by routing requests through its endpoint with one line of code.
AgentOps	★ 5.6k	Observability and DevTool platform for AI agents — from prototype to production
Pydantic Logfire	★ 4.3k	An observability platform from the Pydantic team that records LLM calls, agent runs, and tool invocations with tokens, cost, and latency attached.

// Overview

// What it does

// Getting started