Helicone

One line of code to log, trace, and route every LLM call

github.com/Helicone/helicone★ 5.8k helicone.ai

Overview

Helicone is an open-source AI gateway and LLM observability platform for AI engineers. You route your model requests through its endpoint by changing one line of code, and Helicone logs every call so you can inspect traces, track cost and latency, and debug agents, chatbots, and document pipelines.

It works as a drop-in change to the OpenAI client: you point the baseURL at Helicone's AI gateway and add your Helicone API key. The same gateway gives you access to 100+ models from providers like OpenAI, Anthropic, and Gemini through one API key, with intelligent routing and automatic fallbacks.

As an LLM observability tool, Helicone fits teams who need to see what their model calls are actually doing in production. You can use the hosted service with a free monthly tier, or self-host the whole stack with Docker or a Helm chart for full control over your logs.

What it does

AI Gateway: reach 100+ models through one OpenAI-compatible API key with intelligent routing and automatic fallbacks
One-line integration to log requests from OpenAI, Anthropic, LangChain, Gemini, and the Vercel AI SDK
Trace and debug sessions for agents, chatbots, and document-processing pipelines
Track cost, latency, and quality metrics, with one-line export to PostHog for custom dashboards
Prompt management: version prompts from production data and deploy them through the gateway without code changes
Self-hostable via Docker Compose or a production Helm chart; SOC 2 and GDPR compliant

Getting started

Sign up for a Helicone API key, then point your OpenAI client at the Helicone gateway. You can also self-host the full stack with Docker.

Get your API key

Route requests through the gateway

Update the baseURL in your OpenAI client and pass your Helicone API key. Every call is now logged, and you can swap the model to any provider Helicone supports.

tsts

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://ai-gateway.helicone.ai",
  apiKey: process.env.HELICONE_API_KEY,
});

const response = await client.chat.completions.create({
  model: "gpt-4o-mini",  // claude-sonnet-4, gemini-2.0-flash or any model from https://www.helicone.ai/models
  messages: [{ role: "user", content: "Hello!" }]
});

Self-host with Docker (optional)

To run Helicone yourself, clone the repo and start the services with the included docker-compose script.

bashbash

# Clone the repository
git clone https://github.com/Helicone/helicone.git
cd docker
cp .env.example .env

# Start the services
./helicone-compose.sh helicone up

Commands and code are distilled from the project's own documentation — always check the official repo for the latest.

When to use it

Debug a misbehaving agent or chatbot by inspecting its full request traces and sessions
Track LLM cost and latency across providers and export the metrics to PostHog dashboards
Use one API key and the gateway's automatic fallbacks to reach 100+ models without per-provider wiring
Self-host LLM logging on your own infrastructure when data must stay in-house for compliance

How Helicone compares

Helicone alongside other open-source observability & llmops tools AI/TLDR tracks, ranked by GitHub stars.

Tool	Stars	What it does
Langfuse	★ 29.4k	A self-hostable platform for tracing LLM and agent calls, managing prompts, and running evaluations to debug and improve AI applications.
Opik	★ 19.7k	An open-source platform from Comet for tracing, evaluating, and monitoring LLM applications, RAG systems, and agent workflows with dashboards and LLM-as-judge metrics.
TensorZero	★ 11.7k	An open-source LLMOps platform that puts a single gateway in front of every major LLM provider and adds observability, evaluation, optimization, and A/B testing.
Evidently	★ 7.6k	A monitoring and evaluation framework for ML and LLM systems that tracks output quality, drift, and test results over time with reports and dashboards.
OpenLLMetry	★ 7.2k	An OpenTelemetry-based SDK that auto-instruments LLM providers, vector databases, and frameworks so traces flow into any existing observability backend.
Helicone	★ 5.8k	One line of code to log, trace, and route every LLM call
AgentOps	★ 5.6k	An SDK for monitoring AI agents that tracks LLM cost, session replays, and performance across frameworks like CrewAI, LangChain, and the OpenAI Agents SDK.
Pydantic Logfire	★ 4.3k	An observability platform from the Pydantic team that records LLM calls, agent runs, and tool invocations with tokens, cost, and latency attached.

// Overview

// What it does

// Getting started