OpenClaw Observability¶

OpenTelemetry observability for OpenClaw AI agents — traces, metrics, and logs.

Two Approaches¶

This documentation covers two complementary approaches to OpenClaw observability:

1. Official Diagnostics Plugin (Built-in)¶

OpenClaw v2026.2+ includes built-in OTel support via diagnostics.otel config. Best for:

✅ Operational metrics (tokens, costs, durations)
✅ Gateway health monitoring
✅ Log forwarding
✅ Simple setup (config only)

2. Custom Hook-Based Plugin (This Repo)¶

A plugin that hooks into the agent lifecycle for deeper tracing. Best for:

✅ Connected distributed traces
✅ Per-tool-call spans
✅ Request lifecycle visibility
✅ Debugging agent behavior

Recommendation: Use both for complete observability.

Quick Start¶

Official Plugin (5 minutes)¶

Add to ~/.openclaw/openclaw.json:

{
  "diagnostics": {
    "enabled": true,
    "otel": {
      "enabled": true,
      "endpoint": "http://localhost:4318",
      "serviceName": "openclaw-gateway",
      "traces": true,
      "metrics": true,
      "logs": true
    }
  }
}

Then restart:

openclaw gateway restart

Custom Plugin (Additional)¶

Clone the repo
Add to plugins.load.paths
Configure in plugins.entries.otel-observability
Clear jiti cache and restart

See Getting Started for detailed instructions.

What Gets Captured¶

Official Plugin (`diagnostics-otel`, built into the Gateway)¶

Signal	Data
Metrics	`openclaw.tokens`, `openclaw.cost.usd`, `openclaw.run.duration_ms`, `openclaw.webhook.`, `openclaw.message.`, `openclaw.queue.`, `openclaw.session.` (all emitted by `diagnostics-otel`, not by this plugin)
Traces	Model usage, webhook processing, message processing, stuck sessions
Logs	All Gateway logs with severity, subsystem, code location

Custom Plugin (This Repo)¶

Signal	Data
Traces	`openclaw.request` → `openclaw.agent.turn` → `tool.*` (connected hierarchy)
Metrics	`openclaw.llm.tokens.{total,prompt,completion}`, `openclaw.llm.cost.usd`, `openclaw.tool.calls`, `openclaw.session.resets`, plus the OTel stable `gen_ai.*` set

Trace Structure Comparison¶

Official Plugin:

openclaw.model.usage (standalone span)
openclaw.webhook.processed (standalone span)
openclaw.message.processed (standalone span)

Custom Plugin:

openclaw.request (root span - full lifecycle)
├── openclaw.agent.turn (child)
│   ├── tool.Read (child)
│   ├── tool.exec (child)
│   └── tool.Write (child)

Supported Backends¶

Works with any OTLP-compatible backend:

Dynatrace — Direct OTLP ingest
Grafana — Tempo, Loki, Mimir
Jaeger — Distributed tracing
Prometheus + Grafana — Metrics
Honeycomb, New Relic, Datadog — Cloud platforms
Local OTel Collector — Self-hosted

Documentation¶

Getting Started — Setup in 5 minutes
Configuration — All options explained
Architecture — How it works
Limitations — Known constraints
Telemetry Reference — Metric/trace details

Security Monitoring¶

Security Detection — Real-time threat detection
Tetragon Integration — Kernel-level monitoring

The plugin includes real-time security detection for:

Detection	Severity	What It Catches
Sensitive File Access	Critical	Credentials, SSH keys, .env files
Prompt Injection	High	Social engineering attacks on the AI
Dangerous Commands	Critical	Data exfiltration, rm -rf, crypto mining
Token Spike Anomaly	Warning	Unusual usage patterns

Combined with Tetragon kernel monitoring, this provides defense-in-depth security observability.

Source¶

Official plugin: Built into OpenClaw v2026.2.0+
Custom plugin: github.com/henrikrexed/openclaw-observability-plugin