OpenTelemetry Collector

SideSeat includes a built-in OpenTelemetry collector optimized for AI agent development workflows. It receives OTLP traces via HTTP and gRPC, stores them in SQLite for efficient querying, and provides real-time streaming via SSE.

Architecture

Features

OTLP-compatible: Receives traces via standard OpenTelemetry protocol (HTTP JSON/Protobuf, gRPC)
Framework detection: Automatically detects LangChain, LlamaIndex, Strands, and other AI frameworks
GenAI field extraction: Extracts token usage, model info, and other GenAI-specific fields
Bounded memory: Configurable buffer limits prevent memory exhaustion
FIFO storage: Automatic cleanup when storage limits are reached
Real-time streaming: SSE endpoint for live trace updates
Efficient storage: SQLite with indexed columns for fast queries

Endpoints

Trace Ingestion

Endpoint	Method	Content-Type	Description
`/otel/v1/traces`	POST	`application/json`	OTLP JSON traces
`/otel/v1/traces`	POST	`application/x-protobuf`	OTLP Protobuf traces
`localhost:4317`	gRPC	Protobuf	OTLP gRPC endpoint

Query API

Endpoint	Method	Description
`/api/v1/traces`	GET	List traces with filtering
`/api/v1/traces/filters`	GET	Get available filter options
`/api/v1/traces/{trace_id}`	GET	Get single trace details
`/api/v1/traces/{trace_id}`	DELETE	Delete a trace and all associated data
`/api/v1/traces/{trace_id}/spans`	GET	Get spans for a trace
`/api/v1/spans`	GET	Query spans with Gen AI fields
`/api/v1/spans/{span_id}`	GET	Get span detail with events
`/api/v1/spans/{span_id}/events`	GET	Get events for a span
`/api/v1/sessions`	GET	List sessions with filtering
`/api/v1/sessions/{session_id}`	GET	Get single session details
`/api/v1/sessions/{session_id}`	DELETE	Delete a session and all associated data
`/api/v1/sessions/{session_id}/traces`	GET	Get traces for a session

Real-time Streaming

Endpoint	Method	Description
`/api/v1/traces/sse`	GET	SSE stream of trace events

Configuration

All OTel settings are under the otel key in your config file:

{
  "otel": {
    "enabled": true,
    "grpc": {
      "enabled": true,
      "port": 4317
    },
    "retention": {
      "days": 30  // Optional: set to enable time-based cleanup
    }
  }
}

See Config Manager for the full configuration reference.

Sending Traces

Python with OpenTelemetry SDK

from opentelemetry import trace
from opentelemetry.sdk.trace import TracerProvider
from opentelemetry.sdk.trace.export import BatchSpanProcessor
from opentelemetry.exporter.otlp.proto.http.trace_exporter import OTLPSpanExporter

# Configure exporter to send to SideSeat
exporter = OTLPSpanExporter(endpoint="http://localhost:5001/otel/v1/traces")
provider = TracerProvider()
provider.add_span_processor(BatchSpanProcessor(exporter))
trace.set_tracer_provider(provider)

# Create traces
tracer = trace.get_tracer(__name__)
with tracer.start_as_current_span("my-agent-operation"):
    # Your agent code here
    pass

Python with Strands SDK

from strands import Agent
from strands.models import BedrockModel
from strands.telemetry import StrandsTelemetry

# Configure telemetry to export to SideSeat
telemetry = StrandsTelemetry()
telemetry.setup_otlp_exporter(endpoint="http://localhost:5001/otel/v1/traces")

# Create agent with optional trace attributes
model = BedrockModel(model_id="us.anthropic.claude-haiku-4-5-20251001-v1:0")
agent = Agent(
    name="my-agent",
    model=model,
    trace_attributes={
        "session.id": "my-session-123",
        "user.id": "user-456",
    },
)

# Don't forget to flush telemetry before exit
# telemetry.tracer_provider.force_flush()

Node.js with OpenTelemetry SDK

const { NodeTracerProvider } = require('@opentelemetry/sdk-trace-node');
const { OTLPTraceExporter } = require('@opentelemetry/exporter-trace-otlp-http');
const { BatchSpanProcessor } = require('@opentelemetry/sdk-trace-base');

const exporter = new OTLPTraceExporter({
  url: 'http://localhost:5001/otel/v1/traces',
});

const provider = new NodeTracerProvider();
provider.addSpanProcessor(new BatchSpanProcessor(exporter));
provider.register();

Using gRPC

For higher throughput, use the gRPC endpoint:

from opentelemetry.exporter.otlp.proto.grpc.trace_exporter import OTLPSpanExporter

exporter = OTLPSpanExporter(endpoint="localhost:4317", insecure=True)

Framework Detection

SideSeat automatically detects and normalizes spans from popular AI frameworks:

Framework	Detection Method	Extracted Fields
LangChain	Scope name, attributes	Chain type, run ID
LangGraph	Scope name, attributes	Node, edge, state
LlamaIndex	Scope name, attributes	Query, response
Strands	Scope name, resource attrs	Cycle ID, agent info
OpenInference	Attribute prefix	Session ID, user ID
Generic GenAI	`gen_ai.*` attributes	Model, tokens, system

GenAI Fields

The collector extracts and normalizes GenAI-specific fields:

Field	Description
`gen_ai_system`	AI provider (openai, anthropic, etc.)
`gen_ai_request_model`	Requested model name
`gen_ai_response_model`	Actual model used
`gen_ai_operation_name`	Operation type (chat, completion)
`gen_ai_agent_name`	Agent name (for agent frameworks)
`gen_ai_tool_name`	Tool name (for tool calls)
`usage_input_tokens`	Input/prompt tokens
`usage_output_tokens`	Output/completion tokens
`usage_total_tokens`	Total tokens (computed if not provided)
`usage_cache_read_tokens`	Cache read tokens (Anthropic)
`usage_cache_write_tokens`	Cache write tokens (Anthropic)
`time_to_first_token_ms`	Time to first token (TTFT)
`request_duration_ms`	Total request duration
`session_id`	Session/conversation ID

Span Events

Span events (messages, tool calls, choices) are automatically categorized:

Event Type	Role	Description
`user_message`	user	User input messages
`assistant_message`	assistant	Model responses
`system_message`	system	System prompts
`tool_call`	assistant	Tool/function calls
`tool_result`	tool	Tool execution results
`choice`	assistant	Completion choices with finish_reason

Events include content_preview (first 500 chars) and tool correlation fields (tool_name, tool_call_id).

Storage

All trace data is stored in a single SQLite database (data/sideseat.db) with indexed columns for efficient querying. Full span data is stored as JSON for complete access to all fields.

Retention

Storage is managed with optional time-based retention:

Time-based: If retention.days is set, data older than that is automatically deleted
Default: No retention limit (data kept forever)
Automatic: When enabled, retention check runs every check_interval_secs (default: 5 min)

Real-time Streaming

Subscribe to trace events via Server-Sent Events:

const eventSource = new EventSource('http://localhost:5001/api/v1/traces/sse');

eventSource.onmessage = (event) => {
  const payload = JSON.parse(event.data);
  // Event types: NewSpan, SpanUpdated, TraceCompleted, HealthUpdate
  console.log('Event:', payload.event.type, payload.event.data);
};

SSE Limits

Maximum connections: 100 (configurable)
Connection timeout: 1 hour (configurable)
Keepalive interval: 30 seconds (configurable)

Attribute Filtering

SideSeat uses an Entity-Attribute-Value (EAV) storage pattern for flexible attribute filtering. This enables querying traces by any indexed attribute without schema changes.

Configuring Indexed Attributes

Configure which attributes to extract and index:

{
  "otel": {
    "attributes": {
      "trace_attributes": [
        "environment",
        "deployment.environment",
        "service.version",
        "user.id",
        "session.id"
      ],
      "span_attributes": [
        "gen_ai.system",
        "gen_ai.operation.name",
        "gen_ai.request.model",
        "level"
      ],
      "auto_index_genai": true
    }
  }
}

trace_attributes: Attributes extracted from resource/span attributes and indexed at trace level
span_attributes: Attributes indexed per span
auto_index_genai: Automatically index all gen_ai.* attributes (default: true)

Filter Operators

The attribute filter API supports these operators:

Operator	Description	Example
`eq`	Equals	`{"key":"env","op":"eq","value":"prod"}`
`ne`	Not equals	`{"key":"env","op":"ne","value":"dev"}`
`contains`	Contains substring	`{"key":"user_id","op":"contains","value":"admin"}`
`starts_with`	Starts with	`{"key":"session_id","op":"starts_with","value":"sess_"}`
`in`	In list	`{"key":"env","op":"in","value":["prod","staging"]}`
`gt`, `lt`, `gte`, `lte`	Numeric comparison	`{"key":"latency","op":"gt","value":1000}`
`is_null`	Attribute not present	`{"key":"error","op":"is_null","value":null}`
`is_not_null`	Attribute present	`{"key":"user_id","op":"is_not_null","value":null}`

Query Examples

List Recent Traces

curl http://localhost:5001/api/v1/traces

Filter by Service

curl "http://localhost:5001/api/v1/traces?service=my-agent"

Filter by Framework

curl "http://localhost:5001/api/v1/traces?framework=langchain"

Filter by Attributes

# Filter traces where environment=production
curl "http://localhost:5001/api/v1/traces?attributes=%5B%7B%22key%22%3A%22environment%22%2C%22op%22%3A%22eq%22%2C%22value%22%3A%22production%22%7D%5D"

# Decoded: attributes=[{"key":"environment","op":"eq","value":"production"}]

# Filter by environment AND user_id
curl "http://localhost:5001/api/v1/traces?attributes=%5B%7B%22key%22%3A%22environment%22%2C%22op%22%3A%22eq%22%2C%22value%22%3A%22production%22%7D%2C%7B%22key%22%3A%22user_id%22%2C%22op%22%3A%22eq%22%2C%22value%22%3A%22user-123%22%7D%5D"

Get Trace Details

curl http://localhost:5001/api/v1/traces/abc123def456

Discover available filter values for building UI dropdowns:

curl http://localhost:5001/api/v1/traces/filters

Response:

{
  "services": ["my-agent", "my-service"],
  "frameworks": ["langchain", "openai"],
  "attributes": [
    {
      "key": "environment",
      "key_type": "string",
      "entity_type": "trace",
      "sample_values": ["production", "staging", "development"]
    }
  ]
}

Troubleshooting

Traces Not Appearing

Check OTel is enabled: "otel": { "enabled": true }
Verify endpoint URL matches your exporter configuration
Check server logs for ingestion errors

High Memory Usage

Reduce buffer sizes in config:

{
  "otel": {
    "ingestion": {
      "channel_capacity": 500,
      "buffer_max_spans": 500,
      "buffer_max_bytes": 5242880
    }
  }
}

Disk Full

Reduce retention.days for shorter retention period
The SQLite database will be cleaned up automatically based on retention settings

OpenTelemetry Collector

Architecture

Features

Endpoints

Trace Ingestion

Query API

Real-time Streaming

Configuration

Sending Traces

Python with OpenTelemetry SDK

Python with Strands SDK

Node.js with OpenTelemetry SDK

Using gRPC

Framework Detection

GenAI Fields

Span Events

Storage

Retention

Real-time Streaming

SSE Limits

Attribute Filtering

Configuring Indexed Attributes

Filter Operators

Query Examples

List Recent Traces

Filter by Service

Filter by Framework

Filter by Attributes

Multiple Attribute Filters

Get Trace Details

Get Filter Options

Troubleshooting

Traces Not Appearing

High Memory Usage

Disk Full