Telemetry Pipelines

Collectors organize telemetry into signal-specific pipelines. Each pipeline has receivers, optional processors, and exporters.

flowchart LR
  receiver["Receiver"]
  processor["Processor"]
  exporter["Exporter"]

  receiver --> processor
  processor --> exporter

Pipeline Components

Component	Role	Examples
Receiver	Accepts telemetry from apps	OTLP HTTP, StatsD TCP
Processor	Modifies, batches, filters, or samples telemetry	`batch`, `memory_limiter`, `attributes`
Exporter	Sends telemetry to a backend or exposes it for scrape	OTLP exporter, Prometheus exporter, debug exporter

Recommended Initial Pipelines

flowchart TB
  app["Application"]
  otlp["OTLP receiver :4318"]
  statsd["Optional StatsD TCP metrics receiver :9127"]
  traces_processors["Trace processors"]
  metrics_processors["Metrics processors"]
  logs_processors["Log processors"]
  traces["Trace exporter"]
  metrics_push["Metrics push exporter"]
  prometheus["Prometheus scrape exporter :9292"]
  logs["Log exporter"]

  app -->|"OTLP traces, metrics, logs"| otlp
  app -->|"StatsD metrics only"| statsd
  otlp -->|"traces"| traces_processors --> traces
  otlp -->|"metrics"| metrics_processors
  statsd -->|"metrics"| metrics_processors
  metrics_processors --> metrics_push
  metrics_processors --> prometheus
  otlp -->|"logs"| logs_processors --> logs

Start with direct application telemetry:

OTLP traces from the app SDK.
OTLP metrics from the app SDK, or StatsD TCP if that is what the app already supports.
Structured logs to stdout/stderr, or OTLP logs if your logging library supports them.

Direct Metrics First

Prefer direct metrics from application code when you can change the code. Direct metrics are explicit, cheap, and easy to test.

Good direct metrics:

example.requests.completed
example.jobs.duration_ms
example.queue.depth
example.cache.hit

Use low-cardinality labels such as:

status
route_name
worker
deployment.environment

Derived Metrics

Derived metrics can be useful, but they should not be the default for new work.

Source	Use when	Caution
Spans	You already have spans and need latency/error histograms	Attribute choices can create high cardinality
Logs	You cannot change legacy code yet	Regex processing is expensive and fragile

When a derived metric proves valuable, prefer moving it into direct application instrumentation later.

Sampling

Sampling belongs in the collector when you need centralized control. A common policy shape is:

Keep all error traces.
Keep a small percentage of fast successful traces.
Keep a larger percentage of slow successful traces.

Record sampling decisions in the collector config so agents and humans know why some traces are absent.

Pipeline Components​

Recommended Initial Pipelines​

Direct Metrics First​

Derived Metrics​

Sampling​

Pipeline Components

Recommended Initial Pipelines

Direct Metrics First

Derived Metrics

Sampling