trace-find

Experimental: This operator is experimental and its semantics may change. Find traces matching structural patterns across spans and correlated logs. Evaluates parent-child relationships between spans within each trace and returns matching traces. Optional output clauses (summarize, project, where) control what data is extracted from each matching trace. Predicates inside { } blocks use standard KQL where-clause syntax.

Structural operators define the required relationship between spans: >> (descendant), > (child), << (ancestor), < (parent), ~ (sibling), :: (has correlated log — shorthand for > targeting log fields).

Logs as span children: When the input includes log records (e.g., union otel_logs, spans), logs are treated as children of the span they're attached to (via shared span_id). All structural operators work naturally with logs — >> finds logs as descendants, > finds them as direct children. The :: operator is convenient shorthand for > when the RHS predicate targets log-specific fields like body or severity_text. Logs are leaf nodes and cannot have children.

Composition operators combine independent structural checks at the trace level: and (both must hold), or (either must hold). so { A } and { B } returns traces that has a span matching A and a span matching B. Precedence (tightest first): :: > structural (>>, >, <<, <, ~) > and/or. Search predicates: The search keyword can be used inside { } blocks for full-text matching: { search "error" } is equivalent to { * has "error" }. All search syntax is supported, including column-scoped search ({ search body:"timeout" }).

Chaining is supported: { A } >> { B } >> { C } is desugared into { A } >> { B } and { B } >> { C } — each structural operator is evaluated independently. This works for any depth: { A } >> { B } >> { C } >> { D } becomes three independent structural checks.

Empty braces {} match all spans. Leading :: is shorthand for {} ::.

Time window (within): The within <duration> clause (default: 5 minutes) sets the time bin size for incremental trace processing. The engine divides the query range into bins of this size and processes them in a streaming fashion. To handle traces straddling bin boundaries, a 3-bucket coordinator window ensures that discovery for adjacent bins completes before collection — so with within 5m, the effective discovery window is 15 minutes. Set within to at least the expected duration of the traces you want to find. Shorter windows are faster and use less memory. Use within 1h for long-running traces, or within 30s for low-latency microservice traces. See Compared to TraceQL for details on the execution model.

Output clauses control what trace-find returns. They follow the structural predicates and are mutually exclusive (except where which composes with the others):

summarize agg1, agg2, ... — Aggregate all rows of each matching trace. Grouping by trace_id is implicit. Supports all KQL aggregate functions (count, countif, make_set, avg, min, max, take_anyif, arg_min, dcount, etc.).
summarize agg by col1, col2 — Group aggregations by additional columns beyond trace_id. Produces multiple rows per trace.
project col1, col2, ... — Emit individual span rows from matching traces with only the specified columns. Returns one row per span, not per trace. Mutually exclusive with summarize.
where agg() op literal — Filter traces by aggregate conditions before output. Supports count() > N, countif(...) > N, dcount(...) > N, etc. Multiple conditions can be combined with and/or. Composes with summarize or project.

When no output clause is given, the default output is equivalent to writing: summarize root_name=take_anyif(name, isnull(parent_span_id)), services=make_set(resource.attributes.service.name), spans=count(), start_time=min(start_time), end_time=max(start_time), duration=max(start_time) - min(start_time)

Gotchas:

Predicates in the same { ... } block apply to the same row/span. Splitting them across multiple blocks changes the meaning. For example, { resource.attributes.service.name == “user-service” and status.code == “ERROR” } means one span must satisfy both predicates (only the erroring user-service span), while { resource.attributes.service.name == “user-service” } and { status.code == “ERROR” } only means the trace contains a user-service span and an error span somewhere — they can be different spans.
and/or across separate { ... } blocks are trace-level existence checks, not row-level conjunctions. trace-find { A } and { B } means “there exists a row matching A and there exists a row matching B in the same trace.” It does not require the same row to match both.
Structural operators (>>, >, <<, <, ~) require an actual tree relationship. trace-find { A } >> { B } means the B match must have an ancestor matching A. This is not a general “filter the result set further” operator.
>> does not match when the RHS is itself a root span. Root spans have no ancestor. For example, trace-find { resource.attributes.service.name == “api-gateway” } >> { name == “GET /users” } does not match when GET /users is the root span of the trace, even though it belongs to api-gateway — the root span has no ancestor.
Excluding traces by root span is usually a post-filter on root_name, not another structural clause. If you want “error traces whose root is not POST /pay”, write: trace-find { status.code == “ERROR” } | where root_name != “POST /pay”. Writing { name != “POST /pay” } and { status.code == “ERROR” } is weaker (matches any non-root span with a different name), and { name != “POST /pay” } >> { status.code == “ERROR” } only works when the error span is a descendant of a non-POST /pay span.

This function is inspired by TraceQL, but uses KQL where-clause syntax for predicates. See Compared to TraceQL for a detailed comparison.

Syntax

trace-find within <duration> { pred1 } >> { pred2 }

Set the time window for trace collection. Traces whose spans span more than this duration may be incomplete. Shorter windows are faster and use less memory. Default is 5 minutes.

Parameters

Name	Description
duration	Maximum trace duration, e.g. `5m`, `30s`, `1h`. Default: `5m`

Syntax

trace-find { pred1 } >> { pred2 }

Find traces where a span matching pred1 has a descendant span matching pred2. The >> operator walks the parent chain at any depth.

Parameters

Name	Description
pred1	Predicate on ancestor span (KQL where-clause expression)
pred2	Predicate on descendant span (KQL where-clause expression)

Syntax

trace-find { pred1 } > { pred2 }

Find traces where a span matching pred1 has a direct child span matching pred2 (single parent-child hop).

Parameters

Name	Description
pred1	Predicate on parent span
pred2	Predicate on child span

Syntax

trace-find { pred } :: { log_pred }

Find traces where a span matching pred has correlated log records matching log_pred. Correlation is via shared span_id (OTel log-span link).

Parameters

Name	Description
pred	Predicate on span attributes
log_pred	Predicate on correlated log attributes

Syntax

trace-find { A } >> { B } and { C } >> { D }

Compose independent structural checks with and/or. The trace must satisfy both relationships. Use this to express complex multi-hop patterns.

Parameters

Name	Description
A, B	First structural relationship (ancestor-descendant)
C, D	Second structural relationship (independent check)

Syntax

trace-find { pred1 } >> { pred2 } summarize agg1, agg2, ...

Extract user-defined aggregations from each matching trace's rows. The summarize clause accepts any KQL aggregation expressions. Grouping by trace_id is implicit — do not include by trace_id. All rows for matching traces (spans and logs if unioned) feed into the aggregation, not just predicate-matched rows.

Parameters

Name	Description
pred1, pred2	Structural predicates (any operator)
agg1, agg2, ...	KQL aggregation expressions (count, make_set, countif, avg, etc.)

Syntax

trace-find { pred1 } >> { pred2 } summarize agg by col1, col2

Group aggregations by additional columns beyond trace_id. Produces multiple rows per trace — one per unique combination of (trace_id, col1, col2, ...). The by clause works exactly like in the regular summarize operator.

Parameters

Name	Description
agg	Aggregation expression
col1, col2	Columns to group by (in addition to implicit trace_id)

Syntax

trace-find { pred1 } >> { pred2 } where agg() op literal

Filter traces by aggregate conditions. Only traces where the aggregate value satisfies the comparison are included in the output. Multiple conditions can be combined with and/or. Composes with summarize or project — the where filter is applied first.

Parameters

Name	Description
agg()	An aggregate function (count, countif, dcount, min, max, sum, avg, etc.)
op	Comparison operator: `>`, `>=`, `<`, `<=`, `==`, `!=`
literal	Threshold value to compare against

Syntax

trace-find { pred1 } >> { pred2 } project col1, col2, ...

Emit individual span rows from matching traces with only the specified columns. Unlike summarize, this returns one row per span (not per trace). Mutually exclusive with summarize. Only projected columns are buffered in memory, keeping resource usage proportional to the projection width.

Parameters

Name	Description
col1, col2, ...	Column names to include in the output

Examples

Example 1 — Find traces where an api-gateway span has a downstream error

spans
| trace-find {resource.attributes.service.name == "api-gateway"} >> {status.code == "ERROR"}

trace_id (string)	root_name (string)	services (dynamic)	spans (long)	logs (long)	start_time (datetime)	end_time (datetime)	duration (timespan)
aaaa1111aaaa1111aaaa1111aaaa1111	GET /users	["api-gateway","user-service"]	5	0	2024-01-01T10:00:00Z	2024-01-01T10:00:02Z	00:00:02

Example 2 — Count spans and collect services per matching trace

spans
| trace-find {resource.attributes.service.name == "api-gateway"} >> {status.code == "ERROR"} summarize spans = count(), services = make_set(
  resource.attributes.service.name
)

trace_id (string)	spans (long)	services (dynamic)
aaaa1111aaaa1111aaaa1111aaaa1111	5	["api-gateway","user-service"]

Example 3 — Compute error ratio per matching trace

spans
| trace-find {resource.attributes.service.name == "api-gateway"} >> {status.code == "ERROR"} summarize errors = countif(
  status.code == "ERROR"
), total = count()

trace_id (string)	errors (long)	total (long)
aaaa1111aaaa1111aaaa1111aaaa1111	1	5

Example 4 — Extract root span name (earliest start_time)

spans
| trace-find {resource.attributes.service.name == "api-gateway"} >> {status.code == "ERROR"} summarize root_name = arg_min(
  name,
  start_time
)

Example 5 — Search predicate: full-text match inside predicates

spans
| trace-find {search "internal server error"} >> {status.code == "ERROR"}

trace_id (string)	root_name (string)	services (dynamic)	spans (long)	logs (long)	start_time (datetime)	end_time (datetime)	duration (timespan)

Example 6 — Logs as descendants (no :: needed)

union otel_logs, spans
| trace-find {resource.attributes.service.name == "user-service"} >> {body has "OutOfMemory"}

trace_id (string)	root_name (string)	services (dynamic)	spans (long)	logs (long)	start_time (datetime)	end_time (datetime)	duration (timespan)
aaaa1111aaaa1111aaaa1111aaaa1111	GET /users	["api-gateway","user-service"]	5	1	2024-01-01T10:00:00Z	2024-01-01T10:00:02Z	00:00:02

Example 7 — Direct server-to-client hop

spans
| trace-find {kind == "SERVER"} > {kind == "CLIENT"}

trace_id (string)	root_name (string)	services (dynamic)	spans (long)	start_time (datetime)	end_time (datetime)	duration (timespan)
aaaa1111aaaa1111aaaa1111aaaa1111	GET /users	["api-gateway","user-service"]	5	2024-01-01T10:00:00Z	2024-01-01T10:00:02Z	00:00:02
bbbb2222bbbb2222bbbb2222bbbb2222	POST /pay	["payment-service"]	2	2024-01-01T10:01:00Z	2024-01-01T10:01:01Z	00:00:01
cccc3333cccc3333cccc3333cccc3333	GET /health	["api-gateway"]	2	2024-01-01T10:02:00Z	2024-01-01T10:02:01Z	00:00:01

Example 8 — Error spans with correlated OOM log entries

union otel_logs, spans
| trace-find {status.code == "ERROR"} :: {body has "OutOfMemory"}

trace_id (string)	root_name (string)	services (dynamic)	spans (long)	logs (long)	start_time (datetime)	end_time (datetime)	duration (timespan)
aaaa1111aaaa1111aaaa1111aaaa1111	GET /users	["api-gateway","user-service"]	5	1	2024-01-01T10:00:00Z	2024-01-01T10:00:02Z	00:00:02

Example 9 — Shorthand: any trace with an error log

union otel_logs, spans
| trace-find :: {severity_text == "ERROR"}

trace_id (string)	root_name (string)	services (dynamic)	spans (long)	logs (long)	start_time (datetime)	end_time (datetime)	duration (timespan)

Example 10 — Traces touching both api-gateway and user-service

spans
| trace-find {resource.attributes.service.name == "api-gateway"} and {resource.attributes.service.name == "user-service"}

trace_id (string)	root_name (string)	services (dynamic)	spans (long)	logs (long)	start_time (datetime)	end_time (datetime)	duration (timespan)
aaaa1111aaaa1111aaaa1111aaaa1111	GET /users	["api-gateway","user-service"]	5	0	2024-01-01T10:00:00Z	2024-01-01T10:00:02Z	00:00:02

Example 11 — Same-row filtering: keep all predicates in one block

spans
| trace-find {resource.attributes.service.name == "user-service" and status.code == "ERROR"}

trace_id (string)	root_name (string)	services (dynamic)	spans (long)	logs (long)	start_time (datetime)	end_time (datetime)	duration (timespan)
aaaa1111aaaa1111aaaa1111aaaa1111	GET /users	["api-gateway","user-service"]	5	0	2024-01-01T10:00:00Z	2024-01-01T10:00:02Z	00:00:02

Example 12 — Trace-level filtering: predicates in separate blocks match different spans

spans
| trace-find {resource.attributes.service.name == "user-service"} and {status.code == "ERROR"}

trace_id (string)	root_name (string)	services (dynamic)	spans (long)	logs (long)	start_time (datetime)	end_time (datetime)	duration (timespan)
aaaa1111aaaa1111aaaa1111aaaa1111	GET /users	["api-gateway","user-service"]	5	0	2024-01-01T10:00:00Z	2024-01-01T10:00:02Z	00:00:02

Example 13 — Exclude traces by root span after trace-find

spans
| trace-find {status.code == "ERROR"}
| where root_name != "POST /pay"

trace_id (string)	root_name (string)	services (dynamic)	spans (long)	logs (long)	start_time (datetime)	end_time (datetime)	duration (timespan)
aaaa1111aaaa1111aaaa1111aaaa1111	GET /users	["api-gateway","user-service"]	5	0	2024-01-01T10:00:00Z	2024-01-01T10:00:02Z	00:00:02

Example 14 — Descendant search with log correlation

union otel_logs, spans
| trace-find {resource.attributes.service.name == "api-gateway"} >> {status.code == "ERROR"} :: {body has "OutOfMemory"}

trace_id (string)	root_name (string)	services (dynamic)	spans (long)	logs (long)	start_time (datetime)	end_time (datetime)	duration (timespan)

Example 15 — Three-level chain: gateway → service → database

spans
| trace-find {resource.attributes.service.name == "api-gateway"} >> {resource.attributes.service.name == "user-service"} >> {name == "SELECT users"}

trace_id (string)	root_name (string)	services (dynamic)	spans (long)	logs (long)	start_time (datetime)	end_time (datetime)	duration (timespan)
aaaa1111aaaa1111aaaa1111aaaa1111	GET /users	["api-gateway","user-service"]	5	0	2024-01-01T10:00:00Z	2024-01-01T10:00:02Z	00:00:02

Example 16 — Count error logs per trace (logs + spans input)

union otel_logs, spans
| trace-find {resource.attributes.service.name == "api-gateway"} >> {status.code == "ERROR"} summarize err_logs = countif(
  severity_text == "ERROR"
), spans = countif(isnotnull(span_id))

trace_id (string)	err_logs (long)	spans (long)
aaaa1111aaaa1111aaaa1111aaaa1111	0	5

Example 17 — Count spans per service within each matching trace

spans
| trace-find {resource.attributes.service.name == "api-gateway"} >> {status.code == "ERROR"} summarize spans = count() by resource.attributes.service.name

trace_id (string)	__groupby_0 (dynamic)	spans (long)
aaaa1111aaaa1111aaaa1111aaaa1111	"api-gateway"	3
aaaa1111aaaa1111aaaa1111aaaa1111	"user-service"	2

Example 18 — Only traces with more than 10 spans

spans
| trace-find {resource.attributes.service.name == "api-gateway"} >> {status.code == "ERROR"} where count() > 10

trace_id (string)	root_name (string)	services (dynamic)	spans (long)	logs (long)	start_time (datetime)	end_time (datetime)	duration (timespan)

Example 19 — Only traces with errors, then summarize

spans
| trace-find {resource.attributes.service.name == "api-gateway"} >> {status.code == "ERROR"} where countif(
  status.code == "ERROR"
) > 0 summarize services = make_set(resource.attributes.service.name)

trace_id (string)	services (dynamic)

Example 20 — Emit matching spans with selected columns

spans
| trace-find {resource.attributes.service.name == "api-gateway"} >> {status.code == "ERROR"} project name, resource.attributes.service.name, start_time, duration

trace_id (string)	name (dynamic)	resource_attributes_service_name (dynamic)	start_time (dynamic)	duration (dynamic)
aaaa1111aaaa1111aaaa1111aaaa1111	"GET /users"	"api-gateway"	2024-01-01T10:00:00Z	00:00:00.1500000
aaaa1111aaaa1111aaaa1111aaaa1111	"GET /users"	"user-service"	2024-01-01T10:00:01Z	00:00:00.1000000
aaaa1111aaaa1111aaaa1111aaaa1111	"SELECT users"	"user-service"	2024-01-01T10:00:02Z	00:00:00.0500000
aaaa1111aaaa1111aaaa1111aaaa1111	"cache lookup"	"api-gateway"	2024-01-01T10:00:00Z	00:00:00.0100000
aaaa1111aaaa1111aaaa1111aaaa1111	"call user-svc"	"api-gateway"	2024-01-01T10:00:01Z	00:00:00.1200000

On this page