Python: Add OpenTelemetry instrumentation to ClaudeAgent (#4278) by amitmukh · Pull Request #4326 · microsoft/agent-framework

amitmukh · 2026-02-26T17:18:30Z

Summary

Fixes Python: [Bug]: ClaudeAgent missing telemetry - inherits from BaseAgent instead of Agent #4278: ClaudeAgent inherits from BaseAgent directly, bypassing AgentTelemetryLayer, so enable_instrumentation() has no effect
Adds inline OpenTelemetry telemetry to ClaudeAgent.run() using the same observability helpers as AgentTelemetryLayer
Covers both streaming and non-streaming paths with invoke_agent spans, duration tracking, exception capture, and sensitive data support

Approach

Instead of changing the inheritance chain (which would cause MRO conflicts since ClaudeAgent defines its own run() and doesn't use a standard chat client), this PR integrates telemetry directly into ClaudeAgent.run() using the framework's existing observability helpers (_get_span, _get_span_attributes, _capture_messages, _capture_response, capture_exception). This is the safest approach with zero risk of breaking existing behavior.

Changes

python/packages/claude/agent_framework_claude/_agent.py: Add telemetry wrapping in run(), with two new private methods _run_with_telemetry() and _run_with_telemetry_stream()
python/packages/claude/tests/test_claude_agent.py: Add 5 new tests covering span emission, disabled instrumentation, streaming spans, exception capture, and provider name verification

Test plan

All 49 existing tests pass unchanged
5 new telemetry tests pass (54 total)
Verify invoke_agent span appears in App Insights when enable_instrumentation() is called
Verify no spans are emitted when instrumentation is disabled

Future Work: Tool-level telemetry granularity

This PR provides the outer invoke_agent span for ClaudeAgent. However, visibility into
individual tool calls (Read, Write, Bash, etc.) and per-LLM-call spans that happen inside
the Claude CLI subprocess is not possible at the MAF layer — these are opaque to the framework.

A related issue has been raised upstream: anthropics/claude-agent-sdk-python#611
requesting OpenTelemetry instrumentation in the Claude Agent SDK itself.

Once Anthropic ships SDK-level telemetry, a follow-up PR can wire those events as child spans
under the invoke_agent span created here. The trace context parent is already in place —
the integration should be a small, targeted change once the Anthropic SDK defines its
telemetry surface.

Note: Custom @tool-decorated functions already get tool-level spans today via
FunctionTool.invoke() in core. Only Claude's built-in CLI tools require the upstream fix.

) Add inline telemetry to ClaudeAgent.run() so that enable_instrumentation() emits invoke_agent spans and metrics. Covers both streaming and non-streaming paths using the same observability helpers as AgentTelemetryLayer. Adds 5 unit tests for telemetry behavior. Co-Authored-By: amitmukh <amimukherjee@microsoft.com>

markwallace-microsoft · 2026-02-26T17:21:35Z

Python Test Coverage Report •

File	Stmts	Miss	Cover	Missing
TOTAL	22222	2758	87%

report-only-changed-files is enabled. No files were changed during this commit :)

Python Unit Test Overview

Tests	Skipped	Failures	Errors	Time
4719	247 💤	0 ❌	0 🔥	1m 17s ⏱️

Copilot

Pull request overview

This PR adds OpenTelemetry span emission to ClaudeAgent.run() so enable_instrumentation() affects Claude agents the same way it affects core Agent implementations, covering both streaming and non-streaming execution paths.

Changes:

Wrap ClaudeAgent.run() with OTEL span creation, duration tracking, response capture, and exception capture.
Add streaming-specific span finalization via ResponseStream cleanup hooks and weakref.finalize.
Add new Claude telemetry unit tests for enabled/disabled instrumentation, streaming, exceptions, and provider name.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 4 comments.

File	Description
python/packages/claude/agent_framework_claude/_agent.py	Adds inline OTEL instrumentation to `ClaudeAgent.run()` via new `_run_with_telemetry*` helpers.
python/packages/claude/tests/test_claude_agent.py	Adds unit tests validating ClaudeAgent telemetry behavior across key run paths.

python/packages/claude/agent_framework_claude/_agent.py

python/packages/claude/tests/test_claude_agent.py

- Add justification comment for private observability API imports - Pass system_instructions to capture_messages for system prompt capture - Use monkeypatch instead of try/finally for test global state isolation Co-Authored-By: amitmukh <amitmukh@users.noreply.github.com> Co-Authored-By: Claude <noreply@anthropic.com>

eavanvalkenburg · 2026-02-27T06:58:36Z

I would prefer this to adopt the agent telemetry layer instead is reimplementing a bunch, if that means we need to do some updates to it thats fine, but otel is a changing spec so we don't want to do double work

Restructure ClaudeAgent to inherit from AgentTelemetryLayer via a _ClaudeAgentRunImpl mixin, eliminating duplicated telemetry code and private API imports. MRO: ClaudeAgent → AgentTelemetryLayer → _ClaudeAgentRunImpl → BaseAgent - Remove inline _run_with_telemetry / _run_with_telemetry_stream methods - Remove private observability helper imports (_capture_messages, etc.) - Add default_options property mapping system_prompt → instructions - Net -105 lines by reusing core telemetry layer Co-Authored-By: amitmukh <amitmukh@users.noreply.github.com> Co-Authored-By: Claude <noreply@anthropic.com>

amitmukh · 2026-02-27T17:00:21Z

I would prefer this to adopt the agent telemetry layer instead is reimplementing a bunch, if that means we need to do some updates to it thats fine, but otel is a changing spec so we don't want to do double work

Restructured to adopt AgentTelemetryLayer via MRO instead of inlining telemetry.

Changes in latest commit:

ClaudeAgent now inherits from AgentTelemetryLayer via a _ClaudeAgentRunImpl mixin
MRO: ClaudeAgent → AgentTelemetryLayer.run() → _ClaudeAgentRunImpl.run()
Removed all private API imports (_capture_messages, _get_span, etc.)
Removed duplicated _run_with_telemetry / _run_with_telemetry_stream methods
Added default_options property to map system_prompt → instructions for telemetry layer compatibility
Net -105 lines

…ryLayer.run() Remove explicit `options` parameter from mixin's run() signature and extract it from **kwargs to match AgentTelemetryLayer's signature. Also align overload return types (ResponseStream, Awaitable) to match. Co-Authored-By: Claude <noreply@anthropic.com>

python/packages/claude/agent_framework_claude/_agent.py

Copilot AI review requested due to automatic review settings February 26, 2026 17:18

markwallace-microsoft added the python label Feb 26, 2026

Copilot started reviewing on behalf of amitmukh February 26, 2026 17:19 View session

Copilot AI reviewed Feb 26, 2026

View reviewed changes

amitmukh mentioned this pull request Feb 27, 2026

[FEATURE] OpenTelemetry distributed tracing support (spans) for Claude Agent SDK anthropics/claude-agent-sdk-python#611

Open

dmytrostruk and others added 2 commits February 27, 2026 11:53

Merge branch 'main' into fix/claude-telemetry

9dac5de

dmytrostruk reviewed Feb 27, 2026

View reviewed changes

python/packages/claude/agent_framework_claude/_agent.py Outdated Show resolved Hide resolved

python/packages/claude/agent_framework_claude/_agent.py Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python: Add OpenTelemetry instrumentation to ClaudeAgent (#4278)#4326

Python: Add OpenTelemetry instrumentation to ClaudeAgent (#4278)#4326
amitmukh wants to merge 5 commits intomicrosoft:mainfrom
amitmukh:fix/claude-telemetry

amitmukh commented Feb 26, 2026 •

edited

Loading

Uh oh!

markwallace-microsoft commented Feb 26, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

eavanvalkenburg commented Feb 27, 2026

Uh oh!

amitmukh commented Feb 27, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

amitmukh commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Approach

Changes

Test plan

Future Work: Tool-level telemetry granularity

Uh oh!

markwallace-microsoft commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Python Unit Test Overview

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

eavanvalkenburg commented Feb 27, 2026

Uh oh!

amitmukh commented Feb 27, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

amitmukh commented Feb 26, 2026 •

edited

Loading

markwallace-microsoft commented Feb 26, 2026 •

edited

Loading