fix: capture stderr in ProcessError for better debugging (#641) by MaxwellCalkin · Pull Request #658 · anthropics/claude-agent-sdk-python

MaxwellCalkin · 2026-03-08T21:31:38Z

Summary

Fixes #641 — ProcessError now includes the actual CLI stderr output instead of a generic placeholder message.

Problem: When the CLI subprocess exits with a non-zero exit code, ProcessError was raised with stderr="Check stderr output for details" — a hardcoded string that provides no useful debugging information. Worse, stderr was only piped at all when the user explicitly provided a stderr callback or enabled debug mode, so in the common case the real error output was lost entirely.

Root cause in subprocess_cli.py:

stderr was conditionally piped (only with callback or debug mode)
When piped, stderr was consumed by _handle_stderr() for callbacks but never retained
ProcessError always used a hardcoded placeholder string

Fix:

Always pipe stderr from the subprocess so error output is available regardless of user configuration
Buffer all stderr lines in self._stderr_buffer (in addition to invoking existing user callbacks / debug output)
Use the captured buffer when constructing ProcessError, so callers see the real CLI error
Reset the buffer in close() to prevent memory leaks

Before

ProcessError: Command failed with exit code 1 (exit code: 1)
Error output: Check stderr output for details

After

ProcessError: Command failed with exit code 1 (exit code: 1)
Error output: Error: No conversation found with session ID ab2c985b

Test plan

ruff check passes — no lint issues
mypy passes — no type errors
Existing test suite passes (no behavioral changes to happy path)
Manual test: trigger a CLI error (e.g., resume non-existent session) and verify ProcessError.stderr contains the real error message

🤖 Generated with Claude Code

) Previously, stderr was only piped when the user provided a callback or enabled debug mode. When the CLI process exited with an error, ProcessError was raised with the generic message "Check stderr output for details" instead of the actual error text. This change: - Always pipes stderr from the subprocess - Buffers all stderr lines in _stderr_buffer alongside existing callbacks - Uses the captured stderr content in ProcessError when the process fails - Resets the buffer on close() to prevent memory leaks Fixes anthropics#641 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

c3-rambandi · 2026-03-10T23:05:33Z

amazing, let's merge this asap

b657bm7n98-hub · 2026-03-19T05:40:33Z

Is this the same as #529?

jmehnle · 2026-03-30T04:44:35Z

#641 was (inappropriately) closed, but this PR hasn't even been merged, let alone released. What's blocking this from being merged?

km-anthropic · 2026-03-31T19:36:04Z

@claude review

claude · 2026-03-31T19:40:36Z

src/claude_agent_sdk/_internal/transport/subprocess_cli.py


        # Use exit code for error detection
        if returncode is not None and returncode != 0:
+            captured_stderr = "\n".join(self._stderr_buffer) if self._stderr_buffer else None
            self._exit_error = ProcessError(
                f"Command failed with exit code {returncode}",
                exit_code=returncode,
-                stderr="Check stderr output for details",
+                stderr=captured_stderr,


🟡 Race condition: after await self._process.wait() returns (line 583), self._stderr_buffer is read immediately (line 589) without waiting for _handle_stderr to finish consuming remaining pipe data. The stderr task group is only awaited in close(), which runs after the ProcessError is already raised with potentially incomplete stderr. Consider awaiting/cancelling the stderr task group between process.wait() and reading the buffer.

Extended reasoning...

Bug Analysis

_handle_stderr runs as a separate task inside self._stderr_task_group, started at line 393 during connect(). In _read_messages_impl, after the stdout stream is exhausted, the code awaits self._process.wait() at line 583, then immediately reads self._stderr_buffer at line 589 to construct a ProcessError. There is no synchronization point between these two operations to ensure _handle_stderr has finished.

Code Path

When the subprocess exits, the write end of the stderr pipe is closed. However, there may still be unread data in the kernel pipe buffer. _handle_stderr needs at least one more scheduling turn (an await yield point) to read these final bytes via async for line in self._stderr_stream and append them to self._stderr_buffer. But after process.wait() returns, the code proceeds synchronously — if returncode is not None and returncode != 0: then "\n".join(self._stderr_buffer) — with no yield point, giving the stderr task no opportunity to run.

The _stderr_task_group is cancelled and awaited in close() (lines 460-464), but close() runs after the ProcessError has already been raised with the potentially incomplete buffer contents.

Step-by-step proof

CLI process writes an error message to stderr (e.g., "Error: No conversation found with session ID ab2c985b") and exits with code 1.

The stdout async iterator in _read_messages_impl finishes (no more stdout data).

await self._process.wait() returns with returncode=1.

The event loop does NOT schedule _handle_stderr between wait() returning and the next line executing.

self._stderr_buffer is read — it may be missing the final stderr lines that are still sitting in the pipe buffer.

ProcessError is raised with incomplete stderr.

Only later, in close(), is the stderr task group properly awaited.

Impact

In practice, the window for this race is small: _handle_stderr runs concurrently during the entire stdout-reading phase with many yield points, so for typical short error messages it will have already consumed them. The most likely scenario for data loss is when the process writes to stderr and exits immediately with little or no stdout output, creating a tight race. Since the entire purpose of this PR is to capture stderr for better error reporting, it would be worth closing this gap.

Suggested Fix

After await self._process.wait() and before reading self._stderr_buffer, cancel and await the stderr task group:

if self._stderr_task_group: self._stderr_task_group.cancel_scope.cancel() await self._stderr_task_group.__aexit__(None, None, None) self._stderr_task_group = None

This ensures all buffered pipe data has been consumed before constructing the ProcessError.

claude · 2026-03-31T19:40:36Z

src/claude_agent_sdk/_internal/transport/subprocess_cli.py

@@ -429,6 +425,9 @@ async def _handle_stderr(self) -> None:
                if not line_str:
                    continue



🟡 Nit: _stderr_buffer grows without bound since every stderr line is appended (line 427) but no size cap is applied, unlike the stdout JSON buffer which is guarded by _max_buffer_size (line 555). Since stderr is now always piped, consider capping the buffer (e.g., keep only the last N lines or apply _max_buffer_size) as a defensive measure.

Extended reasoning...

What the bug is

The new self._stderr_buffer list (initialized at line 59) appends every non-empty stderr line unconditionally at line 427, but there is no size limit enforced on this buffer. The existing _max_buffer_size field is only checked against the stdout JSON buffer (line 555), not the stderr buffer. The buffer is only cleared when close() is called (line 487).

Why the surface area expanded

Before this PR, stderr was only piped when a user-provided callback existed or debug mode was enabled (should_pipe_stderr = self._options.stderr is not None or "debug-to-stderr" in self._options.extra_args). Now stderr is always piped (stderr_dest = PIPE), meaning _handle_stderr() always runs and always appends to the buffer regardless of user configuration.

Step-by-step proof

User creates a SubprocessCLITransport and calls connect().

Since stderr_dest = PIPE (always), _handle_stderr() is started as an async task.

For each non-empty line read from stderr, self._stderr_buffer.append(line_str) executes at line 427.

There is no check like if len(self._stderr_buffer) > some_limit — every line is retained.

The buffer is only reset in close() at line 487 (self._stderr_buffer = []).

If the subprocess produces a large volume of stderr output before close() is called, the list grows without bound.

Practical impact

The practical risk is low. The subprocess is the Claude CLI, which produces limited stderr output proportional to conversation activity. Each transport instance is scoped to a single process_query() call, and close() is called when done. Even in debug mode, output is bounded by conversation length. This makes memory exhaustion unlikely in practice.

Recommended fix

Apply the same _max_buffer_size concept to the stderr buffer, or keep only the last N lines (since only the tail of stderr is typically useful for error diagnostics). For example, track the total byte length of the buffer and drop older entries when the limit is exceeded. This would be consistent with how the stdout JSON buffer is already protected and would add a simple defensive guard against any edge cases.

claude · 2026-03-31T19:40:36Z

src/claude_agent_sdk/_internal/transport/subprocess_cli.py


-            # Pipe stderr if we have a callback OR debug mode is enabled
-            should_pipe_stderr = (
-                self._options.stderr is not None
-                or "debug-to-stderr" in self._options.extra_args
-            )
-
-            # For backward compat: use debug_stderr file object if no callback and debug is on
-            stderr_dest = PIPE if should_pipe_stderr else None
+            # Always pipe stderr so we can capture it for error reporting.
+            # User callbacks and debug mode are still honored in _handle_stderr().
+            stderr_dest = PIPE


🟡 Behavioral regression: stderr output during successful CLI operations is now silently swallowed. Previously, when no stderr callback or debug mode was configured, stderr_dest was None so the subprocess inherited the parent stderr FD and CLI warnings/deprecation notices appeared in the terminal. Now stderr is always piped but _handle_stderr has no fallback — lines are buffered but never displayed, and the buffer is cleared in close(). Consider adding an else: sys.stderr.write(line_str + "\n") fallback in _handle_stderr when no callback/debug is configured.

Extended reasoning...

What the bug is

Before this PR, stderr_dest was conditionally set: it was PIPE only when a stderr callback was provided or debug mode was enabled, and None otherwise. When None, the subprocess inherited the parent process's stderr file descriptor, meaning any CLI warnings, deprecation notices, or diagnostic messages written to stderr would appear directly in the user's terminal.

After this PR, stderr_dest is unconditionally set to PIPE (line 372), and all stderr output flows through _handle_stderr(). However, _handle_stderr only has two branches for output: (1) invoke the user's stderr callback, or (2) write to debug_stderr when debug mode is active. There is no fallback else branch for the case where neither is configured.

Code path that triggers the issue

Consider a user who creates a ClaudeAgent with default options — no stderr callback, no debug-to-stderr in extra_args. The subprocess starts with stderr=PIPE. The _handle_stderr coroutine reads each line from stderr, appends it to self._stderr_buffer (line 429), then checks:

if self._options.stderr: — False, no callback set

elif "debug-to-stderr" in self._options.extra_args and self._options.debug_stderr: — False, no debug mode

No output is produced. The line sits in _stderr_buffer and is never displayed.

Step-by-step proof with a concrete example

User creates a ClaudeAgent with default options (no stderr callback, no debug mode)

The CLI subprocess is started with stderr=PIPE

During execution, the CLI writes a deprecation warning to stderr, e.g., "Warning: --verbose flag is deprecated, use --debug instead"

_handle_stderr reads the line, appends it to _stderr_buffer

Neither the callback branch nor the debug branch executes — the warning is silently buffered

The CLI completes successfully (exit code 0)

_read_messages_impl finishes without raising ProcessError (since exit code is 0)

close() is called, which resets self._stderr_buffer = [] (line 487)

The deprecation warning is permanently lost — it was never displayed and the buffer is now empty

Impact

Users who previously relied on seeing CLI stderr output in their terminal during normal (successful) operations will no longer see it. This includes warnings, deprecation notices, and any other diagnostic output the CLI emits to stderr. The stderr buffer is only used when constructing a ProcessError on non-zero exit, so all stderr from successful runs is silently discarded.

How to fix

Add a fallback else branch in _handle_stderr that writes to sys.stderr when no callback or debug mode is configured:

if self._options.stderr: self._options.stderr(line_str) elif "debug-to-stderr" in self._options.extra_args and self._options.debug_stderr: self._options.debug_stderr.write(line_str + "\n") if hasattr(self._options.debug_stderr, "flush"): self._options.debug_stderr.flush() else: sys.stderr.write(line_str + "\n") sys.stderr.flush()

This preserves the new stderr-capture behavior for error reporting while maintaining backward compatibility for terminal visibility.

qing-ant mentioned this pull request Mar 24, 2026

ProcessError loses real CLI error message during initialize() #641

Closed

claude bot reviewed Mar 31, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: capture stderr in ProcessError for better debugging (#641)#658

fix: capture stderr in ProcessError for better debugging (#641)#658
MaxwellCalkin wants to merge 1 commit intoanthropics:mainfrom
MaxwellCalkin:fix/capture-stderr-for-process-error

MaxwellCalkin commented Mar 8, 2026

Uh oh!

c3-rambandi commented Mar 10, 2026

Uh oh!

b657bm7n98-hub commented Mar 19, 2026

Uh oh!

jmehnle commented Mar 30, 2026

Uh oh!

km-anthropic commented Mar 31, 2026

Uh oh!

claude bot Mar 31, 2026

Uh oh!

claude bot Mar 31, 2026

Uh oh!

claude bot Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

		@@ -429,6 +425,9 @@ async def _handle_stderr(self) -> None:
		if not line_str:
		continue

Conversation

MaxwellCalkin commented Mar 8, 2026

Summary

Before

After

Test plan

Uh oh!

c3-rambandi commented Mar 10, 2026

Uh oh!

b657bm7n98-hub commented Mar 19, 2026

Uh oh!

jmehnle commented Mar 30, 2026

Uh oh!

km-anthropic commented Mar 31, 2026

Uh oh!

claude bot Mar 31, 2026

Choose a reason for hiding this comment

Bug Analysis

Code Path

Step-by-step proof

Impact

Suggested Fix

Uh oh!

claude bot Mar 31, 2026

Choose a reason for hiding this comment

What the bug is

Why the surface area expanded

Step-by-step proof

Practical impact

Recommended fix

Uh oh!

claude bot Mar 31, 2026

Choose a reason for hiding this comment

What the bug is

Code path that triggers the issue

Step-by-step proof with a concrete example

Impact

How to fix

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants