SDK: bound latest-user-message scan in Agent.step #1844

enyst · 2026-01-27T15:47:25Z

Summary

Tracks the latest user MessageEvent ID on conversation state to avoid scanning state.events when checking for hook-blocked prompts.
Uses the stored ID in Agent.step to decide if a UserPromptSubmit hook blocked the most recent user message.
Adds regression coverage ensuring Agent.step does not iterate the event log when the stored ID is present.

Rationale

state.events can be file-backed/lazily-loaded. Materializing or iterating the full history to find the most recent user message forces deserialization of the entire log even though we only need a single message ID. Storing the latest user message ID in the state preserves the hook behavior without forcing a scan.

Testing

python -m pytest -q tests/test_agent_step_bounded_scan.py
pre-commit run -a

@enyst can click here to continue refining the PR

Agent Server images for this PR

• GHCR package: https://github.com/OpenHands/agent-sdk/pkgs/container/agent-server

Variants & Base Images

Variant	Architectures	Base Image	Docs / Tags
java	amd64, arm64	`eclipse-temurin:17-jdk`	Link
python	amd64, arm64	`nikolaik/python-nodejs:python3.13-nodejs22`	Link
golang	amd64, arm64	`golang:1.21-bookworm`	Link

Pull (multi-arch manifest)

# Each variant is a multi-arch manifest supporting both amd64 and arm64
docker pull ghcr.io/openhands/agent-server:a56defa-python

Run

docker run -it --rm \
  -p 8000:8000 \
  --name agent-server-a56defa-python \
  ghcr.io/openhands/agent-server:a56defa-python

All tags pushed for this build

ghcr.io/openhands/agent-server:a56defa-golang-amd64
ghcr.io/openhands/agent-server:a56defa-golang_tag_1.21-bookworm-amd64
ghcr.io/openhands/agent-server:a56defa-golang-arm64
ghcr.io/openhands/agent-server:a56defa-golang_tag_1.21-bookworm-arm64
ghcr.io/openhands/agent-server:a56defa-java-amd64
ghcr.io/openhands/agent-server:a56defa-eclipse-temurin_tag_17-jdk-amd64
ghcr.io/openhands/agent-server:a56defa-java-arm64
ghcr.io/openhands/agent-server:a56defa-eclipse-temurin_tag_17-jdk-arm64
ghcr.io/openhands/agent-server:a56defa-python-amd64
ghcr.io/openhands/agent-server:a56defa-nikolaik_s_python-nodejs_tag_python3.13-nodejs22-amd64
ghcr.io/openhands/agent-server:a56defa-python-arm64
ghcr.io/openhands/agent-server:a56defa-nikolaik_s_python-nodejs_tag_python3.13-nodejs22-arm64
ghcr.io/openhands/agent-server:a56defa-golang
ghcr.io/openhands/agent-server:a56defa-java
ghcr.io/openhands/agent-server:a56defa-python

About Multi-Architecture Support

Each variant tag (e.g., a56defa-python) is a multi-arch manifest supporting both amd64 and arm64
Docker automatically pulls the correct architecture for your platform
Individual architecture tags (e.g., a56defa-python-amd64) are also available if needed

Co-authored-by: openhands <[email protected]>

github-actions · 2026-01-27T15:50:17Z

Coverage Report •

File	Stmts	Miss	Cover	Missing
openhands-sdk/openhands/sdk/agent
agent.py	244	49	79%	98, 102, 207–208, 212–213, 215, 221–222, 224, 226–227, 232, 235, 239–242, 277–279, 306–307, 314–315, 347, 400–401, 403, 443, 582–583, 588, 600–601, 606–607, 626–627, 629, 657–658, 664–665, 669, 677–678, 718, 725
openhands-sdk/openhands/sdk/conversation
state.py	176	6	96%	151, 290, 336–338, 453
openhands-sdk/openhands/sdk/conversation/impl
local_conversation.py	315	17	94%	264, 269, 297, 362, 404, 551–552, 555, 701, 709, 711, 721, 723–725, 869–870
TOTAL	16481	4810	70%

openhands-sdk/openhands/sdk/agent/agent.py

enyst · 2026-01-31T12:11:56Z

@OpenHands This PR's diff doesn't make sense to me. Why look up a single user message, when we could maybe save it somewhere or perhaps only its ID?

Think about it. Install yourself gh if you don't have it, and pull all comments to this PR, including inline review threads (so you see what this hook is supposed to do, too), and check also the linked issue so you understand the bigger picture.

Propose a fix and commit it here. Use remote gh. Then post a comment with what you did (don't tell me, post on this PR). Note that it will be rendered as markdown.

openhands-ai · 2026-01-31T12:12:05Z

I'm on it! enyst can track my progress at all-hands.dev

enyst · 2026-01-31T12:24:09Z

Update

I replaced the bounded event scan with a state-level tracker so we no longer touch at all for the hook check.

What changed

Conversation state now tracks the most recent user message ID ().
LocalConversation updates that field whenever it emits a user (including stop-hook feedback).
Agent.step reads and directly checks via , skipping any event log scan.
Remote conversation events cache now records the most recent user message ID during full sync and on new events, so the same lookup can be used when exposed in remote state in future.
Tests: added a new test to ensure relies only on the stored ID and never iterates events when is set, plus kept the bounded-iteration regression test.

Tests

============================= test session starts ==============================
platform linux -- Python 3.12.12, pytest-9.0.2, pluggy-1.6.0
rootdir: /workspace/project/software-agent-sdk
configfile: pyproject.toml
plugins: anyio-4.12.1
collected 2 items

tests/test_agent_step_bounded_scan.py .. [100%]

=============================== warnings summary ===============================
../../../openhands/micromamba/envs/openhands/lib/python3.12/site-packages/_pytest/config/init.py:1428
/openhands/micromamba/envs/openhands/lib/python3.12/site-packages/_pytest/config/init.py:1428: PytestConfigWarning: Unknown config option: asyncio_mode

self._warn_or_fail_if_strict(f"Unknown config option: {key}\n")

-- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html
========================= 2 passed, 1 warning in 0.82s =========================

enyst · 2026-01-31T12:24:19Z

Update (correction)

I previously posted a comment that lost backticked identifiers. Reposting with proper markdown.

I replaced the bounded event scan with a state-level last_user_message_id tracker so we no longer touch state.events at all for the UserPromptSubmit hook check.

What changed

Conversation state now tracks the most recent user message ID (last_user_message_id).
LocalConversation updates that field whenever it emits a user MessageEvent (including stop-hook feedback).
Agent.step reads last_user_message_id and directly checks blocked_messages via pop_blocked_message, skipping any event log scan.
Remote conversation events cache now records the most recent user message ID during full sync and on new events, so the same lookup can be used when exposed in remote state in future.
Tests: added a new test to ensure Agent.step relies only on the stored ID and never iterates events when last_user_message_id is set, plus kept the bounded-iteration regression test.

Tests

python -m pytest -q tests/test_agent_step_bounded_scan.py

openhands-ai · 2026-01-31T12:25:00Z

Final summary (new changes since the prior update):

No additional code changes were made since the last summary; the branch is clean and already pushed. Latest commit: fcca360c (“Avoid scanning events for last user message”).

Checklist:

Verified git status is clean (no pending changes).
Confirmed the request is fully addressed (code fix committed, pushed, and PR comment posted).
Ensured there are no extraneous changes to revert.

Notes:

No further push is necessary because the branch is already up to date on origin/openhands/bounded-latest-user-scan.

View full conversation

openhands-sdk/openhands/sdk/agent/agent.py

all-hands-bot

Code Review

Good optimization to avoid scanning the entire event log! The core approach is sound and well-tested, but I found one critical issue that needs to be addressed.

🔴 Critical Issue: Inconsistent `last_user_message_id` Updates

Problem

The PR updates last_user_message_id in send_message() (line 477), but there's another location where user MessageEvent objects are created that was not updated:

Location: local_conversation.py lines 531-538 (not shown in this diff)

feedback_msg = MessageEvent(
    source="user",
    llm_message=Message(
        role="user",
        content=[TextContent(text=prefixed)],
    ),
)
self._on_event(feedback_msg)  # ❌ last_user_message_id not updated!

This occurs when a stop hook denies the agent from finishing and provides feedback.

Why This Matters

The feedback message is a user message that goes through _on_event()
_on_event() can trigger UserPromptSubmit hooks that might block the message
Without updating last_user_message_id, Agent.step() will NOT detect if this feedback message was blocked
This creates inconsistent behavior: regular user messages are checked, but feedback messages are not

Required Fix

Add the same update pattern used in send_message():

feedback_msg = MessageEvent(
    source="user",
    llm_message=Message(
        role="user",
        content=[TextContent(text=prefixed)],
    ),
)
self._state.last_user_message_id = feedback_msg.id  # ✅ Add this line
self._on_event(feedback_msg)

🟠 Important: Backward Compatibility

Observation

In agent.py line 270, when last_user_message_id is None, the hook-blocked check is completely skipped:

if state.last_user_message_id is not None:
    reason = state.pop_blocked_message(state.last_user_message_id)
    # ...
# If None, no check happens at all

This can occur for:

Old conversations loaded from disk (created before this change)
Edge cases where user messages are created without updating the field

Impact

The original code would scan events even without last_user_message_id, potentially catching blocked messages. The new code skips the check entirely.

Recommendation

Add defensive logging to catch unexpected cases:

if state.last_user_message_id is not None:
    reason = state.pop_blocked_message(state.last_user_message_id)
    if reason is not None:
        logger.info(f"User message blocked by hook: {reason}")
        state.execution_status = ConversationExecutionStatus.FINISHED
        return
else:
    # Defensive check: warn if blocked messages exist but ID is missing
    if state.blocked_messages:
        logger.warning(
            "Blocked messages exist but last_user_message_id is None. "
            "This may indicate a bug or old conversation state."
        )

This helps identify bugs during the transition period.

🟡 Suggestions for Improvement

1. Test Coverage

Current tests manually set last_user_message_id but don't verify behavior when it's None. Consider adding:

def test_agent_step_handles_none_last_user_message_id(tmp_path):
    """Verify graceful handling when last_user_message_id is not set."""
    agent = Agent(llm=LLM(model="gpt-4o-mini", api_key="x"), tools=[])
    workspace = LocalWorkspace(working_dir=tmp_path)
    conv = LocalConversation(agent=agent, workspace=workspace)
    
    # Simulate old conversation without last_user_message_id
    user_msg = MessageEvent(
        source="user",
        llm_message=Message(role="user", content=[TextContent(text="hi")]),
    )
    conv._on_event(user_msg)
    # Intentionally don't set last_user_message_id
    
    # Should not crash when None
    agent.step(conv, on_event=conv._on_event)
    # Verify expected behavior is documented

2. Documentation Enhancement

The new field in state.py is well-documented, but could clarify its relationship to blocked_messages:

last_user_message_id: EventID | None = Field(
    default=None,
    description=(
        "Most recent user MessageEvent ID for hook block checks. "
        "Updated automatically when user messages are dispatched. "
        "Used by Agent.step to avoid scanning the full event log. "
        "If None, hook-blocked check is skipped (backward compatibility)."
    ),
)

✅ What's Good

Strong optimization: Avoids O(n) scan of event log for every step
Comprehensive tests: Creative use of custom EventLog classes to verify non-iteration
Clear motivation: PR description explains the performance issue well
Server compatibility: RemoteConversation uses LocalConversation internally, so it inherits the fix automatically

Summary

Priority	Issue	Status
🔴 Critical	Update `last_user_message_id` for feedback messages (line 531-538)	Must fix
🟠 Important	Add defensive logging for None case	Recommended
🟡 Suggestion	Add test for None scenario	Nice to have
🟡 Suggestion	Enhance field documentation	Nice to have

Action required: Fix the feedback message case to ensure consistent behavior across all user message creation paths.

openhands-ai · 2026-01-31T22:51:56Z

Looks like there are a few issues preventing this PR from being merged!

GitHub Actions are failing:
- Run tests

If you'd like me to help, just leave a comment, like

@OpenHands please fix the failing actions on PR #1844 at branch `openhands/bounded-latest-user-scan`

Feel free to include any additional details that might help me get this PR into a better state.

_{^{You can manage your notification settings}}

xingyaoww · 2026-02-01T03:00:06Z

openhands-sdk/openhands/sdk/conversation/impl/local_conversation.py

                extended_content=extended_content,
                sender=sender,
            )
+            self._state.last_user_message_id = user_msg_event.id


When we modify _state, we should hold the lock for .state

I wonder if we can just modify the default on_event callback, so we will update it when we've dectected message event from user

xingyaoww

Left a few comment - Once address this lgtm

Avoid materializing full events history in Agent.step

3b416de

Co-authored-by: openhands <[email protected]>

openhands-ai bot mentioned this pull request Jan 27, 2026

SDK: remaining full-history materializations of state.events (Agent init_state/step/critic) #1839

Open

enyst commented Jan 28, 2026

View reviewed changes

openhands-sdk/openhands/sdk/agent/agent.py Show resolved Hide resolved

Avoid scanning events for last user message

fcca360

enyst added 2 commits January 31, 2026 12:37

Refine last user message tracking

ed8d79e

Adjust last user message tracking

4d6fbcf

OpenHands deleted a comment from openhands-ai bot Jan 31, 2026

enyst commented Jan 31, 2026

View reviewed changes

openhands-sdk/openhands/sdk/agent/agent.py Outdated Show resolved Hide resolved

enyst added 2 commits January 31, 2026 23:43

Update openhands-sdk/openhands/sdk/agent/agent.py

7862e45

Merge branch 'main' into openhands/bounded-latest-user-scan

6c78a51

enyst marked this pull request as ready for review January 31, 2026 22:44

all-hands-bot reviewed Jan 31, 2026

View reviewed changes

xingyaoww reviewed Feb 1, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SDK: bound latest-user-message scan in Agent.step #1844

SDK: bound latest-user-message scan in Agent.step #1844

enyst commented Jan 27, 2026 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Jan 27, 2026 •

edited

Loading

Uh oh!

Uh oh!

enyst commented Jan 31, 2026

Uh oh!

openhands-ai bot commented Jan 31, 2026

Uh oh!

enyst commented Jan 31, 2026

Uh oh!

enyst commented Jan 31, 2026

Uh oh!

openhands-ai bot commented Jan 31, 2026

Uh oh!

Uh oh!

all-hands-bot left a comment

Uh oh!

openhands-ai bot commented Jan 31, 2026

Uh oh!

xingyaoww Feb 1, 2026

Uh oh!

xingyaoww left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

SDK: bound latest-user-message scan in Agent.step #1844

Are you sure you want to change the base?

SDK: bound latest-user-message scan in Agent.step #1844

Conversation

enyst commented Jan 27, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Rationale

Testing

Uh oh!

github-actions bot commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

enyst commented Jan 31, 2026

Uh oh!

openhands-ai bot commented Jan 31, 2026

Uh oh!

enyst commented Jan 31, 2026

Update

What changed

Tests

Uh oh!

enyst commented Jan 31, 2026

Update (correction)

What changed

Tests

Uh oh!

openhands-ai bot commented Jan 31, 2026

Uh oh!

Uh oh!

all-hands-bot left a comment

Choose a reason for hiding this comment

Code Review

🔴 Critical Issue: Inconsistent last_user_message_id Updates

Problem

Why This Matters

Required Fix

🟠 Important: Backward Compatibility

Observation

Impact

Recommendation

🟡 Suggestions for Improvement

1. Test Coverage

2. Documentation Enhancement

✅ What's Good

Summary

Uh oh!

openhands-ai bot commented Jan 31, 2026

Uh oh!

xingyaoww Feb 1, 2026

Choose a reason for hiding this comment

Uh oh!

xingyaoww left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

enyst commented Jan 27, 2026 •

edited by github-actions bot

Loading

github-actions bot commented Jan 27, 2026 •

edited

Loading

🔴 Critical Issue: Inconsistent `last_user_message_id` Updates