Skip to content

Conversation

@juanmichelini
Copy link
Collaborator

@juanmichelini juanmichelini commented Jan 16, 2026

Summary

Removes gpt-5-mini-2025-08-07 from the model lists as requested in #1747.

This follows the same pattern as #1734 which removed other unsupported models.

Changes:

  • Removed gpt-5-mini-2025-08-07 from .github/run-eval/resolve_model_config.py MODELS dictionary
  • Removed gpt-5-mini-2025-08-07 from openhands-sdk/openhands/sdk/llm/utils/verified_models.py VERIFIED_OPENAI_MODELS list
  • Updated example files to use gpt-5.2 instead of gpt-5-mini-2025-08-07
  • Updated test files to use gpt-5.2 instead of gpt-5-mini-2025-08-07

Fixes #1747

Checklist

  • If the PR is changing/adding functionality, are there tests to reflect this?
  • If there is an example, have you run the example to make sure that it works?
  • If there are instructions on how to run the code, have you followed the instructions and made sure that it works?
  • If the feature is significant enough to require documentation, is there a PR open on the OpenHands/docs repository with the same branch name?
  • Is the github CI passing?

@juanmichelini can click here to continue refining the PR


Agent Server images for this PR

GHCR package: https://github.com/OpenHands/agent-sdk/pkgs/container/agent-server

Variants & Base Images

Variant Architectures Base Image Docs / Tags
java amd64, arm64 eclipse-temurin:17-jdk Link
python amd64, arm64 nikolaik/python-nodejs:python3.12-nodejs22 Link
golang amd64, arm64 golang:1.21-bookworm Link

Pull (multi-arch manifest)

# Each variant is a multi-arch manifest supporting both amd64 and arm64
docker pull ghcr.io/openhands/agent-server:cdc1ba3-python

Run

docker run -it --rm \
  -p 8000:8000 \
  --name agent-server-cdc1ba3-python \
  ghcr.io/openhands/agent-server:cdc1ba3-python

All tags pushed for this build

ghcr.io/openhands/agent-server:cdc1ba3-golang-amd64
ghcr.io/openhands/agent-server:cdc1ba3-golang_tag_1.21-bookworm-amd64
ghcr.io/openhands/agent-server:cdc1ba3-golang-arm64
ghcr.io/openhands/agent-server:cdc1ba3-golang_tag_1.21-bookworm-arm64
ghcr.io/openhands/agent-server:cdc1ba3-java-amd64
ghcr.io/openhands/agent-server:cdc1ba3-eclipse-temurin_tag_17-jdk-amd64
ghcr.io/openhands/agent-server:cdc1ba3-java-arm64
ghcr.io/openhands/agent-server:cdc1ba3-eclipse-temurin_tag_17-jdk-arm64
ghcr.io/openhands/agent-server:cdc1ba3-python-amd64
ghcr.io/openhands/agent-server:cdc1ba3-nikolaik_s_python-nodejs_tag_python3.12-nodejs22-amd64
ghcr.io/openhands/agent-server:cdc1ba3-python-arm64
ghcr.io/openhands/agent-server:cdc1ba3-nikolaik_s_python-nodejs_tag_python3.12-nodejs22-arm64
ghcr.io/openhands/agent-server:cdc1ba3-golang
ghcr.io/openhands/agent-server:cdc1ba3-java
ghcr.io/openhands/agent-server:cdc1ba3-python

About Multi-Architecture Support

  • Each variant tag (e.g., cdc1ba3-python) is a multi-arch manifest supporting both amd64 and arm64
  • Docker automatically pulls the correct architecture for your platform
  • Individual architecture tags (e.g., cdc1ba3-python-amd64) are also available if needed

Removes gpt-5-mini-2025-08-07 from:
- .github/run-eval/resolve_model_config.py MODELS dictionary
- openhands-sdk/openhands/sdk/llm/utils/verified_models.py VERIFIED_OPENAI_MODELS list

Updates example files and tests to use gpt-5.2 instead.

Fixes #1747

Co-authored-by: openhands <[email protected]>
Copy link
Collaborator

@all-hands-bot all-hands-bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Review Summary

The PR successfully removes gpt-5-mini-2025-08-07 from the main model lists and updates examples and tests to use gpt-5.2 as a replacement. The replacement model is appropriate since gpt-5.2 supports the same features (Responses API, reasoning effort, etc.).

🟠 Important Issue Found

Incomplete Removal: One reference to gpt-5-mini-2025-08-07 was missed in the codebase:

  • File: tests/sdk/llm/test_model_features.py
  • Line: 266
  • Content: ("openai/gpt-5-mini-2025-08-07", False),

This test case is verifying that gpt-5-mini-2025-08-07 does NOT support prompt_cache_retention. Since this model is being removed from the codebase entirely, this test case should also be removed to maintain consistency.

Suggested fix: Remove the line at tests/sdk/llm/test_model_features.py:266

✅ What Looks Good

  1. Model properly removed from .github/run-eval/resolve_model_config.py
  2. Model properly removed from openhands-sdk/openhands/sdk/llm/utils/verified_models.py
  3. Examples updated to use valid replacement model gpt-5.2
  4. Tests updated appropriately
  5. Replacement model gpt-5.2 supports all the necessary features (Responses API, reasoning effort)

Once the missed reference is removed, this PR will be complete.

@github-actions
Copy link
Contributor

Coverage

Coverage Report •
FileStmtsMissCoverMissing
TOTAL15563454870% 
report-only-changed-files is enabled. No files were changed during this commit :)

assert api_key, "Set LLM_API_KEY or OPENAI_API_KEY in your environment."

model = "openhands/gpt-5-mini-2025-08-07" # Use a model that supports Responses API
model = "openhands/gpt-5.2" # Use a model that supports Responses API
Copy link
Collaborator

@enyst enyst Jan 16, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think maybe it can stay here, this is not eval, it's just testing with a real LLM.

title_gen_llm = LLM(
usage_id="title-gen-llm",
model=os.getenv("LLM_MODEL", "openhands/gpt-5-mini-2025-08-07"),
model=os.getenv("LLM_MODEL", "openhands/gpt-5.2"),
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ditto

"gpt-5.1-codex-mini",
"gpt-5-codex",
"gpt-5-2025-08-07",
"gpt-5-mini-2025-08-07",
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not sure, but maybe this one makes some sense too? I think it appears in the app-server's web UI in a dropdown for OpenAI 🤔

WDYT, should we remove it?

Copy link
Collaborator

@enyst enyst left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If we remove this from examples, then we need to use a more expensive LLM when running test-examples. Is that intended?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Remove gpt-mini model from the model lists

5 participants