fix: fix simple_chat Responses tool schema + model discovery fallback by gyliu513 · Pull Request #216 · llamastack/llama-stack-apps

gyliu513 · 2026-01-20T13:47:20Z

What does this PR do?

Root cause: examples/agents/simple_chat.py sent tools without the required type discriminator for the Responses API, causing a 400. Also, examples/agents/utils.py assumed the client model schema always exposes model_type, which is not true for some deployments, leading to AttributeError during model selection.
Fix: Send web search tool using Responses-compliant schema ({"type": "web_search"}) and adjust logging to match AgentEventLogger output. Add resilient model discovery helpers to resolve model id/type across client schema variants and default to LLM when type is missing.
Notes: Changes are limited to examples; no runtime API behavior changes.

(stack) gualiu@gualiu-mac llama-stack-apps % python -m examples.agents.simple_chat --host localhost --port 8321
INFO:httpx:HTTP Request: GET http://localhost:8321/v1/shields "HTTP/1.1 200 OK"
No available shields. Disabling safety.
INFO:httpx:HTTP Request: GET http://localhost:8321/v1/models "HTTP/1.1 200 OK"
Using model: ollama/llama3.2:3b
INFO:httpx:HTTP Request: POST http://localhost:8321/v1/conversations "HTTP/1.1 200 OK"
User> Hello
INFO:httpx:HTTP Request: POST http://localhost:8321/v1/responses "HTTP/1.1 200 OK"
🤔 {"name": "search", "parameters": {"query": "hello world"}}
User> Search web for which players played in the winning team of the NBA western conference semifinals of 2024
INFO:httpx:HTTP Request: POST http://localhost:8321/v1/responses "HTTP/1.1 200 OK"
🤔

🔧 Executing web_search (server-side)...
🤔 I was unable to find any specific information on the 2024 NBA Western Conference Semifinals playoff winners. However, I can suggest some options to help you find the answer:

You can try searching for the official NBA website or social media channels for the latest updates and news.

Another option is to check online sports websites such as ESPN, Sports Illustrated, or CBS Sports, which provide comprehensive coverage of the NBA playoffs.

Feature/Issue validation/testing/test plan

Please describe the tests that you ran to verify your changes and relevant result summary. Provide instructions so it can be reproduced.
Please also list any relevant details for your test configuration or test plan.

Test A
Logs for Test A
Test B
Logs for Test B

Sources

Please link relevant resources if necessary.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Thanks for contributing 🎉!

gyliu513 · 2026-01-20T13:48:34Z

@cdoern @leseb ^^

raghotham · 2026-01-26T22:23:18Z

examples/agents/simple_chat.py

+    agent_kwargs = {
+        "model": model_id,
+        "instructions": "",
+        # OpenAI Responses tool schema requires a type discriminator.
+        "tools": [{"type": "web_search"}],
+        "input_shields": available_shields,
+        "output_shields": available_shields,
+        "enable_session_persistence": False,
+    }
+    allowed_params = set(inspect.signature(Agent.__init__).parameters)
+    filtered_kwargs = {k: v for k, v in agent_kwargs.items() if k in allowed_params}


it is not clear that any developer will write code like this when creating agents using llama stack client. Can you make it so that the code here is something a new developer can just copy? We dont need any backward compatibility here either. We could just use the latest version. We can have copies of the examples for older versions if needed.

gyliu513 · 2026-01-27T22:19:00Z

@raghotham can you help check if this can be merged? Thanks!

raghotham · 2026-01-28T01:07:49Z

examples/agents/utils.py

    return available_models[0]
+
+
+def can_model_chat(client: LlamaStackClient, model_id: str) -> bool:


do we have to run a chat completion to see if the model supports chat? we already have model type: https://github.com/llamastack/llama-stack/blob/ffa98595e696c7ab3e0e933d0ed75375ee1d7b84/src/llama_stack_api/models/models.py#L23

@raghotham I can see there are some models with llm type still do not support chat, like ollama/all-minilm:latest.

(llama-stack) (base) gualiu@gualiu-mac llama-stack % curl -s http://localhost:8321/v1/models \ | jq '.data[] | select(.id=="ollama/all-minilm:latest")' { "id": "ollama/all-minilm:latest", "object": "model", "created": 1769569923, "owned_by": "llama_stack", "custom_metadata": { "model_type": "llm", "provider_id": "ollama", "provider_resource_id": "all-minilm:latest" } }

But this model do not support chat.

(stack) gualiu@gualiu-mac llama-stack-apps % python -m examples.agents.simple_chat --host localhost --port 8321 --model _id ollama/all-minilm:latest INFO:httpx:HTTP Request: GET http://localhost:8321/v1/models "HTTP/1.1 200 OK" Using model: ollama/all-minilm:latest INFO:httpx:HTTP Request: POST http://localhost:8321/v1/conversations "HTTP/1.1 200 OK" User> Hello INFO:httpx:HTTP Request: POST http://localhost:8321/v1/responses "HTTP/1.1 200 OK" 🤔 ❌ Turn failed: Error code: 400 - {'error': {'message': '"all-minilm:latest" does not support chat', 'type': 'api_error', 'param': None, 'code': None}} User> Search web for which players played in the winning team of the NBA western conference semifinals of 2024 INFO:httpx:HTTP Request: POST http://localhost:8321/v1/responses "HTTP/1.1 200 OK" 🤔 ❌ Turn failed: Error code: 400 - {'error': {'message': '"all-minilm:latest" does not support chat', 'type': 'api_error', 'param': None, 'code': None}}

I think besides model_type, we may need to add a new field named as capability for the model, the capability can be chat, completion, tool_calling etc, comments?

fix: fix simple_chat Responses tool schema + model discovery fallback

8ee939d

gyliu513 requested review from ashwinb, ehhuang, hardikjshah and raghotham as code owners January 20, 2026 13:47

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jan 20, 2026

gyliu513 mentioned this pull request Jan 22, 2026

Bring back this repo #217

Open

2 tasks

raghotham reviewed Jan 26, 2026

View reviewed changes

Address Ragu's comments

869e7e6

gyliu513 force-pushed the simple-chat branch from 755a3ae to 869e7e6 Compare January 26, 2026 23:11

leseb approved these changes Jan 27, 2026

View reviewed changes

raghotham reviewed Jan 28, 2026

View reviewed changes

cdoern approved these changes Jan 28, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: fix simple_chat Responses tool schema + model discovery fallback#216

fix: fix simple_chat Responses tool schema + model discovery fallback#216
gyliu513 wants to merge 2 commits intollamastack:mainfrom
gyliu513:simple-chat

gyliu513 commented Jan 20, 2026 •

edited

Loading

Uh oh!

gyliu513 commented Jan 20, 2026

Uh oh!

raghotham Jan 26, 2026

Uh oh!

gyliu513 Jan 26, 2026

Uh oh!

gyliu513 commented Jan 27, 2026

Uh oh!

raghotham Jan 28, 2026

Uh oh!

gyliu513 Jan 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		return available_models[0]


		def can_model_chat(client: LlamaStackClient, model_id: str) -> bool:

Conversation

gyliu513 commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Feature/Issue validation/testing/test plan

Sources

Before submitting

Uh oh!

gyliu513 commented Jan 20, 2026

Uh oh!

raghotham Jan 26, 2026

Choose a reason for hiding this comment

Uh oh!

gyliu513 Jan 26, 2026

Choose a reason for hiding this comment

Uh oh!

gyliu513 commented Jan 27, 2026

Uh oh!

raghotham Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

gyliu513 Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

gyliu513 commented Jan 20, 2026 •

edited

Loading