Running Qwen 3 with Reasoning Mode in OpenClaw via Ollama

If you're running local models through Ollama, Qwen 3 is one of the most capable options for agentic workloads. With OpenClaw v2026.2.17, Qwen 3 reasoning mode now works properly—here's how to set it up and why it matters.

Why Qwen 3 for Local Agents?

Qwen 3 brings several advantages for OpenClaw users:

Strong reasoning capabilities: Qwen 3 models include native reasoning/thinking support, similar to Claude's extended thinking or o1-style chain-of-thought
Multiple sizes: From Qwen3-0.5B for quick tasks to Qwen3-72B for complex reasoning
Completely local: No API costs, no rate limits, your data stays on your machine
Good tool use: Qwen 3 handles function calling well, critical for OpenClaw's agentic workflows

The Problem (Before v2026.2.17)

Qwen 3 models return reasoning content in a different format than OpenClaw expected. When you enabled reasoning mode, you'd hit errors or get malformed responses because the Ollama provider wasn't parsing Qwen's reasoning field structure correctly.

This was fixed in PR #18631 by @mr-sk, which handles the Qwen 3 reasoning field format properly in Ollama responses.

Setting Up Qwen 3 with OpenClaw

Step 1: Pull the model in Ollama

# Choose your size based on your hardware
ollama pull qwen3:8b      # Good balance for most setups
ollama pull qwen3:32b     # More capable, needs ~20GB VRAM
ollama pull qwen3:72b     # Best reasoning, needs ~40GB+ VRAM

Step 2: Configure OpenClaw

Add Ollama as a provider in your config.yaml:

providers:
  ollama:
    baseUrl: http://localhost:11434

agents:
  defaults:
    model:
      primary: ollama/qwen3:8b
      # Optional: set thinking defaults for reasoning-capable models
      params:
        thinking: medium  # or low/high

Step 3: Test reasoning mode

In chat, try a complex reasoning task:

Solve this step by step: If a train leaves at 2pm traveling 60mph, and another leaves at 3pm traveling 80mph from the same station in the same direction, when does the second train catch up?

With reasoning enabled, you'll see the model's thinking process before the final answer.

Configuring Reasoning Behavior

You can set thinking/reasoning at multiple levels:

Per-model default (new in v2026.2.17):

agents:
  defaults:
    model:
      primary: ollama/qwen3:8b
      thinkingDefault: medium  # Default reasoning level for this model

Per-session override:

Use /reasoning on|off or /reasoning high|medium|low in chat.

Tips for Qwen 3 + OpenClaw

Start with qwen3:8b - It's surprisingly capable and fast on modest hardware (16GB RAM)
Enable thinking for complex tasks - Reasoning mode significantly improves multi-step planning and code generation
Watch context limits - Qwen 3 models have varying context windows; configure contextWindow in model params if you hit issues
Use with tools carefully - Qwen 3 handles tools well, but complex tool chains might benefit from higher reasoning levels

What's Next?

If you're hitting issues with Qwen 3 or other Ollama models, check:

Issue #18631 - The original Qwen 3 reasoning fix
OpenClaw v2026.2.17 release notes - Full list of Ollama/provider improvements

Running local models with OpenClaw keeps getting better. If you find other model compatibility issues, open an issue on GitHub—the community is quick to fix these.

Have you tried Qwen 3 with OpenClaw? Share your setup in the comments!

Running Qwen 3 with Reasoning Mode in OpenClaw via Ollama

Why Qwen 3 for Local Agents?

The Problem (Before v2026.2.17)

Setting Up Qwen 3 with OpenClaw

Configuring Reasoning Behavior

Tips for Qwen 3 + OpenClaw

What's Next?

Comments (0)

You might also like

Security Alert: Prompt Injection via Fake [System Message] Blocks in Message Channels

Feature Request: hooks.sessionRetention Brings Automatic Cleanup to Webhook-Triggered Sessions

Feature Request: Native GitHub Channel Would Let Your Agent Work Alongside You on Pull Requests