Most language models are generalists. They're trained to chat, summarize, translate, and code—then adapted for agent use through prompt engineering, tool-calling wrappers, and orchestration frameworks. The results are impressive but inherently indirect: you're fitting a conversational model into an action-oriented role through scaffolding rather than design.
OpenRouter is testing a different hypothesis. In early March 2026, the platform quietly released two alpha models under its own namespace—openrouter/hunter-alpha and openrouter/healer-alpha—that are designed from the ground up for agent workloads. Not adapted. Not fine-tuned from a chat model. Purpose-built.
What "Purpose-Built for Agents" Actually Means
The distinction matters more than it might appear. General-purpose models optimize for conversational quality: helpful tone, balanced responses, hedging uncertainty. Agent models optimize for different properties—task completion fidelity, sustained instruction adherence over long execution chains, reliable tool invocation, and the ability to maintain state awareness across hundreds of turns without degrading.
Hunter and Healer are OpenRouter's answer to this gap. Both models carry the "alpha" designation, which means they're early-stage and carry important caveats: all prompts and completions are logged by the provider and may be used to improve the model. They're not production-ready for sensitive workloads. But they represent a clear directional bet—OpenRouter believes the agent use case deserves its own model class, not a repurposed one.
Hunter Alpha: 1T Parameters for Long-Horizon Planning
Hunter Alpha is the text-focused agent model. Its specifications are aggressive:
| Property | Value |
|---|---|
| Parameters | 1 trillion |
| Context window | 1,048,576 tokens (1M) |
| Max completion tokens | 32,000 |
| Modality | Text → Text |
| Pricing | Free (all token types) |
| Tool calling | Supported (tools + tool_choice) |
| Reasoning tokens | Supported (include_reasoning) |
| Structured output | Supported (response_format) |
| Default temperature | 1.0 |
| Default top_p | 0.95 |
The 1T parameter count is notable—this is among the largest dense models available through OpenRouter's platform. The 1M token context window matches what Google offers with Gemini and what NVIDIA claims with Nemotron 3 Super, putting Hunter in the same tier for long-context work. For agent frameworks like OpenClaw that depend on sustained context awareness across tool calls, file reads, and multi-step plans, this matters.
OpenRouter's own description is direct about the target: Hunter "excels at long-horizon planning, complex reasoning, and sustained multi-step task execution, with the reliability and instruction-following precision that frameworks like OpenClaw need." That's not marketing copy for a general audience—that's a signal to agent framework developers about what this model was optimized for.
Supported parameters include frequency_penalty, presence_penalty, stop sequences, and temperature/top_p controls. Notably missing from the supported set: logprobs, top_logprobs, and seed. This suggests the model is tuned for execution reliability over probabilistic analysis—a reasonable tradeoff for an agent model where deterministic behavior matters more than distributional introspection.
Healer Alpha: Omni-Modal Agent Perception
Healer Alpha is the more ambitious of the pair. Where Hunter focuses on deep text reasoning, Healer extends agent capabilities into the full sensory stack:
| Property | Value |
|---|---|
| Parameters | Not disclosed |
| Context window | 262,144 tokens (256K) |
| Max completion tokens | 32,000 |
| Modality | Text + Image + Audio + Video → Text |
| Pricing | Free (all token types) |
| Tool calling | Supported (tools + tool_choice) |
| Reasoning tokens | Supported (include_reasoning) |
| Structured output | Supported (response_format) |
| Default temperature | 1.0 |
| Default top_p | 0.95 |
Text, image, audio, and video as input modalities. That's a broader input range than most frontier models expose through OpenRouter. GPT-4o handles text and image; Claude handles text and image. Healer adds native audio and video understanding, positioned explicitly for "perceiving visual and audio inputs, reasoning across modalities, and executing complex multi-step tasks."
The 262K context window is smaller than Hunter's 1M, likely reflecting the computational cost of processing multi-modal inputs at scale. Still, 262K tokens is substantial—enough for an agent to ingest a 30-minute video transcript, a set of screenshots, and a full codebase in a single context window.
Parameter count is not disclosed for Healer, which is itself a signal. If it were similarly massive (1T+), OpenRouter would likely advertise it. The inference is that Healer may be a smaller, more efficient model optimized for the perception-action loop rather than raw reasoning depth—a reasonable architecture for a model whose primary job is "see, hear, understand, then act."
Comparative Architecture
The Hunter/Healer pairing suggests a deliberate architectural split rather than a single omni-capable model:
| Dimension | Hunter Alpha | Healer Alpha |
|---|---|---|
| Primary role | Deep reasoning & planning | Perception & cross-modal action |
| Input modalities | Text | Text, image, audio, video |
| Context depth | 1M tokens | 262K tokens |
| Parameter scale | 1T (disclosed) | Undisclosed |
| Sweet spot | Multi-hour planning chains | Real-world sensing + response |
| Shared capabilities | Tool calling, reasoning tokens, structured output, free pricing | |
This mirrors patterns we see in robotics and embodied AI: separate models for high-level planning (the "brain") and sensorimotor processing (the "perception layer"). An agent framework could route long-term strategy to Hunter and real-time environmental awareness to Healer, calling each where it's strongest.
The Free Pricing Question
Both models are free across all token types—prompt, completion, request, image, web search, and internal reasoning. This is unusual for frontier-scale models and worth examining.
The alpha designation provides the answer. OpenRouter is likely using these models to collect agent workload data. The explicit logging disclosure ("All prompts and completions for this model are logged by the provider and may be used to improve the model") confirms this. Free access in exchange for training data is a well-established pattern—OpenAI did it with ChatGPT's early free tier, and Google does it with Gemini's generous free API limits.
For developers building agent frameworks, the calculus is straightforward: these models are useful for prototyping, experimentation, and non-sensitive workloads today, with the understanding that pricing will arrive when the models exit alpha. Whether the free tier persists or converts to metered access will depend on how valuable the collected agent interaction data proves to be.
Supported Parameters Analysis
Both Hunter and Healer support an identical parameter set, which reveals the shared agent-oriented design philosophy:
- Tools + tool_choice: Native function calling, the backbone of agent architectures.
- Reasoning tokens (include_reasoning): Expose the model's chain-of-thought for transparency and debugging in agent loops.
- Response format: Force JSON or structured output for reliable agent-to-agent communication.
- Frequency/presence penalty: Control repetition in long generation chains where agents may loop.
- Stop sequences: Terminate generation at specific markers for precise agent handoff.
What's notably absent from both: logprobs, seed, structured_outputs (as a separate flag from response_format), and top_k. These omissions suggest the models are optimized for execution confidence rather than sampling exploration—again, consistent with agent use where you want the model to commit to an action rather than hedge across probability distributions.
Implications for Agent Frameworks
For projects like OpenClaw—which the Hunter description explicitly references—purpose-built agent models represent a potential inflection point. Current agent frameworks spend significant engineering effort on:
- Prompt scaffolding: System prompts that instruct general models to "think like an agent"
- Context management: Compression and summarization to fit agent histories into limited windows
- Tool call reliability: Retry logic and validation for models that weren't optimized for function calling
- Instruction drift: Models losing task context over long execution chains
If purpose-built models like Hunter and Healer genuinely reduce the need for this scaffolding—if they maintain instruction fidelity over longer chains, invoke tools more reliably, and sustain task awareness across 1M token contexts—then the engineering effort shifts from "how do we make a chat model act like an agent" to "how do we build on top of a model that already thinks like one."
That's a fundamentally different development paradigm. It's the difference between building an autopilot on top of a sedan versus designing a plane from scratch for autonomous flight.
Open Questions
Several questions remain unanswered in the alpha release:
- Benchmark performance: No published scores on SWE-Bench, AgentBench, or any standard agent evaluation suite. Without benchmarks, the "purpose-built" claim is asserted, not demonstrated.
- Training methodology: What does "purpose-built" mean at the training level? RLHF on agent trajectories? Multi-turn tool-use pretraining? Architecture changes? The models use "Other" as their tokenizer identifier, suggesting a non-standard training approach, but details are absent.
- Provider identity: Who actually trained these models? OpenRouter is a routing platform, not a model trainer. The models sit under the
openrouter/namespace, but the underlying provider isn't disclosed. - Latency and throughput: A 1T parameter model will be expensive to serve, even if free to the user. Real-world latency characteristics for agent loops—which need fast iteration—aren't documented.
- Alpha to production timeline: When does alpha end? What changes? Will pricing be competitive with existing agent-capable models?
Bottom Line
Hunter and Healer are alpha models with real specifications and zero public benchmarks. What they represent directionally is more interesting than what they prove empirically: OpenRouter is betting that the agent use case is large enough to warrant purpose-built models rather than adapted generalists.
The Hunter/Healer split—deep reasoning vs. omni-modal perception—suggests an architecture where agent frameworks compose specialized models rather than relying on a single generalist. If that pattern holds, the agent infrastructure layer becomes more interesting, not less.
For now, both models are free to experiment with. The logging caveat means they're not for production secrets or sensitive workflows. But for prototyping agent architectures, testing multi-modal perception pipelines, and exploring what purpose-built agent models feel like in practice—the barrier to entry is zero.
That's worth paying attention to. Even if the alpha designation means paying attention with appropriate skepticism.
Technical Appendix
Data source: OpenRouter API (/api/v1/models), queried March 14, 2026
Model IDs: openrouter/hunter-alpha, openrouter/healer-alpha
Created: ~March 9, 2026 (Unix timestamps 1773260671, 1773260668)
All pricing values verified at $0 across prompt, completion, request, image, web_search, and internal_reasoning token types.
References:
OpenRouter Models API: https://openrouter.ai/api/v1/models
Hunter Alpha: https://openrouter.ai/openrouter/hunter-alpha
Healer Alpha: https://openrouter.ai/openrouter/healer-alpha
OpenClaw: https://openclaw.ai