Hanzo

Hanzo Models via Hanzo AI

46 models available

Access all 46 Hanzo models through Hanzo's OpenAI-compatible API. Single API key, unified billing, no rate limit juggling.

claude-3-5-haiku

Zen model: claude-3-5-haiku

claude-3-7-sonnet

Zen model: claude-3-7-sonnet

claude-4-1-opus

Zen model: claude-4-1-opus

claude-haiku-4-5

Zen model: claude-haiku-4-5

claude-opus-4

Zen model: claude-opus-4

claude-opus-4-5

Zen model: claude-opus-4-5

claude-opus-4-6

Zen model: claude-opus-4-6

claude-sonnet-4

Zen model: claude-sonnet-4

claude-sonnet-4-5

Zen model: claude-sonnet-4-5

deepseek-r1-distill-70b

Zen model: deepseek-r1-distill-70b

gpt-4.1

Zen model: gpt-4.1

gpt-4o

Zen model: gpt-4o

gpt-4o-mini

Zen model: gpt-4o-mini

gpt-5

Zen model: gpt-5

gpt-5-mini

Zen model: gpt-5-mini

gpt-5-nano

Zen model: gpt-5-nano

gpt-5.1-codex-max

Zen model: gpt-5.1-codex-max

gpt-5.2

Zen model: gpt-5.2

gpt-5.2-pro

Zen model: gpt-5.2-pro

gpt-oss-120b

Zen model: gpt-oss-120b

gpt-oss-20b

Zen model: gpt-oss-20b

llama-3.1-8b

Zen model: llama-3.1-8b

llama-3.3-70b

Zen model: llama-3.3-70b

mistral-nemo

Zen model: mistral-nemo

o1

Zen model: o1

o3

Zen model: o3

o3-mini

Zen model: o3-mini

Zen3 Embedding
8K

High-quality text embeddings for RAG, search, and classification.

Zen3 Guard — Content Safety
65K

Content safety classifier for moderation and guardrails. 9 safety categories, 119 languages.

Zen3 Nano — Edge
128K

Ultra-lightweight model for edge deployment and low-latency tasks.

Zen3 Omni — Hypermodal
202K

Multimodal model supporting text, vision, audio, and structured output.

Zen3 VL — Vision-Language
262K

Vision-language model for image understanding and visual reasoning.

Zen4 — Flagship
202K

Flagship MoE model for complex reasoning and multi-domain tasks.

Zen4 Coder — Code Generation
163K

Code-specialized MoE model for generation, review, debugging, and agentic programming.

Zen4 Coder Flash — Fast Code
262K

Lightweight code model optimized for speed and inline completions.

Zen4 Coder Pro — Premium Code
131K

Full-precision BF16 code model for maximum accuracy on complex codebases.

Zen4 Max — Maximum Intelligence
1M

Most capable model for complex reasoning, analysis, and agentic tasks. 1M token context window.

Zen4 Mini — Fast & Efficient
128K

Ultra-fast lightweight model optimized for speed and cost efficiency.

Zen4 Pro — High Capability
131K

Efficient MoE model for demanding workloads with strong reasoning at production-grade cost.

Zen4 Thinking — Deep Reasoning
131K

Dedicated reasoning model with explicit chain-of-thought capabilities.

Zen4 Ultra — Maximum Reasoning
262K

Maximum reasoning capability with extended chain-of-thought on MoE architecture.

Zen5 — Next Generation
1M

Next-generation agentic frontier model with native chain-of-thought.

Zen5 Pro — Advanced
524K

High-throughput agentic model for demanding production workloads.

Zen5 Max — Extended
2M

Maximum context agentic model for document-scale analysis.

Zen5 Ultra — Deep Reasoning
1M

Deepest reasoning model. Multi-pass chain-of-thought with self-verification.

Zen5 Mini — Efficient
262K

Efficient agentic model delivering zen5-class intelligence at a fraction of the cost.

Use Hanzo models via Hanzo

One API key. Unified billing. OpenAI-compatible. Works with every existing SDK.