Meet your
thinking partner.

Tackle any big, bold, bewildering challenge with Hanzo AI.

Preview

Some tasks just work better on macOS

With Hanzo Dev, AI can now work directly with your local files and tools. Available in the macOS app.

Complete AI Engineering Toolkit

Everything you need to build, deploy, and manage production-grade AI applications

AI Model Hub

Access a catalog of state-of-the-art foundation models from leading providers and Hanzo's specialized models.

AI Agents

Create autonomous agents that can reason, plan, and execute complex tasks with minimal human intervention.

Optimized Runtime

High-performance inference with automatic batching, caching, and efficient resource utilization.

Vector Database

Built-in vector storage for embeddings with automatic indexing and retrieval optimization.

Evaluation Suite

Comprehensive tools for testing, evaluating, and benchmarking AI models and applications.

AI Observability

Full visibility into AI system behavior with detailed metrics, logging, and performance analytics.

AI Safety & Guardrails

Advanced content filtering, privacy controls, and ethical guardrails for responsible AI deployment.

Enterprise Scale

Built for high-scale production workloads with auto-scaling, high availability, and global distribution.

Developer SDK

Intuitive libraries for Python, TypeScript, and other languages with comprehensive documentation.

Model Serving

Simplified deployment and management of custom models with automatic versioning and A/B testing.

Fine-tuning

User-friendly tools for customizing foundation models to your specific use cases and data.

Unified AI Platform

A complete suite of AI capabilities accessible through a single, consistent API with everything you need to build powerful AI applications

Model Hub Access

One API for thousands of models from top providers and the open-source community

Vector Database

Built-in vector storage with automatic indexing for semantic search and RAG applications

Document Processing

Process, chunk, and index documents in 30+ formats with automatic metadata extraction

Semantic Search

Natural language search across your knowledge base with advanced relevance tuning

Agent Framework

Build autonomous AI agents with reasoning, planning and tool-use capabilities

Code Generation

Specialized models for code completion, refactoring, and documentation

Workflow Orchestration

Chain AI operations with built-in caching, observability, and error handling

Usage Analytics

Comprehensive analytics and cost tracking across all AI operations

Security & Compliance

Enterprise-grade security with data residency options and compliance features

Chat Interfaces

Pre-built chat components with memory management and streaming responses

Prompt Management

Version, test, and optimize prompts across different models and environments

Optimized Edge Serving

Global edge deployment for ultra-low latency AI inference and responses

AI Engineering Platform

Build, deploy, and scale AI applications with an integrated suite of tools designed for modern engineering teams.

Hanzo AI Engineering Platform

AI Model Registry

GPT-4oLLM
Provider: OpenAI
Accuracy
97%
Latency
120ms
Claude 3LLM
Provider: Anthropic
Accuracy
96%
Latency
140ms
GeminiLLM
Provider: Google
Accuracy
95%
Latency
110ms
Llama 3LLM
Provider: Meta
Accuracy
94%
Latency
90ms
MixtralLLM
Provider: Mistral
Accuracy
93%
Latency
85ms
Stable DiffusionImage
Provider: Stability AI
Accuracy
92%
Latency
200ms
Cohere CommandLLM
Provider: Cohere
Accuracy
91%
Latency
95ms

Model Integration

Seamlessly integrate with OpenAI, Hugging Face, and other machine learning platforms. Deploy and serve custom ML models with built-in scaling and monitoring.

  • One-click API connections to popular ML services
  • Simple deployment of custom models with containerization
  • Performance optimization for inference workloads

Vector Search

High-performance vector database capabilities for semantic search, RAG applications, and similarity matching across billions of vectors.

  • Advanced indexing for fast k-NN and ANN queries
  • Built-in embeddings generation from text and images
  • Hybrid search combining vector and traditional queries

Data Processing

Specialized data processing pipelines for cleaning, transforming, and enriching training data for machine learning models.

  • Automated ETL workflows for AI data preparation
  • Data versioning and lineage tracking
  • Scalable batch and stream processing

AI-Enhanced Features

Ready-to-use AI capabilities that can be integrated into applications with minimal configuration.

  • Content generation and summarization
  • Image and video analysis with computer vision
  • Real-time anomaly detection and predictive analytics

Unified Model Access

Access thousands of AI models through a single, unified API with consistent interfaces and predictable pricing

Hanzo Zen Models

14 foundation models across language, code, vision, multimodal, and specialized tasks

Zen4 -- Flagship

Flagship model optimized for complex reasoning and multi-domain tasks.

via Hanzo

  • 744B MoE (40B active)
  • 202K context window
  • Ultra Max tier
  • $3 / $9.60 per MTok

Zen4 Ultra -- Maximum Reasoning

Maximum reasoning capability with extended thinking for complex problems.

via Hanzo

  • 744B MoE (40B active) + CoT
  • 202K context window
  • Ultra Max tier
  • Deep reasoning with chain-of-thought

Zen4 Pro -- High Capability

High-capability model with efficient MoE architecture for demanding workloads.

via Hanzo

  • 80B MoE (3B active)
  • 131K context window
  • Ultra tier
  • Efficient MoE architecture

Zen4 Max -- Extended Context

Extended context flagship model for large-scale analysis and document processing.

via Hanzo

  • 1.04T MoE (32B active)
  • 256K context window
  • Ultra Max tier
  • Largest Zen model

Zen4 Coder -- Code Generation

Code-specialized model for generation, review, and debugging.

via Hanzo

  • 480B MoE (35B active)
  • 262K context window
  • Ultra tier
  • Code generation and review

Zen4 Coder Pro -- Premium Code

Full-precision code model for maximum accuracy on complex codebases.

via Hanzo

  • 480B Dense BF16
  • 262K context window
  • Ultra Max tier
  • Full-precision code analysis

Zen4 Coder Flash -- Fast Code

Lightweight code model optimized for speed and inline completions.

via Hanzo

  • 30B MoE (3B active)
  • 262K context window
  • Pro Max tier
  • Fast inline completions

Zen4 Thinking -- Deep Reasoning

Dedicated reasoning model with explicit chain-of-thought capabilities.

via Hanzo

  • 80B MoE (3B active) + CoT
  • 131K context window
  • Pro Max tier
  • Chain-of-thought

Zen4 Mini -- Fast & Efficient

Lightweight model optimized for speed and cost efficiency.

via Hanzo

  • 8B Dense Transformer
  • 40K context window
  • Pro tier
  • Ultra-fast inference

Zen3 Omni -- Hypermodal

Multimodal model supporting text, vision, and structured output.

via Hanzo

  • ~200B Dense Multimodal
  • 202K context window
  • Pro Max tier
  • Text, vision, and audio

Zen3 VL -- Vision-Language

Vision-language model for image understanding and visual reasoning.

via Hanzo

  • 30B MoE (3B active)
  • 131K context window
  • Pro Max tier
  • Image understanding

Zen3 Nano -- Edge

Ultra-lightweight model for edge deployment and low-latency tasks.

via Hanzo

  • 4B Dense Transformer
  • 40K context window
  • Pro tier
  • Edge deployment

Zen3 Guard -- Content Safety

Content safety classifier for moderation and guardrails.

via Hanzo

  • 4B Dense Transformer
  • 40K context window
  • Pro tier
  • Content moderation

Zen3 Embedding -- Text Embeddings

High-quality text embeddings for search, clustering, and retrieval.

via Hanzo

  • High-dimensional embedding model
  • 8K context window
  • Pro Max tier
  • 3072 dimensions

Third-Party Models

100+ industry-leading models available through the Hanzo AI Cloud gateway

Claude Opus 4.6

Anthropic's most powerful model for the hardest tasks.

via Anthropic

  • 1M context window
  • Most capable model
  • Complex reasoning
  • Extended thinking

Claude Sonnet 4.6

Ideal balance of capability and speed for production workloads.

via Anthropic

  • 1M context window
  • Best balance of speed and intelligence
  • Strong coding
  • Fast inference

Claude Haiku 4.5

Fastest and most affordable Claude model for high-throughput tasks.

via Anthropic

  • 200K context window
  • Fastest Anthropic model
  • Cost efficient
  • Low latency

GPT-5

OpenAI's flagship model with advanced reasoning capabilities.

via OpenAI

  • 400K context window
  • Multimodal
  • Advanced reasoning
  • Tool use

GPT-5 Mini

Cost-efficient OpenAI model for everyday tasks.

via OpenAI

  • 400K context window
  • Fast and affordable
  • Good quality
  • Low latency

DeepSeek R1

Specialized reasoning model excelling at math and code.

via DeepSeek

  • 64K context window
  • Reasoning model
  • Chain-of-thought
  • Math and code

DeepSeek V3

Fast general-purpose model with strong benchmarks.

via DeepSeek

  • 164K context window
  • Fast and efficient
  • Strong general performance
  • Open-weight

Gemini 2.5 Pro

Google's flagship with the longest context window.

via Google

  • 1M context window
  • Multimodal
  • Long-context reasoning
  • Code generation

Custom Models

Deploy and customize models to meet your specific needs

Fine-tuned Models

via Custom

  • Domain adaptation
  • Company knowledge base
  • Specialized tasks
  • Improved performance

Hugging Face Models

via Custom

  • Community models
  • Thousands of options
  • Specialized capabilities
  • Open source

Custom Embedding Models

via Custom

  • Domain-specific embeddings
  • Custom similarity metrics
  • Enhanced search
  • Optimized retrieval

Single API for Everything

Our unified API provides seamless access to all AI capabilities through a consistent, developer-friendly interface

Model Routing

Smart routing to optimal models based on task, cost, and performance requirements

Document Processing

Built-in document parsing, chunking, and semantic analysis capabilities

Vector Search

Integrated vector database for semantic search and retrieval augmented generation

Knowledge Base

Create, manage and query custom knowledge bases for your AI applications

Versatile AI Use Cases

Hanzo's AI platform supports a wide range of intelligent applications across industries

Conversational AI

Build intelligent chatbots, virtual assistants, and customer support agents with natural language understanding.

Generative Content

Create text, images, code, and other content with AI-powered generation and customization.

Knowledge Retrieval

Implement semantic search, question answering, and information extraction from your data.

Autonomous Agents

Deploy AI agents that can perform complex tasks, make decisions, and execute workflows autonomously.

Developer Tooling

Enhance your development workflow with AI-powered code generation, debugging, and documentation.

Voice & Speech

Convert speech to text, text to speech, and analyze voice interactions with advanced AI models.

Simple Implementation

Build powerful AI applications with just a few lines of code using our intuitive SDK

import { Hanzo } from '@hanzo/ai';

// Initialize the Hanzo AI client
const hanzo = new Hanzo({
  apiKey: process.env.HANZO_API_KEY
});

// Create a conversation with memory
const conversation = hanzo.conversation({
  model: 'gpt-4o',
  memory: true,
  system: 'You are a helpful assistant'
});

// Send a message and get a response
const response = await conversation.send('Tell me about AI engineering');

console.log(response);

Documentation Example

Vector Search
// Create a vector store
const vectorStore = hanzo.vectorStore('my-store');

// Add documents to the store
await vectorStore.addDocuments([
  { text: 'AI engineering best practices...' },
  { text: 'Deploying models to production...' }
]);

// Search for similar documents
const results = await vectorStore.search(
  'How to deploy AI models?', 
  { limit: 3 }
);
AI Agents
// Create an agent with tools
const agent = hanzo.agent({
  model: 'claude-3-opus',
  tools: [
    hanzo.tools.webSearch(),
    hanzo.tools.codeInterpreter(),
    vectorStore.asTool('knowledge')
  ]
});

// Run the agent with a task
const result = await agent.run(
  'Analyze our production metrics and suggest optimizations'
);

Trusted by Industry Leaders

Powering AI innovation at organizations of all sizes, from startups to Fortune 500 companies

Microsoft
Airbnb
Netflix
Stripe
Shopify
Spotify
Slack
Amazon
5.0

"Hanzo's AI platform has transformed our ability to ship AI features quickly. What used to take months now takes days."

JD
Jane Doe
CTO, TechInnovate
5.0

"The observability features are game-changing. We finally have full visibility into our AI systems in production."

MS
Michael Smith
AI Lead, EnterpriseAI
5.0

"Our team went from prototype to production in just days. The SDK is intuitive and the documentation is excellent."

EJ
Emma Johnson
VP Engineering, StartupX

Experiences from Our Community

Hear from engineering teams who are building the next generation of AI-powered applications

"Hanzo's AI Engineering Platform has completely transformed how we build and deploy AI applications. What used to take us weeks now takes days."

SC
Sarah Chen
CTO, Quantum Innovations

"The unified approach to model deployment, monitoring, and scaling has allowed our team to focus on building great AI experiences instead of wrestling with infrastructure."

JW
James Wilson
Head of AI, GlobalTech

"We evaluated several AI platforms, but Hanzo stood out for its developer experience and comprehensive observability features. It's made monitoring our AI in production so much easier."

MR
Maria Rodriguez
Principal Engineer, TechFlow

The AI Engineering Community

Join thousands of AI engineers and developers building the future of intelligent applications. Share experiences, get support, and collaborate on best practices.

4,500+ community members
Weekly office hours
Dedicated support team
JS
MK
AL
TN
RW
+2.5k

Start Building the Future of AI

Join thousands of developers and companies who are building intelligent, scalable applications with Hanzo's AI Engineering Platform

Documentation

Comprehensive guides, tutorials, and API references to help you build with Hanzo AI.

Explore Docs

Quickstart

Get up and running quickly with our step-by-step quickstart guides and example projects.

Try Quickstart

Community

Join our growing community of AI engineers, get support, and share your experiences.

Join Community

Ready to get started?

Sign up for free and start building with Hanzo AI today.