Meet your
thinking partner.
Tackle any big, bold, bewildering challenge with Hanzo AI.
Complete AI Engineering Toolkit
Everything you need to build, deploy, and manage production-grade AI applications
AI Model Hub
Access a catalog of state-of-the-art foundation models from leading providers and Hanzo's specialized models.
AI Agents
Create autonomous agents that can reason, plan, and execute complex tasks with minimal human intervention.
Optimized Runtime
High-performance inference with automatic batching, caching, and efficient resource utilization.
Vector Database
Built-in vector storage for embeddings with automatic indexing and retrieval optimization.
Evaluation Suite
Comprehensive tools for testing, evaluating, and benchmarking AI models and applications.
AI Observability
Full visibility into AI system behavior with detailed metrics, logging, and performance analytics.
AI Safety & Guardrails
Advanced content filtering, privacy controls, and ethical guardrails for responsible AI deployment.
Enterprise Scale
Built for high-scale production workloads with auto-scaling, high availability, and global distribution.
Developer SDK
Intuitive libraries for Python, TypeScript, and other languages with comprehensive documentation.
Model Serving
Simplified deployment and management of custom models with automatic versioning and A/B testing.
Fine-tuning
User-friendly tools for customizing foundation models to your specific use cases and data.
Unified AI Platform
A complete suite of AI capabilities accessible through a single, consistent API with everything you need to build powerful AI applications
Model Hub Access
One API for thousands of models from top providers and the open-source community
Vector Database
Built-in vector storage with automatic indexing for semantic search and RAG applications
Document Processing
Process, chunk, and index documents in 30+ formats with automatic metadata extraction
Semantic Search
Natural language search across your knowledge base with advanced relevance tuning
Agent Framework
Build autonomous AI agents with reasoning, planning and tool-use capabilities
Code Generation
Specialized models for code completion, refactoring, and documentation
Workflow Orchestration
Chain AI operations with built-in caching, observability, and error handling
Usage Analytics
Comprehensive analytics and cost tracking across all AI operations
Security & Compliance
Enterprise-grade security with data residency options and compliance features
Chat Interfaces
Pre-built chat components with memory management and streaming responses
Prompt Management
Version, test, and optimize prompts across different models and environments
Optimized Edge Serving
Global edge deployment for ultra-low latency AI inference and responses
AI Engineering Platform
Build, deploy, and scale AI applications with an integrated suite of tools designed for modern engineering teams.
AI Model Registry
Model Integration
Seamlessly integrate with OpenAI, Hugging Face, and other machine learning platforms. Deploy and serve custom ML models with built-in scaling and monitoring.
- •One-click API connections to popular ML services
- •Simple deployment of custom models with containerization
- •Performance optimization for inference workloads
Vector Search
High-performance vector database capabilities for semantic search, RAG applications, and similarity matching across billions of vectors.
- •Advanced indexing for fast k-NN and ANN queries
- •Built-in embeddings generation from text and images
- •Hybrid search combining vector and traditional queries
Data Processing
Specialized data processing pipelines for cleaning, transforming, and enriching training data for machine learning models.
- •Automated ETL workflows for AI data preparation
- •Data versioning and lineage tracking
- •Scalable batch and stream processing
AI-Enhanced Features
Ready-to-use AI capabilities that can be integrated into applications with minimal configuration.
- •Content generation and summarization
- •Image and video analysis with computer vision
- •Real-time anomaly detection and predictive analytics
Unified Model Access
Access thousands of AI models through a single, unified API with consistent interfaces and predictable pricing
Hanzo Zen Models
14 foundation models across language, code, vision, multimodal, and specialized tasks
Zen4 -- Flagship
Flagship model optimized for complex reasoning and multi-domain tasks.
via Hanzo
- 744B MoE (40B active)
- 202K context window
- Ultra Max tier
- $3 / $9.60 per MTok
Zen4 Ultra -- Maximum Reasoning
Maximum reasoning capability with extended thinking for complex problems.
via Hanzo
- 744B MoE (40B active) + CoT
- 202K context window
- Ultra Max tier
- Deep reasoning with chain-of-thought
Zen4 Pro -- High Capability
High-capability model with efficient MoE architecture for demanding workloads.
via Hanzo
- 80B MoE (3B active)
- 131K context window
- Ultra tier
- Efficient MoE architecture
Zen4 Max -- Extended Context
Extended context flagship model for large-scale analysis and document processing.
via Hanzo
- 1.04T MoE (32B active)
- 256K context window
- Ultra Max tier
- Largest Zen model
Zen4 Coder -- Code Generation
Code-specialized model for generation, review, and debugging.
via Hanzo
- 480B MoE (35B active)
- 262K context window
- Ultra tier
- Code generation and review
Zen4 Coder Pro -- Premium Code
Full-precision code model for maximum accuracy on complex codebases.
via Hanzo
- 480B Dense BF16
- 262K context window
- Ultra Max tier
- Full-precision code analysis
Zen4 Coder Flash -- Fast Code
Lightweight code model optimized for speed and inline completions.
via Hanzo
- 30B MoE (3B active)
- 262K context window
- Pro Max tier
- Fast inline completions
Zen4 Thinking -- Deep Reasoning
Dedicated reasoning model with explicit chain-of-thought capabilities.
via Hanzo
- 80B MoE (3B active) + CoT
- 131K context window
- Pro Max tier
- Chain-of-thought
Zen4 Mini -- Fast & Efficient
Lightweight model optimized for speed and cost efficiency.
via Hanzo
- 8B Dense Transformer
- 40K context window
- Pro tier
- Ultra-fast inference
Zen3 Omni -- Hypermodal
Multimodal model supporting text, vision, and structured output.
via Hanzo
- ~200B Dense Multimodal
- 202K context window
- Pro Max tier
- Text, vision, and audio
Zen3 VL -- Vision-Language
Vision-language model for image understanding and visual reasoning.
via Hanzo
- 30B MoE (3B active)
- 131K context window
- Pro Max tier
- Image understanding
Zen3 Nano -- Edge
Ultra-lightweight model for edge deployment and low-latency tasks.
via Hanzo
- 4B Dense Transformer
- 40K context window
- Pro tier
- Edge deployment
Zen3 Guard -- Content Safety
Content safety classifier for moderation and guardrails.
via Hanzo
- 4B Dense Transformer
- 40K context window
- Pro tier
- Content moderation
Zen3 Embedding -- Text Embeddings
High-quality text embeddings for search, clustering, and retrieval.
via Hanzo
- High-dimensional embedding model
- 8K context window
- Pro Max tier
- 3072 dimensions
Third-Party Models
100+ industry-leading models available through the Hanzo AI Cloud gateway
Claude Opus 4.6
Anthropic's most powerful model for the hardest tasks.
via Anthropic
- 1M context window
- Most capable model
- Complex reasoning
- Extended thinking
Claude Sonnet 4.6
Ideal balance of capability and speed for production workloads.
via Anthropic
- 1M context window
- Best balance of speed and intelligence
- Strong coding
- Fast inference
Claude Haiku 4.5
Fastest and most affordable Claude model for high-throughput tasks.
via Anthropic
- 200K context window
- Fastest Anthropic model
- Cost efficient
- Low latency
GPT-5
OpenAI's flagship model with advanced reasoning capabilities.
via OpenAI
- 400K context window
- Multimodal
- Advanced reasoning
- Tool use
GPT-5 Mini
Cost-efficient OpenAI model for everyday tasks.
via OpenAI
- 400K context window
- Fast and affordable
- Good quality
- Low latency
DeepSeek R1
Specialized reasoning model excelling at math and code.
via DeepSeek
- 64K context window
- Reasoning model
- Chain-of-thought
- Math and code
DeepSeek V3
Fast general-purpose model with strong benchmarks.
via DeepSeek
- 164K context window
- Fast and efficient
- Strong general performance
- Open-weight
Gemini 2.5 Pro
Google's flagship with the longest context window.
via Google
- 1M context window
- Multimodal
- Long-context reasoning
- Code generation
Custom Models
Deploy and customize models to meet your specific needs
Fine-tuned Models
via Custom
- Domain adaptation
- Company knowledge base
- Specialized tasks
- Improved performance
Hugging Face Models
via Custom
- Community models
- Thousands of options
- Specialized capabilities
- Open source
Custom Embedding Models
via Custom
- Domain-specific embeddings
- Custom similarity metrics
- Enhanced search
- Optimized retrieval
Single API for Everything
Our unified API provides seamless access to all AI capabilities through a consistent, developer-friendly interface
Model Routing
Smart routing to optimal models based on task, cost, and performance requirements
Document Processing
Built-in document parsing, chunking, and semantic analysis capabilities
Vector Search
Integrated vector database for semantic search and retrieval augmented generation
Knowledge Base
Create, manage and query custom knowledge bases for your AI applications
Versatile AI Use Cases
Hanzo's AI platform supports a wide range of intelligent applications across industries
Conversational AI
Build intelligent chatbots, virtual assistants, and customer support agents with natural language understanding.
Generative Content
Create text, images, code, and other content with AI-powered generation and customization.
Knowledge Retrieval
Implement semantic search, question answering, and information extraction from your data.
Autonomous Agents
Deploy AI agents that can perform complex tasks, make decisions, and execute workflows autonomously.
Developer Tooling
Enhance your development workflow with AI-powered code generation, debugging, and documentation.
Voice & Speech
Convert speech to text, text to speech, and analyze voice interactions with advanced AI models.
Simple Implementation
Build powerful AI applications with just a few lines of code using our intuitive SDK
import { Hanzo } from '@hanzo/ai';
// Initialize the Hanzo AI client
const hanzo = new Hanzo({
apiKey: process.env.HANZO_API_KEY
});
// Create a conversation with memory
const conversation = hanzo.conversation({
model: 'gpt-4o',
memory: true,
system: 'You are a helpful assistant'
});
// Send a message and get a response
const response = await conversation.send('Tell me about AI engineering');
console.log(response);Documentation Example
Vector Search
// Create a vector store
const vectorStore = hanzo.vectorStore('my-store');
// Add documents to the store
await vectorStore.addDocuments([
{ text: 'AI engineering best practices...' },
{ text: 'Deploying models to production...' }
]);
// Search for similar documents
const results = await vectorStore.search(
'How to deploy AI models?',
{ limit: 3 }
);AI Agents
// Create an agent with tools
const agent = hanzo.agent({
model: 'claude-3-opus',
tools: [
hanzo.tools.webSearch(),
hanzo.tools.codeInterpreter(),
vectorStore.asTool('knowledge')
]
});
// Run the agent with a task
const result = await agent.run(
'Analyze our production metrics and suggest optimizations'
);Trusted by Industry Leaders
Powering AI innovation at organizations of all sizes, from startups to Fortune 500 companies
"Hanzo's AI platform has transformed our ability to ship AI features quickly. What used to take months now takes days."
"The observability features are game-changing. We finally have full visibility into our AI systems in production."
"Our team went from prototype to production in just days. The SDK is intuitive and the documentation is excellent."
Experiences from Our Community
Hear from engineering teams who are building the next generation of AI-powered applications
"Hanzo's AI Engineering Platform has completely transformed how we build and deploy AI applications. What used to take us weeks now takes days."
"The unified approach to model deployment, monitoring, and scaling has allowed our team to focus on building great AI experiences instead of wrestling with infrastructure."
"We evaluated several AI platforms, but Hanzo stood out for its developer experience and comprehensive observability features. It's made monitoring our AI in production so much easier."
The AI Engineering Community
Join thousands of AI engineers and developers building the future of intelligent applications. Share experiences, get support, and collaborate on best practices.
Start Building the Future of AI
Join thousands of developers and companies who are building intelligent, scalable applications with Hanzo's AI Engineering Platform
Documentation
Comprehensive guides, tutorials, and API references to help you build with Hanzo AI.
Explore DocsQuickstart
Get up and running quickly with our step-by-step quickstart guides and example projects.
Try QuickstartCommunity
Join our growing community of AI engineers, get support, and share your experiences.
Join CommunityReady to get started?
Sign up for free and start building with Hanzo AI today.