Retrieval Infrastructure for AI
From demos to large-scale products, from chatbots to AI Agents, ZeroEntropy provides the retrieval backbone you need.
/ Models
zerank-2
Our flagship reranker model
$0.025 / MM Tokens
Ratelimits: 2,500,000 UTF-8 bytes per minute
Weights available on HuggingFace
Self-serve with Slack community support
zembed-1
Our flagship embedding model
$0.025 / MM Tokens
$0.050
State-of-the-art embedding model
Weights available upon request
White glove fine-tuning and evaluation
ze on-prem
Our SOTA models on your infra
Contact Us
Model licensing and deployment
White glove and priority support
Optional evaluations and fine-tuning
/ Search API
Pay-As-You-Go
Pay only for what you use. No subscriptions.
Usage-Based
OCR: $1.75 / 1,000 pages
Indexing: $0.50 / MB
Storage: $0.10 / MB / month
Queries: $1.50 / TB queried
Reranking: $0.025 / MM tokens
Embeddings: $0.05 / MM tokens
Enterprise
Critical performance & security.
Contact Us
Volume discounts
On-premises deployment within your VPC
99.99% SLA
White-glove onboarding
Priority access to new features and models
Custom integrations and DevOps support
From security to scale, ZeroEntropy is built for the demands of production ready AI

SOC2 Type II
Audited controls for data security, availability, and confidentiality — verified annually.

HIPAA Compliant
BAA-ready infrastructure with encryption at rest and in transit for protected health data.

GDPR Compliant
Full data residency controls, right-to-deletion, and DPA agreements for EU customers.

CCPA Compliant
Consumer data rights honored with full transparency on collection, use, and deletion.
