Milvus

Milvus is an open-source vector database originally created by Zilliz in 2019 and now a graduated project of the LF AI & Data Foundation, designed for trillion-scale vector search across distributed nodes. The platform supports multiple index types including HNSW, IVF, IVF-PQ, IVF-SQ, DiskANN, and the GPU-accelerated CAGRA index, letting operators pick the right balance of recall, latency, and cost for each workload. Milvus separates compute and storage so individual components can scale independently, and it integrates with cloud object stores like S3 and MinIO for durable persistence. The managed cloud version, Zilliz Cloud, runs on AWS, GCP, and Azure with serverless billing. Enterprises favor Milvus for very large-scale RAG and recommendation workloads where billion-vector indexes are routine, and for its strict open-source license under Apache 2.0 satisfying AI governance vendor-neutrality requirements. Milvus 2.x added strong consistency, multi-tenancy via partitions, and improved write throughput for streaming ingestion of embeddings.

Milvus + Centralpoint at scale: Centralpoint supports Milvus and Zilliz Cloud as one of several vector backends, letting you operate billion-vector indexes inside your environment while routing generation to OpenAI, Anthropic, Gemini, or LLAMA. Token use is metered centrally, prompts stay local, and a fleet of chatbots backed by Milvus can deploy across portals with one line of JavaScript.


Related Keywords:
Milvus,,