The AI-Native Database for LLM Applications
Providing incredibly fast hybrid search of dense embedding, sparse embedding, tensor, and full-text
Core Features
Infinity offers top performance, flexibility, easy usability, and advanced features for future AI application challenges.
Incredibly fast
- Achieves 0.1 milliseconds query latency on million-scale vector datasets.
- Up to 15K QPS on million-scale vector datasets.
Powerful search
- Supports a hybrid search of dense embedding, sparse embedding, tensor, and full text, in addition to filtering.
- Supports several types of rerankers including RRF, weighted sum, and ColBERT.
Rich data types
- Supports a wide range of data types including strings, numerics, vectors, and more.
Ease-of-use
- Intuitive Python API.
- A single-binary architecture with no dependencies, making deployment a breeze.