Skip to main content

The AI-Native Database for LLM Applications

Providing incredibly fast hybrid search of dense embedding, sparse embedding, tensor, and full-text

Core Features

Infinity offers top performance, flexibility, easy usability, and advanced features for future AI application challenges.

Incredibly fast

Achieves 0.1 milliseconds query latency on million-scale vector datasets.
Up to 15K QPS on million-scale vector datasets.

Powerful search

Supports a hybrid search of dense embedding, sparse embedding, tensor, and full text, in addition to filtering.
Supports several types of rerankers including RRF, weighted sum, and ColBERT.

Rich data types

Supports a wide range of data types including strings, numerics, vectors, and more.

Ease-of-use

Intuitive Python API.
A single-binary architecture with no dependencies, making deployment a breeze.