A serverless vector database

built from first principles on object storage: 10-100x cheaper, usage-based pricing, and massive scalability

Apply for access

         ╔════════════╗          
         ║            ║░         
      ┌──║   client   ║░         
      │  ║            ║░         
     API ╚════════════╝░         
      │   ░░░░░░░░░░░░░░         
      └─────────┐                
                │                
                ▼                
╔═turbopuffer══════════════════╗ 
║                              ║░
║  ┏━━━━━━━━━━━━━━━━━━━━━━━━┓  ║░
║  ┃        Memory/         ┃  ║░
║  ┃       SSD Cache        ┃  ║░
║  ┗━━━━━━━━━━━━━━━━━━━━━━━━┛  ║░
║               │              ║░
║               ▼              ║░
║     ┏━━━━━━━━━━━━━━━━━━━┓    ║░
║     ┃Object storage (S3)┃    ║░
║     ┗━━━━━━━━━━━━━━━━━━━┛    ║░
║                              ║░
╚══════════════════════════════╝░
 ░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░
Query latency

Warm query: 1m vectors

Percentile

Latency

P50
28ms
P90
37ms
P99
63ms
MAX
98ms

Warm queries have all their data in cache.

Search for 100 random vectors from the dataset with top k = 10, when dataset is fully in cache. Warming the cache for an index takes about ~10s (for 1m vectors) after the first cold query, and typically stays in cache for a few hours.

Hear from our customers

Justin Watts

Distinguished Engineer

Photo of Justin Watts

Moving to turbopuffer felt less like an upgrade and more like discovering a new paradigm. We didn’t just save costs; we turned once-prohibitive features into standard tools in our arsenal. Vectorize all the things and say goodbye to sharding

MTL 🇨🇦

Telus logo
Limits
MetricMax seen in productionProduction limits (current)Production limits (soon)
Max documents (global)8B+
Max documents (per namespace)100M+
Number of namespaces3.5M+Unlimited
Max dimensions10,752
Max inactive time in cacheContact us if you need custom
Write rate (global)10,000
Write rate (per namespace)10,000 10,000 doc/s100,000 doc/s
QPS (global)>200 QPS
Max QPS (per namespace)~20 QPS100+ QPS10,000 QPS
~90-95%~90-95%Configurable
See full list and workloads that are a great fit for turbopuffer 🐡

FAQ

© 2024 turbopuffer Inc.
Privacy PolicyTerms of service