A serverless vector database

built from first principles on object storage: 10-100x cheaper, usage-based pricing, massive scalability

Join waitlist

         ╔════════════╗          
         ║            ║░         
      ┌──║   client   ║░         
      │  ║            ║░         
     API ╚════════════╝░         
      │   ░░░░░░░░░░░░░░         
      └─────────┐                
                │                
                ▼                
╔═turbopuffer══════════════════╗ 
║                              ║░
║  ┏━━━━━━━━━━━━━━━━━━━━━━━━┓  ║░
║  ┃        Memory/         ┃  ║░
║  ┃       SSD Cache        ┃  ║░
║  ┗━━━━━━━━━━━━━━━━━━━━━━━━┛  ║░
║               │              ║░
║               ▼              ║░
║     ┏━━━━━━━━━━━━━━━━━━━┓    ║░
║     ┃Object storage (S3)┃    ║░
║     ┗━━━━━━━━━━━━━━━━━━━┛    ║░
║                              ║░
╚══════════════════════════════╝░
 ░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░
Cost calculator
768
ItemsUnit costQuantityTotal
Storage
$1.00
Writes
$1.00
Queries
$0.00
Estimated cost

$2.00

per month

Query latency

Warm query: 1m vectors

Percentile

Latency

P50
28ms
P90
37ms
P99
63ms
MAX
98ms

Warm queries have all their data in cache.

Search for 100 random vectors from the dataset with top k = 10, when dataset is fully in cache. Warming the cache for an index takes about ~10s (for 1m vectors) after the first cold query, and typically stays in cache for a few hours.

Hear from our customers

Moving to turbopuffer felt less like an upgrade and more like discovering a new paradigm. We didn’t just save costs; we turned once-prohibitive features into standard tools in our arsenal. Vectorize all the things and say goodbye to sharding
Profile photo of Justin Watts

Justin Watts

Distinguished Engineer

Telus logo
Limits
MetricIn productionLimits (current)Limits (soon)
Max vectors (global)3B+Unlimited
Max vectors (per namespace)100M+
Number of namespaces1.5MUnlimited
Max dimensions10,75210,75210,752
Max inactive time in cacheConfigurableConfigurable
Write rate (global)10,000 Unlimited
Write rate (per namespace)10,000 10,000 vec/s10,000 vec/s
Max write batch rate (per namespace)1 batch/s
Max QPS (global)~40 QPSUnlimited
Max QPS (per namespace)~20 QPS100+ QPS10,000 QPS
~90-95%~90-95%Configurable
See full list

FAQ

© 2024 turbopuffer Inc.

All rights reserved.

Privacy policyTerms of serviceContact usSystem status
Follow us on X (twitter)