A serverless vector database

built from first principles on object storage: 10-100x cheaper, usage-based pricing, massive scalability

Apply for access

         ╔════════════╗          
         ║            ║░         
      ┌──║   client   ║░         
      │  ║            ║░         
     API ╚════════════╝░         
      │   ░░░░░░░░░░░░░░         
      └─────────┐                
                │                
                ▼                
╔═turbopuffer══════════════════╗ 
║                              ║░
║  ┏━━━━━━━━━━━━━━━━━━━━━━━━┓  ║░
║  ┃        Memory/         ┃  ║░
║  ┃       SSD Cache        ┃  ║░
║  ┗━━━━━━━━━━━━━━━━━━━━━━━━┛  ║░
║               │              ║░
║               ▼              ║░
║     ┏━━━━━━━━━━━━━━━━━━━┓    ║░
║     ┃Object storage (S3)┃    ║░
║     ┗━━━━━━━━━━━━━━━━━━━┛    ║░
║                              ║░
╚══════════════════════════════╝░
 ░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░
Cost calculator
768
ItemsUnit costQuantityTotal
Storage
$1.00
Writes
$1.00
Queries
$0.00
Estimated cost

$2.00

per month

Query latency

Warm query: 1m vectors

Percentile

Latency

P50
28ms
P90
37ms
P99
63ms
MAX
98ms

Warm queries have all their data in cache.

Search for 100 random vectors from the dataset with top k = 10, when dataset is fully in cache. Warming the cache for an index takes about ~10s (for 1m vectors) after the first cold query, and typically stays in cache for a few hours.

Hear from our customers

Moving to turbopuffer felt less like an upgrade and more like discovering a new paradigm. We didn’t just save costs; we turned once-prohibitive features into standard tools in our arsenal. Vectorize all the things and say goodbye to sharding
Profile photo of Justin Watts

Justin Watts

Distinguished Engineer

Telus logo
Limits
MetricMax seen in productionProduction limits (current)Production limits (soon)
Max vectors (global)5B+
Max vectors (per namespace)100M+
Number of namespaces2M+Unlimited
Max dimensions10,752
Max inactive time in cacheContact us if you need custom
Write rate (global)10,000
Write rate (per namespace)10,000 10,000 vec/s100,000 vec/s
Max write batch rate (per namespace)1 batch/s
QPS (global)~60 QPS
Max QPS (per namespace)~20 QPS100+ QPS10,000 QPS
~90-95%~90-95%Configurable
See full list

FAQ

Contact
Email us
© 2024 turbopuffer Inc.
Privacy PolicyTerms of service
SOC2 Type 1 certified