A serverless vector database

built from first principles on object storage: 10-100x cheaper, usage-based pricing, massive scalability

Apply for access

         ╔════════════╗          
         ║            ║░         
      ┌──║   client   ║░         
      │  ║            ║░         
     API ╚════════════╝░         
      │   ░░░░░░░░░░░░░░         
      └─────────┐                
                │                
                ▼                
╔═turbopuffer══════════════════╗ 
║                              ║░
║  ┏━━━━━━━━━━━━━━━━━━━━━━━━┓  ║░
║  ┃        Memory/         ┃  ║░
║  ┃       SSD Cache        ┃  ║░
║  ┗━━━━━━━━━━━━━━━━━━━━━━━━┛  ║░
║               │              ║░
║               ▼              ║░
║     ┏━━━━━━━━━━━━━━━━━━━┓    ║░
║     ┃Object storage (S3)┃    ║░
║     ┗━━━━━━━━━━━━━━━━━━━┛    ║░
║                              ║░
╚══════════════════════════════╝░
 ░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░
Cost calculator
768
ItemsUnit costQuantityTotal
Storage
$1.00
Writes
$1.00
Queries
$0.00
Estimated cost

$2.00

per month

Query latency

Warm query: 1m vectors

Percentile

Latency

P50
28ms
P90
37ms
P99
63ms
MAX
98ms

Warm queries have all their data in cache.

Search for 100 random vectors from the dataset with top k = 10, when dataset is fully in cache. Warming the cache for an index takes about ~10s (for 1m vectors) after the first cold query, and typically stays in cache for a few hours.

Hear from our customers

Justin Watts

Distinguished Engineer

Photo of Justin Watts

Moving to turbopuffer felt less like an upgrade and more like discovering a new paradigm. We didn’t just save costs; we turned once-prohibitive features into standard tools in our arsenal. Vectorize all the things and say goodbye to sharding

MTL 🇨🇦

Telus logo
Limits
MetricMax seen in productionProduction limits (current)Production limits (soon)
Max vectors (global)5.5B+
Max vectors (per namespace)100M+
Number of namespaces2M+Unlimited
Max dimensions10,752
Max inactive time in cacheContact us if you need custom
Write rate (global)10,000
Write rate (per namespace)10,000 10,000 vec/s100,000 vec/s
QPS (global)>100 QPS
Max QPS (per namespace)~20 QPS100+ QPS10,000 QPS
~90-95%~90-95%Configurable
See full list

FAQ

Contact
Email us
© 2024 turbopuffer Inc.
Privacy PolicyTerms of service
SOC2 Type 1 certified