Logo

Search every byte

serverless vector and full-text search built from first principles on object storage: fast, 10x cheaper, and extremely scalable

Apply for access

         ╔════════════╗          
         ║            ║░         
      ┌──║   client   ║░         
      │  ║            ║░         
     API ╚════════════╝░         
      │   ░░░░░░░░░░░░░░         
      └─────────┐                
                │                
                ▼                
╔═ turbopuffer ════════════════╗ 
║                              ║░
║  ┏━━━━━━━━━━━━━━━━━━━━━━━━┓  ║░
║  ┃        Memory/         ┃  ║░
║  ┃       SSD Cache        ┃  ║░
║  ┗━━━━━━━━━━━━━━━━━━━━━━━━┛  ║░
║               │              ║░
║               ▼              ║░
║     ┏━━━━━━━━━━━━━━━━━━━┓    ║░
║     ┃Object storage (S3)┃    ║░
║     ┗━━━━━━━━━━━━━━━━━━━┛    ║░
║                              ║░
╚══════════════════════════════╝░
 ░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░

125B+ documents & 6K+ queries/s in prod

cursor logo

Cost calculator

Query latency

Warm namespace
p50
16ms
p90
21ms
p99
33ms
Cold namespace
p50
402ms
p90
524ms
p99
677ms

Approach: 3 QPS with topk=10 for 10 minutes. Reproduce with turbopuffer/tpuf-benchmark

"

Justin Watts

Distinguished Engineer

Photo of Justin Watts

Moving to turbopuffer felt less like an upgrade and more like discovering a new paradigm. We didn't just save costs; we turned once-prohibitive features into standard tools in our arsenal. Vectorize all the things and say goodbye to sharding

MTL 🇨🇦

single-tenant cluster

Telus logo
Limits
MetricMax seen in productionProduction limits (current)Production limits (soon)
Max documents (global)125B+Unlimited
Max documents (per namespace)200M100M1B+
Number of namespaces35M+Unlimited
Max dimensions10,752
Max inactive time in cache~3 daysContact us for custom
Write rate (global)200,000 doc/sUnlimited
Write rate (per namespace)5,000 doc/s5,000 doc/s
Max upsert batch request size256 MB256 MB
Queries (global)6K+ queries/sUnlimited
Max queries/second (per namespace)100+ queries/s100+ queries/s10,000 QPS
Vector search recall@1090-100%90-100%Configurable
Max attribute value8 MiB8 MiB
See full list and workloads that are a great fit for turbopuffer 🐡

FAQ

Yes, turbopuffer is production quality. We have powered production applications since November '23 at 99.99% uptime. We host billions of production vectors at thousands of writes per second.

We maintain SOC2 Type 2 certification and HIPAA compliance.

turbopuffer accounts are not yet generally available. We are currently focused on providing high bandwidth support to make our select early customers successful, while removing limitations. See if your workload is a good fit for turbopuffer here.