serverless vector and full-text search built from first principles on object storage: fast, 10x cheaper, and extremely scalable
╔═ turbopuffer ════════════════════════════╗
╔════════════╗ ║ ║░
║ ║░ ║ ┏━━━━━━━━━━━━━━━┓ ┏━━━━━━━━━━━━━━┓ ║░
║ client ║░───API──▶║ ┃ Memory/ ┃────▶┃ Object ┃ ║░
║ ║░ ║ ┃ SSD Cache ┃ ┃ Storage (S3) ┃ ║░
╚════════════╝░ ║ ┗━━━━━━━━━━━━━━━┛ ┗━━━━━━━━━━━━━━┛ ║░
░░░░░░░░░░░░░░ ║ ║░
╚══════════════════════════════════════════╝░
░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░
╔════════════╗
║ ║░
┌──║ client ║░
│ ║ ║░
API ╚════════════╝░
│ ░░░░░░░░░░░░░░
└─────────┐
│
▼
╔═ turbopuffer ════════════════╗
║ ║░
║ ┏━━━━━━━━━━━━━━━━━━━━━━━━┓ ║░
║ ┃ Memory/ ┃ ║░
║ ┃ SSD Cache ┃ ║░
║ ┗━━━━━━━━━━━━━━━━━━━━━━━━┛ ║░
║ │ ║░
║ ▼ ║░
║ ┏━━━━━━━━━━━━━━━━━━━┓ ║░
║ ┃Object storage (S3)┃ ║░
║ ┗━━━━━━━━━━━━━━━━━━━┛ ║░
║ ║░
╚══════════════════════════════╝░
░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░
150B+ documents & 6K+ queries/s in prod
Approach (3 QPS with topk=10)
Justin Watts
Distinguished Engineer
Metric | Observed in production | Production limits (current)(upcoming) |
---|---|---|
Max documents (global) | 150B+ | Unlimited |
Max documents (per namespace) | 200M | 100M1B+ |
Max number of namespaces | 40M+ | Unlimited |
Max write rate (global) | 1M docs/s | Unlimited |
Max write rate (per namespace) | 10K docs/s | 10K docs/s |
Max queries (global) | 6K+ queries/s | Unlimited |
Max queries (per namespace) | 1K+ queries/s | 1K+ queries/s10K queries/s |
Vector search recall@10 | 90-100% | 90-100%Configurable |
View all |