serverless vector and full-text search built from first principles on object storage: fast, 10x cheaper, and extremely scalable
╔═ turbopuffer ════════════════════════════╗
╔════════════╗ ║ ║░
║ ║░ ║ ┏━━━━━━━━━━━━━━━┓ ┏━━━━━━━━━━━━━━┓ ║░
║ client ║░───API──▶║ ┃ Memory/ ┃────▶┃ Object ┃ ║░
║ ║░ ║ ┃ SSD Cache ┃ ┃ Storage (S3) ┃ ║░
╚════════════╝░ ║ ┗━━━━━━━━━━━━━━━┛ ┗━━━━━━━━━━━━━━┛ ║░
░░░░░░░░░░░░░░ ║ ║░
╚══════════════════════════════════════════╝░
░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░
╔════════════╗
║ client ║░
╚════════════╝░
░░░░░║░░░░░░░░
▼
╔═ turbopuffer ═════════════╗
║ ┏━━━━━━━━━━━━━━━━━━━━┓ ║░
║ ┃ Memory/SSD ┃ ║░
║ ┃ Cache ┃ ║░
║ ┗━━━━━━━━┳━━━━━━━━━━━┛ ║░
║ ▼ ║░
║ ┏━━━━━━━━━━━━━━━━━━━━┓ ║░
║ ┃ Object Storage ┃ ║░
║ ┃ (S3) ┃ ║░
║ ┗━━━━━━━━━━━━━━━━━━━━┛ ║░
╚══════════════════════════╝░
░░░░░░░░░░░░░░░░░░░░░░░░░░░░░
150B+ documents & 6K+ queries/s in prod
Approach (3 QPS with topk=10)
Metric | Observed in production | Production limits (current) |
---|---|---|
Max documents (global) | 150B+ | Unlimited |
Max documents (per namespace) | 200M | 100M |
Max number of namespaces | 40M+ | Unlimited |
Max write rate (global) | 1M+ writes/s | Unlimited |
Max write rate (per namespace) | 10K writes/s | 10K writes/s |
Max queries (global) | 6K+ queries/s | Unlimited |
Max queries (per namespace) | 1K+ queries/s | 1K+ queries/s |
Vector search recall@10 | 90-100% | 90-100% |
View all |