1.5µs cache, AI-powered prefetching, and zero migration — from free tier to 2.5 trillion requests/month. Pay for what you use.
Not add-ons. Not premium tiers. Every single plan.
1.5µs cache hits on warm working sets. Validated on AWS production.
AI prefetching loads data into L1 before it's requested. Zero cache misses on warm sets.
Overlay on your existing Redis, Cloudflare KV, or database. No data migration. No downtime.
ML models learn your access patterns in 10 seconds and continuously optimize eviction and prefetch.
What Cachee costs vs. what it saves. Every plan delivers positive ROI from month one.
| Plan | Monthly Ops | Cachee Cost | Est. DB/Infra Savings | Net ROI |
|---|---|---|---|---|
| Free Trial | 1M | $0 | ~$200 | ∞ |
| Pay-As-You-Go | 5M | $75 | ~$1,000 | 13× |
| Starter | 20M | $199 | ~$2,000 | 10× |
| Scale ★ | 200M | $999 | ~$20,000 | 20× |
| Institutional | 10B | $9,999 | ~$100,000 | 10× |
| Enterprise Elite | 2.5T | $250K | $0.10/1M — lowest unit | Revenue-driven |
Every feature across every tier.
| Feature | Free | PAYG | Starter | Scale | Inst. | Ent. |
|---|---|---|---|---|---|---|
| Cache Engine | ||||||
| L1 Cache (1.5µs) | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| 100% L1 Hit Rate | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Zero Migration Overlay | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| AI & Intelligence | ||||||
| Pattern Learning | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Predictive Prefetching | — | — | ✓ | ✓ | ✓ | ✓ |
| Advanced AI (6 modes) | — | — | — | ✓ | ✓ | ✓ |
| Custom ML Models | — | — | — | — | ✓ | ✓ |
| Infrastructure | ||||||
| Edge Regions | 1 | 1 | 1 | 5 | All (450+) | All + custom |
| Multi-Region Failover | — | — | — | — | ✓ | ✓ |
| Dedicated Infrastructure | — | — | — | — | — | ✓ |
| On-Prem / Hybrid | — | — | — | — | — | ✓ |
| Support & SLA | ||||||
| SLA | — | 99.9% | 99.9% | 99.95% | 99.99% | Custom |
| Support | Community | Priority + Slack | 24/7 + TAM | Executive | ||
| Security | ||||||
| AES-256 encryption | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Tenant isolation | — | — | ✓ | ✓ | ✓ | ✓ |
| SSO / SAML | — | — | — | — | ✓ | ✓ |
| SOC 2 Type II | In progress — expected Q2 2026 | |||||
Validated March 2026. Head-to-head benchmarks on AWS.
| Metric | Cachee | Redis Enterprise | Cloudflare KV | AWS CloudFront |
|---|---|---|---|---|
| Cache Hit Rate | 99.05% | 60–70% | 48% | 50–60% |
| Response Time (P99) | 0.004ms | 1–3ms | 15–20ms | 10–15ms |
| AI Predictions | ✓ | ✗ | ✗ | ✗ |
| Setup Time | < 1 hour | 3–5 days | 1–2 weeks | 2–3 weeks |
| Zero Migration | ✓ Overlay | ✗ | ✓ Edge | ✗ |
| Manual Tuning | None | Extensive | Extensive | Moderate |
✅ Verified Performance Data — Mar 2026
Trusted by Infrastructure Teams
"We dropped our avg latency from 47ms to 0.12ms in under an hour. No migration, no downtime. The ROI was immediate."
— Infrastructure Lead, Series B Fintech
You won't be cut off. Overages are billed at your plan's per-1M rate. For example, on Starter ($9.95/1M), requests beyond 20M are billed at $9.95 per additional million. You can set spending alerts in the dashboard to avoid surprises.
Yes, upgrades take effect immediately and you're prorated for the remaining billing period. Downgrades take effect at the start of your next billing cycle. No penalties either way.
Request buckets never expire on PAYG. For fixed plans (Starter through Enterprise), your monthly allocation resets each billing cycle — unused requests don't roll over. This keeps pricing simple and predictable.
Monthly plans are month-to-month with no contract. Annual and 2-year plans offer 15% and 25% savings respectively and are billed upfront or monthly (your choice). All plans include a 30-day money-back guarantee.
You get 1M requests with full access to the L1 cache engine, AI pattern learning, and the performance dashboard. No credit card required. When your trial requests are used, you can upgrade to any paid plan or let it expire — we won't charge you.
Cachee sits as an overlay between your application and your existing cache (Redis, Cloudflare KV, etc.). You don't move data, change schemas, or rewrite queries. Point Cachee at your existing setup and it starts learning and serving from L1 within 10 seconds.
All plans include AES-256 encryption at rest and in transit. Enterprise plans add SSO/SAML, tenant isolation, audit logging, and self-hosted deployment options for full data sovereignty. Contact our sales team for specific security requirements.
Absolutely. For workloads that don't fit neatly into a tier, or for specialized requirements (on-prem, hybrid, custom ML models, specific SLA targets), use the chat in the bottom-right corner and we'll build a plan around your infrastructure.
1M requests. Full L1 engine. No credit card. See results in under 60 seconds.
* Performance figures (1.5µs L1 latency, 99.05% hit rate, 396× P99 improvement) represent results measured on optimized AWS infrastructure under controlled conditions. Actual performance varies based on workload characteristics, data size, access patterns, network topology, hardware configuration, geographic distribution, and other environmental factors. Results shown reflect warm cache on production-representative traffic. Your results may differ.