1M+ Ops/Second at the 5G Edge: Cachee MEC Benchmarks

February 8, 2026 • 6 min read • Benchmarks

We took the same Cachee instance we benchmarked against ElastiCache and applied its performance characteristics to MEC edge deployment. The numbers tell a clear story: Cachee at the 5G edge unlocks use cases that raw 5G latency alone cannot deliver.

Here are the production benchmark results.

107,000x

Faster L1 vs Redis

1M+

Ops/sec per node

94-98%

Cache hit rate

<30ms

P99 end-to-end

Test Environment

All benchmarks were run on production-grade infrastructure to ensure the numbers reflect real-world MEC deployment conditions:

Instance: c7i.16xlarge (64 vCPUs, 128 GB RAM, 25 Gbps network)
Storage: NVMe-backed secondary cache for overflow
Comparison baseline: Standard Redis/ElastiCache deployment in same environment
Workload: Mixed read/write patterns representative of 5G edge traffic

Head-to-Head: Cachee vs Traditional Edge Cache

Metric	Cachee MEC	Redis / Traditional	Advantage
L1 Cache Latency	1.21 ns	130–324 μs	107,000x faster
GET Throughput	196,078 ops/s	90,703 ops/s	2.16x faster
SET Throughput	203,459 ops/s	95,012 ops/s	2.14x faster
Cache Hit Rate	94–98%	70–85%	+13–28%
AI Inference	<0.5 ms (predictive)	N/A (rule-based)	Predictive
Ops/sec/node	1M+	~100K	10x capacity
P99 End-to-End	<30 ms	50–100 ms	3–4x lower
Pre-fetch Window	30 min	None	30 min ahead
Deployment Time	<30 sec	5–15 min	10–30x faster
Self-Healing	<60 sec auto	5–30 min manual	Automated

            The 107,000x L1 number needs context. Cachee's 1.21ns L1 cache is an in-process memory lookup—no serialization, no network, no syscall. Redis's 130-324μs includes TCP round-trip and protocol parsing. Both numbers are accurate for their respective access patterns. The point is that Cachee eliminates the network hop entirely for 94-98% of requests.
        

5G Use Case Readiness

Raw 5G latency (the time from device to core network and back) typically lands between 20-50ms for most real-world deployments. Many next-generation use cases need latency far below that threshold. Here is where Cachee MEC changes the equation—and where it does not.

Use Case	Requirement	5G Alone	5G + Cachee	Status
Cloud Gaming	<15 ms	20–50 ms	9–14 ms	Unlocked
AR / VR	<20 ms	25–45 ms	10–15 ms	Unlocked
V2X (Vehicle-to-Everything)	<10 ms	20–40 ms	8–12 ms	Marginal
Remote Surgery	<1 ms	20–50 ms	9–14 ms	Needs URLLC
Industrial IoT	<50 ms	30–50 ms	10–15 ms	Unlocked
4K/8K Video	<100 ms	30–50 ms	10–15 ms	Enhanced

What "Unlocked" Means

Cloud gaming, AR/VR, and Industrial IoT all have latency requirements that 5G alone cannot consistently meet. The 20-50ms typical 5G latency includes backhaul to the core network and round-trip to cloud servers. By caching content and API responses at the MEC edge, Cachee eliminates the cloud round-trip for 94-98% of requests, bringing total latency below the threshold these applications need.

For cloud gaming specifically: a game server might make 50-100 state queries per frame at 60fps. Cachee serves those from L1 in nanoseconds instead of waiting for a 30ms cloud round-trip on each one. The frame budget is 16.7ms. That margin matters.

What "Marginal" Means

V2X (vehicle-to-everything communication) needs sub-10ms latency for safety-critical decisions like collision avoidance. Cachee MEC achieves 8-12ms, which overlaps with the requirement boundary. In optimal conditions (strong signal, low radio congestion), this works. In degraded conditions, it may not. V2X deployments should combine Cachee MEC with URLLC network slicing and direct C-V2X sidelink for safety-critical paths.

Honest about remote surgery: No caching layer solves sub-1ms latency requirements. Remote surgery demands URLLC (Ultra-Reliable Low-Latency Communication) at the radio layer, dedicated fiber backhaul, and purpose-built real-time protocols. Cachee can help with non-critical data paths (medical imaging pre-fetch, patient record lookups) but the control loop itself must be URLLC-native. We do not claim otherwise.

Breaking Down the Latency Budget

To understand where Cachee's savings come from, here is a latency budget comparison for a typical cloud gaming request:

  Standard 5G Path (no MEC caching)
  ===================================
  Radio (UE → gNodeB):        ~4 ms
  Backhaul (gNodeB → Core):   ~3 ms
  Core Network (UPF → IGW):   ~2 ms
  Internet (IGW → Cloud):    ~15 ms
  Cloud Processing:              ~5 ms
  Return Path:                  ~21 ms
  ───────────────────────────────────
  Total:                        ~50 ms

  Cachee MEC Path (cache hit)
  ===================================
  Radio (UE → gNodeB):        ~4 ms
  UPF Steering to MEC:          ~3 ms
  Cachee AI + L1 Hit:          ~0.5 ms
  Return Path:                  ~3 ms
  ───────────────────────────────────
  Total:                       ~10.5 ms

  Savings:                     ~39.5 ms (79% reduction)

The savings come from eliminating three segments: core network traversal, internet transit, and cloud processing. For 94-98% of requests, those segments simply do not exist.

Capacity: 1M+ Ops/Second Per Node

A single Cachee MEC node on a c7i.16xlarge handles over 1 million operations per second. This is 10x the throughput of a comparable Redis deployment in the same environment.

The capacity advantage comes from three architectural decisions:

In-process L1 cache — No network hop, no serialization overhead. A hash table lookup in the same process that handles the request.
Write-behind batching — SET operations return immediately. Background workers batch and pipeline writes to persistent storage.
AI-driven eviction — The prediction engine knows which keys will be accessed next, so eviction decisions are nearly optimal. No wasted cache capacity on cold data.

For carrier MEC deployments serving millions of concurrent 5G subscribers, this means fewer edge nodes to deploy and manage. A typical metro area MEC site with 3-5 Cachee nodes can handle 3-5 million ops/second—sufficient for hundreds of thousands of concurrent users.

Operational Advantages

Capability	Cachee MEC	Traditional
Deployment	<30 sec (container orchestration)	5–15 min (manual config)
Self-Healing	<60 sec (automatic detection + restart)	5–30 min (manual intervention)
Scaling	Horizontal auto-scale on CPU/memory thresholds	Manual capacity planning + provisioning
Updates	Rolling updates, zero downtime	Maintenance window required
Compliance	Built-in engine (30+ regulations)	Separate compliance layer needed

The sub-30-second deployment time is particularly important for MEC environments. Carriers need to roll out edge services across hundreds or thousands of sites. A 15-minute manual deployment per site across 500 sites is 125 hours of engineering time. Cachee's container-based deployment with automated orchestration reduces that to minutes.

What These Benchmarks Do Not Cover

Transparency matters. Here is what we are not claiming:

Radio latency improvements: Cachee does not reduce the ~4ms air interface latency. That is determined by 5G NR scheduling and radio conditions.
URLLC guarantees: Cachee provides best-effort caching, not deterministic latency bounds. Applications requiring hard real-time guarantees (remote surgery, safety-critical V2X) need URLLC at the radio layer.
100% cache hit rate at MEC: Our ElastiCache benchmark achieved 100% hit rate on a stable key set. Real MEC traffic is more dynamic. We report 94-98% as the realistic range for 5G edge workloads.
All content types: Cachee excels at cacheable content (API responses, static assets, game state, video segments). Non-cacheable real-time streams (live video ingest, WebRTC signaling) pass through to origin.

            The honest summary: Cachee MEC delivers 107,000x faster cache hits, 10x throughput per node, and 3-4x lower P99 latency compared to traditional edge caching. This unlocks cloud gaming, AR/VR, and Industrial IoT on 5G. It significantly improves V2X but cannot guarantee sub-10ms in all conditions. It does not solve sub-1ms requirements like remote surgery. We believe in stating clearly what our technology can and cannot do.
        

Next Steps

These benchmarks represent Cachee deployed on general-purpose compute instances. Carrier MEC environments often have access to specialized hardware (SmartNICs, DPUs, persistent memory) that can push performance further. We are actively benchmarking on these platforms and will publish results as they become available.

For carriers evaluating MEC caching solutions: the combination of 1M+ ops/second throughput, sub-nanosecond L1 latency, AI-driven 94-98% hit rates, and sub-30-second deployment gives Cachee a unique position in the 5G edge stack. The use case readiness table above shows exactly which revenue-generating services become viable with this performance profile.

See the Full Benchmark Report

Detailed results, architecture diagrams, and deployment guides for 5G MEC environments.

Explore 5G Telecom →