Vector Search Metadata Guide: Sizing, Limits & Best Practices

Metadata Basics

Every vector stored with VADD can carry arbitrary metadata as key-value string pairs. Internally, Cachee stores this as a Rust HashMap<String, String> alongside the vector in the HNSW index. There is no enforced maximum size per field, per entry, or per key.

The command format accepts metadata after the METADATA keyword:

RESP Command
VADD my_index vec_001 0.12 0.87 0.45 ... METADATA category "machine-learning" version "3.1" author "team-alpha"
        

Each metadata field is a string key mapped to a string value. You can store serialized JSON, base64-encoded binary, UUIDs, timestamps, or any other string-representable data. The only constraint is system memory.

Key Insight

Metadata lives in the same process memory as the HNSW graph but is structurally separate from the vector data. The graph traversal algorithm never reads metadata during neighbor selection, so metadata size has zero impact on search latency.

Performance Impact by Size

The table below summarizes behavior at common metadata sizes. The critical takeaway: VSEARCH latency is constant regardless of metadata payload.

Metadata Size	Works?	Memory Impact	VSEARCH Impact	Filter Impact
UUID (36 bytes)	Yes	Negligible	Zero	Zero
Small JSON (1 KB)	Yes	Negligible	Zero	Zero
Medium payload (12 KB)	Yes	Plan for it	Zero	Zero
ZK-STARK proof (45-200 KB)	Yes, but use tiered approach	Significant	Zero	Zero

VSEARCH impact remains at zero across all sizes because the HNSW graph traversal only operates on Vec<f32> vector data. Metadata is never touched during neighbor selection or distance computation.

Why Metadata Doesn't Affect Search Speed

Cachee's HNSW implementation keeps a strict separation between the graph structure (vector coordinates, neighbor lists) and attached metadata. During a VSEARCH query, the engine performs three distinct phases:

Graph traversal — navigates HNSW layers using only Vec<f32> vectors. Distance calculations (cosine, L2, dot product) touch raw float arrays only. Metadata is not loaded, not dereferenced, not part of the hot path.
Filter evaluation — if a FILTER clause is present, the engine does a single HashMap lookup on the specified field and a string comparison. This is O(1) per candidate vector, regardless of how many other metadata fields exist on that entry.
Result construction — once the top-K results are identified, metadata is cloned into the response. This is a post-search operation that does not affect the search time reported in benchmarks.

The 0.0015ms search latency you see in benchmarks is unaffected whether metadata per vector is 36 bytes or 200 KB. The cost of metadata only manifests as memory consumption and, for very large payloads, response serialization time after the search completes.

Memory Planning

While metadata does not affect search performance, it does consume heap memory. For capacity planning, calculate the total memory footprint as the sum of vector storage and metadata storage.

100K Vectors (768 dimensions)

Memory Math
# Vector storage (constant regardless of metadata)
100K vectors x 768 dims x 4 bytes = 300 MB

# Metadata storage (varies by payload size)
100K vectors x    500 bytes metadata = 50 MB
100K vectors x     12 KB metadata    = 1.2 GB
100K vectors x    200 KB metadata    = 20 GB
        

1M Vectors (768 dimensions)

Memory Math
# Vector storage
1M vectors x 768 dims x 4 bytes = 3 GB

# Metadata storage
1M vectors x    500 bytes metadata = 500 MB    # Comfortable
1M vectors x     12 KB metadata    = 12 GB     # Requires capacity planning
1M vectors x    200 KB metadata    = 200 GB    # Too large for most deployments
        

Planning Threshold

When total metadata exceeds vector storage, you are likely storing data that belongs in a separate cache key rather than inline metadata. See the two-tier pattern below.

Best Practice: Two-Tier Pattern for Large Payloads

For payloads exceeding 10 KB — ZK-STARK proofs, serialized ML model weights, large JSON documents, binary blobs — the recommended approach is to store lightweight metadata in VADD and the full payload as a separate cache key.

Store: Index + Cache
# Step 1: Store small metadata in the vector index for search and filtering
VADD proofs proof_123 0.12 0.87 0.45 ... METADATA proof_hash "abc123" circuit "plonk" status "verified"

# Step 2: Store the full payload in a regular cache key
SET stark:abc123 <full 200KB proof blob>
        

Retrieve: Search + Fetch
# Step 1: VSEARCH finds semantically similar proofs via HNSW
VSEARCH proofs 0.12 0.87 0.45 ... 5
# Returns: [{id: "proof_123", score: 0.97, metadata: {proof_hash: "abc123", ...}}]

# Step 2: GET the full proof payload at sub-microsecond latency
GET stark:abc123
# Returns: full 200KB proof blob at ~31ns
        

This pattern keeps the HNSW index lean — more vectors fit in memory, neighbor lists stay cache-friendly — while the full payloads still live in-process at sub-microsecond read latency via standard GET. Both operations share the same Cachee process. No network hops, no external storage.

ZK-STARK Specific Guidance

ZK-STARK proofs are a common large-payload use case. A typical STARK proof is 45-200 KB depending on the circuit complexity and number of trace columns. Here is the recommended storage strategy:

VADD metadata (~500 bytes): Store the proof hash, verification status, circuit ID, prover identity, and timestamp as metadata fields. These are the fields you will filter and search on.
Cache key (full proof): Store the complete serialized proof as a separate key using SET. Reference it from metadata via the proof hash.
Search pattern: Use VSEARCH to find semantically related proofs (same circuit family, similar constraint structure), then GET the full proof for verification or aggregation.
Cache hit rates: STARK proofs are immutable once generated. Expect 90%+ cache hit rates for proof lookups, making this pattern extremely efficient for verification pipelines and recursive proof composition.

ZK-STARK Example
# Enroll a verified STARK proof into the vector index
VADD stark_proofs proof_0xf7a2 0.31 0.92 0.18 0.67 ... \
  METADATA proof_hash "0xf7a2c8..." \
             circuit "fibonacci_air" \
             status "verified" \
             prover "winterfell-v0.8" \
             timestamp "1711540800"

# Store full 145KB proof blob
SET proof:0xf7a2c8 <145KB serialized STARK proof>

# Search for similar Fibonacci AIR proofs, filter to verified only
VSEARCH stark_proofs 0.31 0.92 0.18 0.67 ... 10 FILTER status eq verified
        

For deeper coverage of Cachee's ZK integration capabilities, see the ZKP feature page.

Metadata Filtering

The FILTER clause in VSEARCH lets you constrain results by any metadata field. Because metadata is stored as a HashMap, field lookup is O(1) — constant time regardless of how many metadata fields exist on a given vector entry.

Filter Examples
# Exact match on a single field
VSEARCH products 0.5 0.3 0.8 ... 10 FILTER category eq electronics

# Filter by status across a proof index
VSEARCH stark_proofs 0.2 0.9 ... 5 FILTER status eq verified

# Filter by model version in an ML artifact index
VSEARCH model_weights 0.1 0.4 ... 3 FILTER version eq v2.1
        

Adding more metadata fields to a vector entry does not slow down filtering. Only the field specified in the FILTER clause is evaluated. An entry with 3 metadata fields and an entry with 50 metadata fields have identical filter performance.

Filter Architecture

Filters are evaluated during HNSW traversal, not as a post-processing step. This means filtered results respect the top-K constraint correctly — you always get K results that match both the vector similarity and the filter condition, not K results filtered down to fewer matches.

Vector Search Metadata:
Sizing, Limits & Best Practices

Metadata Basics

Performance Impact by Size

Why Metadata Doesn't Affect Search Speed

Memory Planning

100K Vectors (768 dimensions)

1M Vectors (768 dimensions)

Best Practice: Two-Tier Pattern for Large Payloads

ZK-STARK Specific Guidance

Metadata Filtering

Also Read

Build With Vectors at
Sub-Microsecond Speed.

Metadata Basics

Performance Impact by Size

Why Metadata Doesn't Affect Search Speed

Memory Planning

100K Vectors (768 dimensions)

1M Vectors (768 dimensions)

Best Practice: Two-Tier Pattern for Large Payloads

ZK-STARK Specific Guidance

Metadata Filtering

Also Read

Build With Vectors atSub-Microsecond Speed.

Build With Vectors at
Sub-Microsecond Speed.