perf(index): cache small-index posting, doc, and vector-probe reads by hamersaw · Pull Request #7258 · lance-format/lance

hamersaw · 2026-06-12T20:07:32Z

Summary

Querying a small full-text index (e.g. a mem_wal flushed-generation index) re-paid object-store IO on every query for metadata that is never cached. Three uncached read paths, each fixed for small indexes while leaving large-index behavior unchanged:

Posting metadata — posting_len_for_token / posting_metadata_for_token issued one tiny single-row read_range against the posting file per term per partition, never cached. For a small index (≤256Ki tokens) bulk-load the whole posting metadata once into the existing OnceCell (small_index_bulk_metadata); large indexes keep the O(1) single-row path.
Doc row-ids — DeferredDocSet::resolve_row_ids did targeted (uncached) row-id reads on every query. For a small partition (≤256Ki docs) load and cache the whole ROW_ID column instead.
Vector-index probe — the "is this a vector index?" file-existence check for indexes without files metadata issued a HEAD per generic open. Memoize per uuid in the session index cache (IsVectorIndexProbeKey).

Changes

lance-index/.../inverted/index.rs: small_index_bulk_metadata + updated test_bm25_stats_for_terms_is_lazy.
lance-index/.../inverted/lazy_docset.rs: small-partition row-id column caching in resolve_row_ids.
lance/src/index.rs: memoized is_vector_index existence probe.
lance/src/session/index_caches.rs: IsVectorIndexProbe cache key.

Validation

cargo test -p lance-index test_bm25_stats_for_terms_is_lazy passes (asserts the first stats lookup issues exactly one posting read and subsequent lookups issue none). Validated end-to-end against a WAL FTS benchmark: a warm query dropped from re-reading per-term posting offsets + doc row-ids + the vector probe each query to zero such reads (all served from cache).

🤖 Generated with Claude Code

Querying a small FTS index (e.g. a mem_wal flushed generation) re-paid object-store IO on every query for metadata that is never cached: 1. `posting_len_for_token` / `posting_metadata_for_token` issued one tiny single-row `read_range` per term per partition. For a small index, bulk- load the whole posting metadata once into the cached `OnceCell` instead (`small_index_bulk_metadata`, ≤256Ki tokens); large indexes keep the uncached single-row path. 2. `DeferredDocSet::resolve_row_ids` did targeted (uncached) row-id reads every query. For a small partition (≤256Ki docs) load and cache the whole ROW_ID column instead. 3. The "is this a vector index?" file-existence probe for indexes without `files` metadata issued a HEAD per generic open. Memoize it per uuid in the session index cache (`IsVectorIndexProbeKey`). Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>

hamersaw · 2026-06-15T15:27:25Z

This PR was fixing a performance issue on a path we should not have been on, closing accordingly.

github-actions Bot added performance A-index Vector index, linalg, tokenizer labels Jun 12, 2026

hamersaw closed this Jun 15, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(index): cache small-index posting, doc, and vector-probe reads#7258

perf(index): cache small-index posting, doc, and vector-probe reads#7258
hamersaw wants to merge 1 commit into
lance-format:mainfrom
hamersaw:perf/wal-cache-indicies

hamersaw commented Jun 12, 2026

Uh oh!

hamersaw commented Jun 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

hamersaw commented Jun 12, 2026

Summary

Changes

Validation

Uh oh!

hamersaw commented Jun 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant