diff --git a/docs/indexing/index.mdx b/docs/indexing/index.mdx index a6e72a8..815cf5f 100644 --- a/docs/indexing/index.mdx +++ b/docs/indexing/index.mdx @@ -33,7 +33,7 @@ LanceDB provides a comprehensive suite of indexing strategies for different data | `FTS` (Full-text search) | String columns (e.g., title, description, content) requiring keyword-based search with BM25 ranking. | Full-text search index using BM25 ranking algorithm. Tokenizes text with configurable tokenization, stemming, stop word removal, and language-specific processing. | | `BTree` (Scalar) | Numeric, temporal, and string columns with mostly distinct values. Best for highly selective queries on columns with many unique values. | Sorted index storing sorted copies of scalar columns with block headers in a btree cache. Header entries map to blocks of rows (4096 rows per block) for efficient disk reads. | | `Bitmap` (Scalar) | Low-cardinality columns with few thousand or fewer distinct values. Accelerates equality and range filters. | Stores a bitmap for each distinct value in the column, with one bit per row indicating presence. Memory-efficient for low-cardinality data. | -| `LabelList` (Scalar) | List columns (e.g., tags, categories, keywords) requiring array containment queries. | Scalar index for `List` columns using an underlying bitmap index structure to enable fast array membership lookups. | +| `LabelList` (Scalar) | List columns (e.g., tags, categories, keywords) requiring array containment queries. | Scalar index for `List` and `LargeList` columns of primitive values, using an underlying bitmap index structure to enable fast array membership lookups. | TypeScript currently doesn't support `IvfSq` (IVF with Scalar Quantization). diff --git a/docs/indexing/scalar-index.mdx b/docs/indexing/scalar-index.mdx index a835c4c..7e71459 100644 --- a/docs/indexing/scalar-index.mdx +++ b/docs/indexing/scalar-index.mdx @@ -24,7 +24,7 @@ LanceDB supports four types of scalar indexes: - `BTREE`: Stores column data in sorted order for binary search. Best for columns with many unique values. - `BITMAP`: Uses bitmaps to track value presence. Ideal for columns with few unique values (e.g., categories, tags). -- `LABEL_LIST`: Special index for `List` columns supporting `array_contains_all` and `array_contains_any` queries. +- `LABEL_LIST`: Special index for `List` and `LargeList` columns of primitive values supporting `array_contains_all` and `array_contains_any` queries. - `FM`: FM-Index over string or binary columns that accelerates substring search via `contains(col, 'needle')`. ## Choosing the Right Index Type