Inverted Index

Inverted Index is Lakehouse's full-text search acceleration index. It tokenizes text content in columns and builds a mapping from words to rows, enabling keyword matching and full-text search queries.

Comparison with Bloomfilter Index

AspectInverted IndexBloomfilter Index
Applicable QueriesFull-text search, keyword matching (MATCH)Equality filtering (=, IN)
Applicable Column TypesText columns (VARCHAR, STRING)Any type, high-cardinality columns
Typical ScenariosLog search, product name searchPoint lookup by ID

Applicable Scenarios

  • Log analysis: search for keywords in the message column
  • Product search: full-text search in product names and descriptions
  • User behavior: match specific behaviors in event description fields

Create Example

-- Create Inverted Index at table creation CREATE TABLE logs ( log_id BIGINT, message STRING, INDEX idx_message (message) USING INVERTED ); -- Query using Inverted Index SELECT * FROM logs WHERE MATCH(message, 'error timeout');