Table
A Table is the basic unit for storing data in the Lakehouse, using columnar storage (Parquet format) suitable for analytical queries.
Unlike Dynamic Tables and Materialized Views, a table's data must be maintained manually through INSERT, UPDATE, DELETE, or COPY INTO — the system will not auto-refresh the data.
Type Selection Reference
| Table Type | Data Maintenance Method | Suitable Scenarios |
|---|---|---|
| Table | Manual writes | ODS raw data, scenarios requiring precise control over write timing |
| Dynamic Table | Automatic incremental refresh | DWD/DWS layers, automatically compute results based on queries |
| Materialized View | Automatic refresh | Pre-computed query results, transparent query acceleration |
| View | No data storage | Logic encapsulation, simplified queries |
Core Features
Columnar Storage: Uses Parquet format by default, reading only the necessary columns during queries, suitable for large-scale analysis.
Primary Key Support: After defining a primary key, the system automatically deduplicates by primary key during real-time writes, suitable for CDC sync scenarios.
Time Travel: Retains historical version data, supporting queries of data states at historical points in time.
Partitioning and Clustering: Optimizes query performance through partition pruning and clustered joins.
Quick Example
Related Documentation
- CREATE TABLE — Table creation syntax
- ALTER TABLE — Modify table structure
- Partitioning — Partition design guide
- Clustering — Clustering design guide
- Time Travel — Historical data queries
