Unique Rows¶
Check: unique-rows-check
Purpose: Validates that all rows in the dataset are unique, either across all columns or a defined subset. The check fails on any duplicate row within the evaluated scope.
Note
If no subset_columns are provided, uniqueness is evaluated across all columns.
Typical Use Cases¶
- Enforce uniqueness on primary key–like columns such as
trip_idoruser_id. - Detect duplicate records introduced by faulty joins, reprocessing, or double ingestion.
- Validate the correctness of deduplication logic before writing to transactional stores.