Column Presence¶
Check: column-presence-check
Purpose: Validates that all required columns are present in the DataFrame, regardless of data type. Use this as an early schema guard to prevent downstream failures caused by missing columns.
Typical Use Cases¶
- Detect schema regressions where key columns are accidentally dropped by upstream producers.
- Validate that technical metadata columns such as
event_timestamporpartition_keyare present. - Prevent silent failures in transformation logic that depends on specific column names.