Data Cleaner
Automates data cleaning tasks including deduplication, missing value handling, date format normalization, and text standardization. Provides before/after comparison and reusable scripts.
/cleanOperations manager? Run /clean to unify different date formats across vendors in one pass
Data engineer? Auto-generate reusable scripts for repetitive preprocessing logic
How It Works
Skill Code
Copy and paste into your CLAUDE.md to start using immediately.
How Data Cleaner Works
Data Cleaner scans your dataset for inconsistencies — duplicate rows, missing values, formatting variations, type mismatches — then generates a cleaning pipeline that standardizes, deduplicates, and validates the output.
When to Use Data Cleaner
Essential in any data pipeline where raw input is messy — especially when combining data from multiple sources with different formatting conventions, date formats, and naming standards that need harmonization.
Key Strengths
- Detects duplicates, missing values, and format inconsistencies
- Generates reproducible cleaning pipelines
- Standardizes formats across heterogeneous data sources
- Validates output data quality after cleaning