Python script to clean email lists: deduplicate, validate format, remove role addresses, and check for disposable domains. Feed it a dirty list, get back a clean one.
Case-insensitive deduplication with original case preservation
RFC 5322 email format validation
Role-based address detection and removal (info@, admin@, etc.)
Disposable domain detection (built-in blocklist of 2000+ domains)
Typo domain suggestion (gmial.com → gmail.com)
Empty and whitespace line cleanup
Before/after statistics report
Supports TXT, CSV, and one-email-per-line input formats
# Clean an email list
python list_hygiene.py --input dirty_list.txt --output clean_list.txt
# Run specific checks only
python list_hygiene.py --input list.txt --checks dedup,format,disposable
# Get statistics without modifying
python list_hygiene.py --input list.txt --dry-run