Unicode Normalization Explained: NFC, NFD, NFKC, NFKD

4 min read

The same text can have multiple Unicode representations. Normalization converts text to a standard form for reliable comparison.

The Four Forms

FormNameUse Case
NFCComposedMost common — recommended for web and storage
NFDDecomposedUsed by macOS file system
NFKCCompatibility ComposedSearch and matching
NFKDCompatibility DecomposedStripping accents

Why It Matters

"café" can be encoded as 4 characters (NFC: é as single code point) or 5 characters (NFD: e + combining accent). Without normalization, string comparison fails.

Remove Accents

Use our plain text converter with "Remove accents/diacritics" to normalize text.