Codepoints · invisibles · NFC/NFD
🔎 Unicode Inspector
See exactly what's inside a string: every codepoint with UTF-8/16 bytes, hidden zero-width characters flagged in red, and NFC/NFD normalization — the classic Vietnamese text gotcha.
Normalization
Codepoints (click one for details)
NFC composes (“ế” = 1 codepoint U+1EBF), NFD decomposes (“e” + circumflex + acute = 3 codepoints) — both LOOK identical but compare unequal, breaking search & dedupe. Old Vietnamese input methods produced NFD; the web standard is NFC. Zero-width and bidi-control characters are invisible but real — common in text copied from chats or used in phishing.