Codepoints · invisibles · NFC/NFD

🔎 Unicode Inspector

See exactly what's inside a string: every codepoint with UTF-8/16 bytes, hidden zero-width characters flagged in red, and NFC/NFD normalization — the classic Vietnamese text gotcha.

Normalization

Codepoints (click one for details)

NFC composes (“ế” = 1 codepoint U+1EBF), NFD decomposes (“e” + circumflex + acute = 3 codepoints) — both LOOK identical but compare unequal, breaking search & dedupe. Old Vietnamese input methods produced NFD; the web standard is NFC. Zero-width and bidi-control characters are invisible but real — common in text copied from chats or used in phishing.