In Unicode, a character (arranged) can have either a unique code point or a sequence of code points consisting of a base character and its accents.
Wikipedia lists, by way of example, Vietnamese B (U + 1EBF) and its decomposed sequence U + 0065 (e) U + 0302 (circumflex accent) U + 0301 (acute accent).
string.Normalize () converts between 4 normal forms, a string can be encoded in Unicode.
devio Jul 20 '10 at 8:33 2010-07-20 08:33
source share