As for UTF-8, this is a multi-bit string of characters, and therefore you have problems, and this is a bad idea / Instead of using regular Unicode.
So, in my opinion, it is best to use plain ASCII char text with some set of encodings. You must use Unicode if you are using more than two sets of different characters (languages) in one.
This is a rather rare case. In most cases, 2 character sets are sufficient. For this general case, use ASCII characters, not Unicode.
The effect of using multiple characters, such as UTF-8, you get only traditional Chinese, Arabic or hieroglyphic text. This is a very rare case !!!
I do not think that many people need this. Therefore, never use UTF-8 !!! This avoids severe headaches when manipulating such lines.
Anatoly Jul 27 '13 at 20:13 2013-07-27 20:13
source share