I would like to split the string into each empty character ('', '\ n', '\ r', '\ t', '\ v', '\ f') The string is stored in UTF8 encoding in a byte array (char *, e.g. vector or string)
Is it possible to simply split an array of bytes into each split character? It is said differently, I'm sure that the byte values corresponding to these characters cannot be found in a multibyte character? Looking at the UTF-8 specification, it seems that all multibyte characters have only bytes above 128.
thank
Yes, you can.
( MSB 11) ( MSB 10). ( + ) MSB, 1 , 0 (: 110xxxxx, , 11110xxx, ).
11
10
110xxxxx
11110xxx
, MB , , , , , , , , , .
-, , , : Unicode "" , ASCII. , .
, , "".
, , UTF-8...