What is the current modern term "multibyte character set",

Question

What is the current modern term "multibyte character set",

I quite often confuse:

After reading the comments by all participants, plus:

Looking at an old article (2001): http://www.hastingsresearch.com/net/04-unicode-limitations.shtml , which talks about unicode:

- This is a 16-bit character definition that allows theoretically 65,000 characters. However, complete world character sets add up to over 170,000 characters.

and looking at the current “modern” article: http://en.wikipedia.org/wiki/Unicode

The most commonly used UTF-8 encodings (which uses 1 byte for all ASCII characters that have the same code values, as in standard ASCII encoding and up to 4 bytes for other characters), now deprecated UCS-2 (which uses 2 bytes for all characters but does not include a character in the Unicode standard), and UTF-16 (which extends UCS-2, using 4 bytes to encode missing characters from UCS-2).

It seems that in the compilation options in VC2008 , the Unicode options in the Character Sets section really mean "Unicode encoded in UCS-2" (or UTF-16? I'm not sure)

I am trying to verify this by running the following code under VC2008

#include <iostream>

int main()
{
    // Use unicode encoded in UCS-2?
    std::cout << sizeof(L"我爱你") << std::endl;
    // Use unicode encoded in UCS-2?
    std::cout << sizeof(L"abc") << std::endl;
    getchar();

    // Compiled using options Character Set : Use Unicode Character Set.
    // print out 8, 8

    // Compiled using options Character Set : Multi-byte Character Set.
    // print out 8, 8
}

It seems that at compile time with Unicode character set parameters, the result matched my guess.

? "" ?:)

+5

c++ visual-c++ unicode internationalization

Cheok Yan Cheng 10 . '10 3:12

5

multi-byte , .

wikipedia utf8:

UTF-8 ( ) 1 - 4 (8- ), 128 US-ASCII.

, utf8 - : -).

0

Phil Rykoff 10 . '10 3:15

, 1 .

, , , "IBM EBCDIC cp1390 - ".

UNICODE MBCS, , "UNICODE" utf-8, utf-16 utf-32.

"" , MBCS, Microsoft Office. "Windows MBCS". utf-16, . - Microsofts , "Windows MBCS".

0

James Anderson 10 . '10 8:37

MSVC "Unicode" " " , _T("X") L"X". MBCS, _T("X") "X".

, Win32 MessageBox() MessageBoxW() MessageBoxA, Win32, A/W.

0

MSalters 10 . '10 8:53

, VC2008 "Unicode" "Unicode, UCS-2" ( UTF-16? )

Unicode, Utf-16 LE. , , .

? "" ?:)

MBCS is mainly used in the MSDN documentation to refer to DBCS . This is explained in more detail in this blog post . If you want to avoid confusion, you can say "MBCS Code Page".

0

cdiggins Oct 21 '11 at 18:49

source share

MSN · Accepted Answer · 2010-03-10T07:02:01+0000

http://en.wikipedia.org/wiki/Multi-byte_character_set

MBCS - , , , , , . , , , . ISO/IEC 8859 MBCS, , , ISO 2004 , , Unicode.

, , MBCS " Unicode".

What is the current modern term "multibyte character set",

More articles: