Japanese ASCII Code

Where can I get a list of ASCII codes corresponding to Japanese characters, hiragana and katakana games. I am making a java function and Javascript that determines if this is a Japanese character. What is its range in ASCII code?

+4
source share
7 answers

ASCII stands for American Standard Code for Information Interchange, includes only 128 characters (not all of them are even printable) and is based on the needs of American use around 1960. It contains nothing to do with Japanese characters.

I believe you want Unicode code points for some characters, which you can find in the charts provided by unicode.org.

+9
source

See my similar question about Kanji / Kan characters . As @coobird mentions, it can be difficult to determine which range you want to test, since many kanji overlap with Chinese characters.

In short, Unicode ranges for hiragana and katakana:

  • Hiragana: Unicode: 3040-309F
  • Katakana: Unicode: 30A0-30FF

If you find this answer helpful, upvote @coobird answer my question .

が ん ば っ て!

+5
source

Japanese characters will not be in the ASCII range, they will be in Unicode. What do you need, just a char value for each character?

+2
source

I will not rephrase the ASCII part. Just check out the Unicode Code Charts .

0
source

Kanji will have the Unicode Hani "Script" property, hiragana will have the property

Of course, most kanji are characters that are also used in Chinese; given a character like 猫, it is impossible to determine if it is used as a Chinese or Japanese character.

0
source

Well, it has been a while, but there is a link to tables of hiragana, katakana, kanji, etc. and their unicode ...

http://www.rikai.com/library/kanjitables/kanji_codes.unicode.shtml

BUT, since you probably know that Unicodes are hexadecimal. You can convert them to decimal numbers using Windows Calc in programmer mode, and then enter that number as ASCII code, and it will generate the character you need, depending on which one you put it in. It will be in MS Wordpad and Word (Not Notepad).

For example, hiragana ぁ is 3041 in Unicode. 3041 is a hexadecimal and is converted to 12353 in decimal value. If you enter 12353 as an ASCII code in Wordpad or Word, then hold Alt, enter 12353 on the keyboard, then release Alt, it will print ぁ. The range of Japanese characters seems to be Hiragana: 3040-309f (12352-12447 in ASCII), Katakana: 30a0-30ff (12448-12543 in ASCII), Kanji: 4e00-4DB5 (19968-19893 ASCII), so there are several ranges. This map also has a range of width sliders.

0
source

I think what you mean by ASCII code for Japanese is the equivalent of SBCS (Single Byte Character Set) in Japanese. For the Japanese, you only have MBCS (Multi-Byte Character Sets), which has a combination of single-byte characters and multi-byte characters. Thus, for a Japanese text file stored in MBCS, you have non-Japanese characters (English letters and numbers, as well as regular non-alphanumeric characters) stored as one byte, and Japanese characters saved as two bytes.

Assuming you don't mean UNICODE , which is a uniform DBCS (character set with two bytes), where each character has exactly two bytes. In fact, to be more correct, recently UNICODE also has several DBCS, because the character set can no longer accommodate another character. Some UNICODE characters consist of 4 bytes, already having the first two bytes as the leading character.

If you mean the first (MBCS), which, and not UNICODE, then there are many Japanese characters, such as Shift-JIS (more popular). Therefore, I suggest you look for the Shift-JIS character map. Although there are other maps of the Japanese character set besides Shift-JIS.

-2
source

All Articles