Below are mapping (conversion) tables between JIS X 0213 encodings and Unicode (ISO/IEC 10646 UCS).Feel free to copy. Note that these tables are made by volunteers andnotauthorized by any official organization for standardization eventhough they are intended to be identical to the standard mapping.
These tables are based on the tables for JIS X 0213:2000 availableat "JIS-UCS変換表" and modified according to JIS X 0213:2004.The table for Shift_JIS-2004 above is equivalent to the table available at"JIS X 0213:2004対応による変更点".
If you find an error in these tables, please contact the author via"JIS X 0213Mailing List" or the相談所 inJIS X 0213Wiki.
JIS X 0213 defines two double-byte ISO2022-compliant codedcharacter sets. Kanji Set 1 (or Plane 1) is a superset of JIS X 0208containing kanji, hiragana, katakana, Latin, Greek, symbols, and soon. Kanji Set 2 (or Plane 2) is a newly designed coded character setcontaining kanji characters only. Several encoding schemes of thesecharacter sets are defined in JIS X 0213. Some of them are encodingmethods using JIS X 0213 and ASCII (or JIS X 0201) together.
Shift_JIS-2004 is the JIS X 0213 version of Shift_JIS encoding. Itis a superset of Shift_JIS. In this encoding, JIS X 0213 double-bytecode is transformed by a formula.
Shift_JISX0213 is the name of the previous version of thisencoding. iconv uses the name Shift_JISX0213 for Shift_JIS-2004.
EUC-JIS-2004 is the JIS X 0213 version of EUC-JP encoding. Thisencoding is compliant to ISO/IEC 2022 but no escape sequence isused. It is a superset of EUC-JP except for JIS X 0212 in EUC-JP(and JIS X 0212 in EUC-JP is rarely used).
0x8E (Single Shift 2) and 0x8F (Single Shift 3) are "single shift"codes; i.e., they have effect on just one following character. Whenrepresenting a sequence of kanji of plane 2, 0x8F needs to appearbefore each single kanji.
EUC-JISX0213 is the name of the previous version of thisencoding. iconv uses the name EUC-JISX0213 for EUC-JIS-2004.
ISO-2022-JP-2004 is the JIS X 0213 version of ISO-2022-JP encoding.It is a 7-bit stateful encoding using escape sequences to switch codedcharacter sets.
Escape sequences used in this encoding are following:
For compatibility reason, "1B 24 28 4F" (escape sequence for plane1 of JIS X 0213:2000) can be used with a little restriction. Also, "1B24 42" (escape sequence for JIS X 0208:1983) can be used with somerestriction.
ISO-2022-JP-3 is the name of the previous version of thisencoding.
JIS X 0213 7-bit code is a pure double-byte code; i.e., nosingle-byte code like ASCII is used together. It is a superset of7-bit code of JIS X 0208.
Plane 1 and 2 of JIS X 0213 are switched by Shift Out (SO) andShift In (SI) control characters. The initial state is plane 1. SOswitches to plane 2. When SI appears after that, the state is back toplane 1. That is, this encoding is stateful.
JIS X 0213 8-bit code is a pure double-byte code; i.e., nosingle-byte code like ASCII is used together. It is a superset of7-bit code of JIS X 0208.