Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

GB 12345

From Wikipedia, the free encyclopedia
Traditional Chinese character set
This articleprovides insufficient context for those unfamiliar with the subject. Please helpimprove the article byproviding more context for the reader.(January 2023) (Learn how and when to remove this message)

GB 12345,[1] entitledCode of Chinese ideogram set for information interchange supplementary set (Chinese:信息交換用漢字編碼字符集 輔助集), is aTraditional Chinesecharacter set standard established byChina, and can be thought as the traditional counterpart ofGB 2312. It is used as an encoding of traditional Chinese characters, although it is not as commonly used asBig5. It has 6,866 characters, and has no relationship nor compatibility withBig5 andCNS 11643.

Characters

[edit]

Characters in GB 12345 are arranged in a 94×94 grid (as inISO/IEC 2022), and the two-byte code point of each character is expressed in thequ-wei form, which specifies a row (qu 区) and the position of the character within the row (cell,wei 位).

The rows (numbered from 1 to 94) contain characters as follows:[2]

  • 01–09: identical toGB 2312, except in row 06 position 57–85, added 29 vertical punctuation forms, and in row 08 position 27–32, added 6pinyin characters from GB 5007.1–85, the correction of GB 2312.
  • 16–87: arranged the traditional character forms which replaced their simplified forms from GB 2312.
  • 88–89: 103Chinese characters which is merged due to the simplification of Chinese characters.

The rows 10–15 and 90–94 are unassigned.

Encodings

[edit]

The specification for theISO-2022-CN-EXT encoding states that the sequenceESC $ ) followed by a yet-undetermined byte (shown by the placeholder<X12345>) can be used to indicateGB 12345 characters, similarly to the sequenceESC $ ) A (also with theESC $ ) prefix) indicatingGB 2312, but only after it receives a registration in theISO-IR registry specifying what the final byte of the sequence is.[3] As of 2023[update], no such registration exists.[4] However, the sameRequest for Comments also defines the encoding labelCN-GB-12345 forGB 12345 used with ASCII in a manner analogous toEUC-CN.[3]

Inclusion of non-standard Traditional Chinese characters

[edit]

GB/T 12345 includes a few traditional characters which are different from the table of correspondences between Simplified Chinese characters and Traditional Chinese characters in the standardGeneral List of Simplified Chinese Characters.

  • 隷 (33-05): The traditional counterpart of 隶 is 隸. 隷 is listed as a variant form in theFirst List of Processed Variant Chinese Characters.
  • 𨻶 (47-22): 隙 has no traditional correspondence in the standard.
  • 鳧 (57-76): The traditional counterpart of 凫 is 鳬. 鳧 is not in theFirst List of Processed Variant Chinese Characters either.

GB 12345 and Unicode

[edit]

The characters in GB 12345 were taken as one of the sources for theHan unification which led to the unified set ofCJK characters in the initialISO 10646/Unicode standard. All the 6,866 Chinese characters were incorporated.

See also

[edit]

References

[edit]
  1. ^"GB/T 12345-1990: Code of Chinese ideogram set for information interchange--Supplementary set".Standardization Administration of the People's Republic of China. Retrieved2022-10-01.
  2. ^Lunde, Ken (2009).CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing (2nd ed.).Sebastopol, CA:O'Reilly. pp. 99–103.ISBN 978-0-596-51447-1.
  3. ^abZhu, HF.; Hu, DY.; Wang, ZG.; Kao, TC.; Chang, WCH.; Crispin, M. (1996).Chinese Character Encoding for Internet Messages.IETF.doi:10.17487/RFC1922.RFC1922.Note: Currently, there are some GB sets that have not been registered in ISO. Here <X7589>, <X7590>, <X12345>, <X13131> and <X13132> represent the final character that will be assigned by ISO for those sets. These GB sets shall only be used once these final characters are assigned.
  4. ^ISO-IR: ISO/IEC International Register of Coded Character Sets To Be Used With Escape Sequences(PDF) (Registry Index). ITSCJ/IPSJ. Archived fromthe original(PDF) on 2023-05-12. Retrieved2023-05-13.

External links

[edit]
Chinese, Japanese and Korean computing
Encodings
Chinese
Japanese
Korean
International
Input methods
Fonts
Early telecommunications
ISO/IEC 8859
Bibliographic use
National standards
ISO/IEC 2022
Mac OSCode pages
("scripts")
DOS code pages
IBM AIX code pages
Windows code pages
EBCDIC code pages
DEC terminals (VTx)
Platform specific
Unicode /ISO/IEC 10646
TeX typesetting system
Miscellaneous code pages
Control character
Related topics
Retrieved from "https://en.wikipedia.org/w/index.php?title=GB_12345&oldid=1301082649"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp