Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Thai Industrial Standard 620-2533

From Wikipedia, the free encyclopedia
Thai language character set and encoding
icon
This articleneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "Thai Industrial Standard 620-2533" – news ·newspapers ·books ·scholar ·JSTOR
(January 2017) (Learn how and when to remove this message)

Thai Industrial Standard 620-2533, commonly referred to asTIS-620, is the most common single-bytecharacter encoding for theThai language.[citation needed] The standard is published by theThai Industrial Standards Institute (TISI), an organ of the Ministry of Industry under the Royal Thai Government, and is the sole official standard for encoding Thai inThailand.

The descriptive name of the standard is "Standard for Thai Character Codes for Computers" (Thai: รหัสสำหรับอักขระไทยที่ใช้กับคอมพิวเตอร์). "2533" refers to year 2533 of theBuddhist Era (1990), the year the present version of the standard was published; a previous revision, TIS 620-2529 (1986), is now obsolete. The code page layout is the same between the two editions.[1]

TIS-620 is theIANA preferred charset name for TIS-620, and that charset name is used also forISO/IEC 8859-11 (which adds a no-break space character at 0xA0, which is unassigned in TIS-620). When the IANA name is used the codes are supplemented with theC0 and C1 control codes fromISO/IEC 6429.

Structure

[edit]

TIS-620 is a conventionally structuredExtended ASCII national character set that retains full compatibility with 7-bitASCII and uses the 8-bit range hex A1 to FB for encoding theThai alphabet. Due to the complex combining nature of Thai vowels and diacritics, TIS-620 is intended for information interchange only, and an additional display engine is required to compose characters correctly.

Variants

[edit]

A nearly identical version of TIS-620 has been adopted asISO/IEC 8859-11 in 2001, the sole difference being that ISO/IEC 8859-11 defines hex A0 as anon-breaking space, while TIS-620 leaves it undefined but reserved. (In practice, this small distinction is usually ignored.)

The ISO/IEC 8859-11 set has also been registered asISO-IR-166 byEcma International, but this variation adds explicit escape codes for signaling the beginning and end of Thai character sequences.

The TIS-620 character set ordering has been used essentially as is withinUnicode (ISO/IEC 10646) as well. Unicode'sThai block is U+0E01 through U+0E7F, and TIS-620 Thai characters can be converted toUTF-16 simply by prefixing each byte with 0E and subtracting hex A0 from the value.

Character set

[edit]
TIS-620[2]
0123456789ABCDEF
0x
1x
2x SP !"#$%&'()*+,-./
3x0123456789:;<=>?
4x@ABCDEFGHIJKLMNO
5xPQRSTUVWXYZ[\]^_
6x`abcdefghijklmno
7xpqrstuvwxyz{|}~
8x
9x
Ax
Bx
Cx
Dxัำิีึืฺุู฿
Ex็่้๊๋์ํ๎
Fx

In the table above, 20 is the regular SPACE character. Code values 00-1F, 7F, 80-9F, A0, DB-DE and FC-FF are not assigned to characters by TIS-620.

Code values D1, D4-DA, E7-EE arecombining characters.

Further reading

[edit]

References

[edit]
  1. ^Meru, Ibrahim (1996-12-03)."Re: Thai encoding standards".Unicode Mail List Archive.
  2. ^Leisher, Mark (1998-03-06),TCCII 2533 1009 / TIS 620 Thai, TIS620.TXT

External links

[edit]
Early telecommunications
ISO/IEC 8859
Bibliographic use
National standards
ISO/IEC 2022
Mac OSCode pages
("scripts")
DOS code pages
IBM AIX code pages
Windows code pages
EBCDIC code pages
DEC terminals (VTx)
Platform specific
Unicode /ISO/IEC 10646
TeX typesetting system
Miscellaneous code pages
Control character
Related topics
Retrieved from "https://en.wikipedia.org/w/index.php?title=Thai_Industrial_Standard_620-2533&oldid=1282873418"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp