NAME
Encode::TW - Taiwan-based Chinese Encodings
SYNOPSIS
use Encode qw/encode decode/; $big5 = encode("big5", $utf8); # loads Encode::TW implicitly$utf8 = decode("big5", $big5); # ditto
DESCRIPTION
This module implements tradition Chinese charset encodings as used in Taiwan and Hong Kong. Encodings supported are as follows.
Canonical AliasDescription--------------------------------------------------------------------big5-eten /\bbig-?5$/iBig5 encoding (with ETen extensions) /\bbig5-?et(en)?$/i /\btca-?big5$/ibig5-hkscs /\bbig5-?hk(scs)?$/i /\bhk(scs)?-?big5$/i Big5 + Cantonese characters in Hong KongMacChineseTradBig5 + Apple Vendor Mappingscp950 Code Page 950 = Big5 + Microsoft vendor mappings--------------------------------------------------------------------
To find out how to use this module in detail, seeEncode.
NOTES
Due to size concerns,EUC-TW
(Extended Unix Character),CCCII
(Chinese Character Code for Information Interchange),BIG5PLUS
(CMEX's Big5+) andBIG5EXT
(CMEX's Big5e) are distributed separately on CPAN, under the nameEncode::HanExtra. That module also contains extra China-based encodings.
BUGS
Since the originalbig5
encoding (1984) is not supported anywhere (glibc and DOS-based systems usesbig5
to meanbig5-eten
; Microsoft usesbig5
to meancp950
), a conscious decision was made to aliasbig5
tobig5-eten
, which is the de facto superset of the original big5.
TheCNS11643
encoding files are not complete. For commonCNS11643
manipulation, please useEUC-TW
inEncode::HanExtra, which contains planes 1-7.
The ASCII region (0x00-0x7f) is preserved for all encodings, even though this conflicts with mappings by the Unicode Consortium.
SEE ALSO
Module Install Instructions
To install Encode, copy and paste the appropriate command in to your terminal.
cpanm Encode
perl -MCPAN -e shellinstall Encode
For more information on module installation, please visitthe detailed CPAN module installation guide.