Movatterモバイル変換


[0]ホーム

URL:


Encode::JP
(source,CPAN)
version 2.04
You are viewing the version of this documentation from Perl 5.24.0.View the latest version

CONTENTS

#NAME

Encode::JP - Japanese Encodings

#SYNOPSIS

use Encode qw/encode decode/; $euc_jp = encode("euc-jp", $utf8);   # loads Encode::JP implicitly$utf8   = decode("euc-jp", $euc_jp); # ditto

#ABSTRACT

This module implements Japanese charset encodings. Encodings supported are as follows.

Canonical   AliasDescription--------------------------------------------------------------------euc-jp      /\beuc.*jp$/iEUC (Extended Unix Character)            /\bjp.*euc/i           /\bujis$/ishiftjis    /\bshift.*jis$/iShift JIS (aka MS Kanji)        /\bsjis$/i7bit-jis    /\bjis$/i7bit JISiso-2022-jpISO-2022-JP                  [RFC1468]              = 7bit JIS with all Halfwidth Kana                 converted to Fullwidthiso-2022-jp-1ISO-2022-JP-1                [RFC2237]                              = ISO-2022-JP with JIS X 0212-1990                support.  See belowMacJapanese                Shift JIS + Apple vendor mappingscp932       /\bwindows-31j$/i Code Page 932                              = Shift JIS + MS/IBM vendor mappingsjis0201-raw                   JIS0201, raw formatjis0208-raw                   JIS0201, raw formatjis0212-raw                   JIS0201, raw format--------------------------------------------------------------------

#DESCRIPTION

To find out how to use this module in detail, seeEncode.

#Note on ISO-2022-JP(-1)?

ISO-2022-JP-1 (RFC2237) is a superset of ISO-2022-JP (RFC1468) which adds support for JIS X 0212-1990. That means you can use the same code to decode to utf8 but not vice versa.

$utf8 = decode('iso-2022-jp-1', $stream);

and

$utf8 = decode('iso-2022-jp',   $stream);

yield the same result but

$with_0212 = encode('iso-2022-jp-1', $utf8);

is now different from

$without_0212 = encode('iso-2022-jp', $utf8 );

In the latter case, characters that map to 0212 are first converted to U+3013 (0xA2AE in EUC-JP; a white square also known as 'Tofu' or 'geta mark') then fed to the decoding engine. U+FFFD is not used, in order to preserve text layout as much as possible.

#BUGS

The ASCII region (0x00-0x7f) is preserved for all encodings, even though this conflicts with mappings by the Unicode Consortium.

#SEE ALSO

Encode

Perldoc Browser is maintained by Dan Book (DBOOK). Please contact him via theGitHub issue tracker oremail regarding any issues with the site itself, search, or rendering of documentation.

The Perl documentation is maintained by the Perl 5 Porters in the development of Perl. Please contact them via thePerl issue tracker, themailing list, orIRC to report any issues with the contents or format of the documentation.


[8]ページ先頭

©2009-2026 Movatter.jp