1- <!-- $Header: /cvsroot/pgsql/doc/src/sgml/charset.sgml,v 2.24 2002/04/03 05:39:27 petere Exp $ -->
1+ <!-- $Header: /cvsroot/pgsql/doc/src/sgml/charset.sgml,v 2.25 2002/07/24 05:51:56 ishii Exp $ -->
22
33<chapter id="charset">
44 <title>Localization</>
@@ -326,7 +326,7 @@ perl: warning: Falling back to the standard locale ("C").
326326
327327 <para>
328328 Tatsuo Ishii (<email>ishii@postgresql.org</email>),
329- last updated2000-03-22 .
329+ last updated2002-07-24 .
330330 Check <ulink
331331 url="http://www.sra.co.jp/people/t-ishii/PostgreSQL/">Tatsuo's
332332 web site</ulink> for more information.
@@ -346,21 +346,19 @@ perl: warning: Falling back to the standard locale ("C").
346346 overridden when you create a database using
347347 <application>createdb</application> or by using the SQL command
348348 <command>CREATE DATABASE</>. So you can have multiple databases each with
349- a different encoding system.
349+ a different encoding system. Note that <acronym>MB</acronym> can
350+ handle single byte characters sets such as ISO-8859-1.
350351 </para>
351352
352- <sect2>
353- <title>Enabling Multibyte Support</title>
354-
355353 <para>
356- Run configure with the multibyte option:
354+ Multibyte support is enabled by default since PostgreSQL version 7.3.
355+ </para>
357356
358- <synopsis>
359- ./configure --enable-multibyte<optional>=<replaceable>encoding_system</replaceable></optional>
360- </synopsis>
357+ <sect2>
358+ <title>Supported character set encodings</title>
361359
362- where <replaceable>encoding_system</replaceable> can be one of the
363- values in the following table:
360+ <para>
361+ Following encoding can be used as database encoding.
364362
365363 <table tocentry="1">
366364 <title>Character Set Encodings</title>
@@ -508,23 +506,8 @@ perl: warning: Falling back to the standard locale ("C").
508506 <literal>LATIN8</>, and <literal>LATIN10</>.
509507 </para>
510508 </important>
509+ </sect2>
511510
512- <para>
513- Here is an example of configuring
514- <productname>PostgreSQL</productname> to use a Japanese encoding by
515- default:
516-
517- <screen>
518- $ <userinput>./configure --enable-multibyte=EUC_JP</userinput>
519- </screen>
520- </para>
521-
522- <para>
523- If the encoding system is omitted (<literal>./configure --enable-multibyte</literal>),
524- <literal>SQL_ASCII</> is assumed.
525- </para>
526- </sect2>
527-
528511 <sect2>
529512 <title>Setting the Encoding</title>
530513
@@ -539,8 +522,8 @@ $ <userinput>initdb -E EUC_JP</>
539522 sets the default encoding to <literal>EUC_JP</literal> (Extended Unix Code for Japanese).
540523 Note that you can use <option>--encoding</option> instead of <option>-E</option> if you prefer
541524 to type longer option strings.
542- If no <option>-E</> or <option>--encoding</option> option is given, the encoding
543- specified at configure time is used.
525+ If no <option>-E</> or <option>--encoding</option> option is
526+ given, SQL_ASCII is used.
544527 </para>
545528
546529 <para>
@@ -583,14 +566,17 @@ $ <userinput>psql -l</userinput>
583566 </sect2>
584567
585568 <sect2>
586- <title>Automatic encodingtranslation between server and
569+ <title>Automatic encodingconversion between server and
587570 client</title>
588571
589572 <para>
590573 <productname>PostgreSQL</productname> supports an automatic
591- encoding translation between server
592- and client for some encodings. The available combinations are
593- listed in <xref linkend="multibyte-translation-table">.
574+ encoding conversion between server and client for some
575+ encodings. The conversion info is stored in pg_converson system
576+ catalog. You can create a new conversion by using <command>CREATE
577+ CONVERSION</command>. PostgreSQL comes with some predefined
578+ conversions. They are listed in <xref
579+ linkend="multibyte-translation-table">.
594580 </para>
595581
596582 <table tocentry="1" id="multibyte-translation-table">
@@ -887,6 +873,18 @@ RESET CLIENT_ENCODING;
887873be overridden using any of the other methods mentioned above.)
888874 </para>
889875 </listitem>
876+
877+ <listitem>
878+ <para>
879+ Using client_encoding variable.
880+
881+ If client_encoding variable in postgresql.conf is set, that
882+ client encoding is automatically selected when a connection to the
883+ server is made. (This can subsequently be overridden using any of the
884+ other methods mentioned above.)
885+ </para>
886+ </listitem>
887+
890888 </itemizedlist>
891889 </para>
892890 </sect2>
@@ -909,6 +907,10 @@ RESET CLIENT_ENCODING;
909907 The Unicode conversion functionality is automatically enabled
910908 if <option>--enable-multibyte</option> is specified.
911909 </para>
910+ <para>
911+ For 7.3, <option>--enable-unicode-conversion</option> nor
912+ <option>--enable-multibyte</option> is needed.
913+ </para>
912914 </sect2>
913915
914916 <sect2>