Movatterモバイル変換


[0]ホーム

URL:


Quick links:help overview ·quick reference ·user manual toc ·reference manual toc·faq
Go to keyword (shortcut:k)
Site search (shortcut:s)
mbyte.txt     ForVim version 9.1.  Last change: 2024 Jul 17VIM REFERENCE MANUAL  by Bram Moolenaar et al.Multi-byte supportmultibytemulti-byteChineseJapaneseKoreanThisis about editing text in languages which have many characters that cannot be represented using one byte (one octet).  Examples are Chinese,Japaneseand Korean.Unicodeis also covered here.For an introduction to the most common features, seeusr_45.txt in the usermanual.Forchanging the language ofmessages andmenus seemlang.txt.1.  Getting startedmbyte-first2.  Localembyte-locale3.  Encodingmbyte-encoding4.  Usingaterminalmbyte-terminal5.  Fonts onX11mbyte-fonts-X116.  Fonts onMS-Windowsmbyte-fonts-MSwin7.  Input onX11mbyte-XIM8.  Input onMS-Windowsmbyte-IME9.  Input witha keymapmbyte-keymap10. Input with imactivatefunc()mbyte-func11. UsingUTF-8mbyte-utf812. Overview ofoptionsmbyte-optionsNOTE: This file containsUTF-8 characters.  These may show upas strangecharacters or boxes when using another encoding.==============================================================================1. Getting startedmbyte-firstThisisa summary of themultibyte features in Vim.  If you are luckyit worksas described and you can start using Vim without much trouble.  If somethingdoesn't work you will have to read the rest.  Don't be surprised ifit takesquitea bit of work and experimenting to make Vim use all themultibytefeatures.  Unfortunately, every system has its own way to deal withmultibytelanguages anditis quite complicated.LOCALEFirst of all, youmust make sure your currentlocaleis set correctly.  Ifyour system has been installed to use the language,it probably works rightaway.  If not, you can often makeit work by setting the $LANG environmentvariable in your shell:setenv LANG ja_JP.EUCUnfortunately, the name of thelocale depends on your system.Japanese mightalso be called "ja_JP.EUCjp" or just "ja".  To see whatis currently used::languageTo change thelocale inside Vim use::language ja_JP.EUCVim will give an error message if this doesn't work.  Thisisa good way toexperiment and find thelocale name you want to use.  But it's always betterto set thelocale in the shell, so thatitis used right from the start.Seembyte-locale for details.ENCODINGIf yourlocale works properly, Vim will try to set the'encoding' optionaccordingly.  If this doesn't work you can overrule its value::set encoding=utf-8Seeencoding-values foralist of acceptable values.The resultis that all the text thatis used inside Vim will be in thisencoding.  Not only the text in the buffers, but also in registers, variables,etc.  This also means thatchanging the value of'encoding' makes the existingtext invalid!  The text doesn't change, butit will be displayed wrong.You can edit files in another encoding than what'encoding'is set to.  Vimwill convert the file when you readit and convertit back when you write it.See'fileencoding','fileencodings' and++enc.DISPLAY AND FONTSIf you are working inaterminal (emulator) youmust make sureit accepts thesame encodingas which Vimis working with.  If thisis not the case, you canuse the'termencoding' option to make Vim convert text automatically.For theGUI youmust select fonts that work with the current'encoding'.  Thisis the difficult part.  It depends on the system you are using, thelocale anda few other things.  See the chapters on fonts:mbyte-fonts-X11 forX-Windows andmbyte-fonts-MSwin for MS-Windows.ForGTK+ 2, you can skip most of this section.  The option'guifontset' doesno longer exist.  You only need to set'guifont' and everything should "justwork".  If your system comes with Xft2 and fontconfig and the current fontdoes not containa certain glyph,a different font will be used automaticallyif available.  The'guifontwide' optionis still supported but usually youdonot need to set it.  Itis only necessary if the automatic font selection doesnot suit your needs.ForX11 you can set the'guifontset' option toalist of fonts that togethercover the characters that are used.  Example for Korean::set guifontset=k12,r12Alternatively, you can set'guifont' and'guifontwide'.'guifont'is used forthe single-width characters,'guifontwide' for the double-width characters.Thus the'guifontwide' fontmust be exactly twiceas wideas'guifont'.Example for UTF-8::set guifont=-misc-fixed-medium-r-normal-*-18-120-100-100-c-90-iso10646-1:set guifontwide=-misc-fixed-medium-r-normal-*-18-120-100-100-c-180-iso10646-1You can also set'guifont' alone, Vim will try to finda matching'guifontwide' for you.INPUTThere are several ways to entermultibyte characters:- ForX11XIM can be used.  SeeXIM.- ForMS-WindowsIME can be used.  SeeIME.- For all systems keymaps can be used.  Seembyte-keymap.Theoptions'iminsert','imsearch' and'imcmdline' can be used to choosethe different input methods or disable them temporarily.==============================================================================2.  Localembyte-localeThe easiest setupis when your whole system uses thelocale you want to workin.  But it's also possible to set thelocale for one shell you are workingin, or just usea certainlocale inside Vim.WHAT IS A LOCALE?localeThere are many languages in the world.  And there are different cultures andenvironmentsat leastas manyas the number of languages.A linguisticenvironment corresponding to an areais called "locale".  This includesinformation about the used language, the charset, collating order for sorting,date format, currency format and so on.  For Vim only the language andcharsetreally matter.You can only usealocale if your system has support for it.  Some systemshave onlya few locales, especially in the USA.  The language which you wantto use may not be on your system.  In thatcase you might be able toinstallitas an extra package.  Check your system documentation for how todo that.The location in which the locales are installed varies from system to system.For example, "/usr/share/locale" or "/usr/lib/locale".  See your system'ssetlocale() man page.Looking in these directories will show you the exact name of each locale.Mostly upper/lowercase matters, thus "ja_JP.EUC" and "ja_jp.euc" aredifferent.  Some systems havea locale.alias file, which allows translationfroma short name like "nl" to the full name "nl_NL.ISO_8859-1".Note that X-windows has its ownlocale stuff.  And unfortunately useslocalenames different from whatis used elsewhere.  Thisis confusing!  For Vimitmatters what the setlocale() function uses, whichis generally NOT theX-windows stuff.  You might have todo some experiments to find out whatreally works.locale-nameThe (simplified) format oflocale name is:languageorlanguage_territoryorlanguage_territory.codesetTerritory means the country (or part of it),codeset means thecharset.  Forexample, thelocale name "ja_JP.eucJP" means:jathe languageisJapaneseJPthe countryis JapaneucJPthecodesetis EUC-JPButit also could be "ja", "ja_JP.EUC", "ja_JP.ujis", etc.  And unfortunately,thelocale name fora specific language, territory andcodesetis not unifiedand depends on your system.Examples oflocale name:    charset    language  locale name    GB2312Chinese (simplified)  zh_CN.EUC, zh_CN.GB2312    Big5Chinese (traditional) zh_TW.BIG5, zh_TW.Big5    CNS-11643Chinese (traditional) zh_TW    EUC-JPJapanese  ja, ja_JP.EUC, ja_JP.ujis, ja_JP.eucJP    Shift_JISJapanese  ja_JP.SJIS, ja_JP.Shift_JIS    EUC-KRKorean  ko, ko_KR.EUCUSING A LOCALETo start usingalocale for the whole system, see the documentation of yoursystem.  Mostly you need to setit ina configuration file in "/etc".To usealocale ina shell, set the $LANG environment value.  When you want touseKorean and thelocale nameis "ko",do this:    sh:    export LANG=ko    csh:   setenv LANG koYou canput this in your ~/.profile or ~/.cshrc file to always use it.To usealocale in Vim only, use the:language command::language koPut this in your ~/.vimrc file to useit always.Or specify $LANG whenstarting Vim:   sh:    LANG=ko vim{vim-arguments}   csh:  env LANG=ko vim{vim-arguments}You could makea small shellscript for this.==============================================================================3.  Encodingmbyte-encodingVim uses the'encoding' option to specify how characters are identified andencoded when they are used inside Vim.  This applies to all the places wheretextis used, includingbuffers (files loaded into memory),registers andvariables.charsetcodesetCharsetis another name for encoding.  There are subtle differences, but thesedon't matter when using Vim.  "codeset"is another similar name.Each characteris encodedas one or more bytes.  When all characters areencoded with one byte, we call thisa single-byte encoding.  The most oftenused oneis called "latin1".  Thislimits the number of characters to 256.Some of these arecontrol characters, thus even fewer can be used for text.When some characters use two or more bytes, we call thisamultibyteencoding.  This allows using much more than 256 characters, whichis requiredfor most East Asian languages.Mostmultibyte encodings use one byte for the first 127 characters.  Theseare equal to ASCII, which makesiteasy to exchange plain-ASCII text, nomatter what languageis used.  Thus you might see the right text even when theencoding was set wrong.encoding-namesVim can use many different character encodings.  There are three major groups:1   8bitSingle-byte encodings, 256 different characters.  Mostly usedin USA and Europe.  Example: ISO-8859-1 (Latin1).  Allcharacters occupy one screen cell only.2   2byteDouble-byte encodings, over 10000 different characters.Mostly used in Asian countries.  Example: euc-kr (Korean)The number of screen cellsis equal to the number of bytes(except for euc-jp when the first byteis 0x8e).uUnicodeUniversal encoding, can replace all others.  ISO 10646.Millions of different characters.  Example: UTF-8.  Therelation between bytes and screen cellsis complex.Other encodings cannot be used by Vim internally.  But files in otherencodings can be edited by using conversion, see'fileencoding'.Note that all encodingsmust use ASCII for the characters up to 128 (exceptwhen compiled for EBCDIC).Supported'encoding' values are:encoding-values1   latin18-bit characters (ISO 8859-1, also used for cp1252)1   iso-8859-nISO_8859 variant (n= 2 to 15)1   koi8-rRussian1   koi8-uUkrainian1   macroman    MacRoman (Macintosh encoding)1   8bit-{name} any 8-bit encoding (Vim specific name)1   cp437similar to iso-8859-11   cp737similar to iso-8859-71   cp775Baltic1   cp850similar to iso-8859-41   cp852similar to iso-8859-11   cp855similar to iso-8859-21   cp857similar to iso-8859-51   cp860similar to iso-8859-91   cp861similar to iso-8859-11   cp862similar to iso-8859-11   cp863similar to iso-8859-81   cp865similar to iso-8859-11   cp866similar to iso-8859-51   cp869similar to iso-8859-71   cp874Thai1   cp1250Czech, Polish, etc.1   cp1251Cyrillic1   cp1253Greek1   cp1254Turkish1   cp1255Hebrew1   cp1256Arabic1   cp1257Baltic1   cp1258Vietnamese1   cp{number}MS-Windows: any installed single-byte codepage2   cp932Japanese (Windows only)2   euc-jpJapanese (Unix only)2   sjisJapanese (Unix only)2   cp949Korean (Unix and Windows)2   euc-krKorean (Unix only)2   cp936simplifiedChinese (Windows only)2   euc-cnsimplifiedChinese (Unix only)2   cp950traditionalChinese (onUnix alias for big5)2   big5traditionalChinese (on Windows alias for cp950)2   euc-twtraditionalChinese (Unix only)2   2byte-{name} Unix: any double-byte encoding (Vim specific name)2   cp{number}MS-Windows: any installed double-byte codepageuutf-832 bitUTF-8 encodedUnicode (ISO/IEC 10646-1)u   ucs-216 bit UCS-2 encodedUnicode (ISO/IEC 10646-1)u   ucs-2lelike ucs-2, little endianu   utf-16ucs-2 extended with double-words for more charactersu   utf-16lelike utf-16, little endianu   ucs-432 bit UCS-4 encodedUnicode (ISO/IEC 10646-1)u   ucs-4lelike ucs-4, little endianThe{name} can be any encoding name that your system supports.  Itis passedtoiconv() to convert between the encoding of the file and the current locale.ForMS-Windows "cp{number}" means using codepage{number}.Examples::set encoding=8bit-cp1252:set encoding=2byte-cp932TheMS-Windows codepage 1252is very similar to latin1.  For practical reasonsthe same encodingis used and it's called latin1.'isprint' can be used todisplay the characters 0x80- 0xA0 or not.Several aliases can be used, they are translated to one of the names above.An incomplete list:1   ansisameas latin1 (obsolete, for backward compatibility)2   japanJapanese: onUnix "euc-jp", onMS-Windows cp9322   koreaKorean: onUnix "euc-kr", onMS-Windows cp9492   prcsimplified Chinese: onUnix "euc-cn", onMS-Windows cp9362   chinese     sameas "prc"2   taiwantraditional Chinese: onUnix "euc-tw", onMS-Windows cp950uutf8sameasutf-8uunicodesameas ucs-2u   ucs2besameas ucs-2 (big endian)u   ucs-2besameas ucs-2 (big endian)u   ucs-4besameas ucs-4 (big endian)u   utf-32sameas ucs-4u   utf-32lesameas ucs-4le    default     stands for the default value of'encoding', depends on theenvironmentFor the UCS codes the byte order matters.  Thisis tricky, useUTF-8 wheneveryou can.  The defaultis to use big-endian (most significant byte comesfirst):    namebyteschar    ucs-2      11 22    1122    ucs-2le      22 11    1122    ucs-411 22 33 4411223344    ucs-4le44 33 22 1111223344OnMS-Windows systems you often want to use "ucs-2le", becauseit uses littleendian UCS-2.There area few encodings which are similar, but not exactly the same.  Vimtreats themas if they were different encodings, so that conversion will bedone when needed.  You might want to use the similar name to avoid conversionor when conversionis not possible:cp932, shift-jis, sjiscp936, euc-cnencoding-tableNormally'encoding'is equal to your currentlocale and'termencoding'isempty.  This means that your keyboard and display work with characters encodedin your current locale, and Vim uses the same characters internally.You can make Vim use characters ina different encoding by setting the'encoding' option toa different value.  Since the keyboard and display stilluse the current locale, conversion needs to be done.  The'termencoding' thentakes over the value of the current locale, so Vim converts between'encoding'and'termencoding'.  Example::let &termencoding = &encoding:set encoding=utf-8However, not all combinations of values are possible.  The table below tellsyou how each of the nine combinations works.  Thisis further restricted bynot all conversions being possible,iconv() being present, etc.  Since thisdepends on the system used, no detailedlist can be given.('tenc'is the short name for'termencoding' and'enc' short for'encoding')'tenc'    'enc'remark 8bit    8bitWorks.  When'termencoding'is different from'encoding' typing and displaying may be wrong for somecharacters, Vim does NOT perform conversion (set'encoding' to "utf-8" to get this). 8bit      2byteMS-Windows: works for all codepages installed on yoursystem; you can only type 8bit characters;Other systems: does NOT work. 8bitUnicodeWorks, but only 8bit characters can be typed directly(others through digraphs, keymaps, etc.); inaterminal you can only see 8bit characters; theGUI canshow all characters that the'guifont' supports. 2byte    8bitWorks, but typing non-ASCII characters mightbea problem. 2byte   2byteMS-Windows: works for all codepages installed on yoursystem; typing characters might bea problem whenlocaleis different from'encoding'.Other systems: Only works when'termencoding'is equalto'encoding', you mightas well leaveit empty. 2byteUnicodeworks, Vim will translate typed characters.Unicode    8bitworks (unusual)Unicode    2bytedoes NOT workUnicodeUnicodeworks very well (leaving'termencoding' empty worksthe same way, because allUnicodeis handledinternallyas UTF-8)CONVERSIONcharset-conversionVim will automatically convert from one to another encoding in several places:- When readinga file and'fileencoding'is different from'encoding'- Whenwritinga file and'fileencoding'is different from'encoding'- When displaying characters and'termencoding'is different from'encoding'- When reading input and'termencoding'is different from'encoding'- When displayingmessages and the encoding used for LC_MESSAGES differs from'encoding' (requiresa gettext version that supports this).- When readinga Vimscript where:scriptencodingis different from'encoding'.- When reading orwritingaviminfo file.Most of these require the+iconv feature.  Conversion for reading andwriting files may also be specified with the'charconvert' option.Useful utilities for converting the charset:    All:    iconvGNU iconv can convert most encodings.Unicodeis usedas theintermediate encoding, which allows conversion from and to all otherencodings.  Seehttp://www.gnu.org/directory/libiconv.html.    Japanese:    nkfNkfis "Network Kanji code conversion Filter".  One of the most uniquefacility of nkfis the guess of the input Kanji code.  So, you don'tneed to know what the inputting file'scharset is.  When convert toEUC-JP from ISO-2022-JP or Shift_JIS, simplydo the following commandin Vim:    :%!nkf-eNkf can be found at:http://www.sfc.wide.ad.jp/~max/FreeBSD/ports/distfiles/nkf-1.62.tar.gz    Chinese:    hcHcis "Hanzi Converter".  Hc converta GB file toa Big5 file, or Big5file to GB file.  Hc can be found at:https://www.freshports.org/chinese/hcftp://ftp.cuhk.hk/pub/chinese/ifcss/software/unix/convert/hc-30.tar.gz    Korean:    hmconvHmconvisKorean code conversion utility especially for E-mail.  It canconvert between EUC-KR and ISO-2022-KR.  Hmconv can be found at:https://www.freshports.org/korean/hmconv/    Multilingual:   lvLvisa Powerful Multilingual File Viewer.  Andit can be workedascharset converter.  Supportedcharset: ISO-2022-CN, ISO-2022-JP,ISO-2022-KR, EUC-CN, EUC-JP, EUC-KR, EUC-TW, UTF-7, UTF-8, ISO-8859series, Shift_JIS, Big5 and HZ.  Lv can be foundat (link seems dead):http://www.ff.iij4u.or.jp/~nrt/lv/index.htmlmbyte-conversionWhen reading andwriting files in an encoding different from'encoding',conversion needs to be done.  These conversions are supported:- All conversions between Latin-1 (ISO-8859-1), UTF-8, UCS-2 and UCS-4 are  handled internally.- For MS-Windows, when'encoding'isaUnicode encoding, conversion from and  to any codepage should work.- Conversion specified with'charconvert'- Conversion with the iconv library, ifitis available.Old versions of GNUiconv() may cause the conversion to fail (theyrequesta very large buffer, more than Vimis willing to provide).Try getting anothericonv() implementation.iconv-dynamicOnMS-Windows Vim can be compiled with the+iconv/dyn feature.  This meansVim will search for the "iconv.dll" and "libiconv.dll" libraries.  Whenneither of them can be found Vim will still work but some conversions won't bepossible.==============================================================================4. Usingaterminalmbyte-terminalTheGUI fully supportsmultibyte characters.  Itis also possible inaterminal, if theterminal supports the same encoding that Vim uses.  Thus thisisless flexible.For example, you can run Vim in an xterm with addedmultibyte support and/orXIM.  Examples are kterm (Kanji term) and hanterm (for Korean),Eterm(Enlightened terminal) and rxvt.If yourterminal does not support the right encoding, you can set the'termencoding' option.  Vim will then convert the typed characters from'termencoding' to'encoding'.  And displayed text will be converted from'encoding' to'termencoding'.  If the encoding supported by theterminaldoesn't include all the characters that Vim uses, this leads to lostcharacters.  This may mess up the display.  If you useaterminal thatsupports Unicode, suchas the xterm mentioned below,it should work just fine,since nearly every character set can be converted toUnicode without loss ofinformation.UTF-8 IN XFREE86 XTERMUTF8-xtermThisisa short explanation of how to useUTF-8 character encoding in thexterm that comes with XFree86 by Thomas Dickey (text by Markus Kuhn).Get the latest xterm version which has nowUTF-8 support:http://invisible-island.net/xterm/xterm.htmlCompileit with "./configure --enable-wide-chars; make"Also get the ISO 10646-1 version ofvarious fonts, whichis available onhttp://www.cl.cam.ac.uk/~mgk25/download/ucs-fonts.tar.gzandinstall the fontas described in the README file.Now start xterm with  xterm -u8 -fn -misc-fixed-medium-r-semicondensed--13-120-75-75-c-60-iso10646-1or, for bigger character:  xterm -u8 -fn -misc-fixed-medium-r-normal--15-140-75-75-c-90-iso10646-1and you will havea workingUTF-8terminal emulator.  Try both   cat utf-8-demo.txt   vim utf-8-demo.txtwith the demo text that comes with ucs-fonts.tar.gz in order to seewhether there are any problems withUTF-8 in your xterm.For Vim you may need to set'encoding' to "utf-8".==============================================================================5.  Fonts onX11mbyte-fonts-X11Unfortunately, using fonts inX11is complicated.  The name ofa single-bytefontisa long string.  Formultibyte fonts we need several of these...Note: Most of thisis no longer relevant forGTK+ 2.  Selectinga font viaitsXLFDis not supported; see'guifont' for an example of how toset the font.  Do yourselfa favor and ignore theXLFD andxfontsetsections below.First of all, Vim only accepts fixed-width fonts for displaying text.  Youcannot use proportionally spaced fonts.  This excludes many of the available(and nicer looking) fonts.  However, formenus andtooltips any font can beused.Note that Display and Input are independent.  Itis possible to see yourlanguage even though you have no inputmethod for it.You should geta default font formenus andtooltips that works, butit mightbe ugly.  Read the following to find out how to selecta better font.X LOGICAL FONT DESCRIPTION (XLFD)XLFDXLFDis theX font name and contains the information about the font size,charset, etc.  The nameis in this format:FOUNDRY-FAMILY-WEIGHT-SLANT-WIDTH-STYLE-PIXEL-POINT-X-Y-SPACE-AVE-CR-CEEach field means:- FOUNDRY:  FOUNDRY field.  The company that created the font.- FAMILY:   FAMILY_NAME field.  Basic font family name.  (helvetica, gothic,    times, etc)- WEIGHT:   WEIGHT_NAME field.  How thick the letters are.  (light, medium,    bold, etc)- SLANT:    SLANT field.r:  Roman (no slant)i:  Italico:  Obliqueri: Reverse Italicro: Reverse Obliqueot: Othernumber:Scaled font- WIDTH:    SETWIDTH_NAME field.  Width of characters.  (normal, condensed,    narrow, double wide)- STYLE:    ADD_STYLE_NAME field.  Extra info to describe font.  (Serif, Sans    Serif, Informal, Decorated, etc)- PIXEL:    PIXEL_SIZE field.  Height, in pixels, of characters.- POINT:    POINT_SIZE field.  Ten times height of characters in points.- X:    RESOLUTION_X field.X resolution (dots per inch).- Y:    RESOLUTION_Y field.Y resolution (dots per inch).- SPACE:    SPACING field.p:  Proportionalm:  Monospacedc:  CharCell- AVE:    AVERAGE_WIDTH field.  Ten times average width in pixels.- CR:    CHARSET_REGISTRY field.  The name of thecharset group.- CE:    CHARSET_ENCODING field.  The rest of thecharset name.  For some    charsets, suchas JISX 0208, if this fieldis 0, code points has    the same valueas GL, and GR if 1.For example, incase ofa 16 dots font corresponding to JISX 0208,itiswritten like:    -misc-fixed-medium-r-normal--16-110-100-100-c-160-jisx0208.1990-0X FONTSETfontsetxfontsetA single-bytecharsetis typically associated with one font.  Formultibytecharsetsa combination of fontsis often used.  This means that one group ofcharacters are used from one font and another group from another font (whichmight be double wide).  This collection of fontsis calleda fontset.Which fonts are required inafontset depends on the current locale.Xwindows maintainsa table of which groups of characters are required foralocale.  You have to specify all the fonts thatalocale requires in the'guifontset' option.Setting the'guifontset' option also means that all font names will be handledasafontset name.  Also the ones used for the "font" argument of the:highlight command.Note the difference between'guifont' and'guifontset': In'guifont'the comma-separated names are alternative names, one of which will beused.  In'guifontset' the wholestringis onefontset name,including the commas.  Itis not possible to specify alternativefontset names.This example works on manyX11 systems::set guifontset=-*-*-medium-r-normal--16-*-*-*-c-*-*-*The fontsmust match with the current locale.  If fonts for the character setsthat the currentlocale uses are not included, setting'guifontset' will fail.NOTE: Thefontset always uses the current locale, even though'encoding' maybe set to usea different charset.  In that situation you might want to use'guifont' and'guifontwide' instead of'guifontset'.Example:    |charset| language    "groups of characters"    GB2312Chinese (simplified)  ISO-8859-1 and GB 2312    Big5Chinese (traditional) ISO-8859-1 and Big5    CNS-11643Chinese (traditional) ISO-8859-1, CNS 11643-1 and CNS 11643-2    EUC-JPJapanese    JISX 0201 and JISX 0208    EUC-KRKorean    ISO-8859-1 and KSC 5601 (KSX 1001)You can search for fonts using the xlsfonts command.  For example, when you'researching fora font for KSC 5601:    xlsfonts | grep ksc5601Thisis complicated and confusing.  You might want to consult the X-Windowsdocumentation if thereis something you don't understand.base_font_name_listWhen you have found the names of the fonts you want to use, you need to setthe'guifontset' option.  You specify thelist by concatenating the font namesand puttinga comma in between them.For example, when you use the ja_JP.eucJP locale, this requires JISX 0201and JISX 0208.  You could supplyalist of fonts that explicitlyspecifiesthe charsets, like: :set guifontset=-misc-fixed-medium-r-normal--14-130-75-75-c-140-jisx0208.1983-0,\-misc-fixed-medium-r-normal--14-130-75-75-c-70-jisx0201.1976-0Alternatively, you can supplya base font namelist that omits thecharsetname, letting X-Windows select font characters required for the locale.  Forexample: :set guifontset=-misc-fixed-medium-r-normal--14-130-75-75-c-140,\-misc-fixed-medium-r-normal--14-130-75-75-c-70Alternatively, you can supplya single base font name that allows X-Windows toselect from all available fonts.  For example: :set guifontset=-misc-fixed-medium-r-normal--14-*Alternatively, you can specify alias names.  See the fonts.alias file in thefonts directory (e.g., /usr/X11R6/lib/X11/fonts/).  For example: :set guifontset=k14,r14E253Note that in East Asian fonts, the standard character cellis square.  Whenmixinga Latin font and an East Asian font, the East Asian font width shouldbe twice the Latin font width.If'guifontset'is not empty, the "font" argument of the:highlight commandis also interpretedasa fontset.  For example, you should use forhighlighting::hi Comment font=english_font,your_fontIf you usea wrong "font" argument you will get an error message.Also make sure that you set'guifontset' before setting fonts for highlightgroups.USING RESOURCE FILESInstead of specifying'guifontset', you can setX11 resources and Vim willpick them up.  Thisis only for people who know howX resource files work.ForMotifinsert these three lines in your $HOME/.Xdefaults file:Vim.font:base_font_name_listVim*fontSet:base_font_name_listVim*fontList: your_language_fontNote: Vim.fontis for text area.      Vim*fontSetis for menu.      Vim*fontListis for menu (forMotif GUI)For example, when you are usingJapanese anda 14 dots font,Vim.font: -misc-fixed-medium-r-normal--14-*Vim*fontSet: -misc-fixed-medium-r-normal--14-*Vim*fontList: -misc-fixed-medium-r-normal--14-*or:Vim*font: k14,r14Vim*fontSet: k14,r14Vim*fontList: k14,r14To have them take effect immediately you will have todoxrdb -merge ~/.XdefaultsOtherwise you will have to stop and restart theX server before the changestake effect.TheGTK+ version ofGUI Vim does not use .Xdefaults, use ~/.gtkrc instead.The default mostly works OK.  But for themenus you might have to changeit.  Example:style "default"{fontset="-*-*-medium-r-normal--14-*-*-*-c-*-*-*"}widget_class "*" style "default"==============================================================================6.  Fonts onMS-Windowsmbyte-fonts-MSwinThe simplestis to use the fontdialog to select fonts and try them out.  Youcan find thisat the "Edit/Select Font..." menu.  Once you finda font namethat works well you can use this command to see its name::set guifontThen adda command to yourgvimrc file to set'guifont'::set guifont=courier_new:h12==============================================================================7.  Input onX11mbyte-XIMX INPUT METHOD (XIM) BACKGROUNDXIMximx-input-methodXIMis an international input module for X.  There are two kinds of structures,Xlib unit type andIM-server (Input-Method server) type.IM-server typeis suitable for complex input, suchas CJK.-IM-serverIM-server  InIM-server type input structures, the input eventis handled by either  of the two ways: FrontEnd system and BackEnd system.  In the FrontEnd  system, input events are snatched by theIM-server first, thenIM-server  give the application the result of input.  On the other hand, the BackEnd  system works reverse order.MS-Windows adopt BackEnd system.  In X, most ofIM-servers adopt FrontEnd system.  The demerit of BackEnd systemis the  large overhead in communication, butit provides safe synchronization with  no restrictions on applications.  For example, there are xwnmo and kinput2JapaneseIM-server, both are  FrontEnd system.  Xwnmois distributed with Wnn (see below), kinput2 can be  foundat (link seems dead):ftp://ftp.sra.co.jp/pub/x11/kinput2/  For Chinese, there'sa greatXIM server named "xcin", you can input both  Traditional and SimplifiedChinese characters.  Andit can accept otherlocale if you makea correct input table.  Xcin can be foundat (link seems  dead):http://cle.linux.org.tw/xcin/  Others are scim:https://www.freedesktop.org/wiki/Software/scim/  and fcitx:http://www.fcitx.org/- Conversion Serverconversion-server  Some system needs additional server: conversion server.  Most ofJapaneseIM-servers need it, Kana-Kanji conversion server.  ForChinese inputting,it depends on themethod of inputting, in some methods, PinYin or ZhuYin to  HanZi conversion serveris needed.  ForKorean inputting, if you want to  input Hanja, Hangul-Hanja conversion serveris needed.  For example, theJapanese inputting processis divided into 2 steps.  First  we pre-input Hira-gana, second Kana-Kanji conversion.  There are so many  Kanji characters (6349 Kanji characters are defined in JISX 0208) and the  number of Hira-gana characters are 76.  So, first, we pre-input textas  pronounced in Hira-gana, second, we convert Hira-gana to Kanji or Kata-Kana,  if needed.  There are some Kana-Kanji conversion server: jserver  (distributed with Wnn, see below) and canna.  Canna can be found at:https://osdn.net/projects/canna/Thereisa good input system: Wnn4.2.  Wnn 4.2 contains,    xwnmo(IM-server)    jserver (Japanese Kana-Kanji conversion server)    cserver (Chinese PinYin or ZhuYin to simplified HanZi conversion server)    tserver (Chinese PinYin or ZhuYin to traditional HanZi conversion server)    kserver (Hangul-Hanja conversion server)Wnn 4.2 for several systems can be foundatvarious places on the internet.Use the RPM or port for your system.- Input Stylexim-input-style  When inputting CJK, there are four areas:      1. The area to display of the input whileitis being composed      2. The area to display the currently active input mode.      3. The area to display the next candidate for the selection.      4. The area to display other tools.  The third areais needed when converting.  For example, inJapanese  inputting, multiple Kanji characters could have the same pronunciation, soa sequence of Hira-gana characters could map toa distinct sequence of Kanji  characters.  The first and second areas are defined in international input ofX with the  names of "Preedit Area", "Status Area" respectively.  The third and fourth  areas are not defined and are left to be managed by theIM-server.  In the  international input, four input styles have been defined using combinations  of Preedit Area and Status Area:OnTheSpot,OffTheSpot,OverTheSpot  andRoot.  Currently,GUI Vim supports three styles,OverTheSpot,OffTheSpot andRoot.  When compiled with+GUI_GTK feature,GUI Vim supports two styles,OnTheSpot andOverTheSpot.  You can select the style with the'imstyle'  option.*.  on-the-spotOnTheSpot    Preedit Area and Status Area are performed by the client application in    the area of application.  The client applicationis directed by theIM-server to display all pre-edit dataat the location of text    insertion.  The clientregisters callbacks invoked by the inputmethod    during pre-editing.*.  over-the-spotOverTheSpot    Status Areais created ina fixed position within the area of application,    incase of Vim, the positionis the additional status line.  Preedit Areais madeat present input position of application.  The inputmethod    displays pre-edit data inawindow whichit brings up directly over the    text insertion position.*.  off-the-spotOffTheSpot    Preedit Area and Status Area are performed in the area of application, incase of Vim, the areais additional status line.  The client application    provides displaywindows for the pre-edit data to the inputmethod which    displays into them directly.*.  root-windowRoot    Preedit Area and Status Area are outside of the application.  The inputmethod displays all pre-edit data ina separate area of the screen inawindow specific to the input method.USING XIMmultibyte-inputE284E285E286E287E288E289Note that Display and Input are independent.  Itis possible to see yourlanguage even though you have no inputmethod for it.  But when your Displaymethod doesn't match your Input method, the text will be displayed wrong.Note: You can not use IM unless you specify'guifontset'.      Therefore, Latin users, you have to also use'guifontset'      if you use IM.To input your language you should run theIM-server which supports yourlanguage andconversion-server if needed.The next 3 lines should beput in your ~/.Xdefaults file.  They are common forallX applications which usesXIM.  If you already useXIM, you can skipthis.*international: True*.inputMethod: your_input_server_name*.preeditType: your_input_styleinput_server_nameis yourIM-server name (check yourIM-servermanual).your_input_styleis one ofOverTheSpot,OffTheSpot,Root.  Seealsoxim-input-style.*international may not be necessary if you use X11R6.*.inputMethod and *.preeditType are optional if you use X11R6.For example, when you are using kinput2asIM-server,*international: True*.inputMethod: kinput2*.preeditType: OverTheSpotWhen usingOverTheSpot,GUI Vim always connects to the IM Server even inNormal mode, so you can input your language with commands like "f" and "r".But when using one of the other two methods,GUI Vim connects to the IM Serveronly ifitis not inNormal mode.If your IM Server does not supportOverTheSpot, and if you want to use yourlanguage with someNormal mode command like "f" or "r", then you should usealocalized xterm  or an xterm which supportsXIMIf needed, you can set the XMODIFIERS environment variable:sh:  export XMODIFIERS="@im=input_server_name"csh: setenv XMODIFIERS "@im=input_server_name"For example, when you are using kinput2asIM-server and sh,export XMODIFIERS="@im=kinput2"FULLY CONTROLLED XIMYou can fullycontrol XIM, like withIME ofMS-Windows (seemultibyte-ime).Thisis currently only available for theGTK GUI.Before using fully controlled XIM, one settingis required.  Set the'imactivatekey' option to the key thatis used for the activation of the inputmethod.  For example, when you are using kinput2+ cannaas IM Server, theactivation keyis probably Shift+Space::set imactivatekey=S-spaceSee'imactivatekey' for the format.==============================================================================8.  Input onMS-Windowsmbyte-IME(WindowsIME support)multibyte-imeIME{only works Windows GUI and compiled with the |+multi_byte_ime| feature}To inputmultibyte characters on Windows, you can use an Input Method Editor(IME).  In process of your editing text, youmust switch status (on/off) ofIME many many many times.  BecauseIME with status onis hooking all of yourkey inputs, you cannot input 'j', 'k', or almost all of keys to Vim directly.The+multi_byte_ime feature helps for this.  It reduces the number of timestheIME status has to be switched manually.  InNormal mode, thereis almostno need to use IME, even when editingmultibyte text.  So whenexitingInsertmode, Vim memorizes the last status ofIME and turns off IME.  Whenre-enteringInsert mode, Vim sets theIME status to that memorized statusautomatically.This works on not only insert-normal mode, but also search-command input andreplace mode.Theoptions'iminsert','imsearch' and'imcmdline' can be used to choosethe different input methods or disable them temporarily.On Windows 9x and Windows NT 4.0 there wasglobal-ime, but thisis nolonger supported.  You can still find documentation for Active Input MethodManager (Global IME) here:http://msdn.microsoft.com/en-us/library/aa741221(v=VS.85).aspxNOTE: ForIME to work youmust make sure the input locales of your languageare added to your system.  The exact location of this depends on the versionof Windows you use.  For example, on my Windows 2000 box:1. Control Panel2. Regional Options3. Input LocalesTab4. Add Installed input locales-> Chinese(PRC)   The defaultis still English (United Stated)Cursor color whenIME orXIMis onCursorIM    Thereisa little cute feature for IME.  Cursor can indicate status ofIME    bychanging its color.  Usually status ofIME was indicated by little iconata corner of desktop (or taskbar).  Itis noteasy to verify status of    IME.  But this featurehelp this.    This works in the same way when using XIM.    You can select cursor color when statusis on by using highlight group    CursorIM.  For example, add these lines to yourgvimrc:if has('multi_byte_ime')    highlight Cursor guifg=NONE guibg=Green    highlight CursorIM guifg=NONE guibg=Purpleendif    Cursor color with offIMEis green.  And purple cursor indicates that    statusis on.==============================================================================9. Input witha keymapmbyte-keymapWhen the keyboard doesn't produce the characters you want to enter in yourtext, you can use the'keymap' option.  This will translate one or more(English) characters to another (non-English) character.  This only happenswhen typing text, not when typing Vim commands.  This avoids having to switchbetween two keyboard settings.{only available when compiled with the |+keymap| feature}The value of the'keymap' optionspecifiesa keymap file to use.  The name ofthis fileis one of these two:keymap/{keymap}_{encoding}.vimkeymap/{keymap}.vimHere{keymap}is the value of the'keymap' option and{encoding} of the'encoding' option.  The file name with the{encoding} includedis tried first.'runtimepath'is used to find these files.  To see an overview of allavailable keymap files, use this::echo globpath(&rtp, "keymap/*.vim")InInsert andCommand-line mode you can useCTRL-^ totoggle between using thekeyboard map or not.i_CTRL-^c_CTRL-^This flagis remembered forInsert mode with the'iminsert' option.  Whenleaving and enteringInsert mode the previous valueis used.  The same valueis also used for commands that takea single character argument, likef andr.ForCommand-line mode the flagis NOT remembered.  You are expected to type anEx command first, whichis ASCII.For typing search patterns the'imsearch' optionis used.  It can be set touse the same valueas for'iminsert'.lCursorItis possible to give theGUI cursor another color when the language mappingsare being used.  Thisis disabled by default, to avoid that the cursor becomesinvisible when you usea non-standard background color.  Hereis an example tousea brightly colored cursor::highlight Cursor guifg=NONE guibg=Green:highlight lCursor guifg=NONE guibg=Cyankeymap-file-format:loadk:loadkeymapE105E791The keymap file looks something like this:" Maintainer:name <email@address>" Last Changed:2001 Jan 1let b:keymap_name = "short"loadkeymapaAbBcommentThe linesstarting witha " are comments and will be ignored.  Blank lines arealso ignored.  The lines with the mappings may havea comment after the usefultext.The "b:keymap_name" can be set toa short name, which will be shown in thestatus line.  The ideais that this takesless room than the value of'keymap', which might be long to distinguish between different languages,keyboards and encodings.The actual mappings are in the lines below "loadkeymap".  In the example "a"is mapped to "A" and "b" to "B".  Thus the first itemis mapped to the seconditem.  Thisis done for each line, until theend of the file.These items are exactly the sameas what can be used ina:lnoremap command,using "<buffer>" to make the mappings local to the buffer.You can check the result with this command::lmapThe two itemsmust be separated by white space.  You cannot include whitespace inside an item, use the special names "<Tab>" and "<Space>" instead.The length of the two items togethermust not exceed 200 bytes.It's possible to have more than one character in the first column.  This workslikea dead key.  Example:'aáSince Vim doesn't know if the next character afteraquoteis really an "a",it will wait for the next character.  To be able toinserta single quote,also add this line:'''Since themappingis defined with:lnoremap the resultingquote will not beused for the start of another character.The "accents" keymap uses this.keymap-accentsThe first column can also be in<> form:<C-c>Ctrl-C<A-c>Alt-c<A-C>Alt-CNote that the Alt mappings may not work, depending on your keyboard andterminal.Although it's possible to have more than one character in the second column,thisis unusual.  But you can usevarious ways to specify the character:Aaliteral characterA<char-97>decimal valueA<char-0x61>hexadecimal valueA<char-0141>octal valuex<Space>special key nameThe characters are assumed to be encoded for the current value of'encoding'.It's possible to use ":scriptencoding" when all characters are givenliterally.  That doesn't work when using the<char-> construct, because theconversionis done on the keymap file, not on the resulting character.The lines after "loadkeymap" are interpreted with'cpoptions' set to "C".This means that continuation lines are not used andabackslash hasa specialmeaning in the mappings.  Examples:" a comment line\"xmaps " to x\\ymaps \ to yIf you writea keymap file that will be useful for others, consider submittingit to the Vim maintainer forinclusion in the distribution:<maintainer@vim.org>HEBREW KEYMAPkeymap-hebrewThis file explains what characters are available inUTF-8 and CP1255 encodings,and what the keymaps are to get those characters:glyph   encoding   keymapChar UTF-8 cp1255  hebrew  hebrewp  nameא    0x5d0  0xe0ta     'alefב    0x5d1  0xe1cb     betג    0x5d2  0xe2dg     gimelד    0x5d3  0xe3sd     daletה    0x5d4  0xe4vh     heו    0x5d5  0xe5uv     vavז    0x5d6  0xe6zz     zayinח    0x5d7  0xe7jj     hetט    0x5d8  0xe8yT     tetי    0x5d9  0xe9hy     yodך    0x5da  0xealK     kaf sofitכ    0x5db  0xebfk     kafל    0x5dc  0xeckl     lamedם    0x5dd  0xedoM     mem sofitמ    0x5de  0xeenm     memן    0x5df  0xefiN     nun sofitנ    0x5e0  0xf0bn     nunס    0x5e1  0xf1xs     samechע    0x5e2  0xf2gu     `ayinף    0x5e3  0xf3;P     pe sofitפ    0x5e4  0xf4pp     peץ    0x5e5  0xf5.X     tsadi sofitצ    0x5e6  0xf6mx     tsadiק    0x5e7  0xf7eq     qofר    0x5e8  0xf8rr     reshש    0x5e9  0xf9aw     shinת    0x5ea  0xfa,t     tavVowel marks and special punctuation:הְ    0x5b0  0xc0     A:      A:   shevaהֱ    0x5b1  0xc1     HE      HE   hataf segolהֲ    0x5b2  0xc2     HA      HA   hataf patahהֳ    0x5b3  0xc3     HO      HO   hataf qamatsהִ    0x5b4  0xc4II    hiriqהֵ    0x5b5  0xc5     AY      AY   tsereהֶ    0x5b6  0xc6EE    segolהַ    0x5b7  0xc7     AA      AA   patahהָ    0x5b8  0xc8     AO      AO   qamatsהֹ    0x5b9  0xc9OO    holamהֻ    0x5bb  0xcbUU    qubutsכּ    0x5bc  0xccDD    dageshהֽ    0x5bd  0xcd     ]T      ]T   metegה־   0x5be  0xce     ]Q      ]Q   maqafבֿ    0x5bf  0xcf     ]R      ]R   rafeב׀   0x5c0  0xd0]p]p   paseqשׁ    0x5c1  0xd1     SR      SR   shin-dotשׂ    0x5c2  0xd2     SL      SL   sin-dot׃    0x5c3  0xd3]P]P   sof-pasuqװ    0x5f0  0xd4     VV      VV   double-vavױ    0x5f1  0xd5     VY      VY   vav-yodײ    0x5f2  0xd6     YY      YY   yod-yodThe following are only available inUTF-8Cantillation marks:glyphCharUTF-8hebrew nameב֑    0x591   C:   etnahtaב֒    0x592   Cs   segolב֓    0x593   CS   shalsheletב֔    0x594   Cz   zaqef qatanב֕    0x595   CZ   zaqef gadolב֖    0x596   Ct   tipehaב֗    0x597   Cr   reviaב֘    0x598   Cq   zarqaב֙    0x599   Cp   pashtaב֚    0x59a   C!   yetivב֛    0x59b   Cv   tevirב֜    0x59c   Cg   gereshב֝    0x59dC*   geresh qadimב֞    0x59e   CG   gershayimב֟    0x59f   CP   qarnei-parahב֪    0x5aa   Cy   yerach-ben-yomoב֫    0x5ab   Co   oleב֬    0x5ac   Ci   iluyב֭    0x5ad   Cd   dehiב֮    0x5ae   Cn   zinorב֯    0x5af   CC   masora circleCombining forms:ﬠ    0xfb20  X`   Alternative `ayinﬡ    0xfb21  X'   Alternative 'alefﬢ    0xfb22  X-d  Alternative daletﬣ    0xfb23  X-h  Alternative heﬤ    0xfb24  X-k  Alternative kafﬥ    0xfb25  X-l  Alternative lamedﬦ    0xfb26  X-m  Alternative mem-sofitﬧ    0xfb27  X-r  Alternative reshﬨ    0xfb28  X-t  Alternative tav﬩    0xfb29  X-+  Alternative plusשׁ    0xfb2a  XW   shin+shin-dotשׂ    0xfb2b  Xw   shin+sin-dotשּׁ    0xfb2c  X..W  shin+shin-dot+dageshשּׂ    0xfb2d  X..w  shin+sin-dot+dageshאַ    0xfb2e  XA   alef+patahאָ    0xfb2f  XO   alef+qamatsאּ    0xfb30  XI   alef+hiriq (mapiq)בּ    0xfb31  X.b  bet+dageshגּ    0xfb32  X.g  gimel+dageshדּ    0xfb33  X.d  dalet+dageshהּ    0xfb34  X.h  he+dageshוּ    0xfb35  Xu  vav+dageshזּ    0xfb36  X.z  zayin+dageshטּ    0xfb38  X.T  tet+dageshיּ    0xfb39  X.y  yud+dageshךּ    0xfb3a  X.K  kaf sofit+dageshכּ    0xfb3b  X.k  kaf+dageshלּ    0xfb3c  X.l  lamed+dageshמּ    0xfb3e  X.m  mem+dageshנּ    0xfb40  X.n  nun+dageshסּ    0xfb41  X.s  samech+dageshףּ    0xfb43  X.P  pe sofit+dageshפּ    0xfb44  X.p  pe+dageshצּ    0xfb46  X.x  tsadi+dageshקּ    0xfb47  X.q  qof+dageshרּ    0xfb48  X.r  resh+dageshשּ    0xfb49  X.w  shin+dageshתּ    0xfb4a  X.t  tav+dageshוֹ    0xfb4b  Xo   vav+holamבֿ    0xfb4c  XRb  bet+rafeכֿ    0xfb4d  XRk  kaf+rafeפֿ    0xfb4e  XRp  pe+rafeﭏ    0xfb4f  Xal  alef-lamed==============================================================================10.  Input with imactivatefunc()mbyte-funcVim has the'imactivatefunc' and'imstatusfunc' options. These are useful toactivate/deactivate the inputmethod from Vim in any way, also with an externalcommand. For example, fcitx provide fcitx-remote command:set iminsert=2set imsearch=2set imcmdlineset imactivatefunc=ImActivatefunction! ImActivate(active)  if a:active    call system('fcitx-remote -o')  else    call system('fcitx-remote -c')  endifendfunctionset imstatusfunc=ImStatusfunction! ImStatus()  return system('fcitx-remote')[0] is# '2'endfunctionUsing this script, you can activate/deactivateXIM via Vim even whenitis notcompiled with+xim.==============================================================================11. UsingUTF-8mbyte-utf8UTF-8utf-8utf8UnicodeunicodeTheUnicode character set was designed to include all characters from othercharacter sets.  Thereforeitis possible to write text in any language usingUnicode (witha few rarely used languages excluded).  And it's mostly possibleto mix these languages in one file, whichis impossible with other encodings.Unicode can be encoded in several ways.  The most popular oneis UTF-8, whichuses one or more bytes for each character andis backwards compatible withASCII.   OnMS-Windows UTF-16is also used (previously UCS-2), which uses16-bit words.  Vim can support all of these encodings, but always usesUTF-8internally.Vim has comprehensiveUTF-8 support.  It works well in:- xterm withUTF-8 support enabled-Motif andGTKGUI-MS-WindowsGUI- several other platformsDouble-width characters are supported.  This works best with'guifontwide' or'guifontset'.  When using only'guifont' the wide characters are drawn in thenormal width andaspace to fill the gap.Note that the'guifontset' optionis no longer relevant in theGTK+ 2 GUI.bom-bytesWhen readinga filea BOM (Byte Order Mark) can be used to recognize theUnicode encoding:EF BB BFUTF-8FE FF        UTF-16 big endianFF FE        UTF-16 little endian00 00 FE FF  UTF-32 big endianFF FE 00 00  UTF-32 little endianUTF-8is the recommended encoding.Note that it's difficult to tell utf-16and utf-32 apart.  Utf-16is often used on MS-Windows, utf-32is notwidespreadas file format.mbyte-combiningmbyte-composingA composing or combining characteris used to change the meaning of thecharacter before it.  The combining characters are drawn on top of thepreceding character.Up to two combining characters can be used by default.  This can be changedwith the'maxcombine' option.When editing texta composing characteris mostly considered part of thepreceding character.  For example "x" will deletea character and itsfollowing composing characters by default.If the'delcombine' optionis on, then pressing 'x' will delete the combiningcharacters, oneata time, then the base character.  But when inserting, youtype the first character and the following composing characters separately,after which they will be joined.  The "r" command will not allow you to typeacombining character, becauseit doesn't know oneis coming.  Use "R" instead.Bytes which are not part ofa validUTF-8 byte sequence are handled likeasingle character and displayedas<xx>, where "xx"is the hex value of thebyte.Overlong sequences are not handled specially and displayed likea validcharacter.  However, search patterns may not match on an overlong sequence.(an overlong sequenceis where more bytes are used than required for thecharacter.)  An exceptionis NUL (zero) whichis displayedas "<00>".In the file and buffer the full range ofUnicode characters can be used (31bits).  However, displaying only works for the characters present in theselected font.Useful commands:- "ga" shows the decimal, hexadecimal andoctal value of the character under  the cursor.  If there are composing characters these are shown too.  (If the  messageis truncated, use ":messages").- "g8" shows the bytes used inaUTF-8 character, also the composing  characters,as hex numbers.- ":set encoding=utf-8 fileencodings=" forces usingUTF-8 for all files.  The  defaultis to use the currentlocale for'encoding' and set'fileencodings'  to automatically detect the encoding ofa file.STARTING VIMIf your currentlocaleis in anUTF-8 encoding, Vim will automatically startinUTF-8 mode.If you are using another locale:set encoding=utf-8You might also want to select the font used for the menus.  Unfortunately thisdoesn't always work.  See the system specific remarks below, and'langmenu'.USING UTF-8 IN X-WINDOWSutf-8-in-xwindowsNote: Thissection does not apply to theGTK+ 2 GUI.You need to specifya font to be used.  For double-wide characters anotherfontis required, whichis exactly twiceas wide.  There are three ways todothis:1. Set'guifont' and let Vim finda matching'guifontwide'2. Set'guifont' and'guifontwide'3. Set'guifontset'See the documentation for each option for details.  Example:   :set guifont=-misc-fixed-medium-r-normal--15-140-75-75-c-90-iso10646-1You might also want to set the font used for the menus.  This only works forMotif.  Use the ":hi Menu font={fontname}" command for this.:highlightTYPING UTF-8utf-8-typingIf you are using X-Windows, you should find an inputmethod that supportstheUTF-8 encoding.If your system does not provide support for typing UTF-8, you can use the'keymap' feature.  This allowswritinga keymap file, which definesaUTF-8characterasa sequence of ASCII characters.  Seembyte-keymap.Anothermethodis to set the currentlocale to the language you want to useand for which you haveaXIM available.  Then set'termencoding' to thatlanguage and Vim will convert the typed characters to'encoding' for you.If everything else fails, you can type any characteras four hex bytes:CTRL-V u 1234"1234"is interpretedasa hex number.  Youmust type four characters, prependa zero if necessary.COMMAND ARGUMENTSutf-8-char-argCommands likef,F,t andr take an argument of one character.  ForUTF-8 this argument may include one or two composing characters.  These needto be produced together with the base character, Vim doesn't wait for the nextcharacter to be typed to find out ifitisa composing character or not.Using'keymap' or:lmapisa nice way to type these characters.The commands that search fora character ina line handle composing charactersas follows.  When searching fora character withouta composing character,this will find matches in the text with or without composing characters.  Whensearching fora character witha composing character, this will only findmatches with that composing character.  It was implemented this way, becausenot everybodyis able to typea composing character.==============================================================================12. Overview ofoptionsmbyte-optionsTheseoptions are relevant for editingmultibyte files.  Check thehelp inoptions.txt for detailed information.'encoding'Encoding used for the keyboard and display.  Itis also thedefault encoding for files.'fileencoding'Encoding ofa file.  When it's different from'encoding'conversionis done when reading orwriting the file.'fileencodings'List of possible encodings ofa file.  When openinga filethese will be tried and the first one that doesn't cause anerroris used for'fileencoding'.'charconvert'Expression used to convert files from one encoding to another.'formatoptions' The 'm' flag can be included to haveformatting breaka lineatamultibyte character of 256 or higher.  Thusis useful forlanguages wherea sequence of characters can be brokenanywhere.'guifontset'Thelist of font names used foramultibyte encoding.  Whenthis optionis not empty,it replaces'guifont'.'keymap'Specify the name ofa keyboard mapping.==============================================================================Contributions specifically for themultibyte features by:Chi-Deok Hwang <hwang@mizi.co.kr>SungHyun Nam <goweol@gmail.com>K.Nagano <nagano@atese.advantest.co.jp>Taro Muraoka  <koron@tka.att.ne.jp>Yasuhiro Matsumoto <mattn@mail.goo.ne.jp> vim:tw=78:ts=8:noet:ft=help:norl:

Quick links:help overview ·quick reference ·user manual toc ·reference manual toc·faq


[8]ページ先頭

©2009-2025 Movatter.jp