Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

MIK (character set)

From Wikipedia, the free encyclopedia
(Redirected fromMIK Code page)
Bulgarian character code set used with DOS

MIK (МИК) is an 8-bitCyrilliccode page used withDOS. It is based on the character set used in the BulgarianPravetz 16[1] IBM PC compatible system.Kermit calls this character set "BULGARIA-PC" / "bulgaria-pc".[2][3][4] In Bulgaria, it was sometimes incorrectly referred to as code page 856 (which clashes with IBM's definition for a Hebrew code page). This code page is known by Star printers andFreeDOS asCode page 3021 (Earlier it was known by FreeDOS ascode page 30033 (now used for acode page 857 variant which contains the Crimean Tatar hryvnia sign), but it was renumbered to match the Star Printer code page).

This is the most widespreadDOS/OEM code page used inBulgaria, rather thanCP 808, CP 855,CP 866 orCP 872.

Almost every DOS program created in Bulgaria, which has Bulgarian strings in it, was using MIK as encoding, and many such programs are still in use.

Character set

[edit]

Each character is shown with its equivalentUnicode code point and its decimal code point. Only the second half of the table (code points 128–255) is shown, the first half (code points 0–127) being the same asASCII.

MIK[5][6][4]
0123456789ABCDEF
8xАБВГДЕЖЗИЙКЛМНОП
9xРСТУФХЦЧШЩЪЫЬЭЮЯ
Axабвгдежзийклмноп
Bxрстуфхцчшщъыьэюя
Cx
Dx§
Exαß[nb 1]ΓπΣ[nb 2]σµ[nb 3]τΦΘΩ[nb 4]δφε[nb 5]
Fx±÷°·²NBSP

Notes for implementors of mapping tables to Unicode

[edit]

Implementors of mapping tables to Unicode should note that the MIK Code pageunifies some characters:

  1. ^0xE1 is both the Germansharp S (U+00DF, ß) and the Greek lowercasebeta (U+03B2, β);
  2. ^0xE4 is both then-ary summation sign (U+2211, ∑) and the Greek uppercasesigma (U+03A3, Σ);
  3. ^0xE6 is both themicro sign (U+00B5, µ) and the Greek lowercasemu (U+03BC, μ);
  4. ^0xEA is both theOhm sign (U+2126, Ω) and the Greek uppercaseomega (U+03A9, Ω);
  5. ^0xEE is both theelement-of sign (U+2208, ∈) and the Greek lowercaseepsilon (U+03B5, ε)!

Binary character manipulations

[edit]

TheMIK code page maintains in alphabetical order all Cyrillic letters which enables very easy character manipulation in binary form:

10xx xxxx - is a Cyrillic Letter

100x xxxx - is an Upper-case Cyrillic Letter

101x xxxx - is a Lower-case Cyrillic Letter

In such case testing and character manipulating functions as:

IsAlpha(), IsUpper(), IsLower(), ToUpper() and ToLower(),

are bit operations and sorting is by simple comparison of character values.

See also

[edit]

References

[edit]
  1. ^"Pravetz 16".Archived from the original on 2016-12-06. Retrieved2016-12-06.
  2. ^da Cruz, Frank (2010-04-02)."Kermit and MIME Character-Set Names".The Kermit Project.Columbia University, New York, USA.Archived from the original on 2016-12-03. Retrieved2016-12-02.
  3. ^"Kermit 95 - Cyrillic Character Sets".
  4. ^ab"The Bulgarian PC Code Page Improperly Known as CP856". Archived fromthe original on 2007-08-07.
  5. ^Czyborra, Roman (1998-11-30) [1998-05-25]."The Cyrillic Charset Soup".Archived from the original on 2016-12-03. Retrieved2016-12-03.[1][2]
  6. ^Hohlov, Yu. E."Cyrillic Information Representation in Electronic Form - Character Set (Code Page) Tables".Archived from the original on 2016-12-05. Retrieved2016-12-05.

External links

[edit]
Multilingual
National
Russian
East Slavic
South Slavic
Other
Early telecommunications
ISO/IEC 8859
Bibliographic use
National standards
ISO/IEC 2022
Mac OSCode pages
("scripts")
DOS code pages
IBM AIX code pages
Windows code pages
EBCDIC code pages
DEC terminals (VTx)
Platform specific
Unicode /ISO/IEC 10646
TeX typesetting system
Miscellaneous code pages
Control character
Related topics
Retrieved from "https://en.wikipedia.org/w/index.php?title=MIK_(character_set)&oldid=1299329763"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp