Movatterモバイル変換


[0]ホーム

URL:


  1. Glossary
  2. Unicode

Unicode

Unicode is a standardcharacter set that numbers and definescharacters from the world's different languages, writing systems, and symbols.

By assigning each character a number, programmers can createcharacter encodings, to let computers store, process, and transmit any combination of languages in the same file or program.

Before Unicode, it was difficult and error-prone to mix languages in the same data. For example, one character set would store Japanese characters, and another would store the Arabic alphabet. If it was not clearly marked which parts of the data were in which character set, other programs and computers would display the text incorrectly, or damage it during processing. If you've ever seen text where characters like curly quotes (“”) were replaced with gibberish like£, then you've seen this problem, known asMojibake.

The most common Unicode character encoding on the Web isUTF-8. Other encodings exist, likeUTF-16 or the obsolete UCS-2, but UTF-8 is recommended.

See also

Help improve MDN

Learn how to contribute

This page was last modified on byMDN contributors.


[8]ページ先頭

©2009-2025 Movatter.jp