Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

ISO basic Latin alphabet

From Wikipedia, the free encyclopedia

26 letters in two cases broadly used in international communication

TheISO basic Latin alphabet is an international standard (beginning with ISO/IEC 646) for aLatin-script alphabet that consists of two sets (uppercase and lowercase) of 26 letters, codified in[1] various national andinternational standards and used widely ininternational communication. They are the same letters that comprise the currentEnglish alphabet. Since medieval times, they are also the same letters of the modernLatin alphabet. The order is also important for sorting words intoalphabetical order.

The two sets contain the following 26 letters each:[1]

ISO basic Latin alphabet
Uppercase letter setABCDEFGHIJKLMNOPQRSTUVWXYZ
Lowercase letter setabcdefghijklmnopqrstuvwxyz

History

[edit]

By the 1960s it became apparent to thecomputer andtelecommunications industries in theFirst World that a non-proprietary method of encoding characters was needed. TheInternational Organization for Standardization (ISO) encapsulated theLatin script in their (ISO/IEC 646) 7-bit character-encoding standard. To achieve widespread acceptance, this encapsulation was based on popular usage. The standard was based on the already publishedAmerican Standard Code for Information Interchange, better known asASCII, which included in thecharacter set the 26 × 2 letters of theEnglish alphabet. Later standards issued by the ISO, for exampleISO/IEC 8859 (8-bit character encoding) andISO/IEC 10646 (Unicode Latin), have continued to define the 26 × 2 letters of the English alphabet as the basic Latin script with extensions to handle other letters in other languages.[1]

Terminology

[edit]
Main article:Basic Latin (Unicode block)

TheUnicode block that contains the alphabet is called "C0 Controls and Basic Latin". Two subheadings exist:[2]

  • "Uppercase Latin alphabet": the letters start at U+0041 and contain the string LATIN CAPITAL LETTER in their descriptions
  • "Lowercase Latin alphabet": the letters start at U+0061 and contain the string LATIN SMALL LETTER in their descriptions

There are also another two sets in theHalfwidth and Fullwidth Forms block:[3]

  • Uppercase: the letters start at U+FF21 and contain the string FULLWIDTH LATIN CAPITAL LETTER in their descriptions
  • Lowercase: the letters start at U+FF41 and contain the string FULLWIDTH LATIN SMALL LETTER in their descriptions

Timeline for encoding standards

[edit]
  • 1865 International Morse Code was standardized at the International Telegraphy Congress in Paris, and was later made the standard by the International Telecommunication Union (ITU)
  • 1950sRadiotelephony Spelling Alphabet by ICAO[4]

Timeline for widely used computer codes supporting the alphabet

[edit]
  • 1963:ASCII (7-bit character-encoding standard from theAmerican Standards Association, which became theAmerican National Standards Institute in 1969)
  • 1963/1964:EBCDIC (developed byIBM and supporting the same alphabetic characters as ASCII, but with different code values)
  • 1965-04-30: Ratified byECMA asECMA-6[5] based on work the ECMA's Technical Committee TC1 had carried out since December 1960.[5]
  • 1972:ISO 646 (ISO 7-bit character-encoding standard, using the same alphabetic code values as ASCII, revised in second edition ISO 646:1983 and third edition ISO/IEC 646:1991 as a jointISO/IEC standard)
  • 1983: ITU-T Rec. T.51 |ISO/IEC 6937 (a multi-byte extension of ASCII)
  • 1987:ISO/IEC 8859-1:1987 (8-bit character encoding)
    • Subsequently, other versions and parts of ISO/IEC 8859 have been published.
  • Mid-to-late 1980s:Windows-1250,Windows-1252, and other encodings used inMicrosoft Windows (some roughly similar to ISO/IEC 8859-1)
  • 1990:Unicode 1.0 (developed by theUnicode Consortium),[6][7] contained in the block "C0 Controls and Basic Latin" using the same alphabetic code values as ASCII and ISO/IEC 646
    • Subsequently, other versions of Unicode have been published and it later became a jointISO/IEC standard as well, as identified below.
  • 1993:ISO/IEC 10646-1:1993, ISO/IEC standard for characters in Unicode 1.1
    • Subsequently, other versions of ISO/IEC 10646-1 and one of ISO/IEC 10646-2 have been published. Since 2003, the standards have been published under the name "ISO/IEC 10646" without the separation into two parts.
  • 1997:Windows Glyph List 4

Representation

[edit]
The uppercase letters of the ISO basic Latin alphabet on a16-segment display (plus theArabic numerals).

In ASCII the letters belong to theprintable characters and in Unicode since version 1.0 they belong to the block "C0 Controls and Basic Latin". In both cases, as well as inISO/IEC 646,ISO/IEC 8859 andISO/IEC 10646 they are occupying the positions in hexadecimal notation 41 to 5A for uppercase and 61 to 7A for lowercase.

Not case sensitive, all letters have code words in theICAO spelling alphabet and can be represented withMorse code.

Usage

[edit]

All of the lowercase letters are used in theInternational Phonetic Alphabet (IPA). InX-SAMPA andSAMPA these letters have the same sound value as in IPA.

Alphabets containing the same set of letters

[edit]
See also:List of Latin-script alphabets

The list below only includes alphabets that include all the 26 letters but exclude:

Notable omissions due to these rules includeSpanish,Esperanto,Filipino andGerman. The German alphabet is sometimes considered by tradition to contain only 26 letters (with⟨ä⟩,⟨ö⟩,⟨ü⟩ considered variants and⟨ß⟩ considered a ligature of⟨ſ⟩ (long s) and⟨s⟩), but the current German orthographic rules include⟨ä⟩,⟨ö⟩,⟨ü⟩,⟨ß⟩ in the alphabet placed after⟨Z⟩. In Spanish orthography, the letters⟨n⟩ and⟨ñ⟩ are distinct; thetilde is not considered a diacritic in this case.

AlphabetDiacriticMultigraphs (not constituting distinct letters)Ligatures
Afrikaans alphabetá,ä,é,è,ê,ë,í,î,ï,ó,ô,ö,ú,û,ü,ýDigraphs: ⟨aa⟩, ⟨ai⟩, ⟨ch⟩, ⟨ee⟩, ⟨ei⟩, ⟨eu⟩, ⟨gh⟩, ⟨ie⟩, ⟨nj⟩, ⟨ng⟩ ⟨oe⟩, ⟨oi⟩, ⟨oo⟩, ⟨ou⟩, ⟨sj⟩, ⟨tj⟩, ⟨ts⟩, ⟨ui⟩, ⟨uu

Trigraphs: ⟨aai⟩, ⟨eeu⟩, ⟨oei⟩, ⟨ooi⟩

ʼn (N‑apostrophe)
Aragonese alphabet (Academia de l'Aragonés orthography)á,é,í,ó,ú,ü,lꞏlch⟩, ⟨gu⟩, ⟨ll⟩, ⟨ny⟩, ⟨qu⟩, ⟨rr⟩, ⟨tz
Catalan alphabetà,é,è,í,ï,ó,ò,ú,ü,ç,lꞏlgu⟩, ⟨ig⟩, ⟨ix⟩, ⟨ll⟩, ⟨ny⟩, ⟨qu⟩, ⟨rr⟩, ⟨ss
Dutch alphabetä,é,è,ë,ï,ö,üThedigraphij⟩ is sometimes considered to be a separate letter. When that is the case, it usually replaces or is intermixed with ⟨y⟩. Other digraphs: ⟨aa⟩, ⟨ae⟩, ⟨ai⟩, ⟨au⟩, ⟨ch⟩, ⟨ee⟩, ⟨ei⟩, ⟨eu⟩, ⟨ie⟩, ⟨oe⟩, ⟨oi⟩, ⟨oo⟩, ⟨ou⟩, ⟨ui⟩, ⟨uu
English alphabetonly in loanwords (see below)‹ThetemplateSmallsup is beingconsidered for deletion.› 1sh⟩, ⟨ch⟩, ⟨ea⟩, ⟨ou⟩, ⟨th⟩, ⟨ph⟩, ⟨ngæ,œ (both archaic)
French alphabetà,â,ç,é,è,ê,ë,î,ï,ô,ù,û,ü,ÿai⟩, ⟨au⟩, ⟨ei⟩, ⟨eu⟩, ⟨oi⟩, ⟨ou⟩, ⟨eau⟩, ⟨ch⟩, ⟨ph⟩, ⟨gn⟩, ⟨an⟩, ⟨am⟩, ⟨en⟩, ⟨em⟩, ⟨in⟩, ⟨im⟩, ⟨on⟩, ⟨om⟩, ⟨un⟩, ⟨um⟩, ⟨yn⟩, ⟨ym⟩, ⟨ain⟩, ⟨aim⟩, ⟨ein⟩, ⟨oin⟩, ⟨⟩, ⟨æ (rare),œ (mandatory)
Hmong Latin alphabetnonebh⟩, ⟨bl⟩, ⟨ch⟩, ⟨dh⟩, ⟨dl⟩, ⟨gh⟩, ⟨hl⟩, ⟨hm⟩, ⟨hn⟩, ⟨jh⟩, ⟨kh⟩, ⟨ml⟩, ⟨nc⟩, ⟨nq⟩, ⟨nr⟩, ⟨nt⟩, ⟨nx⟩, ⟨ny⟩, ⟨ph⟩, ⟨pl⟩, ⟨qh⟩, ⟨rh⟩, ⟨th⟩, ⟨ts⟩, ⟨tx⟩, ⟨xy⟩, ⟨bhl⟩, ⟨dhl⟩, ⟨hml⟩, ⟨hny⟩, ⟨nch⟩, ⟨ndl⟩, ⟨ngh⟩, ⟨nrh⟩, ⟨nth⟩, ⟨nxh⟩, ⟨phl⟩, ⟨tsh⟩, ⟨txh⟩, ⟨ndhl
Italian alphabet (extended)[a]à,è,é,ì,î (formal),ò,ó,ùch⟩, ⟨ci⟩, ⟨gh⟩, ⟨gi⟩, ⟨gl⟩, ⟨gli⟩, ⟨gn⟩, ⟨sc⟩, ⟨sci
Ido alphabet*nonequ⟩, ⟨ch⟩, ⟨sh
Indonesian alphabetonly in learning materials (see below)‹ThetemplateSmallsup is beingconsidered for deletion.› 4kh⟩, ⟨ng⟩, ⟨ny⟩, ⟨sy⟩,diphthongs: ⟨ai⟩, ⟨au⟩, ⟨ei⟩, ⟨oi⟩
Interlingua alphabet*only in unassimilated loanwords (see below)‹ThetemplateSmallsup is beingconsidered for deletion.› 2ch⟩, ⟨ph⟩, ⟨qu⟩, ⟨rh⟩, ⟨sh
Javanese Latin alphabeté,èdh⟩, ⟨kh⟩, ⟨ng⟩, ⟨ny⟩, ⟨sy⟩, ⟨th
Latino sine flexione alphabet*only an optional accent for unusual stress (see below)‹ThetemplateSmallsup is beingconsidered for deletion.› 3ae⟩, ⟨ch⟩, ⟨oe⟩, ⟨ph⟩, ⟨qu⟩, ⟨rh⟩, ⟨th[8]
Luxembourgish alphabetä,é,ëaa⟩, ⟨ch⟩, ⟨ck⟩, ⟨ee⟩, ⟨ei⟩, ⟨ie⟩, ⟨ii⟩, ⟨ng⟩, ⟨oo⟩, ⟨ou⟩, ⟨qu⟩, ⟨ue⟩, ⟨uu⟩, ⟨sch
Malay alphabetonly in learning materials (see below)‹ThetemplateSmallsup is beingconsidered for deletion.› 4gh⟩, ⟨kh⟩, ⟨ng⟩, ⟨ny⟩, ⟨sy
Portuguese alphabet[b]ã,õ,á,é,í,ó,ú,â,ê,ô,à,çch⟩, ⟨lh⟩, ⟨nh⟩, ⟨rr⟩, ⟨ss⟩, ⟨am⟩, ⟨em⟩, ⟨im⟩, ⟨om⟩, ⟨um⟩, ⟨ãe⟩, ⟨ão⟩, ⟨õe
Sundanese Latin alphabetéeu⟩, ⟨ng⟩, ⟨ny
Xhosa alphabetonly in learning materials (see below)‹ThetemplateSmallsup is beingconsidered for deletion.› 5bh⟩, ⟨ch⟩, ⟨dl⟩, ⟨dy⟩, ⟨dz⟩, ⟨gc⟩, ⟨gq⟩, ⟨gr⟩, ⟨gx⟩, ⟨hh⟩, ⟨hl⟩, ⟨kh⟩, ⟨kr⟩, ⟨krh⟩, ⟨lh⟩, ⟨mh⟩, ⟨nc⟩, ⟨ng⟩, ⟨ngʼ⟩, ⟨ngc⟩, ⟨ngh⟩, ⟨ngq⟩, ⟨ngx⟩, ⟨nh⟩, ⟨nkc⟩, ⟨nkq⟩, ⟨nkx⟩, ⟨nq⟩, ⟨nx⟩, ⟨ny⟩, ⟨nyh⟩, ⟨ph⟩, ⟨qh⟩, ⟨rh⟩, ⟨sh⟩, ⟨th⟩, ⟨ths⟩, ⟨thsh⟩, ⟨ts⟩, ⟨tsh⟩, ⟨ty⟩, ⟨tyh⟩, ⟨wh⟩, ⟨xh⟩, ⟨yh⟩, ⟨zh
Zulu alphabetnonebh⟩, ⟨ch⟩, ⟨dl⟩, ⟨dy⟩, ⟨gc⟩, ⟨gq⟩, ⟨gx⟩, ⟨hh⟩, ⟨hl⟩, ⟨kh⟩, ⟨kl⟩, ⟨mb⟩, ⟨nc⟩, ⟨ng⟩, ⟨ngc⟩, ⟨ngq⟩, ⟨ngx⟩, ⟨nj⟩, ⟨nk⟩, ⟨nq⟩, ⟨ntsh⟩, ⟨nx⟩, ⟨ny⟩, ⟨ph⟩, ⟨qh⟩, ⟨sh⟩, ⟨th⟩, ⟨ts⟩, ⟨tsh⟩, ⟨xh

* Constructed languages

  1. English is one of the few modern European languages requiring no diacritics for native words (although adiaeresis is used by some American publishers in words such as "coöperation").[c][9]
  2. Interlingua, a constructed language, never uses diacritics except in unassimilated loanwords. However, they can be removed if they are not used to modify the vowel (e.g.cafe, fromFrench:café).[10]
  3. Latino sine flexione, a.k.a. "Peano's Interlingua",allows but does not require the placement of an accent for unusual stress. (It antedates the other "Interlingua" by roughly four decades.)
  4. Malay and Indonesian (based on Malay) use all the Latin alphabet and require no diacritics and ligatures. However, Malay and Indonesian learning materials may use ⟨é⟩ (E with acute) to clarify the pronunciation of the letter E; in that case, ⟨e⟩ is pronounced /ə/ while ⟨é⟩ is pronounced /e/ and (è) is pronounced /ɛ/. Many of the700+ languages of Indonesia also use the Indonesian alphabet to write their languages, some—such asJavanese—adding diacritics é and è, and some omitting q, x, and z.
  5. Xhosa is usually written without diacritics, but may optionally use diacritics over⟨a, e, i, o, u⟩ for tones:⟨à, á, â, ä⟩.

Column numbering

[edit]
This sectionneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources in this section. Unsourced material may be challenged and removed.(October 2023) (Learn how and when to remove this message)

The Roman (Latin) alphabet is commonly used for column numbering in a table or chart. This avoids confusion with row numbers usingArabic numerals. For example, a 3-by-3 table would contain columns A, B, and C, set against rows 1, 2, and 3. If more columns are needed beyond Z (normally the final letter of the alphabet), the column immediately after Z is AA, followed by AB, and so on[11] (seebijective base-26 system). This can be seen by scrolling far to the right in a spreadsheet program such asMicrosoft Excel orLibreOffice Calc.

The letters are often used for indexing nested bullet points. In this case after the 26th it is more common to use AA, BB, CC, ... instead of base-26 numbers.

See also

[edit]

Notes

[edit]
  1. ^The Italian alphabet is traditionally considered to have only 21 letters, excludingj,k,w,x,y. However, in practice these letters occur in a number of loanwords.J also occurs in some native Italian proper names as a variant of writing semivocalici.
  2. ^Note for Portuguese:k andy (but notw) were part of the alphabet until several spelling reforms during the 20th century, the aim of which was to change the etymological Portuguese spelling into an easier phonetic spelling. These letters were replaced by other letters having the same sound: thuspsychologia becamepsicologia,kioske becamequiosque,martyr becamemártir, etc. Nowadaysk,w, andy are only found in foreign words and their derived terms and in scientific abbreviations (e.g.km,byronismo). These letters are considered part of the alphabet again following the1990 Portuguese Language Orthographic Agreement, which came into effect on January 1, 2009, in Brazil. SeeReforms of Portuguese orthography.
  3. ^As an example of an article containing adiaeresis in "coöperate", as well as accents on loan words in English, such as acedilla in "façades" and acircumflex in the word "crêpe", seeGrafton, Anthony (October 23, 2006)."Books: The Nutty Professors, The history of academic charisma".The New Yorker.

References

[edit]
  1. ^abc"Internationalisation standardization of 7-bit codes, ISO 646". Trans-European Research and Education Networking Association (TERENA). RetrievedOctober 3, 2010.
  2. ^"C0 Controls and Basic Latin"(PDF).Unicode.org. RetrievedAugust 8, 2016.
  3. ^"Halfwidth and Fullwidth Forms"(PDF).Unicode.org. RetrievedAugust 8, 2016.
  4. ^"The Postal History of ICAO".www.icao.int. Archived fromthe original on February 12, 2019. RetrievedFebruary 17, 2019.
  5. ^abStandard ECMA-6: 7-Bit Coded Character Set(PDF) (5th ed.). Geneva, Switzerland:European Computer Manufacturers Association (Ecma). March 1985. Archived fromthe original(PDF) on May 29, 2016. RetrievedMay 29, 2016.The Technical Committee TC1 ofECMA met for the first time in December 1960 to prepare standard codes for Input/Output purposes. On April 30, 1965, Standard ECMA-6 was adopted by the General Assembly of ECMA.
  6. ^"Unicode character database".The Unicode Standard. RetrievedMarch 22, 2013.
  7. ^The Unicode Standard Version 1.0, Volume 1. Addison-Wesley Publishing Company, Inc. 1990.ISBN 0-201-56788-1.
  8. ^Not "letters", per:Ager, Simon."Latino sine Flexione".Omniglot. Latino sine Flexione alphabet. RetrievedApril 14, 2023.
  9. ^"The New Yorker's odd mark — the diaeresis". December 16, 2010. Archived fromthe original on December 16, 2010.
  10. ^"Introduction al IED (in anglese)".www.interlingua.com. RetrievedSeptember 21, 2020.
  11. ^"How To Switch From Letters to Numbers for Columns in Excel".Indeed. RetrievedNovember 21, 2024.
Alphabets (list)
Letters (list)
Multigraphs
Digraphs
Trigraphs
Tetragraphs
Pentagraphs
Keyboard layouts (list)
Historical standards
Current standards
Lists
Early telecommunications
ISO/IEC 8859
Bibliographic use
National standards
ISO/IEC 2022
Mac OSCode pages
("scripts")
DOS code pages
IBM AIX code pages
Windows code pages
EBCDIC code pages
DEC terminals (VTx)
Platform specific
Unicode /ISO/IEC 10646
TeX typesetting system
Miscellaneous code pages
Control character
Related topics
Retrieved from "https://en.wikipedia.org/w/index.php?title=ISO_basic_Latin_alphabet&oldid=1311552460"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp