Over a thousand characters from theLatin script are encoded in theUnicode Standard, grouped in several basic and extended Latinblocks. The extended ranges contain mainlyprecomposed letters plus diacritics that are equivalently encoded withcombining diacritics, as well as some ligatures and distinct letters, used for example in the orthographies of various African languages (includingclick symbols in Latin Extended-B) and theVietnamese alphabet (Latin Extended Additional). Latin Extended-C contains additions forUighur and theClaudian letters. Latin Extended-D comprises characters that are mostly of interest to medievalists. Latin Extended-E mostly comprises characters used for German dialectology (Teuthonista).[1] Latin Extended-F and -G contain characters forphonetic transcription.
As of version 17.0 of the Unicode Standard, 1,492 characters in the following 19 blocks are classified as belonging to the Latin script.[2]
In addition, a number of Latin-like characters are encoded in theCurrency Symbols,Control Pictures,CJK Compatibility,Enclosed Alphanumerics,Enclosed CJK Letters and Months,Mathematical Alphanumeric Symbols, andEnclosed Alphanumeric Supplement blocks, but, although they are Latin letters graphically, they have the script propertycommon, and, so, do not belong to the Latin script in Unicode terms.Lisu also consists almost entirely of Latin forms, but uses its own script property.
In this table those characters with theUnicode script property of Latin are highlighted in colour, indicating the version of Unicode they were introduced in. Reserved code points (which may be assigned as characters at a future date) have a grey background. All characters that do not belong to the Latin script have a white background (and the version of Unicode they were introduced in is therefore not indicated).