Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WiktionaryThe Free Dictionary
Search

Wiktionary:Language treatment

From Wiktionary, the free dictionary
This is aWiktionary policy, guideline or common practices page. This is adraft proposal. It is unofficial, and it is unknown whether it is widely accepted by Wiktionary editors.
Policies – Entries:CFI -EL -NORM -NPOV -QUOTE -REDIR -DELETE. Languages:LT -AXX. Others:BLOCK -BOTS -VOTES.
Shortcuts:
WT:LANGTREAT
WT:LT

The distinction between languages and dialects is notclear-cut. Alect that some regard as adialect of a certain language may be regarded as a full, separatelanguage by others. This page contains a list of languages and their (ISO-code-having) dialects, with notes on whether or not the dialects are treated as separate languages on Wiktionary. If there is no note about the status of a particular language+dialect group, the situation is not yet regulated. If multiple dialects are treated as a single language on Wiktionary, but there is no ISO code that represents all of them, the code of one of the dialects is used as the code for the whole language, or an exceptional code is created (for more, seeWiktionary:Languages).

For the most part, this page documents cases where Wiktionary’s treatment of lects deviates from that of the ISO/SIL, e.g. cases where we have merged lects that they have not. Cases where an ISO code has been excluded from Wiktionary altogether (typically because it was too vague to be meaningful) are also documented. Cases where the ISO/SIL itself has merged lects which they formerly granted separate codes, and we have followed suit, are not necessarily documented here.

Discussions about splitting, merging, deleting, adding or renaming lects may be conducted atWiktionary:Language treatment requests (WT:LTR).

List of languages and dialects

Colour coding:

  • gcf: This code is treated as a language. Entries in this language are allowed.
  • acf: This code is not treated as a separate language. See the "Treatment" column to understand which code should be used instead. In certain cases, these codes may be permitted in etymologies (so-called"etymology-only" languages).
  • ber: This code represents a language family. It is not itself a language, although it may have aproto-language (ber-pro).
  • cmn: The disposition of this code is complex or unclear. See the "Treatment" column.
CodeMacrolanguageSubdivisionsTreatment
akAkanabr (Abron / Brong),fat (Fanti),tw (Twi),wss (Wasa)Only the macrolanguage is treated as a (full) language. (The dialects are etymology-only.) (discussion 1,discussion 2)
sqAlbanianaae (Arbëreshë Albanian),aat (Arvanitika Albanian),aln (Gheg Albanian),als (Tosk Albanian)Only the macrolanguage is treated as a language. (discussion 1,discussion 2)
gswAlemannicwae (Walser)Only the macrolanguage is treated as a language. (discussion)
gcfAntillean Creolegcf,acf,scfAntillean Creole is treated as a single language with the codegcf. The ISO had coded two dialects separately, usingacf for “Saint Lucian Creole French” andgcf for “Guadeloupean Creole French”. (discussion) San Miguel Creole French, which is also often considered part of Antillean Creole, is treated as a separate language with the codescf.
arArabicabh (Tajiki Arabic),abv (Baharna Arabic),acm (Iraqi Arabic),acw (Hijazi Arabic),acx (Omani Arabic),acy (Cypriot Arabic),adf (Dhofari Arabic),aeb (Tunisian Arabic),afb (Gulf Arabic),ajp (South Levantine Arabic),apc (North Levantine Arabic),apd (Sudanese Arabic),arb (Standard Arabic),arq (Algerian Arabic),aao Algerian Saharan Arabicars (Najdi Arabic),ary (Moroccan Arabic),arz (Egyptian Arabic),aec (Saidi Arabic),avl (Eastern Egyptian Bedawi Arabic),auz (Uzbeki Arabic),ayl (Libyan Arabic),ayn (Yemeni Arabic),acq (Ta'izzi-Adeni Arabic),ayh (Hadrami Arabic),ayp (North Mesopotamian Arabic),pga (Juba Arabic),shu (Chadian Arabic),ssh (Shihhi Arabic)Both the macrolanguage and many of its subdivisions are treated as languages, but not every subdivision with an ISO code or Ethnologue entry is deemed distinct enough for lexicographic division, and the macrolanguage code is used in place of the code (arb) which the ISO gave the standard variety of the language.
Jewish varieties of Arabic, for which there have been introduced some spotty ISO codes (ajt,aju,jye,yhd,yud,jrb), are included within the general regional dialects, or the standard variety, of which the bulk of medieval texts falling under the “Judeo-Arabic” label turn out Hebrew-script forms—as also other sectarian or tribal language forms.
NOTE: Per 2023, ISO has mergedapc (North Levantine Arabic) andajp (South Levantine Arabic), and it has beenrequested to do the same here at Wiktionary. There is no conceptual objection to this merger, but it is awaiting resources on the part of the proposers of the merge, as it will take significant effort.
(discussion of aao,discussion of merging ayn, acq, ayh as Yemeni Arabic,discussion of aec and arz,discussion of avl)
sem-araAramaicarc (Aramaic),oar (Old Aramaic),aii (Assyrian Neo-Aramaic),aij (Lishanid Noshan),amw (Western Neo-Aramaic),bhn (Bohtan Neo-Aramaic),bjf (Barzani Jewish Neo-Aramaic),cld (Chaldean Neo-Aramaic),hrt (Hértevin),huy (Hulaulá),jpa (Jewish-Palestinian Aramaic),kqd (Koy Sanjaq Surat),lhs (Mlahsô),lsd (Lishana Deni),mid (Modern Mandaic),myz (Classical Mandaic),sam (Samaritan Aramaic),syc (Syriac; Classical Syriac),syn (Senaya),tmr (Jewish Babylonian Aramaic),trg (Lishán Didán),tru (Turoyo),xrm (Armazic)Some varieties are treated as languages, others are not:
The codeoar for “Old Aramaic” (up to 700B.C.E.) is not used; it has been superseded byarc andsyc.
“Jewish Babylonian Aramaic” (circa 200–1200C.E.) is also not allowed L2s, as it has been superseded byarc, but its codetmr is allowed in etymologies.
Assyrian Neo-Aramaic (aii) and Chaldean Neo-Aramaic (cld) are currently treated as languages, as areaij,amw,bhn,bjf,hrt,huy,kqd,lhs,lsd,mid,myz,sam,syn,trg,tru,xrm, andarc andsyc.jpa andsyr (the latter, “Syriac”, being a macrolanguage coveringaii andcld; seediscussion) are not currently treated as languages.
hyArmenianhyw (Western Armenian)Only the macrolanguage is treated as a language. (discussion)
ayAymaraayc (Southern Aymara),ayr (Central Aymara)Only the macrolanguage is treated as a language. (discussion)
azAzeriazb (South Azerbaijani),azj (North Azerbaijani),qxq (Qashqai, Kashkay)Only the macrolanguage is treated as a language; dialects, including Afshar and Sonqor, are not. (discussion 1,discussion 2)
sit-baiBaibca (Central Bai),bfc (Panyi Bai),bfs (Southern Bai),lay (Lama Bai)Not yet discussed.
balBaluchibcc (Southern Baluchi/Balochi),bgp (Eastern Baluchi/Balochi),bgn (Western Baluchi/Balochi)Only the macrolanguage is treated as a language. (discussion)
lodBerawanzbc (Central Berawan),zbe (East Berawan),zbw (West Berawan)Only the macrolanguage is treated as a language. (discussion)
berBerberauj (Awjilah),swn (Sawknah),siz (Siwi),cnu (Chenoua),jbe (Judeo-Berber),shi (Tashelhit),tzm (Central Atlas Tamazight),zgh (Standard Moroccan Tamazight),kab (Kabyle),gha (Ghadamès),jbn (Nafusi),sds (Sened),gho (Ghomara),oua (Tagargrent),tjo (Temacine Tamazight),grr (Taznatit),mzb (Tumzabt),sjs (Senhaja Berber),rif (Tarifit),shy (Tachawit),tia (Tidikelt Tamazight),thv (Tahaggart Tamahaq),ttq (Tawallammat Tamajaq),thz (Tayart Tamajeq),taq (Tamasheq),zen (Zenaga)Only some of the subdivisions are treated as languages. Judeo-Berber is treated as a variety of Tashelhit. (discussion 1 anddiscussion 2,discussion 3,discussion 4,discussion 5)
bikBikolagk (Isarog Agta),agz (Mount Iriga Agta),atl (Mount Iraya Agta),bcl (Central Bikol),bln (Southern Catanduanes Bikol),bto (Rinconada Bikol),cts (Northern Catanduanes Bikol),fbl (West Albay Bikol),lbl (Libon Bikol),rbl (Miraya Bikol),ubl (Buhi'non Bikol)Only the subdivisions are treated as languages, the macrolanguage is not. (discussion)
bncBontocrbk (Northern Bontoc),vbk (Southwestern Bontoc),lbk (Central Bontoc),ebk (Eastern Bontoc),obk (Southern Bontoc)Only the subdivisions are treated as languages, the macrolanguage is not. (discussion)
buaBuryatbxm (Mongolian Buriat),bxr (Russian Buriat),bxu (Chinese Buriat)Only the macrolanguage is treated as a language. (discussion 1,discussion 2,discussion 3)
zhChinesecdo (Eastern Min),cjy (Jin),cmn (Mandarin),cnp (Northern Pinghua),cpx (Puxian Min),csp (Southern Pinghua),czh (Huizhou),czo (Central Min),dng (Dungan),gan (Gan),hak (Hakka),hnm (Hainanese),hsn (Xiang),ltc (Middle Chinese),luh (Leizhou Min),lzh (Classical Chinese),mnp (Northern Min),nan (Southern Min),nan-hbl (Hokkien),nan-tws (Teochew),och (Old Chinese),wuu (Wu),wxa (Waxiang),yue (Cantonese),zho (inclusive code),zhx-sht (Shaozhou Tuhua),zhx-sic (Sichuanese),zhx-tai (Taishanese)Unique treatment: for the purpose of language headersonly, only the macrolanguage is treated as a language (superseded discussion;vote), but the romanizations of Standard Mandarin (Hanyu Pinyin) and Standard Cantonese (Jyutping) are allowed as non-lemmas, while the romanization of Hokkien (Pe̍h-ōe-jī) and Dungan terms in Cyrillic script and Xiao'erjing are allowed as lemmas (seethis guideline). Forall other purposes, the constituent codes are treated as full languages.

Southern Min is treated as a language family, and its members are given exceptional codes.

crCreeatj (Atikamekw),crj (Southern East Cree),crk (Plains Cree),crl (Northern East Cree),crm (Moose Cree),csw (Swampy Cree),cwd (Woods Cree),moe (Montagnais),nsk (Mansaka)Not yet discussed.
dihDiegueñonai-ipa (Ipai),nai-kum (Kumeyaay),nai-tip (Tipai)Only the subdivisions are treated as languages. (discussion)
difDieridit (Dirari)Only the macrolanguage is treated as a language.
dinDinkadib (South Central Dinka),dik (Southwestern Dinka),dip (Northeastern Dinka),diw (Northwestern Dinka),dks (Southeastern Dinka)Not yet discussed.
doiDogridgo (Hindi Dogri),xnr (Kangri)Not yet discussed.
kzhDongolawi
(= Kenuzi-Dongola)
dgl (Andaandi / Dongolawi),xnz (Kenzi / Mattoki)Only the macrolanguage is treated as a language. (discussion)
ddrDhudhuroaxjt (Yaitmathang)Onlyddr is treated as a language. (discussion)
gmw-ecgEast Central Germansli (Silesian),sxu (Upper Saxon)Only the macrolanguage is treated as a language. (It also includes Thuringian, Lusatian German, Erzgebirgisch, and High Prussian.) (discussion 1,discussion 2)
enEnglishpld (Polari)Only the macrolanguage is treated as a language. (discussion ofpld)
etEstonianvro (Võro)Both the macrolanguage and its subdivisionvro are treated as languages, but the macrolanguage code is used in place of the code (ekk) which the ISO gave the standard variety of the language.
evnEvenkievn (Evenki),tuw-sol (Solon)Both Evenki and Solon are treated as languages. (discussion)
ffFulaffm,fub,fuc,fue,fuf,fuh,fui,fuq,fuvOnly the macrolanguage is treated as a language. (discussion)
frFrenchroa-gal (Gallo),nrf (Norman – see separate entry in this table),frc (Cajun French / Louisiana French),fr-aca (Acadian French)Both the macrolanguagefr and its subdivisionsroa-gal andnrf are treated as languages.frc andfr-aca are etymology-only languages (and the former is not to be confused with Louisiana Creole French, which is treated as a full language). (discussion of Acadian French)
gmw-friFrisianfrr (North Frisian),stq (Saterland Frisian),fy (West Frisian)The Frisian languages are treated as separate languages.
kmcGamdoc (Northern Dong),kmc (Southern Dong)Gam is treated as a single language with the codekmc. The ISO had coded two dialects separately, usingdoc for “Northern Dong” andkmc for “Southern Dong”. “Cao Miao” (cov) is tentatively still treated as a separate language, pending further discussion. (discussion)
cel-gauGaulishxtg (Transalpine Gaulish),xcg (Cisalpine Gaulish)The two varieties of Gaulish have been merged under the codecel-gau, though they may still be separated in etymologies. (discussion)
gbaGbayabdt,gbp,gbq,gmm,gso,gyaNot yet discussed.
gioGelaoaou (A'ou),giq (Hagei, Green Gelao),gir (Vandu, Red Gelao),giw (Telue, White Gelao, Duoluo),giu (Mulao),gqu (Qau)Not yet discussed.
kaGeorgianjge (Judeo-Georgian)Only the macrolanguage is treated as a language. (discussion)
gonGondigno (Northern Gondi),ggo (Southern Gondi, retired),esg (Aheri Gondi),wsg (Adilabad Gondi)Only the macrolanguage is treated as a language. (discussion)
grbGrebogbo,gec,grj,grv,gry,ktj,oub,pye,tedNot yet discussed.
grcGreek, Ancientgkm (Byzantine Greek),gmy (Mycenaean Greek),xmk (Ancient Macedonian)Among the ancient languages,gmy andxmk are treated as separate languages.gkm is an etymology-only language, as are the various ancient dialects (see the family tree table atCategory:Ancient Greek language). (discussions ofgkm:discussion 1,discussion 2)
elGreek, Moderncpg (Cappadocian Greek),grk-cal (Calabrian Greek),grk-ita (Italiot Greek),grk-mar (Mariupol Greek),tsd (Tsakonian)Among the modern languages, both the macrolanguage and its subdivisions are treated as languages.
gnGuaranígnw,gug,gui,gun,nhdNot yet discussed.
haiHaidahax (Southern Haida),hdn (Northern Haida)Not yet discussed.
heHebrewhbo (Biblical Hebrew)Biblical Hebrew does not have an L2 separate from Hebrew; the codehe is used for both. In etymologies, however, the two may be distinguished. (SeeWT:AHE.)
hitHittiteoht (Old Hittite),htx (Middle Hittite),nei (Neo-Hittite / New Hittite)In practice, onlyhit is treated as a language.
hmnHmongcqd (Chuanqiandian-cluster Miao),hea (Northern Qiandong Miao),hma (Southern Mashan Hmong),hmc (Central Huishui Hmong),hmd (A-Hmao / Large Flowery Miao),hme (Eastern Huishui Hmong),hmf (Hmong Don),hmg (Southwestern Guiyang Hmong),hmh (Southwestern Huishui Hmong),hmi (Northern Huishui Hmong),hmj (Ge),hml (Luopohe Hmong),hmm (Central Mashan Hmong),hmp (Northern Mashan Hmong),hmq (Eastern Qiandong Miao),hms (Southern Qiandong Miao),hmv (Hmong Do),hmw (Western Mashan Hmong),hmy (Southern Guiyang Hmong),hmz (Hmong Shua),hnj (Hmong Njua),hrm (Horned Miao),huj (Northern Guiyang Hmong),mmr (Western Xiangxi Miao),muq (Eastern Xiangxi Miao),mww (White Hmong),sfm (Small Flowery Miao)Tentatively, all subvarieties are accepted.
The old macrolanguage codeblu and the newer macrolanguage codehmn are not used.
cqd (Chuanqiandian-cluster Miao) is an umbrella term for various varieties of Hmong in China. (discussion 1,discussion 2)
eroHorpaero-gsz (Geshiza),ero-nya (Nyagrong Minyag),ero-tau (Stau),txg (Tangut)In practice, only the subvarieties are treated as languages; the ISO language codeero is used for the family.
huvHuavehve (San Dionisio del Mar Huave),hvv (Santa María del Mar Huave),hue (San Francisco del Mar Huave),huv (San Mateo del Mar Huave)Only the macrolanguage is treated as a language, using the codehuv. (discussion)
iuInuktitutike (Eastern Canadian Inuktitut),ikt (Inuvialuktun),esx-inq (Inuinnaqtun)Although it was decided in 2018 that only the macrolanguage would be treated as a language (seediscussion), that decision was never implemented. In fact, all four codes are treated as languages.
ikInupiakesi (North Alaskan Inupiatun),esk (Northwest Alaskan Inupiatun)Only the macrolanguage is treated as a language.
illIranunilm,ilpIn 2015, the ISO splitill intoilm (the Iranun of Malaysia) andilp (the Iranun of the Philippines). Wiktionary has not made this split at this time.
kdvKadozkd (Kadu proper),zkn (Kanan)In 2012, the ISO splitkdv intozkd andzkn. Wiktionary has not made this split at this time. (discussion)
klnKalenjinenb,eyo,niq,oki,pko,sgc,spy,tec,tuyOnly the macrolanguage is treated as a language.
kneKankanaeyxnn (Northern Kankanay)Onlykne is treated as a language. ([[Wiktionary:Language treatment requests#Merge Northern Kankanay [xnn] to just Kankanaey [kne]|discussion 1]], [[Wiktionary:Language treatment requests#Northern Kankanay [xnn] -> delete|discussion 2]])
krKanuribms (Bilma Kanuri),kau,kbl (Kanembu),kby,knc,krtOnly the macrolanguage is treated as a language. (discussion)
kzkKazukurudrr (Dororo),gli (Guliguli)Onlykzk is treated as a language. (discussion)
kelKela-Yelakel,yelKela-Yela is treated as a single language with the codekel. The ISO had coded two dialects separately, usingkel for “Kela” andyel for “Yela”.
kcaKhantykca-eas (Eastern Khanty),kca-nor (Northern Khanty),kca-sou (Southern Khanty)Eastern (kca-eas), Northern (kca-nor) and Southern Khanty (kca-sou) are treated as separate languages with exceptional language codes. (discussion)
kmKhmerkhm,kxmNot yet discussed.
cnkKhumi Chincek (Eastern Khumi)Onlycnk is treated as a language.
ktuKitubamkwKituba is treated as a single language with the codektu. The ISO had coded two dialects separately, usingktu for the variety spoken in the Democratic Republic of the Congo andmkw for the variety spoken in the Republic of the Congo. (discussion)
kvKomikoi,kpvOnly the subdivisions are treated as languages. (discussion)
kgKongokng (Koongo),kwy (San Salvador Kongo),ldi (Laari),yom (Yombe)Only the macrolanguage is treated as a language. (discussion)
kokKonkanigom,knnOnly the macrolanguage is treated as a language.
kpeKpellegkp,xpeNot yet discussed.
khi-kunǃKungmwj (Sekele / Maligo),knw (Ekoka ǃKung),oun (ǃOǃKung),gfx (Mangetti Dune ǃXung)Only the macrolanguage is treated as a language. (discussion 1,discussion 2,discussion 3)
kjnKunjenolkKunjen is treated as a single language with the codekjn. The ISO had coded two dialects separately, usingkjn for “Uw Oykangand” andolk for “Uw Olkola”, and had not given codes to the varieties Ogh-Undjan, Kawarrangg, or Athima.
kuKurdishckb (Central Kurdish),kmr (Northern Kurdish),sdh (Southern Kurdish)Central (ckb), Northern (kmr) and Southern (sdh) are treated as separate languages. (discussion,follow-on discussion.) It has beenproposed in 2023 to rename Northern Kurdish to Kurmanji and Central Kurdish to Sorani, but the discussion ended in no consensus.
unnKurnaiihw (Bidhawal, Birrdhawal)Only the macrolanguage is treated as an individual language.
lahLahndahnd,hno,jat,phr,pmu,pnb,skr,xheNot yet discussed.
lvLatvianltg (Latgalian)Both the macrolanguage and its subdivisionltg are treated as languages, but the macrolanguage code is used in place of the code (lvs) which the ISO gave the standard variety of the language.
delLenape
(= Delaware)
umu (Munsee),unm (Unami)Only the subdivisions are treated as languages. (discussion)
ltLithuaniansgs (Samogitian)Both the macrolanguage and its subdivisionsgs are treated as languages.
lrkLoarkigda (Gade Lohar)Only the macrolanguage is treated as an individual language. (discussion)
ndsLow German
  • nds-de (German Low German) –wep (Westphalian),frs (East Frisian)
  • nds-nl (Dutch Low Saxon) –act (Achterhoeks),drt (Drents),gos (Gronings),sdz (Sallands),stl (Stellingwerfs),twd (Twents),vel (Veluws)
  • pdt (Plautdietsch)
The codends is deprecated.nds-de is used for German Low German varieties.nds-nl is used for Dutch Low Saxon varieties. Plautdietsch (pdt) is a separate lect. (discussion of language name,discussion ofdrt,gos,twd,general discussion 1,general discussion 2,discussion ofnds-de,general discussion 3(permalink))
luyLuhyabxk,ida,lkb,lko,lks,lri,lrm,lsm,lto,lts,lwg,nle,nyd,ragNot yet discussed.
xluLuwianhluLuwian is treated as a single language with the codexlu. Luwian is written in two scripts, and the ISO had coded each separately, usingxlu for “Cuneiform Luwian” andhlu for “Hieroglyphic Luwian”. (discussion,permalink)
mgMalagasybhr (Bara Malagasy),bjq,bmm (Northern Betsimisaraka Malagasy),buc (Bushi),bzc (Southern Betsimisaraka Malagasy),mlg,msh,plt,skg,tdx,tkg,txy,xmv,xmwOnly the macrolanguage is treated as an individual language. (discussion 1,discussion 2)
msMalaybjn,btj,bve,bvu,coa,dup,hji,id,jak,jax,kvb,kvr,kxd,lce,lcf,liw,max,meo,mfa,mfb,mhp,min,mqg,msi,mui,orn,ors,pel,pse,tmw,urk,vkk,vkt,xmm,zlm,zmi,zsm,omyThe codeszsm,zlm andomy are not used;ms is used instead. The status of the remaining lects remain unclear. (discussion 1,discussion 2,discussion 3,vote 1,vote 2(Failed votes on unification of Malay and Indonesian),discussion 4,discussion 5)
manMandingoemk,mku,mlq,mnk,msc,mwk,myqNot yet discussed.
mnsMansimns-cen (Central Mansi),mns-nor (Northern Mansi),mns-sou (Southern Mansi)Central (mns-cen, including Western and Eastern Mansi), Northern (mns-nor) and Southern Mansi (mns-sou) are treated as separate languages with exceptional language codes. (discussion)
Manthartadjl (Jiwarli (macrolanguage code)),dze (Jiwarli (proper)),iin (Thiin),dhr (Tharrgari),wri (Warriyangga)Jiwarli is treated as a single language with the codedjl; the ISO’s split of that code intodze for Jiwarli proper andiin for Thiin has not been followed. However,dhr (Tharrgari) andwri (Warriyangga) have tentatively been retained as languages, rather than being merged, with Jiwarli, into the single language Mantharta. (discussion)
chmMarimhr (Eastern Mari),mrj (Western Mari)Only the subdivisions are treated as languages, the macrolanguage is not. (discussion 1,discussion 2,discussion 3)
mwrMarwaridhd,mve,rwr,swv,wryOnly the macrolanguage Marwari (mwr) is treated as a language. (discussion).mtr (Mewari) is a separate Rajasthani language and not a variety of Marwari at all (discussion)
mntMaykulanwnn (Wunumara),xyj (Mayi-Yapi),xyk (Mayi-Kulan),xyt (Mayi-Thakurti)Only the macrolanguage is treated as a language. (discussion)
mnMongoliankhk (Khalkha Mongolian),mvf (Peripheral Mongolian)Only the codemn is used for Mongolian;khk is redundant to it andmvf is not usable. Note that Kalmyk (xal) and Buryat (see its entry in this table), which some scholars consider dialects of Mongolian, are treated as independent languages on Wiktionary. (discussion)
mjgMonguormjg-mgr (Mangghuer),mjg-mgl (Mongghul)Only the subdivisions are treated as languages, the macrolanguage is not. (discussion)
nbfNanru (Narua),nxq (Naxi)Only the subdivisions are treated as languages, the macrolanguage is not.
nahNahuatlazd (Eastern Durango Nahuatl),azn (Western Durango Nahuatl),azz (Highland Puebla Nahuatl),naz (Coatepec Nahuatl),nch (Central Huasteca Nahuatl),nci (Classical Nahuatl),ncj (Northern Puebla Nahuatl),ncl (Michoacán Nahuatl),ncx (Central Puebla Nahuatl),ngu (Guerrero Nahuatl),nhc (Tabasco Nahuatl),nhe (Eastern Huasteca Nahuatl),nhg (Tetelcingo Nahuatl),nhi (Zacatlán-Ahuacatlán-Tepetzintla Nahuatl),nhk (Cosoleacaque Nahuatl),nhm (Morelos Nahuatl),nhn (Central Nahuatl),nhp (Pajapan Nahuatl),nhq (Huaxcaleca Nahuatl),nht (Ometepec Nahuatl),nhv (Temascaltepec Nahuatl),nhw (Western Huasteca Nahuatl),nhx (Mecayapan Nahuatl),nhy (Northern Oaxaca Nahuatl),nhz (Santa María La Alta Nahuatl),nlv (Orizaba Nahuatl),nuz (Tlamacazapa Nahuatl),ppl (Pipil),xpo (Pochutec)The macrolanguage codenah is no longer used for entries; its subdivisions are treated as languages. A number ofnah translations remain and require to be assigned to the appropriate Nahuatl language. (discussion)
mffNakijms (Mashi),buz (Bukwen)Only the macrolanguage is treated as a language. (discussion)
Nambunmx (Nama),nkm (Namat),ncm (Nambo),mxw (Namo, Dorro),nex (Neme),nqn (Nen)Not yet discussed
Nenetsyrk-tun (Tundra Nenets),yrk-for (Forest Nenets)Only the subdivisions are treated as languages, the macrolanguage is not. (The code the ISO gave the macrolanguage,yrk, is used for the Nenets family.) (discussion) “Yurats” (rts), a barely-attested extinct lect which was probably just a variety of Enets, or a transitional variety between Enets and the Nenets varieties, is excluded for now. (discussion)
neNepalinpiOnly the codene is used.
Ngbandingb,deq, ...deq is not included.
nrfNormanroa-nor (Norman),nrf-grn (Guernésiais),nrf-jer (Jèrriais)Only the macrolanguage Norman is treated as a language, and the codenrf (which the ISO assigned to Guernésiais and Jèrriais) is used for all varieties of it, which had previously been granted separate exceptional codes. Comparefr. (discussion)
noNorwegiannb,nnIn practice, both the macrolanguage and its subdivisions are treated as languages. There has been discussion of either treating only the macrolanguage as a language, or of only treating the subdivisions as languages, but there is no consensus about which of these to do. (discussion 1,discussion 2,discussion 3,discussion 4,discussion 5,discussion 6,stalemated vote) Nonetheless, over time, the subdivisions have become favoured; as of May 2025, Bokmål and Nynorsk had more than 20,000 lemmas each, whereas Norwegian (no) had around 1,550 lemmas, predominantly proper nouns.
nktNyikankt,nkvNyika is treated as a single language with the codenkt. The ISO had coded two regional varieties separately, usingnkt for the Nyika of Tanzania andnkv for the Nyika of Malawi and Zambia.
ocOccitanprv (Provençal),sdt (Shuadit)Only the macrolanguage is treated as a language.prv is an etymology-only language.
orOdiaori,ory,ort (Adivasi Odia),dso (Desiya),spv (Sambalpuri, Kosli, Kosali)Neither of the three-letter codes the ISO assigned to Odia proper (ori andory) is used; instead, only the two-letter codeor is used. Of the three other codes the ISO assigned to Odia varieties,spv is currently not used, whiledso andort are.
ojOjibweciw,ojb (Northwestern Ojibwa),ojc (Central Ojibwa),ojg (Eastern Ojibwa),ojs (Severn Ojibwa),ojw (Western Ojibwa),otw (Ottawa)Both the macrolanguage and its subdivisions are treated as languages, except that the codeciw is not used, having been merged intooj.
froOld Frenchxno (Anglo-Norman),zrp (Judeo-French)Only the macrolanguage is treated as a language.xno is an etymology-only language. (discussion)
mgxOmati-Minijbk (Barikewa),jmw (Mouwase)Only the subdivisions are treated as languages, the macrolanguage is not.
omOromogax,gaz,hae (Eastern / Ittu / Qottu / Harar Oromo),orcOnly the macrolanguage is treated as a language. (discussion)
osOssetianosd (Digor),oss (Iron)Only the macrolanguage is treated as a language, using the codeos. (discussion 1,discussion 2)
kppPaku Karenjkp (Paku Karen),jkm (Mobwa Karen)In 2012, the ISO splitkpp intojkp andjkm. Wiktionary has not made this split at this time. (discussion)
psPashtopbt (Southern Pashto),pbu (Northern Pashto),pst (Central Pashto),wne (Waneci)Only the macrolanguageps and the varietywne (Waneci) are treated as languages. (discussion ofpbt,pbu,pst)
faPersian
(= Farsi)
aiq (Aimaq),bhh (Bukhari),deh (Dehwari),haz (Hazaragi),jdt (Judeo-Tat),jpr (Judeo-Persian),pes (Western Persian),phv (Pahlavani),prd,prp,prs (Dari),tg (Tajik),ttt (Tat)Persian (fa), Tajik (tg), Judeo-Persian (jpr), Bukhari (bhh), Judeo-Tat (jdt), Tat (ttt) and Hazaragi (haz) are treated as separate languages. Western Persian (pes), Eastern Persian / Dari (prs) and Aimaq (aiq) are subsumed intofa. “Parsi” (prp) and “Parsi-Dari” (prd) are spurious and are excluded. The status ofdeh andphv is unresolved. (discussion of Tajik,of Judeo-Persian and Bukhari, andof Tat,discussion of pes, prs, aiq, haz, deh and phv)
pihPitcairn-Norfolkcpe-pit (Pitcairn),cpe-nor (Norfolk)For a time, Wiktionary split Pitcairn-Norfolk into two varieties, granting each an exceptional code: Pitcairn wascpe-pit, Norfolk wascpe-nor. That split has been undone; onlypih is now treated as a language. (discussion 1,discussion 2)
plPolishcsb (Kashubian),zlw-pom (Pomeranian),zlw-slv (Slovincian)In practice, both the macrolanguage and its subdivisions are treated as languages, although this has not been discussed.
praPrakritpka (Ardhamagadhi Prakrit),pmh (Maharastri Prakrit),psu (Sauraseni Prakrit)Only the macrolanguage is treated as a language. Besides the ISO-coded varieties to the left, Elu Prakrit, Khasa Prakrit, Magadhi Prakrit, and Paisaci Prakrit are also merged intopra. (vote)
puaPurepechatszPurepecha is treated as a single language with the codepua. The ISO had coded two dialects separately, usingpua for “Western Purepecha” andtsz for “Eastern Purepecha”. (discussion)
quQuechuacqu,inb,inj,qub,qud,quf,qug,quh,quk,qul,qup,qur,qus,quw,qux,quy,quz,qva,qvc,qve,qvh,qvi,qvj,qvl,qvm,qvn,qvo,qvp,qvs,qvw,qvz,qwa,qwc,qwh,qws,qxa,qxc,qxh,qxl,qxn,qxo,qxp,qxr,qxt,qxu,qxw,qwe-kchOnly the macrolanguage Quechua (qu), Inga (inb), Jungle Inga (inj), Classical Quechua (qwc) and the standardized variety Kichwa (qwe-kch) are treated as languages. (discussion of Kichwa)NOTE: This is subject to change. There was arecent 2023 discussion about splitting Quechua that ended in a general consensus to split somehow, but no consensus on how to split.
rajRajasthanibgq,gda,gju,hoj,mup,wbrNot yet discussed.
romRomanirmc,rmf,rml,rmn,rmo (Sinte Romani),rmw,rmyOnly the macrolanguagerom and the subdivisionrmo have L2 headers, but all of the subdivisions can have nested lines in translations tables. (discussion)
roRomanianmo (Moldavian)Only the macrolanguage (ro) is treated as a language. (vote)
jyarGyalrongsit-sit (Situ),sit-jap (Japhug),sit-tsh (Tshobdun),sit-zbu (Zbu)Only the subdivisions are treated as languages, the macrolanguage is not. (discussion)
rueRusynrsk (Pannonian Rusyn)Rusyn is treated as two separate languages: Carpathian Rusyn (rue) and Pannonian Rusyn (rsk). The ISO originally only had the former code, which was understood to comprise both Rusyn languages. The latter code was approved on 2022. The official designations for these codes are somewhat ambiguous, but are usually understood asrue for Carpathian Rusyn andrsk for Pannonian Rusyn. The English Wiktionary follows this interpretation. (discussion)
rwRwanda-Rundirn (Rundi proper),haq (Ha, Giha),suj (Shubi),han (Hangaza),vin (Vinza)Rwanda-Rundi is treated as a single language with the coderw. The ISO had coded the dialects separately, usingrw for “(Kinya)rwanda” andrn for “(Ki)rundi”, etc. (discussion)
nai-sptSahaptinuma,waa,yak,tqnBoth the macrolanguage and its subdivisions are treated as languages.
scSardiniansdc (Sassarese),sdn (Gallurese),src (Logudorese),sro (Campidanese)Sassarese (sdc) and Gallurese (sdn) are treated as separate languages; Logudorese (src) and Campidanese (sro) are not. (discussion 1,discussion 2)
sehSenabwg (Barwe),swk (Malawi Sena)Only the macrolanguage and the Barwe variety are treated as languages, while Malawi Sena is not.
selSelkupsel-nor (Northern Selkup),sel-sou (Southern Selkup, including Central Selkup)Northern (sel-nor) and Southern Selkup (sel-sou, including Central Selkup) are treated as separate languages with exceptional language codes. (discussion)
shSerbo-Croatianbs (Bosnian),hr (Croatian),sr (Serbian),cnr (Montenegrin),kjv (Kajkavian)Only the macrolanguage is treated as a language. (Seediscussion 1,discussion 2,discussion 3,discussion 4,discussion 5,this discussion from late 2023,more discussion,this old vote,this discussion ofkjv, and many other discussions.)
kquSeroagkuIn 2014, the ISO restrictedkqu to ǁKuǁe and split offgku ǂUngkue / ǂKunkwe. Wiktionary has not made this split at this time. (discussion)
denSlaveyscs,xslNot yet discussed.
swSwahilista,swc,swh,bnt-cmw (Chimwiini)Only the macrolanguage and Chimwiinibnt-cmw are treated as languages. (discussion ofsta)
syrSyriacSee the entry for “Aramaic”.
tlTagalogfil (Filipino)Only the macrolanguage (tl) is treated as a language. (vote)
tmhTamashektaq,thv,thz,ttqNot yet discussed.
xtzTasmanianxpb (Pyemmairre),xpd (Paredarerme),xpf (Southeast Tasmanian),xph (Tyerrernotepanner),xpl (Port Sorell),xpv (Tommeginne),xpw (Peerapper),xpx (Toogee),xpz (Bruny Island),aus-lsw (Little Swanport)There is no single "Tasmanian language"; it is a geographic grouping of an unknown number of possibly unrelated families. Seethis 2024 discussion.
boTibetanbod (Lhasa, Ü, Dbus),adx (Amdo, Panang),khg (Khams),kbg (Khamba),tsk (Tseku),dre (Dolpo),hut (Humla, Limi),lhm (Lhomi, Shing Saapa),muk (Mugom, Mugu),kte (Nubri),ola (Walungge, Gola),thw (Thudam),loy (Lowa, Loke, Mustang),tcn (Tichurong)Only the macrolanguage is treated as a language, as well asxct (Classical Tibetan) andotb (Old Tibetan). Other Tibetic languages, such as Dzongkha (dz), are separate.
tidTidongitd,ntdOnly the subdivisions are treated as languages, the macrolanguage is not.
uzUzbekuzn,uzsOnly the macrolanguage is treated as a language.
xwkWangkumaraxpt (Punthamara),eaa (Karenggapa)Only the macrolanguage Wangkumara (xwk) is treated as a language, and itis treated as a language, rather than as a dialect of Ngura (which the ISO used to consider a single language with the codenbx).
xwwWemba-Wembarbp (Baraba-Baraba),rnr (Nari-Nari),weg (Wergaia),xwt (Wotjobaluk)Only the macrolanguage is treated as a language. (discussion)
wnwWintunol (Nomlaki),pwi (Patwin),wnw (Wintu)Both the macrolanguage and its subdivisions are treated as languages. (discussion)
woWolofwof (Gambian Wolof)Onlywo is treated as a language. (discussion)
yafYakalnz (Lonzo),ppp (Pelende)Onlyyaf is treated as a language. (discussion 1,discussion 2)
yxlYarliwdk,ygaOnlyyxl is treated as a language.
yenYendangynq (Yendang proper),yot (Yotti)Only the macrolanguage is treated as a language. (discussion)
yiYiddishydd,yihOnly the macrolanguage is treated as a language. (discussion)
yiyYir-Yorontyyr (Yir-Yoront),yrm (Yirrk-Mel / Yirrk-Thangalkl)Only the macrolanguage is treated as a language. (discussion)
yokYokutsyok-ply (Palewyami),yok-bvy (Buena Vista Yokuts),yok-tky (Tule-Kaweah Yokuts),yok-kry (Kings River Yokuts),yok-gsy (Gashowu),yok-svy (Southern Valley Yokuts, including Yawelmani),yok-nvy (Northern Valley Yokuts, including Chukchansi),yok-dly (Delta Yokuts)Only the subdivisions are treated as languages, the macrolanguage is not.
zapZapoteczaa,zab,zac,zad,zae,zaf,zai,zam,zao,zaq,zar,zas,zat,zav,zaw,zax,zca,zoo,zpa,zpb,zpc,zpd,zpe,zpf,zpg,zph,zpi,zpj,zpk,zpl,zpm,zpn,zpo,zpp,zpq,zpr,zps,zpt,zpu,zpv,zpw,zpx,zpy,zpz,zsr,zte,ztg,ztl,ztm,ztn,ztp,ztq,zts,ztt,ztu,ztx,ztyNot yet discussed.
zzaZazakidiq,kiuOnly the macrolanguage is treated as a language.
zaZhuangzch (Central Hongshuihe Zhuang),zeh (Eastern Hongshuihe Zhuang),zgb (Guibei Zhuang),zgm (Minz Zhuang),zgn (Guibian Zhuang),zhd (Dai Zhuang),zhn (Nong Zhuang),zlj (Liujiang Zhuang),zln (Lianshan Zhuang),zlq (Liuqian Zhuang),zqe (Qiubei Zhuang),zyb (Yongbei Zhuang),zyg (Yang Zhuang),zyj (Youjiang Zhuang),zyn (Yongnan Zhuang),zzj (Zuojiang Zhuang)Only the macrolanguage is treated as a language. (discussion)

Excluded codes

The following ISO 639-3 codes have been excluded without being subsumed into a single other code:

  • bpw (“Bo”, “Po”, “Sorimi”) is excluded for now because its existence as a distinct language is unconfirmed and undocumented, and if it were included, a naming conflict would exist withbgl (discussion 1,discussion 2).
  • bzt (“Brithenig”), a minor constructed language (seeWT:CFI § Constructed languages).
  • dgu (“Degaru/Dhekaru”), spurious according toEthnologue (seew:Dhekaru); it’s a caste, not a language.
  • dws (“Dutton World Speedwords”), a minor constructed language (seeWT:CFI § Constructed languages).
  • dyg (“Villa Viciosa Agta”) is excluded for now because its existence as a distinct language is unconfirmed and undocumented (discussion).
  • ekc, called “Eastern Karnic” by the ISO, is excluded from Wiktionary because it is not clear that there is any data for this “language”, or even that it is a single real language.
  • ghc, called “Hiberno-Scottish Gaelic” by the ISO, is treated as Irish (ga) or Scottish (gd) according to whether the word in question is from an Irish text, a Scottish one, or both (discussion).
  • gok (“Gowli”), spurious according toGlottolog (seew:Spurious languages); it’s a caste, not a language.
  • jrt (“Jorto”), spurious (discussion).
  • luw (“Luo”), an unclassified extinct language, is excluded for now because no includable content in it has been put forth, and if it were included, a naming conflict would exist withluo.
  • myi (“Mina (India)”) is excluded because it is spurious. Furthermore, if it were included, a naming conflict would exist withhna.
  • vms (“Moksela”) is extinct and unattested (seeMoksela language).
  • wai (“Wares”), spurious (discussion).
  • wma (“Mawa (Nigeria)”) is excluded because there is no evidence it ever existed.
  • wwb (“Wakabunga”), an unclassified extinct language, is excluded for now because no includable content in it exists: the only wordlist which was labelled Wakabunga turned out to be Kalkatungu.
  • yrs (“Yarsun”), spurious (discussion).
  • zba ("Balaibalan"), a minor constructed language (brief discussion).
  • zbl (“Blissymbols”, “Blissymbolics”, “Semantography”), a minor constructed language / script (seeWT:CFI § Constructed languages).
  • zxx, the code for “No linguistic content”.

Other notes:

  • xsc, assigned by the ISO to the "Scythian" language and identified by Wikipedia as "Pontic Scythian", is in practice treated here as afamily code instead
  • xme, assigned by the ISO to the "Median" language, is in practice treated here as afamily code instead (sincecirca March 2019)
  • seb (Shempire Senoufo) is subsumed intospp (Supyire) (discussion)
  • gic (Gayle, Gail), a gay argot used in English and Afrikaans, is treated as those languages (discussion), cf. Polari
Retrieved from "https://en.wiktionary.org/w/index.php?title=Wiktionary:Language_treatment&oldid=89499232"
Category:
Hidden category:

[8]ページ先頭

©2009-2026 Movatter.jp