Linguistic theories generally regard human languages as consisting of two parts: a lexicon, essentially a catalogue of a language's words (its wordstock); and agrammar, a system of rules which allow for the combination of those words into meaningful sentences. The lexicon is also thought to includebound morphemes, which cannot stand alone as words (such as mostaffixes).[2] In some analyses,compound words and certain classes ofidiomatic expressions,collocations and otherphrasemes are also considered to be part of the lexicon.Dictionaries are lists of the lexicon, in alphabetical order, of a given language; usually, however, bound morphemes are not included.
Items in the lexicon are called lexemes,lexical items, or word forms. Lexemes are not atomic elements but contain both phonological and morphological components. When describing the lexicon, a reductionist approach is used, trying to remain general while using a minimal description. To describe the size of a lexicon,lexemes are grouped into lemmas. Alemma is a group of lexemes generated byinflectional morphology. Lemmas are represented in dictionaries by headwords that list thecitation forms and anyirregular forms, since these must be learned to use the words correctly. Lexemes derived from a word by derivational morphology are considered new lemmas. The lexicon is also organized according to open and closed categories.Closed categories, such asdeterminers orpronouns, are rarely given new lexemes; their function is primarilysyntactic. Open categories, such asnouns andverbs, have highly active generation mechanisms and their lexemes are moresemantic in nature.
Lexicalization and other mechanisms in the lexicon
A central role of the lexicon is documenting establishedlexical norms and conventions.Lexicalization is the process by which new words, having gained widespread usage, enter the lexicon. Since lexicalization[3] may modify lexemes phonologically and morphologically, it is possible that a single etymological source may be inserted into a single lexicon in two or more forms. These pairs, called adoublet, are often close semantically. Two examples areaptitude versusattitude andemploy versusimply.[4]
Neologisms are new lexeme candidates which, if they gain wide usage over time, become part of a language's lexicon. Neologisms are often introduced by children who produce erroneous forms by mistake.[7] Other common sources are slang and advertising.
Neologisms that maintain the sound of their external source
There are two types of borrowings (neologisms based on external sources) that retain the sound of thesource language material:
Borrowing using the source language lexical item as the basic material for the neologization: guestwords, foreignisms and loanwords
Borrowing using a target language lexical items as the basic material for the neologization: phono-semantic matching, semanticized phonetic matching and phonetic matching.
The following are examples of external lexical expansion using thesource language lexical item as the basic material for the neologization, listed in decreasing order of phonetic resemblance to the original lexical item (in the source language):[8]
Guestword (in German:Gastwort): unassimilated borrowing.
Foreignism (in German:Fremdwort): foreign word, e.g. phonetic adaptation.
Loanword (in German:Lehnwort): totally assimilated borrowing, e.g. morphemic adaptation.
Phono-semantic matches, semanticized phonetic matches and phonetic matches
The following are examples of simultaneous external and internal lexical expansion usingtarget language lexical items as the basic material for the neologization but still resembling the sound of the lexical item in the source language:[9]
Phono-semantic matching (PSM): the target language material is originally similar to the source language lexical item both phonetically and semantically.
Semanticized phonetic matching (SPM): the target language material is originally similar to the source language lexical item phonetically, and only in a loose way semantically.
Phonetic matching (PM): the target language material is originally similar to the source language lexical item phonetically but not semantically.
Another mechanism involvesgenerative devices that combine morphemes according to a language's rules. For example, thesuffix "-able" is usually only added totransitive verbs, as in "readable" but not "cryable".
A compound word is a lexeme composed of several established lexemes, whose semantics is not the sum of that of their constituents. They can be interpreted throughanalogy,common sense and, most commonly, context.[3] Compound words can have simple or complex morphological structures. Usually, only the head requires inflection for agreement. Compounding may result in lexemes of unwieldy proportion. This is compensated by mechanisms that reduce the length of words. A similar phenomenon has been recently shown to feature in social media also where hashtags compound to form longer-sized hashtags that are at times more popular than the individual constituent hashtags forming the compound.[10] Compounding is the most common of word formation strategies cross-linguistically.
Comparative historical linguistics studies the evolution of languages and takes adiachronic view of the lexicon. The evolution of lexicons in different languages occurs through a parallel mechanism. Over time historical forces work to shape the lexicon,[11] making it simpler to acquire and often creating an illusion of great regularity in language.
Phonological assimilation, the modification of loanwords to fit a new language's sound structure more effectively. If, however, a loanword sounds too "foreign", inflection or derivation rules may not be able to transform it.
Analogy, where new words undergo inflection and derivation analogous to that of words with a similar sound structure.
Emphasis, the modification of words' stress or accent.
The term "lexicon" is generally used in the context of a single language. Therefore, multi-lingual speakers are generally thought to have multiple lexicons. Speakers of language variants (Brazilian Portuguese andEuropean Portuguese, for example) may be considered to possess a single lexicon. Thus acash dispenser (British English) as well as anautomatic teller machine orATM in American English would be understood by both American and British speakers, despite each group using different dialects.
When linguists study a lexicon, they consider such things as what constitutes a word; the word/concept relationship;lexical access and lexical access failure; how a word'sphonology,syntax, and meaning intersect; themorphology-word relationship; vocabulary structure within a given language; language use (pragmatics);language acquisition; the history and evolution of words (etymology); and the relationships between words, often studied withinphilosophy of language.
^λεξικόςArchived 2021-05-14 at theWayback Machine in Henry George Liddell, Robert Scott,A Greek–English Lexicon (Perseus Digital Library). Sc.βιβλίονbiblios 'book'.
^abGeert, Booij (2005).The grammar of words : an introduction to linguistic morphology. Oxford textbooks in linguistics. Oxford University Press.ISBN0-19-928042-8.
^Page 8 inLanguage Contact and Lexical Enrichment in Israeli Hebrew, by Ghil'ad Zuckermann, Palgrave Macmillan, 2003.
^Suman Kalyan Maity, Ritvik Saraf and Animesh Mukherjee (2016). #Bieber + #Blast = #BieberBlast: Early Prediction of Popular Hashtag Compounds. InACM CSCW, San Francisco, CA.