Movatterモバイル変換


[0]ホーム

URL:


Jump to content
Wikidata
Search

Wikidata:Lexicographical data/Glossary

Shortcut: WD:Gloss
From Wikidata
<Wikidata:Lexicographical data
Translate this page

Wikidata:Lexicographical data

Other languages:


This is a glossary of terms that are specific tolexicographical data on Wikidata. See also the generalWikidata Glossary.

  • Form: The grammatical realization of alexeme. This includes inflections and declensions. For example, the English verb "run" becomes "running" as a present participle and "runs" in 3rd person singular. The role of each form is denoted bygrammatical features. A form is asubentity. It includes an ID, arepresentation, an unordered (possibly empty) list of grammatical features and a list ofform statements. Forms have IDs starting with the ID of the lexeme they belong to, followed by a hyphen ("-") and an "F", followed by a natural number in decimal notation: e.g. L123-F7. This prevents confusion of different forms in different lexemes.
  • Form statement: Astatement describing theform or its relations to other forms oritems (e.g. IPA, pronunciation audio).
  • Gloss: A natural-language definition of asense (seegloss(Q1132324)). A gloss is a non-empty unordered set of pairs, each of which contains a string andgloss language. Both the whole pair and its string part may be called agloss. It provides definition in various languages. For example, a French word may have a gloss in French, in additional to a gloss in German.
  • Gloss language orLanguage of Gloss is represented as alanguage code.
  • Grammatical feature: A specification of the conditions or the syntactic roles in which theform is used. Grammatical features are represented as an unordered (possibly empty) set ofitems.
  • Language: seegloss language andlexeme language.
  • Language code: A code that assigns letters or numbers as identifiers or classifiers for languages (seelanguage identifier(Q2092812)). The valid language codes in lexicographical data are mostly a subset of valid (BCP 47)IETF language tag(Q1059900) and a superset oflabel languages. It may have an optional private-use subtag in the format-x-Q######.
  • Lemma: The canonical, dictionary, or citation form of a word (seelemma(Q18514)). For example, "run", "runs", "ran", and "running" are forms of the samelexeme, with "run" as the lemma. Lemmas are not simple strings, but a non-empty unordered set of pairs that include a string and aspelling variant. Each of the pairs or the string parts thereof may also be called alemma. Therefore, a lemma can include words in multiple orthographies. For example, the lemma for the English noun "color" would include "colour" for British English as well as "color" for American English.
  • Lexeme: A lexical element of a language, such as a word, a phrase, or a affix (seelexeme(Q111352)). It consists of a set offorms taken by a single word in one language, and words in any variant orthographies thereof. For example, the English nouns "color" and "colour" and their plural forms "colors" and "colours" are considered as one lexeme, while English noun "color" and verb "color" are two lexemes. Every lexeme is anentity and is stored in apage in theLexeme: namespace. A lexeme includes a prefixed id (like L123), alemma, alexeme language, alexical category, a list oflexeme statements, a list offorms, and a list ofsenses.
  • Lexeme language orLanguage of Lexeme: The language to which a lexeme belongs. Denoted by anitem.
  • Lexeme statement: Astatement that is not specific to a Form or a Sense, but applies to the whole lexeme, for example, derived from grammatical gender or syntactic function.
  • Lexical category: A classification oflexemes into categories which have similar grammatical properties (seepart of speech(Q82042)), also known as part of speech in grammar. It is denoted by anitem, e.g.noun(Q1084) andverb(Q24905).
  • Representation: The text of aform. Likelemmas, representations are not simple strings, but a non-empty unordered set of string/spelling variant pairs. Each of the pairs or the string part thereof may also be called arepresentation.
  • Sense: A meaning of alexeme which it may represent in a text. Any number of senses may be defined, although every lexeme should have at least one. A sense is asubentity. It includes an ID, agloss and a list ofsense statements. Senses have IDs starting with the ID of the lexeme they belong to, followed by a hyphen ("-") and an "S", followed by a natural number in decimal notation: e.g. L123-S4.
  • Sense statement: Astatement describing thesense and its relations to other entities, such asitems. For example: translation, synonym, antonym, connotation, etc.
  • Spelling variant: The specific orthography of alemma orrepresentation, for example British English (en-gb) and American English (en-us) are different spelling variants. A spelling variant is represented as alanguage code.
  • Subentity: Aform or asense. They are a special kind ofentity and may be used in statements, but they do not have their ownpages. Instead, all the information of a subentity is stored in the page of thelexeme that a subentity belongs to. Each subentity functionally depends on a singlelexeme.
Retrieved from "https://www.wikidata.org/w/index.php?title=Wikidata:Lexicographical_data/Glossary&oldid=2335146493"
Categories:

[8]ページ先頭

©2009-2025 Movatter.jp