Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Collocation extraction

From Wikipedia, the free encyclopedia
Computational technique to find word sequences

Collocation extraction is the task of using a computer to extractcollocations automatically from acorpus.

The traditional method of performing collocation extraction is to find a formula based on the statistical quantities of those words to calculate a score associated to every word pairs. Proposed formulas aremutual information,t-test,z test,chi-squared test andlikelihood ratio.[1]

Within the area ofcorpus linguistics,collocation is defined as a sequence of words orterms whichco-occur more often than would be expected by chance. 'Crystal clear', 'middle management', 'nuclear family', and 'cosmetic surgery' are examples of collocated pairs of words. Some words are often found together because they make up acompound noun, for example 'riding boots' or 'motor cyclist' or ‘collocation extraction’ its very self.

See also

[edit]

External links

[edit]
Look upcollocation in Wiktionary, the free dictionary.

References

[edit]
  1. ^Manning, C. D.; Schütze, H. (1999).Foundations of statistical natural language processing. Cambridge, MA: MIT Press.ISBN 978-0-262-13360-9.
General terms
Text analysis
Text segmentation
Automatic summarization
Machine translation
Distributional semantics models
Language resources,
datasets and corpora
Types and
standards
Data
Automatic identification
and data capture
Topic model
Computer-assisted
reviewing
Natural language
user interface
Related


Stub icon

This article aboutnatural language processing is astub. You can help Wikipedia byadding missing information.

Retrieved from "https://en.wikipedia.org/w/index.php?title=Collocation_extraction&oldid=1297307976"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2026 Movatter.jp