Jump to content

Collocation extraction

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by 张开旭 (talk | contribs) at 08:56, 9 November 2008 (Created page with ''''Collocation extraction''' is a task that extracting collocations automaticly from a corpus using computer. Traditional method to do collocation extraction i...'). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)

Collocation extraction is a task that extracting collocations automaticly from a corpus using computer.

Traditional method to do collocation extraction is to find a formula based on the statistical quantities of those words to caculate a score associated to every word pairs. Proposed formulas are Mutual information, t-test, z test, chi-square test and likelihood ratio.[1]

references

  1. ^ Manning, C. D. (1999). Foundations of statistical natural language processing. MIT Press.