Jump to content

Collocation extraction

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by UnCatBot (talk | contribs) at 11:46, 26 November 2008 ((Bot) tagging, added uncategorised tag). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Collocation extraction is a task that extracting collocations automaticly from a corpus using computer.

Traditional method to do collocation extraction is to find a formula based on the statistical quantities of those words to caculate a score associated to every word pairs. Proposed formulas are Mutual information, t-test, z test, chi-square test and likelihood ratio.[1]

References

  1. ^ Manning, C. D. (1999). Foundations of statistical natural language processing. MIT Press.