Jump to content

Data-oriented parsing

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Leatherbear (talk | contribs) at 13:07, 12 February 2008 (link to Remko Scha added). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Data-oriented parsing (DOP, also data-oriented processing) is a probabilistic grammar formalism in computational linguistics. DOP was conceived by Remko Scha in 1990 with the aim of developing a performance-oriented grammar framework. Unlike other probabilistic formalisms, DOP takes into account all subtrees contained in a treebank rather than being restricted to, for example, 2-level subtrees (like PCFGs).

Several variants of DOP have been developed. The initial version was based on tree-substitution grammar, while more recently, DOP has been combined with lexical-functional grammar (LFG). The resulting DOP-LFG finds an application in machine translation.[1]

References

  1. ^ Andy Way (1999). A hybrid architecture for robust MT using LFG-DOP. Journal of Experimental & Theoretical Artificial Intelligence 11(3):441โ€“471.