Jump to content

Generalized vector space model

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Babe12345654321 (talk | contribs) at 14:13, 20 September 2013 (Semantic information on GVSM). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

The Generalized vector space model is a generalization of the vector space model used in information retrieval. Wong et al.[1] presented an analysis of the problems that the pairwise orthogonality assumption of the vector space model (VSM) creates. From here they extended the VSM to the generalized vector space model (GVSM).

Definitions

GVSM introduces term to term correlations, which deprecate the pairwise orthogonality assumption. More specifically, they considered a new space, where each term vector ti was expressed as a linear combination of 2n vectors mr where r = 1...2n.

For a document dk and a query q the similarity function now becomes:

where ti and tj are now vectors of a 2n dimensional space.

Term correlation can be implemented in several ways. As an example Wong et al. use as input to their algorithm the term occurrence frequency matrix obtained from automatic indexing and the output is term correlation between any pair of index terms.

this does not exist. youre a babe

References

  1. ^ Wong, S. K. M. (1985), Generalized vector spaces model in information retrieval, SIGIR ACM {{citation}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)