Jump to content

SMART Information Retrieval System

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Qwertyus (talk | contribs) at 12:47, 5 April 2010 (+SMART notation for tf-idf, reference, changed cats). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

The SMART (System for the Mechanical Analysis and Retrieval of Text) Information Retrieval System is an information retrieval system developed at Cornell University in the 1960s. Many important concepts in information retrieval were developed as part of research on the SMART system, including the vector space model, relevance feedback, and Rocchio Classification.

Gerard Salton led the group that developed SMART. Other contributors included Mike Lesk.

The SMART system also provides a set a corpora, queries and reference rankings, taken from different subjects, notably

  • ADI: publications from information science reviews [1]
  • CACM: computer science [2]
  • Cranfield collection : publications from aeronautic reviews [3]
  • CISI: library science [4]
  • Medlars collection : publications from medical reviews [5]
  • Time magazine collection : archives of the generalist review Time in 1963 [6]

To the legacy of the SMART system belongs the so-called SMART notation, a mnemonic scheme for denoting tf-idf weighting variants in the vector space model.[1]

References

  1. ^ Manning, Christopher D.; Raghavan, Prabhakar; Schütze, Hinrich (2008), Introduction to Information Retrieval, Cambridge University Press