Automatic indexing

Automatic indexing is the ability for a computer to scan large volumes of documents against a controlled vocabulary, taxonomy, thesaurus or ontology and use those controlled terms to quickly and effectively index large document depositories. As the number of documents exponentially increases with the proliferation of the Internet, automatic indexing will become essential to maintaining the ability to find relevant information in a sea of ir-relevant information. Automatic Indexing is the process of analyzing an item to extract the information to be permanently kept in an index.

History

There are scholars who cite that the subject of automatic indexing attracted attention as early as he 1950s, particularly with the demand for faster and more comprehensive access to scientific and engineering literature.^[1] The data and information explosion predicted in the 1960s^[2] and fulfilled by the emergence of information technology and the World Wide Web also drove the development of advanced system that can facilitate information access.^[3] The technology is partly driven by the emergence of the field called computational linguistics, which steered research that eventually contribute to the development of automatic indexing.

The automated process can encounter problems and these are primarily caused by two factors: 1) the complexity of the language; and, 2) the lack intuitiveness and the difficulty in extrapolating concepts out of statements on the part of the computing technology.^[4] These are primarily linguistic challenges and specific problems involve semantic and syntactic aspects of language.^[4]

References

^ Riaz, Muhammad (1989). Advanced Indexing and Abstracting Practies. Delhi: Atlantic Publishers & Distributors. p. 263.
^ Torres-Moreno, Juan-Manuel (2014). Automatic Text Summarization. Hoboken, NJ: John Wiley & Sons. pp. xii. ISBN 9781848216686.
^ Kapetanios, Epaminondas; Sugumaran, Vijayan; Natural Language and Information Systems: 13th International Conference on Applications of Natural Language to Information Systems, NLDB 2008 London, UK, June 24-27, 2008, Proceedings, Myra (2008). Natural Language and Information Systems: 13th International Conference on Applications of Natural Language to Information Systems, NLDB 2008 London, UK, June 24-27, 2008, Proceedings. Berlin: Springer Science & Business Media. p. 350. ISBN 3540698574.{{cite book}}: CS1 maint: multiple names: authors list (link) CS1 maint: numeric names: authors list (link)
^ ^a ^b Cleveland, Ana; Cleveland, Donald (2013). Introduction to Indexing and Abstracting: Fourth Edition. Santa Barbara, CA: ABC-CLIO. p. 289. ISBN 9781598849769.

This computing article is a stub. You can help Wikipedia by expanding it.

[1] Riaz, Muhammad (1989). Advanced Indexing and Abstracting Practies. Delhi: Atlantic Publishers & Distributors. p. 263.

[2] Torres-Moreno, Juan-Manuel (2014). Automatic Text Summarization. Hoboken, NJ: John Wiley & Sons. pp. xii. ISBN 9781848216686.

[3] Kapetanios, Epaminondas; Sugumaran, Vijayan; Natural Language and Information Systems: 13th International Conference on Applications of Natural Language to Information Systems, NLDB 2008 London, UK, June 24-27, 2008, Proceedings, Myra (2008). Natural Language and Information Systems: 13th International Conference on Applications of Natural Language to Information Systems, NLDB 2008 London, UK, June 24-27, 2008, Proceedings. Berlin: Springer Science & Business Media. p. 350. ISBN 3540698574.{{cite book}}: CS1 maint: multiple names: authors list (link) CS1 maint: numeric names: authors list (link)

[:0-4] Cleveland, Ana; Cleveland, Donald (2013). Introduction to Indexing and Abstracting: Fourth Edition. Santa Barbara, CA: ABC-CLIO. p. 289. ISBN 9781598849769.

[1]

[2]

[3]

[4]

History

See also

References