Jump to content

Semantic analysis (machine learning)

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by ManuelRodriguez (talk | contribs) at 08:31, 26 August 2020 (a metalanguage can parse human speech). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

In machine learning, semantic analysis of a corpus is the task of building structures that approximate concepts from a large set of documents. It generally does not involve prior semantic understanding of the documents. A metalanguage based on Predicate logic can analyze the speech of humans.[1]: 93- 

Latent semantic analysis (sometimes latent semantic indexing), is a class of techniques where documents are represented as vectors in term space. A prominent example is PLSI.

Latent Dirichlet allocation involves attributing document terms to topics.

n-grams and hidden Markov models work by representing the term stream as a markov chain where each term is derived from the few terms before it.

See also

References

  1. ^ Nitin Indurkhya; Fred J. Damerau (22 February 2010). Handbook of Natural Language Processing. CRC Press. ISBN 978-1-4200-8593-8.