Jump to content

Unstructured data

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by TreveX (talk | contribs) at 00:48, 25 September 2005. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)

Unstructured data refers to masses of (usually) computerized information which do not have a data structure which is easily readable by a machine. Examples of unstructured data may include unstructured text such as the body of an email. Data mining techniques are used to find patterns in, or otherwise interpret, this information.

Data with some form of structure may sometimes be referred to as unstructured data if the structure is not helpful for the desired processing task. For example, a HTML webpage is highly structured, but this structure is oriented towards formatting, rather than ordering the prose on the page.

See also