Jump to content

XML Information Set

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Netjeff (talk | contribs) at 17:07, 9 October 2014 (Wikilink to Binary XML in section Serialization: , and removed ext link to Fast Infoset (just one of many binary). Also added a ref to XML format.). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

XML Information Set (XML Infoset) is a W3C specification describing an abstract data model of an XML document in terms of a set of information items.[1] The definitions in the XML Information Set specification are meant to be used in other specifications that need to refer to the information in a well-formed XML document.

An XML document has an information set if it is well-formed and satisfies the namespace constraints. There is no requirement for an XML document to be valid in order to have an information set.

An information set can contain up to eleven different types of information items:

  1. The Document Information Item (always present)
  2. Element Information Items
  3. Attribute Information Items
  4. Processing Instruction Information Items
  5. Unexpanded Entity Reference Information Items
  6. Character Information Items
  7. Comment Information Items
  8. The Document Type Declaration Information Item
  9. Unparsed Entity Information Items
  10. Notation Information Items
  11. Namespace Information Items

Infoset recommendation Second Edition was adopted on February 4, 2004.

Infoset Augmentation

Infoset augmentation or infoset modification refers to the process of modifying the infoset during schema validation, for example by adding default attributes. The augmented infoset is called the post-schema-validation infoset, or PSVI. [2]

Infoset augmentation is somewhat controversial, with claims that it is a violation of modularity and tends to cause interoperability problems, since applications get different information depending on whether or not validation has been performed. [3]

Infoset augmentation is supported by XML Schema but not RELAX NG.

Serialization

Typically, XML Information Set is serialized as XML[4] . There are also serialization formats for Binary XML, CSV[5], and JSON.[6]

See also

XML Information Set instances:

References

  1. ^ W3C XML Infoset
  2. ^ XML Schema 1.1 Part 1: Structures
  3. ^ RELAX NG and W3C XML Schema, James Clark, 4 Jun 2002
  4. ^ "Extensible Markup Language (XML)". W3C. Retrieved 9 October 2014.
  5. ^ XmlCsvReader Implementation
  6. ^ Apache CXF JSON Support