Jump to content

Legacy encoding

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Plugwash (talk | contribs) at 02:20, 17 July 2005. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

In unicode terminology a Legacy encoding is an encoding that can't represent all of unicode but is still used for compatibility or other reasons. Many of them pre-date unicode, others are slight modifications to older encodings to support important new characters such as the euro sign (€) or to satisfy countries that felt there were significant ommisions for thier language in older encodings. The best known such encoding is probablly ISO-8859-15.

Legacy encodings can be split into several groups

  • The ISO-8859-n group of single byte encodings
  • The IBM/DOS/Windows OEM series of single byte code pages (437,850 and others).
  • The single byte windows ANSI code pages (125x)
  • The windows multi byte code pages used by windows as both ansi and OEM code pages for CJK languages.
  • Various other multibyte CJK encodings such as ISO-2022 and EUC.