Jump to content

Extended Unix Code

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Asamboi (talk | contribs) at 14:09, 23 March 2004 (more strange encodings). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)

Extended Unix Coding (EUC) is an 8-bit character encoding used primarily for Japanese and Korean.

In Japan, the encoding is heavily used by Unix-derived operating systems such as Linux, but little elsewhere, and is consequently the smallest of the big 3 Japanese encodings behind ISO-2022-JP (JIS) and Shift-JIS.

EUC-JP

In the EUC-JP coding for Japanese, the 7-bit values used in ISO-2022-JP are simply incremented by 128. This allows easy mixing 7-bit ASCII and 8-bit Japanese without the need for the escape characters employed by ISO-2022-JP.

EUC-KR

EUC-KR is a similar 8-bit coding of ISO-2022-KR (KS X 1001), implemented with the same principle of simply adding 128 to each byte.

Other encodings

EUC-CN and EUC-TW for simplified and traditional Chinese respectively also exist, but are used very little.