Code page 950
Code page 950 is Microsoft's implementation of the de facto standard Big5. The code page is not registered with IANA,[1] and hence, is not a standard to communicate information over the internet. The major difference between code page 950 and Big5 is the incorporation of some ETEN characters at F9D6-F9FE (碁, 銹, 裏, 墻, 恒, 粧, and 嫺) and 34 box drawing characters and block elements.
It was updated in 2000 with the addition of the euro sign (€), at the double-byte code 0xA3E1. IBM did not change CCSID 950,[2] and refers to the euro sign update as CCSID 1370 (which includes both single-byte (0x80) and double-byte euro signs)[3] or CCSID 1373 (which includes only the double-byte euro sign),[4] but Microsoft Windows refers to the version with a double-byte euro sign as a Code page 950 update.
CCSID 950 comprises single byte code page 1114 (CCSID 1114) and double byte code page 947 (CCSID 947).[2][5][6]
Euro sign extended CCSID 1370 comprises single byte code page 1114 (CCSID 5210) and double byte code page 947 (CCSID 21427).[3][7][8]
Single byte codes
Letter Number Punctuation Symbol Other Undefined
The rest are parts of a double byte sequence.
Private Use Area usage
Big5 range | Unicode range | Formula[12] |
---|---|---|
81 40–8D FE | U+EEB8–U+F6B0 | 0xeeb8 + (157 * (H-0x81)) + (L<0x80)?(L-0x40):(L-0x62)
|
8E 40–A0 FE | U+E311–U+EEB7 | 0xe311 + (157 * (H-0x8e)) + (L<0x80)?(L-0x40):(L-0x62)
|
C6 A1–C8 FE | U+F6B1–U+F848 | 0xf672 + (157 * (H-0xc6)) + (L<0x80)?(L-0x40):(L-0x62)
|
FA 40–FE FE | U+E000–U+E310 | 0xe000 + (157 * (H-0xfa)) + (L<0x80)?(L-0x40):(L-0x62)
|
This mapping is also used in HKSCS where a given glyph is not yet found in the Unicode revision specified.[13]
See also
- LMBCS-18
- Code page 951, a Microsoft hack for replacing cp950 with an HKSCS-enabled version on Windows XP
References
- ^ "Character Sets". IANA — Protocol Registries.
- ^ a b "CCSID 950 information document". Archived from the original on 2014-12-02.
- ^ a b "CCSID 1370 information document". Archived from the original on 2016-03-27.
- ^ "Converter Explorer: ibm-1373_P100-2002". ICU Demonstration. International Components for Unicode.
- ^ "CCSID 1114 information document". Archived from the original on 2016-03-27.
- ^ "CCSID 947 information document". Archived from the original on 2014-12-01.
- ^ "CCSID 5210 information document". Archived from the original on 2014-11-29.
- ^ "CCSID 21427 information document". Archived from the original on 2016-03-27.
- ^ Code Page CPGID 01114 (pdf) (PDF), IBM
- ^ Code Page CPGID 01114 (txt), IBM
- ^ "Windows Best Fit Chart: CP950". unicode.org. Retrieved 13 September 2016.
- ^ "Big5". Kanji Database. Retrieved 13 September 2016.
- ^ "Big5-HKSCS:2008". Archived from the original on 2016-09-13.
External links
- Microsoft's Reference for Code Page 950
- Mapping of Code Page 950 to Unicode
- International Components for Unicode (ICU) mapping files: windows-950-2000.ucm, ibm-950_P110-1999.ucm, ibm-1373_P100-2002.ucm