Code page 950
Code page 950 is Microsoft's implementation of the de facto standard Big5. The code page is not registered with IANA,[1] and hence, is not a standard to communicate information over the internet. The major difference between code page 950 and Big5 is the incorporation of some ETEN characters at F9D6-F9FE (碁, 銹, 裏, 墻, 恒, 粧, and 嫺) and 34 box drawing characters and block elements. It was updated in 2000 with the addition of the euro sign (€), at 0xA3E1. This update is called CCSID 1370 by IBM[2] (as IBM did not change CCSID 950),[3] but Microsoft Windows refers to it as a Code page 950 update.
CCSID 950 is comprised of single byte code page 1114 (CCSID 1114) and double byte code page 947 (CCSID 947).[3][4][5]
Euro sign extended CCSID 1370 is comprised of single byte code page 1114 (CCSID 5210) and double byte code page 947 (CCSID 21427).[2][6][7]
Single byte codes
Letter Number Punctuation Symbol Other Undefined
The rest are parts of a double byte sequence.
Private Use Area usage
Big5 range | Unicode range | Formula[11] |
---|---|---|
81 40–8D FE | U+EEB8–U+F6B0 | 0xeeb8 + (157 * (H-0x81)) + (L<0x80)?(L-0x40):(L-0x62)
|
8E 40–A0 FE | U+E311–U+EEB7 | 0xe311 + (157 * (H-0x8e)) + (L<0x80)?(L-0x40):(L-0x62)
|
C6 A1–C8 FE | U+F6B1–U+F848 | 0xf672 + (157 * (H-0xc6)) + (L<0x80)?(L-0x40):(L-0x62)
|
FA 40–FE FE | U+E000–U+E310 | 0xe000 + (157 * (H-0xfa)) + (L<0x80)?(L-0x40):(L-0x62)
|
This mapping is also used in HKSCS where a given glyph is not yet found in the Unicode revision specified.[12]
See also
- LMBCS-18
- Code page 951, a Microsoft hack for replacing cp950 with an HKSCS-enabled version on Windows XP
References
- ^ "Character Sets". IANA — Protocol Registries.
- ^ a b "CCSID 1370 information document". Archived from the original on 2016-03-27.
- ^ a b "CCSID 950 information document". Archived from the original on 2014-12-02.
- ^ "CCSID 1114 information document". Archived from the original on 2016-03-27.
- ^ "CCSID 947 information document". Archived from the original on 2016-03-27.
{{cite web}}
:|archive-date=
/|archive-url=
timestamp mismatch; 2014-12-01 suggested (help) - ^ "CCSID 5210 information document". Archived from the original on 2016-03-27.
{{cite web}}
:|archive-date=
/|archive-url=
timestamp mismatch; 2014-11-29 suggested (help) - ^ "CCSID 21427 information document". Archived from the original on 2016-03-27.
- ^ Code Page CPGID 01114 (pdf) (PDF), IBM
- ^ Code Page CPGID 01114 (txt), IBM
- ^ "Windows Best Fit Chart: CP950". unicode.org. Retrieved 13 September 2016.
- ^ "Big5". Kanji Database. Retrieved 13 September 2016.
- ^ "Big5-HKSCS:2008". Archived from the original on 2016-09-13.
External links
- Microsoft's Reference for Code Page 950
- Mapping of Code Page 950 to Unicode
- International Components for Unicode (ICU) mapping files: windows-950-2000.ucm and ibm-950_P110-1999.ucm