Halfwidth and Fullwidth Forms (Unicode block)

Halfwidth and Fullwidth Forms
Halfwidth and Fullwidth Forms
Range	U+FF00..U+FFEF; (240 code points)
Plane	BMP
Scripts	Hangul (52 char.) ; Katakana (55 char.) ; Latin (52 char.) ; Common (66 char.)
Symbol sets	Variant width characters
Assigned	225 code points
Unused	15 reserved code points
Unicode version history
1.0.0 (1991)	216 (+216)
1.0.1 (1992)	223 (+7)
3.2 (2002)	225 (+2)
Unicode documentation
	Code chart ∣ Web page
	Note:

Halfwidth and Fullwidth Forms is the name of a Unicode block U+FF00–FFEF, provided so that older encodings containing both halfwidth and fullwidth characters can have lossless translation to/from Unicode. It is the last of the Basic Multilingual Plane excepting the short Specials block at U+FFF0–FFFF.

Range U+FF01–FF5E reproduces the characters of ASCII 21 to 7E as fullwidth forms. U+FF00 does not correspond to a fullwidth ASCII 20 (space character), since that role is already fulfilled by U+3000 "ideographic space".

Range U+FF65–FF9F encodes halfwidth forms of katakana in a transposition of their JIS X 0201 layout – see half-width kana.

The range U+FFA0–FFDC encodes halfwidth forms of compatibility jamo characters for Hangul, in a transposition of their 1974 standard layout. It is used in the mapping of some IBM encodings for Korean.^[4]

Range U+FFE0–FFEE includes fullwidth and halfwidth symbols.

Block

Halfwidth and Fullwidth Forms^[1]^[2] Official Unicode Consortium code chart (PDF)
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
U+FF0x		！	＂	＃	＄	％	＆	＇	（	）	＊	＋	，	－	．	／
U+FF1x	０	１	２	３	４	５	６	７	８	９	：	；	＜	＝	＞	？
U+FF2x	＠	Ａ	Ｂ	Ｃ	Ｄ	Ｅ	Ｆ	Ｇ	Ｈ	Ｉ	Ｊ	Ｋ	Ｌ	Ｍ	Ｎ	Ｏ
U+FF3x	Ｐ	Ｑ	Ｒ	Ｓ	Ｔ	Ｕ	Ｖ	Ｗ	Ｘ	Ｙ	Ｚ	［	＼	］	＾	＿
U+FF4x	｀	ａ	ｂ	ｃ	ｄ	ｅ	ｆ	ｇ	ｈ	ｉ	ｊ	ｋ	ｌ	ｍ	ｎ	ｏ
U+FF5x	ｐ	ｑ	ｒ	ｓ	ｔ	ｕ	ｖ	ｗ	ｘ	ｙ	ｚ	｛	｜	｝	～	｟
U+FF6x	｠	｡	｢	｣	､	･	ｦ	ｧ	ｨ	ｩ	ｪ	ｫ	ｬ	ｭ	ｮ	ｯ
U+FF7x	ｰ	ｱ	ｲ	ｳ	ｴ	ｵ	ｶ	ｷ	ｸ	ｹ	ｺ	ｻ	ｼ	ｽ	ｾ	ｿ
U+FF8x	ﾀ	ﾁ	ﾂ	ﾃ	ﾄ	ﾅ	ﾆ	ﾇ	ﾈ	ﾉ	ﾊ	ﾋ	ﾌ	ﾍ	ﾎ	ﾏ
U+FF9x	ﾐ	ﾑ	ﾒ	ﾓ	ﾔ	ﾕ	ﾖ	ﾗ	ﾘ	ﾙ	ﾚ	ﾛ	ﾜ	ﾝ	ﾞ	ﾟ
U+FFAx	HW HF	ﾡ	ﾢ	ﾣ	ﾤ	ﾥ	ﾦ	ﾧ	ﾨ	ﾩ	ﾪ	ﾫ	ﾬ	ﾭ	ﾮ	ﾯ
U+FFBx	ﾰ	ﾱ	ﾲ	ﾳ	ﾴ	ﾵ	ﾶ	ﾷ	ﾸ	ﾹ	ﾺ	ﾻ	ﾼ	ﾽ	ﾾ
U+FFCx			ￂ	ￃ	ￄ	ￅ	ￆ	ￇ			ￊ	ￋ	ￌ	ￍ	ￎ	ￏ
U+FFDx			ￒ	ￓ	ￔ	ￕ	ￖ	ￗ			ￚ	ￛ	ￜ
U+FFEx	￠	￡	￢	￣	￤	￥	￦		￨	￩	￪	￫	￬	￭	￮
Notes 1.^ As of Unicode version 16.0 2.^ Grey areas indicate non-assigned code points

The block has variation sequences defined for East Asian punctuation positional variants.^[5]^[6] They use U+FE00 VARIATION SELECTOR-1 (VS01) and U+FE01 VARIATION SELECTOR-2 (VS02):

Variation sequences for punctuation alignment
U+	FF01	FF0C	FF0E	FF1A	FF1B	FF1F	Description
base code point	！	，	．	：	；	？
base + VS01	！︀	，︀	．︀	：︀	；︀	？︀	corner-justified form
base + VS02	！︁	，︁	．︁	：︁	；︁	？︁	centered form

An additional variant is defined for a fullwidth zero with a short diagonal stroke: U+FF10 FULLWIDTH DIGIT ZERO, U+FE00 VS1 (０︀).^[7]^[6]

History

The following Unicode-related documents record the purpose and process of defining specific characters in the Halfwidth and Fullwidth Forms block:

Version	Final code points^[a]	Count	L2 ID	WG2 ID	Document
1.0.0	U+FF01..FF5E, FF61..FFBE, FFC2..FFC7, FFCA..FFCF, FFD2..FFD7, FFDA..FFDC, FFE0..FFE6	216			(to be determined)
				N4403 (pdf, doc)	Umamaheswaran, V. S. (2014-01-28), "Resolution M61.01", Unconfirmed minutes of WG 2 meeting 61, Holiday Inn, Vilnius, Lithuania; 2013-06-10/14
			L2/17-056		Lunde, Ken (2017-02-13), Proposal to add standardized variation sequences
			L2/17-436		Lunde, Ken (2018-01-21), Proposal to add standardized variation sequences for fullwidth East Asian punctuation
			L2/18-039		Anderson, Deborah; Whistler, Ken; Pournader, Roozbeh; Moore, Lisa; Liang, Hai; Cook, Richard (2018-01-19), "24. Fullwidth East Asian Punctuation", Recommendations to UTC #154 January 2018 on Script Proposals
			L2/17-362		Moore, Lisa (2018-02-02), "B.4.1 New Proposal to add standardized variation sequence for U+FF10 FULL WIDTH DIGIT ZERO", UTC #153 Minutes
			L2/18-115		Moore, Lisa (2018-05-09), "Consensus 154-C17", UTC #155 Minutes, Add 16 standardized variation sequences based on L2/17-436R, for Unicode 12.0.
			L2/19-055		Iancu, Laurențiu (2019-01-14), Proposed Changes in the Segmentation Property Values for Fullwidth Digits
			L2/19-008		Moore, Lisa (2019-02-08), "B.11.11.1.2 Proposed changes in the segmentation property values for fullwidth digits", UTC #158 Minutes
1.0.1	U+FFE8..FFEE	7			(to be determined)
3.2	U+FF5F..FF60	2	L2/99-052		Freytag, Asmus (1999-02-05), The math pieces from the symbol font
			L2/01-033		Karlsson, Kent; Freytag, Asmus (2001-01-16), Disunify braces/brackets for math, computing science, and Z notation from similar-looking CJK braces/brackets
			L2/01-159	N2344	Ad-hoc report on Mathematical Symbols, 2001-04-03
			L2/01-157	N2345R	Karlsson, Kent (2001-04-04), Proposal to disunify certain fencing CJK punctuation marks from similar-looking Math fences
			L2/01-168		Whistler, Ken (2001-04-10), Bracket Disunification & Normalization Hell
			L2/01-012R		Moore, Lisa (2001-05-21), "Disunifying Braces and Brackets", Minutes UTC #86 in Mountain View, Jan 2001
			L2/01-223		Suignard, Michel (2001-05-23), Discussion of Issues Regarding Bracket Disunification
			L2/01-184R		Moore, Lisa (2001-06-18), "Motion 87-M21", Minutes from the UTC/L2 meeting, Reverse the decision made in motion 86-M6 not to disunify brackets.
			L2/01-317		Suignard, Michel (2001-08-14), Bracket Disunification & Normalization
			L2/01-295R		Moore, Lisa (2001-11-06), "Bracket Disunification and Normalization", Minutes from the UTC/L2 meeting #88
			L2/02-154	N2403	Umamaheswaran, V. S. (2002-04-22), "Resolution M41.1", Draft minutes of WG 2 meeting 41, Hotel Phoenix, Singapore, 2001-10-15/19
^ Proposed code points and characters names may differ from final code points and names

References

^ "Unicode 1.0.1 Addendum" (PDF). The Unicode Standard. 1992-11-03. Archived (PDF) from the original on 2016-07-02. Retrieved 2016-07-09.
^ "Unicode character database". The Unicode Standard. Archived from the original on 2016-07-10. Retrieved 2016-07-09.
^ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Archived from the original on 2016-06-29. Retrieved 2016-07-09.
^ "ICU Demonstration - Converter Explorer". demo.icu-project.org. Retrieved 7 May 2018.
^ Lunde, Ken (2018-01-21). "L2/17-436: Proposal to add standardized variation sequences for fullwidth East Asian punctuation" (PDF).
^ ^a ^b "Unicode Character Database: Standardized Variation Sequences". The Unicode Consortium.
^ Beeton, Barbara; Freytag, Asmus; Iancu, Laurențiu; Sargent, Murray (2015-10-30). "L2/15-268: Proposal to Represent the Slashed Zero Variant of Empty Set" (PDF).

[final-8] Proposed code points and characters names may differ from final code points and names

[1] "Unicode 1.0.1 Addendum" (PDF). The Unicode Standard. 1992-11-03. Archived (PDF) from the original on 2016-07-02. Retrieved 2016-07-09.

[2] "Unicode character database". The Unicode Standard. Archived from the original on 2016-07-10. Retrieved 2016-07-09.

[3] "Enumerated Versions of The Unicode Standard". The Unicode Standard. Archived from the original on 2016-06-29. Retrieved 2016-07-09.

[ibm933-4] "ICU Demonstration - Converter Explorer". demo.icu-project.org. Retrieved 7 May 2018.

[5] Lunde, Ken (2018-01-21). "L2/17-436: Proposal to add standardized variation sequences for fullwidth East Asian punctuation" (PDF).

[stdvar-6] "Unicode Character Database: Standardized Variation Sequences". The Unicode Consortium.

[7] Beeton, Barbara; Freytag, Asmus; Iancu, Laurențiu; Sargent, Murray (2015-10-30). "L2/15-268: Proposal to Represent the Slashed Zero Variant of Empty Set" (PDF).

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[a]

Block

History

See also

References