Template:Punctuation marks in Unicode/doc
Appearance
General Catagory Px, punctuation
- Listed are all code points in Genetral Category P:
P, Punctuation mark | |||||
Pc | Punctuation, connector | Graphic | Character | 10 | Includes "_" underscore |
Pd | Punctuation, dash | Graphic | Character | 25 | Includes several hyphen characters |
Ps | Punctuation, open | Graphic | Character | 75 | Opening bracket characters |
Pe | Punctuation, close | Graphic | Character | 77 | Closing bracket characters |
Pi | Punctuation, initial quote | Graphic | Character | 12 | Opening quotation mark. Does not include the ASCII "neutral" quotation mark. May behave like Ps or Pe depending on usage |
Pf | Punctuation, final quote | Graphic | Character | 10 | Closing quotation mark. May behave like Ps or Pe depending on usage |
Po | Punctuation, other | Graphic | Character | 593 | |
P | Punctuation | 798 | Total |
- todo: check numbers; big table has 798 cp entries
- Numbers as of version 14.0, by § BabelMap
Parameters
- {{Punctuation marks in Unicode/row}}
- {{Punctuation marks in Unicode/row-pair}} -- for pair Ps-Pe (brackets); Pi-Pf (quotes)
- cp = code point, hex. Determines Character as shown, Name, anchors. Use normalised "000A" (uppercase) notation.
- link = link to article, will link from (first) name
- gencat = Generic Category, Px by list definition
- script = Character script property
- style =
large
→ double cell height, for example § 104C: .၌ - technote = maintenance info: is pair, is unbalanced pair, is vertical pair
- For pairs (Pi-Pf quotes; Ps-Pe brackets)
- Use {{../row-pair}}
- cp2 = second code point
- gencat2 = second generic category (expected: Pf or Pe)
- script2, link2 are not present
{{Punctuation marks in Unicode/row | cp = | link = <!-- name wikilink --> | gencat = | script = | style = <!-- =large -- for Javanese etc.: double cell height --> | technote =<!-- maintenance info --> | cp2 = <!-- ../row-pair| --> | gencat2 = <!-- ../row-pair| --> }}
Note: |char=
is for checks only, might be removed later on.
Anchors
- All marks are anchored twice:
U+00B6
,¶
(= as entity value and as character)
- When possible, notation
U+00B6
is preferred (first code point of a pair). This resaults in nicer arrival (top of tablerow)
See also