Template:Punctuation marks in Unicode/doc
Appearance
General Catagory Px, punctuation
- Listed are all code points in Genetral Category P:
| P, Punctuation mark | |||||
| Pc | Punctuation, connector | Graphic | Character | 10 | Includes "_" underscore |
| Pd | Punctuation, dash | Graphic | Character | 25 | Includes several hyphen characters |
| Ps | Punctuation, open | Graphic | Character | 75 | Opening bracket characters |
| Pe | Punctuation, close | Graphic | Character | 77 | Closing bracket characters |
| Pi | Punctuation, initial quote | Graphic | Character | 12 | Opening quotation mark. Does not include the ASCII "neutral" quotation mark. May behave like Ps or Pe depending on usage |
| Pf | Punctuation, final quote | Graphic | Character | 10 | Closing quotation mark. May behave like Ps or Pe depending on usage |
| Po | Punctuation, other | Graphic | Character | 593 | |
| P | Punctuation | 798 | Total | ||
- todo: check numbers; big table has 798 cp entries
- Numbers as of version 14.0, by § BabelMap
TOC for this table
- {{Punctuation marks in Unicode/TOC}} -- local, no pagename provided
| By Unicode General Category 'P' ()
|
| § Pd, dash |
| § Ps-Pe, start–end (open–close brackets) |
| § Pi-Pf, initial–final quote |
| § Pc, connector |
| § Po, other |
Parameters
- {{Punctuation marks in Unicode/row}}
- {{Punctuation marks in Unicode/row-pair}} -- for pair Ps-Pe (brackets); Pi-Pf (quotes)
|cp=code point, hex. Determines Character as shown, Name, anchors. Use normalised "000A" (uppercase) notation.|link=link to article, will link from (first) name; optional|gencat=Generic Category, Px by list definition|script=character script property|style=large→ double cell height, for example § U+104C: ၌|technote=maintenance info: is pair, is unbalanced pair, is vertical pair|style, technote=are shown in {{/testcases}}
For pairs (Pi-Pf quotes; Ps-Pe brackets); use {{../row-pair}}:
|cp2=second code point|gencat2=second generic category (expected: Pf or Pe)|unused, do not existscript2, link2=|char=present for tests, could be removed when stable
{{Punctuation marks in Unicode/row
| cp =
| link =
| gencat =
| script =
| style =
| technote =
| cp2 =
| gencat2 =
}}
Note: |char= is for checks only, might be removed later on.
|TOC=yeswill show table-TOC
|gencat=
Anchors
- All marks are anchored twice:
U+00B6,¶(= as entity value and as character)
- When possible, notation
U+00B6is preferred (first code point of a pair). This resaults in nicer arrival (top of tablerow)