Jump to content

Template:Punctuation marks in Unicode/doc

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by DePiep (talk | contribs) at 09:36, 10 April 2022 (Parameters). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

General Catagory Px, punctuation

Listed are all code points in Genetral Category P:
P, Punctuation mark
Pc Punctuation, connector Graphic Character 10 Includes "_" underscore
Pd Punctuation, dash Graphic Character 25 Includes several hyphen characters
Ps Punctuation, open Graphic Character 75 Opening bracket characters
Pe Punctuation, close Graphic Character 77 Closing bracket characters
Pi Punctuation, initial quote Graphic Character 12 Opening quotation mark. Does not include the ASCII "neutral" quotation mark. May behave like Ps or Pe depending on usage
Pf Punctuation, final quote Graphic Character 10 Closing quotation mark. May behave like Ps or Pe depending on usage
Po Punctuation, other Graphic Character 593
P Punctuation 798 Total
todo: check numbers; big table has 798 cp entries
Numbers as of version 14.0, by § BabelMap

Parameters

{{Punctuation marks in Unicode/row}}
{{Punctuation marks in Unicode/row-pair}} -- for pair Ps-Pe (brackets); Pi-Pf (quotes)
|cp=        code point, hex. Determines Character as shown, Name, anchors. Use normalised "000A" (uppercase) notation.
|link=      link to article, will link from (first) name; optional
|gencat=    Generic Category, Px by list definition
|script=    Character script property
|style=     large → double cell height, for example § U+104C:
|technote=  maintenance info: is pair, is unbalanced pair, is vertical pair
For pairs (Pi-Pf quotes; Ps-Pe brackets), Use {{../row-pair}}:
|cp2=       second code point
|gencat2=   second generic category (expected: Pf or Pe)
|script2, link2= unused, do not exist
|char= present for tests, could be removed when stable
{{Punctuation marks in Unicode/row
| cp       =
| link     =
| gencat   =
| script   =
| style    =
| technote =
| cp2      =
| gencat2  =
}}

Note: |char= is for checks only, might be removed later on.

Anchors

  • All marks are anchored twice: U+00B6, ¶ (= as entity value and as character)
cp=00B6§ U+00B6, [entity ¶] § ¶, char [¶]: § ¶
Template:Punctuation marks in Unicode#U+00B6
cp=2995§ U+2995, entity: § ⦕, char: § ⦕
Template:Punctuation marks in Unicode#U+2995
cp=2996§ U+2996, entity: § ⦖, char: § ⦖
Template:Punctuation marks in Unicode#U+2996
When possible, notation U+00B6 is preferred (first code point of a pair). This resaults in nicer arrival (top of tablerow)

See also