Jump to content

Wikipedia:Naming conventions (Unicode) (draft)

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by MalnadachBot (talk | contribs) at 17:18, 19 August 2021 (Fixed Lint errors in signatures. (Task 2)). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Right now this page just contains info on special characters that are likely to be interesting/usefull for en. Hopefully it can be expanded into a policy on their use.

Unicode provides an international standard which has the goal of providing the means to encode the text of every document people want to store on computers. This includes all scripts in active use today, many scripts known only by scholars, and symbols which do not strictly represent scripts, like mathematical, linguistic and APL symbols.

ASCII

C0 Controls and Basic Latin[a]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+000x NUL SOH STX ETX EOT ENQ ACK BEL  BS   HT   LF   VT   FF   CR   SO   SI 
U+001x DLE DC1 DC2 DC3 DC4 NAK SYN ETB CAN  EM  SUB ESC  FS   GS   RS   US 
U+002x  SP  ! " # $ % & ' ( ) * + , - . /
U+003x 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
U+004x @ A B C D E F G H I J K L M N O
U+005x P Q R S T U V W X Y Z [ \ ] ^ _
U+006x ` a b c d e f g h i j k l m n o
U+007x p q r s t u v w x y z { | } ~ DEL
  1. ^ As of Unicode version 16.0

These are our bread and butter and have been dealt with exhaustively elsewhere. Plugwash 28 June 2005 20:37 (UTC)

Latin-1

C1 Controls and Latin-1 Supplement[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+008x XXX XXX BPH NBH  IND NEL SSA ESA HTS HTJ VTS PLD PLU  RI   SS2 SS3
U+009x DCS PU1 PU2 STS CCH  MW  SPA EPA SOS XXX SCI  CSI   ST  OSC  PM  APC
U+00Ax NBSP ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ SHY ® ¯
U+00Bx ° ± ² ³ ´ µ · ¸ ¹ º » ¼ ½ ¾ ¿
U+00Cx À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï
U+00Dx Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß
U+00Ex à á â ã ä å æ ç è é ê ë ì í î ï
U+00Fx ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ
1.^ As of Unicode version 16.0

Same as above pretty much Plugwash 28 June 2005 20:41 (UTC)

The degree sign and the masculine ordinal indicator are often conflated, and are sometimes misused in place of the raised "o" in approximations of the numero sign. The feminine ordinal indicator is also occasionally misused as a superscript "1". The passability of such approximations is predicated on the output media being visual (screen, print) and the characteristics of certain fonts at certain sizes. In other contexts, the semantics of the text are changed when the wrong characters are used. Therefore, these characters should only ever be used for their intended purposes: use the degree sign to mean degrees, and the ordinal indicators as appropriate for the languages that need them.

Latin extended A

Default font   Unicode font
Latin Extended-A[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+010x Ā ā Ă ă Ą ą Ć ć Ĉ ĉ Ċ ċ Č č Ď ď
U+011x Đ đ Ē ē Ĕ ĕ Ė ė Ę ę Ě ě Ĝ ĝ Ğ ğ
U+012x Ġ ġ Ģ ģ Ĥ ĥ Ħ ħ Ĩ ĩ Ī ī Ĭ ĭ Į į
U+013x İ ı IJ ij Ĵ ĵ Ķ ķ ĸ Ĺ ĺ Ļ ļ Ľ ľ Ŀ
U+014x ŀ Ł ł Ń ń Ņ ņ Ň ň ʼn Ŋ ŋ Ō ō Ŏ ŏ
U+015x Ő ő Œ œ Ŕ ŕ Ŗ ŗ Ř ř Ś ś Ŝ ŝ Ş ş
U+016x Š š Ţ ţ Ť ť Ŧ ŧ Ũ ũ Ū ū Ŭ ŭ Ů ů
U+017x Ű ű Ų ų Ŵ ŵ Ŷ ŷ Ÿ Ź ź Ż ż Ž ž ſ
Notes
1.^ As of Unicode version 16.0
2.^ Unicode code point U+0149 is deprecated as of Unicode version 5.2
 
Latin Extended-A[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+010x Ā ā Ă ă Ą ą Ć ć Ĉ ĉ Ċ ċ Č č Ď ď
U+011x Đ đ Ē ē Ĕ ĕ Ė ė Ę ę Ě ě Ĝ ĝ Ğ ğ
U+012x Ġ ġ Ģ ģ Ĥ ĥ Ħ ħ Ĩ ĩ Ī ī Ĭ ĭ Į į
U+013x İ ı IJ ij Ĵ ĵ Ķ ķ ĸ Ĺ ĺ Ļ ļ Ľ ľ Ŀ
U+014x ŀ Ł ł Ń ń Ņ ņ Ň ň ʼn Ŋ ŋ Ō ō Ŏ ŏ
U+015x Ő ő Œ œ Ŕ ŕ Ŗ ŗ Ř ř Ś ś Ŝ ŝ Ş ş
U+016x Š š Ţ ţ Ť ť Ŧ ŧ Ũ ũ Ū ū Ŭ ŭ Ů ů
U+017x Ű ű Ų ų Ŵ ŵ Ŷ ŷ Ÿ Ź ź Ż ż Ž ž ſ
Notes
1.^ As of Unicode version 16.0
2.^ Unicode code point U+0149 is deprecated as of Unicode version 5.2

This is mostly letters with less common diacritics. One or two of these are already in article titles due to conversions from windows-1252 (most browsers interpret iso-8859-1 as windows-1252 and so that stuff got into article names here). The extra diacritics are probably a good thing as long as redirects are in place from the diacritic-less names and these are likely to be widely supported so I think it's pretty safe to have them in article titles. Plugwash 28 June 2005 20:47 (UTC)

Example: Paul Erdős. Arbor 6 July 2005 12:59 (UTC)

It is erroneous to refer to these as "less common" diacritics. Many of these are diacritics used for Eastern European languages, including Polish, Czech, Slovakian, Croatian, Slovenian, Serbian in Latin script, Hungarian, Turkish, Romanian, Latvian, Lithuanian, etc. and since the introduction of MediaWiki 1.5 they are already extensively used in article titles. The fact that Latin 1 is limited to Western European languages is just an artifact of the Cold War. -- Curps 15:05, 2 August 2005 (UTC)[reply]

Latin extended B

Default font   Unicode font
Latin Extended-B[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+018x ƀ Ɓ Ƃ ƃ Ƅ ƅ Ɔ Ƈ ƈ Ɖ Ɗ Ƌ ƌ ƍ Ǝ Ə
U+019x Ɛ Ƒ ƒ Ɠ Ɣ ƕ Ɩ Ɨ Ƙ ƙ ƚ ƛ Ɯ Ɲ ƞ Ɵ
U+01Ax Ơ ơ Ƣ ƣ Ƥ ƥ Ʀ Ƨ ƨ Ʃ ƪ ƫ Ƭ ƭ Ʈ Ư
U+01Bx ư Ʊ Ʋ Ƴ ƴ Ƶ ƶ Ʒ Ƹ ƹ ƺ ƻ Ƽ ƽ ƾ ƿ
U+01Cx ǀ ǁ ǂ ǃ DŽ Dž dž LJ Lj lj NJ Nj nj Ǎ ǎ Ǐ
U+01Dx ǐ Ǒ ǒ Ǔ ǔ Ǖ ǖ Ǘ ǘ Ǚ ǚ Ǜ ǜ ǝ Ǟ ǟ
U+01Ex Ǡ ǡ Ǣ ǣ Ǥ ǥ Ǧ ǧ Ǩ ǩ Ǫ ǫ Ǭ ǭ Ǯ ǯ
U+01Fx ǰ DZ Dz dz Ǵ ǵ Ƕ Ƿ Ǹ ǹ Ǻ ǻ Ǽ ǽ Ǿ ǿ
U+020x Ȁ ȁ Ȃ ȃ Ȅ ȅ Ȇ ȇ Ȉ ȉ Ȋ ȋ Ȍ ȍ Ȏ ȏ
U+021x Ȑ ȑ Ȓ ȓ Ȕ ȕ Ȗ ȗ Ș ș Ț ț Ȝ ȝ Ȟ ȟ
U+022x Ƞ ȡ Ȣ ȣ Ȥ ȥ Ȧ ȧ Ȩ ȩ Ȫ ȫ Ȭ ȭ Ȯ ȯ
U+023x Ȱ ȱ Ȳ ȳ ȴ ȵ ȶ ȷ ȸ ȹ Ⱥ Ȼ ȼ Ƚ Ⱦ ȿ
U+024x ɀ Ɂ ɂ Ƀ Ʉ Ʌ Ɇ ɇ Ɉ ɉ Ɋ ɋ Ɍ ɍ Ɏ ɏ
Notes
1.^ As of Unicode version 16.0
 
Latin Extended-B[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+018x ƀ Ɓ Ƃ ƃ Ƅ ƅ Ɔ Ƈ ƈ Ɖ Ɗ Ƌ ƌ ƍ Ǝ Ə
U+019x Ɛ Ƒ ƒ Ɠ Ɣ ƕ Ɩ Ɨ Ƙ ƙ ƚ ƛ Ɯ Ɲ ƞ Ɵ
U+01Ax Ơ ơ Ƣ ƣ Ƥ ƥ Ʀ Ƨ ƨ Ʃ ƪ ƫ Ƭ ƭ Ʈ Ư
U+01Bx ư Ʊ Ʋ Ƴ ƴ Ƶ ƶ Ʒ Ƹ ƹ ƺ ƻ Ƽ ƽ ƾ ƿ
U+01Cx ǀ ǁ ǂ ǃ DŽ Dž dž LJ Lj lj NJ Nj nj Ǎ ǎ Ǐ
U+01Dx ǐ Ǒ ǒ Ǔ ǔ Ǖ ǖ Ǘ ǘ Ǚ ǚ Ǜ ǜ ǝ Ǟ ǟ
U+01Ex Ǡ ǡ Ǣ ǣ Ǥ ǥ Ǧ ǧ Ǩ ǩ Ǫ ǫ Ǭ ǭ Ǯ ǯ
U+01Fx ǰ DZ Dz dz Ǵ ǵ Ƕ Ƿ Ǹ ǹ Ǻ ǻ Ǽ ǽ Ǿ ǿ
U+020x Ȁ ȁ Ȃ ȃ Ȅ ȅ Ȇ ȇ Ȉ ȉ Ȋ ȋ Ȍ ȍ Ȏ ȏ
U+021x Ȑ ȑ Ȓ ȓ Ȕ ȕ Ȗ ȗ Ș ș Ț ț Ȝ ȝ Ȟ ȟ
U+022x Ƞ ȡ Ȣ ȣ Ȥ ȥ Ȧ ȧ Ȩ ȩ Ȫ ȫ Ȭ ȭ Ȯ ȯ
U+023x Ȱ ȱ Ȳ ȳ ȴ ȵ ȶ ȷ ȸ ȹ Ⱥ Ȼ ȼ Ƚ Ⱦ ȿ
U+024x ɀ Ɂ ɂ Ƀ Ʉ Ʌ Ɇ ɇ Ɉ ɉ Ɋ ɋ Ɍ ɍ Ɏ ɏ
Notes
1.^ As of Unicode version 16.0

Here, U+018F is uppercase schwa, used in Azerbaijani. Note the lowercase schwa is U+0259 in the IPA section; the character U+01DD is "Latin small letter turned e", whose uppercase is U+018E, which is used in pan-Nigerian alphabets.

Also, U+01A0, U+01A1 is "o with horn" and U+01AF, U+01B0 is "u with horn", used in Vietnamese.

Also, U+01CD through U+01DC are a, i, o, u, u-umlaut with caron, used in Chinese pinyin for the third tone. Notice that Ě/ě (e with caron) is U+011A/U+011B and is in Latin extended A, since it is also a letter of Czech alphabet.

Also, U+0218 through U+021B are "s with comma below" and "t with comma below", used in Romanian. Fonts are sometimes not available for these, so "s with cedilla" and "t with cedilla" from the Latin extended A section are sometimes used instead of these. See Special Romanian Unicode characters -- Curps 21:10, 24 August 2005 (UTC)[reply]

out of interest do you know if the romanian wp has a policy on using comma below or cedilla? Plugwash 22:46, 24 August 2005 (UTC)[reply]
I'm pretty sure they use the cedilla forms, simply because my browser can't display the comma versions at Special Romanian Unicode characters and when I go to the http://ro.wikipedia.org/ I can see the letters. The comma versions only display in the above table because the whole table is set up to use special Unicode fonts rather than the default fonts. I edited the above table to show the default-font version alongside the Unicode font version. -- Curps 03:39, 25 August 2005 (UTC)[reply]

Latin extended additional

Default font   Unicode font
Latin Extended Additional[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+1E0x
U+1E1x
U+1E2x
U+1E3x ḿ
U+1E4x
U+1E5x
U+1E6x
U+1E7x ṿ
U+1E8x
U+1E9x
U+1EAx
U+1EBx ế
U+1ECx
U+1EDx
U+1EEx
U+1EFx ỿ
Notes
1.^ As of Unicode version 16.0
 
Latin Extended Additional[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+1E0x
U+1E1x
U+1E2x
U+1E3x ḿ
U+1E4x
U+1E5x
U+1E6x
U+1E7x ṿ
U+1E8x
U+1E9x
U+1EAx
U+1EBx ế
U+1ECx
U+1EDx
U+1EEx
U+1EFx ỿ
Notes
1.^ As of Unicode version 16.0

The range U+01EA0 through U+01EF9 is used for Vietnamese. -- Curps 21:10, 24 August 2005 (UTC)[reply]

IPA extensions

Default font Unicode font IPA Font (for comparison purposes only)
IPA Extensions[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+025x ɐ ɑ ɒ ɓ ɔ ɕ ɖ ɗ ɘ ə ɚ ɛ ɜ ɝ ɞ ɟ
U+026x ɠ ɡ ɢ ɣ ɤ ɥ ɦ ɧ ɨ ɩ ɪ ɫ ɬ ɭ ɮ ɯ
U+027x ɰ ɱ ɲ ɳ ɴ ɵ ɶ ɷ ɸ ɹ ɺ ɻ ɼ ɽ ɾ ɿ
U+028x ʀ ʁ ʂ ʃ ʄ ʅ ʆ ʇ ʈ ʉ ʊ ʋ ʌ ʍ ʎ ʏ
U+029x ʐ ʑ ʒ ʓ ʔ ʕ ʖ ʗ ʘ ʙ ʚ ʛ ʜ ʝ ʞ ʟ
U+02Ax ʠ ʡ ʢ ʣ ʤ ʥ ʦ ʧ ʨ ʩ ʪ ʫ ʬ ʭ ʮ ʯ
Notes
1.^ As of Unicode version 16.0
IPA Extensions[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+025x ɐ ɑ ɒ ɓ ɔ ɕ ɖ ɗ ɘ ə ɚ ɛ ɜ ɝ ɞ ɟ
U+026x ɠ ɡ ɢ ɣ ɤ ɥ ɦ ɧ ɨ ɩ ɪ ɫ ɬ ɭ ɮ ɯ
U+027x ɰ ɱ ɲ ɳ ɴ ɵ ɶ ɷ ɸ ɹ ɺ ɻ ɼ ɽ ɾ ɿ
U+028x ʀ ʁ ʂ ʃ ʄ ʅ ʆ ʇ ʈ ʉ ʊ ʋ ʌ ʍ ʎ ʏ
U+029x ʐ ʑ ʒ ʓ ʔ ʕ ʖ ʗ ʘ ʙ ʚ ʛ ʜ ʝ ʞ ʟ
U+02Ax ʠ ʡ ʢ ʣ ʤ ʥ ʦ ʧ ʨ ʩ ʪ ʫ ʬ ʭ ʮ ʯ
Notes
1.^ As of Unicode version 16.0
IPA Extensions[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+025x ɐ ɑ ɒ ɓ ɔ ɕ ɖ ɗ ɘ ə ɚ ɛ ɜ ɝ ɞ ɟ
U+026x ɠ ɡ ɢ ɣ ɤ ɥ ɦ ɧ ɨ ɩ ɪ ɫ ɬ ɭ ɮ ɯ
U+027x ɰ ɱ ɲ ɳ ɴ ɵ ɶ ɷ ɸ ɹ ɺ ɻ ɼ ɽ ɾ ɿ
U+028x ʀ ʁ ʂ ʃ ʄ ʅ ʆ ʇ ʈ ʉ ʊ ʋ ʌ ʍ ʎ ʏ
U+029x ʐ ʑ ʒ ʓ ʔ ʕ ʖ ʗ ʘ ʙ ʚ ʛ ʜ ʝ ʞ ʟ
U+02Ax ʠ ʡ ʢ ʣ ʤ ʥ ʦ ʧ ʨ ʩ ʪ ʫ ʬ ʭ ʮ ʯ
Notes
1.^ As of Unicode version 16.0

Used for IPA in body text (with a special template to persuade IE to render them right) probably not appropriate for article titles here. Plugwash 28 June 2005 20:48 (UTC)

Note however that schwa (U+0259) is also a letter in the Azerbaijani alphabet. Lowercase schwa is in the IPA section, but uppercase (U+018F) is in the Latin B section. -- Curps 15:05, 2 August 2005 (UTC)[reply]

Spacing modifier letters

Default font   Unicode font
Spacing Modifier Letters[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+02Bx ʰ ʱ ʲ ʳ ʴ ʵ ʶ ʷ ʸ ʹ ʺ ʻ ʼ ʽ ʾ ʿ
U+02Cx ˀ ˁ ˂ ˃ ˄ ˅ ˆ ˇ ˈ ˉ ˊ ˋ ˌ ˍ ˎ ˏ
U+02Dx ː ˑ ˒ ˓ ˔ ˕ ˖ ˗ ˘ ˙ ˚ ˛ ˜ ˝ ˞ ˟
U+02Ex ˠ ˡ ˢ ˣ ˤ ˥ ˦ ˧ ˨ ˩ ˪ ˫ ˬ ˭ ˮ ˯
U+02Fx ˰ ˱ ˲ ˳ ˴ ˵ ˶ ˷ ˸ ˹ ˺ ˻ ˼ ˽ ˾ ˿
Notes
1.^ As of Unicode version 16.0
 
Spacing Modifier Letters[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+02Bx ʰ ʱ ʲ ʳ ʴ ʵ ʶ ʷ ʸ ʹ ʺ ʻ ʼ ʽ ʾ ʿ
U+02Cx ˀ ˁ ˂ ˃ ˄ ˅ ˆ ˇ ˈ ˉ ˊ ˋ ˌ ˍ ˎ ˏ
U+02Dx ː ˑ ˒ ˓ ˔ ˕ ˖ ˗ ˘ ˙ ˚ ˛ ˜ ˝ ˞ ˟
U+02Ex ˠ ˡ ˢ ˣ ˤ ˥ ˦ ˧ ˨ ˩ ˪ ˫ ˬ ˭ ˮ ˯
U+02Fx ˰ ˱ ˲ ˳ ˴ ˵ ˶ ˷ ˸ ˹ ˺ ˻ ˼ ˽ ˾ ˿
Notes
1.^ As of Unicode version 16.0

I am not formatting this correctly, but there is at least one interesting item in this code block:

  • MODIFIER LETTER TURNED COMMA (U+02BB): used in Hawaiian, where it is called ʻOkina
Unfortunately, this character does not seem to be available in either the default or Unicode fonts on Windows. -- Curps 21:38, 4 September 2005 (UTC)[reply]
Works fine on mine, although I couldn't tell you precisely which font is being so obliging. —Phil | Talk 14:14, 14 October 2005 (UTC)[reply]

Combining diacritical marks

Default font   Unicode font
Combining Diacritical Marks[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+030x ◌̀ ◌́ ◌̂ ◌̃ ◌̄ ◌̅ ◌̆ ◌̇ ◌̈ ◌̉ ◌̊ ◌̋ ◌̌ ◌̍ ◌̎ ◌̏
U+031x ◌̐ ◌̑ ◌̒ ◌̓ ◌̔ ◌̕ ◌̖ ◌̗ ◌̘ ◌̙ ◌̚ ◌̛ ◌̜ ◌̝ ◌̞ ◌̟
U+032x ◌̠ ◌̡ ◌̢ ◌̣ ◌̤ ◌̥ ◌̦ ◌̧ ◌̨ ◌̩ ◌̪ ◌̫ ◌̬ ◌̭ ◌̮ ◌̯
U+033x ◌̰ ◌̱ ◌̲ ◌̳ ◌̴ ◌̵ ◌̶ ◌̷ ◌̸ ◌̹ ◌̺ ◌̻ ◌̼ ◌̽ ◌̾ ◌̿
U+034x ◌̀ ◌́ ◌͂ ◌̓ ◌̈́ ◌ͅ ◌͆ ◌͇ ◌͈ ◌͉ ◌͊ ◌͋ ◌͌ ◌͍ ◌͎  CGJ 
U+035x ◌͐ ◌͑ ◌͒ ◌͓ ◌͔ ◌͕ ◌͖ ◌͗ ◌͘ ◌͙ ◌͚ ◌͛ ◌͜◌ ◌͝◌ ◌͞◌ ◌͟◌
U+036x ◌͠◌ ◌͡◌ ◌͢◌ ◌ͣ ◌ͤ ◌ͥ ◌ͦ ◌ͧ ◌ͨ ◌ͩ ◌ͪ ◌ͫ ◌ͬ ◌ͭ ◌ͮ ◌ͯ
Notes
1.^ As of Unicode version 16.0
 
Combining Diacritical Marks[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+030x ◌̀ ◌́ ◌̂ ◌̃ ◌̄ ◌̅ ◌̆ ◌̇ ◌̈ ◌̉ ◌̊ ◌̋ ◌̌ ◌̍ ◌̎ ◌̏
U+031x ◌̐ ◌̑ ◌̒ ◌̓ ◌̔ ◌̕ ◌̖ ◌̗ ◌̘ ◌̙ ◌̚ ◌̛ ◌̜ ◌̝ ◌̞ ◌̟
U+032x ◌̠ ◌̡ ◌̢ ◌̣ ◌̤ ◌̥ ◌̦ ◌̧ ◌̨ ◌̩ ◌̪ ◌̫ ◌̬ ◌̭ ◌̮ ◌̯
U+033x ◌̰ ◌̱ ◌̲ ◌̳ ◌̴ ◌̵ ◌̶ ◌̷ ◌̸ ◌̹ ◌̺ ◌̻ ◌̼ ◌̽ ◌̾ ◌̿
U+034x ◌̀ ◌́ ◌͂ ◌̓ ◌̈́ ◌ͅ ◌͆ ◌͇ ◌͈ ◌͉ ◌͊ ◌͋ ◌͌ ◌͍ ◌͎  CGJ 
U+035x ◌͐ ◌͑ ◌͒ ◌͓ ◌͔ ◌͕ ◌͖ ◌͗ ◌͘ ◌͙ ◌͚ ◌͛ ◌͜◌ ◌͝◌ ◌͞◌ ◌͟◌
U+036x ◌͠◌ ◌͡◌ ◌͢◌ ◌ͣ ◌ͤ ◌ͥ ◌ͦ ◌ͧ ◌ͨ ◌ͩ ◌ͪ ◌ͫ ◌ͬ ◌ͭ ◌ͮ ◌ͯ
Notes
1.^ As of Unicode version 16.0

Notice how the marks show up now they have something to combine with… —Phil | Talk 10:49, 24 October 2005 (UTC)[reply]

Greek

Default font Unicode font Polytonic font (for comparison purposes)
Greek and Coptic[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+037x Ͱ ͱ Ͳ ͳ ʹ ͵ Ͷ ͷ ͺ ͻ ͼ ͽ ; Ϳ
U+038x ΄ ΅ Ά · Έ Ή Ί Ό Ύ Ώ
U+039x ΐ Α Β Γ Δ Ε Ζ Η Θ Ι Κ Λ Μ Ν Ξ Ο
U+03Ax Π Ρ Σ Τ Υ Φ Χ Ψ Ω Ϊ Ϋ ά έ ή ί
U+03Bx ΰ α β γ δ ε ζ η θ ι κ λ μ ν ξ ο
U+03Cx π ρ ς σ τ υ φ χ ψ ω ϊ ϋ ό ύ ώ Ϗ
U+03Dx ϐ ϑ ϒ ϓ ϔ ϕ ϖ ϗ Ϙ ϙ Ϛ ϛ Ϝ ϝ Ϟ ϟ
U+03Ex Ϡ ϡ Ϣ ϣ Ϥ ϥ Ϧ ϧ Ϩ ϩ Ϫ ϫ Ϭ ϭ Ϯ ϯ
U+03Fx ϰ ϱ ϲ ϳ ϴ ϵ ϶ Ϸ ϸ Ϲ Ϻ ϻ ϼ Ͻ Ͼ Ͽ
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
Greek and Coptic[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+037x Ͱ ͱ Ͳ ͳ ʹ ͵ Ͷ ͷ ͺ ͻ ͼ ͽ ; Ϳ
U+038x ΄ ΅ Ά · Έ Ή Ί Ό Ύ Ώ
U+039x ΐ Α Β Γ Δ Ε Ζ Η Θ Ι Κ Λ Μ Ν Ξ Ο
U+03Ax Π Ρ Σ Τ Υ Φ Χ Ψ Ω Ϊ Ϋ ά έ ή ί
U+03Bx ΰ α β γ δ ε ζ η θ ι κ λ μ ν ξ ο
U+03Cx π ρ ς σ τ υ φ χ ψ ω ϊ ϋ ό ύ ώ Ϗ
U+03Dx ϐ ϑ ϒ ϓ ϔ ϕ ϖ ϗ Ϙ ϙ Ϛ ϛ Ϝ ϝ Ϟ ϟ
U+03Ex Ϡ ϡ Ϣ ϣ Ϥ ϥ Ϧ ϧ Ϩ ϩ Ϫ ϫ Ϭ ϭ Ϯ ϯ
U+03Fx ϰ ϱ ϲ ϳ ϴ ϵ ϶ Ϸ ϸ Ϲ Ϻ ϻ ϼ Ͻ Ͼ Ͽ
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
Greek and Coptic[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+037x Ͱ ͱ Ͳ ͳ ʹ ͵ Ͷ ͷ ͺ ͻ ͼ ͽ ; Ϳ
U+038x ΄ ΅ Ά · Έ Ή Ί Ό Ύ Ώ
U+039x ΐ Α Β Γ Δ Ε Ζ Η Θ Ι Κ Λ Μ Ν Ξ Ο
U+03Ax Π Ρ Σ Τ Υ Φ Χ Ψ Ω Ϊ Ϋ ά έ ή ί
U+03Bx ΰ α β γ δ ε ζ η θ ι κ λ μ ν ξ ο
U+03Cx π ρ ς σ τ υ φ χ ψ ω ϊ ϋ ό ύ ώ Ϗ
U+03Dx ϐ ϑ ϒ ϓ ϔ ϕ ϖ ϗ Ϙ ϙ Ϛ ϛ Ϝ ϝ Ϟ ϟ
U+03Ex Ϡ ϡ Ϣ ϣ Ϥ ϥ Ϧ ϧ Ϩ ϩ Ϫ ϫ Ϭ ϭ Ϯ ϯ
U+03Fx ϰ ϱ ϲ ϳ ϴ ϵ ϶ Ϸ ϸ Ϲ Ϻ ϻ ϼ Ͻ Ͼ Ͽ
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points

Already used heavily for math type stuff but probably not too appropriate for article titles in English. Plugwash 28 June 2005 20:38 (UTC)

Well, here are some English articles that could use those letters in the title. Some of these articles claim to have the “wrong title due to technical limitations”: Pi, C omega, Omega constant, Chi-squared distribution, Gamma function, Cronbach's alpha, Beta particle, Beta distribution, and many, many more. Arbor 6 July 2005 19:07 (UTC)

Also some star names such as α And as a redirect for Alpha Andromedae. But here we run into the "initial letter is capitalized" issue, which applies to Greek letters too. -- Curps 04:44, 25 August 2005 (UTC)[reply]

Greek extended

Default font Unicode font Polytonic font (for comparison purposes)
Greek Extended[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+1F0x
U+1F1x
U+1F2x
U+1F3x Ἷ
U+1F4x
U+1F5x
U+1F6x
U+1F7x
U+1F8x
U+1F9x
U+1FAx
U+1FBx ᾿
U+1FCx
U+1FDx
U+1FEx
U+1FFx
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
Greek Extended[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+1F0x
U+1F1x
U+1F2x
U+1F3x Ἷ
U+1F4x
U+1F5x
U+1F6x
U+1F7x
U+1F8x
U+1F9x
U+1FAx
U+1FBx ᾿
U+1FCx
U+1FDx
U+1FEx
U+1FFx
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
Greek Extended[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+1F0x
U+1F1x
U+1F2x
U+1F3x Ἷ
U+1F4x
U+1F5x
U+1F6x
U+1F7x
U+1F8x
U+1F9x
U+1FAx
U+1FBx ᾿
U+1FCx
U+1FDx
U+1FEx
U+1FFx
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points

Cyrillic

Default font   Unicode font
Cyrillic[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+040x Ѐ Ё Ђ Ѓ Є Ѕ І Ї Ј Љ Њ Ћ Ќ Ѝ Ў Џ
U+041x А Б В Г Д Е Ж З И Й К Л М Н О П
U+042x Р С Т У Ф Х Ц Ч Ш Щ Ъ Ы Ь Э Ю Я
U+043x а б в г д е ж з и й к л м н о п
U+044x р с т у ф х ц ч ш щ ъ ы ь э ю я
U+045x ѐ ё ђ ѓ є ѕ і ї ј љ њ ћ ќ ѝ ў џ
U+046x Ѡ ѡ Ѣ ѣ Ѥ ѥ Ѧ ѧ Ѩ ѩ Ѫ ѫ Ѭ ѭ Ѯ ѯ
U+047x Ѱ ѱ Ѳ ѳ Ѵ ѵ Ѷ ѷ Ѹ ѹ Ѻ ѻ Ѽ ѽ Ѿ ѿ
U+048x Ҁ ҁ ҂ ◌҃ ◌҄ ◌҅ ◌҆ ◌҇ ◌҈ ◌҉ Ҋ ҋ Ҍ ҍ Ҏ ҏ
U+049x Ґ ґ Ғ ғ Ҕ ҕ Җ җ Ҙ ҙ Қ қ Ҝ ҝ Ҟ ҟ
U+04Ax Ҡ ҡ Ң ң Ҥ ҥ Ҧ ҧ Ҩ ҩ Ҫ ҫ Ҭ ҭ Ү ү
U+04Bx Ұ ұ Ҳ ҳ Ҵ ҵ Ҷ ҷ Ҹ ҹ Һ һ Ҽ ҽ Ҿ ҿ
U+04Cx Ӏ Ӂ ӂ Ӄ ӄ Ӆ ӆ Ӈ ӈ Ӊ ӊ Ӌ ӌ Ӎ ӎ ӏ
U+04Dx Ӑ ӑ Ӓ ӓ Ӕ ӕ Ӗ ӗ Ә ә Ӛ ӛ Ӝ ӝ Ӟ ӟ
U+04Ex Ӡ ӡ Ӣ ӣ Ӥ ӥ Ӧ ӧ Ө ө Ӫ ӫ Ӭ ӭ Ӯ ӯ
U+04Fx Ӱ ӱ Ӳ ӳ Ӵ ӵ Ӷ ӷ Ӹ ӹ Ӻ ӻ Ӽ ӽ Ӿ ӿ
Notes
1.^ As of Unicode version 16.0
 
Cyrillic[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+040x Ѐ Ё Ђ Ѓ Є Ѕ І Ї Ј Љ Њ Ћ Ќ Ѝ Ў Џ
U+041x А Б В Г Д Е Ж З И Й К Л М Н О П
U+042x Р С Т У Ф Х Ц Ч Ш Щ Ъ Ы Ь Э Ю Я
U+043x а б в г д е ж з и й к л м н о п
U+044x р с т у ф х ц ч ш щ ъ ы ь э ю я
U+045x ѐ ё ђ ѓ є ѕ і ї ј љ њ ћ ќ ѝ ў џ
U+046x Ѡ ѡ Ѣ ѣ Ѥ ѥ Ѧ ѧ Ѩ ѩ Ѫ ѫ Ѭ ѭ Ѯ ѯ
U+047x Ѱ ѱ Ѳ ѳ Ѵ ѵ Ѷ ѷ Ѹ ѹ Ѻ ѻ Ѽ ѽ Ѿ ѿ
U+048x Ҁ ҁ ҂ ◌҃ ◌҄ ◌҅ ◌҆ ◌҇ ◌҈ ◌҉ Ҋ ҋ Ҍ ҍ Ҏ ҏ
U+049x Ґ ґ Ғ ғ Ҕ ҕ Җ җ Ҙ ҙ Қ қ Ҝ ҝ Ҟ ҟ
U+04Ax Ҡ ҡ Ң ң Ҥ ҥ Ҧ ҧ Ҩ ҩ Ҫ ҫ Ҭ ҭ Ү ү
U+04Bx Ұ ұ Ҳ ҳ Ҵ ҵ Ҷ ҷ Ҹ ҹ Һ һ Ҽ ҽ Ҿ ҿ
U+04Cx Ӏ Ӂ ӂ Ӄ ӄ Ӆ ӆ Ӈ ӈ Ӊ ӊ Ӌ ӌ Ӎ ӎ ӏ
U+04Dx Ӑ ӑ Ӓ ӓ Ӕ ӕ Ӗ ӗ Ә ә Ӛ ӛ Ӝ ӝ Ӟ ӟ
U+04Ex Ӡ ӡ Ӣ ӣ Ӥ ӥ Ӧ ӧ Ө ө Ӫ ӫ Ӭ ӭ Ӯ ӯ
U+04Fx Ӱ ӱ Ӳ ӳ Ӵ ӵ Ӷ ӷ Ӹ ӹ Ӻ ӻ Ӽ ӽ Ӿ ӿ
Notes
1.^ As of Unicode version 16.0

Cyrillic supplement

Default font   Unicode font
Cyrillic Supplement[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+050x Ԁ ԁ Ԃ ԃ Ԅ ԅ Ԇ ԇ Ԉ ԉ Ԋ ԋ Ԍ ԍ Ԏ ԏ
U+051x Ԑ ԑ Ԓ ԓ Ԕ ԕ Ԗ ԗ Ԙ ԙ Ԛ ԛ Ԝ ԝ Ԟ ԟ
U+052x Ԡ ԡ Ԣ ԣ Ԥ ԥ Ԧ ԧ Ԩ ԩ Ԫ ԫ Ԭ ԭ Ԯ ԯ
Notes
1.^ As of Unicode version 16.0
 
Cyrillic Supplement[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+050x Ԁ ԁ Ԃ ԃ Ԅ ԅ Ԇ ԇ Ԉ ԉ Ԋ ԋ Ԍ ԍ Ԏ ԏ
U+051x Ԑ ԑ Ԓ ԓ Ԕ ԕ Ԗ ԗ Ԙ ԙ Ԛ ԛ Ԝ ԝ Ԟ ԟ
U+052x Ԡ ԡ Ԣ ԣ Ԥ ԥ Ԧ ԧ Ԩ ԩ Ԫ ԫ Ԭ ԭ Ԯ ԯ
Notes
1.^ As of Unicode version 16.0

Armenian

Default font   Unicode font
Armenian[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+053x Ա Բ Գ Դ Ե Զ Է Ը Թ Ժ Ի Լ Խ Ծ Կ
U+054x Հ Ձ Ղ Ճ Մ Յ Ն Շ Ո Չ Պ Ջ Ռ Ս Վ Տ
U+055x Ր Ց Ւ Փ Ք Օ Ֆ ՙ ՚ ՛ ՜ ՝ ՞ ՟
U+056x ՠ ա բ գ դ ե զ է ը թ ժ ի լ խ ծ կ
U+057x հ ձ ղ ճ մ յ ն շ ո չ պ ջ ռ ս վ տ
U+058x ր ց ւ փ ք օ ֆ և ֈ ։ ֊ ֍ ֎ ֏
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
 
Armenian[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+053x Ա Բ Գ Դ Ե Զ Է Ը Թ Ժ Ի Լ Խ Ծ Կ
U+054x Հ Ձ Ղ Ճ Մ Յ Ն Շ Ո Չ Պ Ջ Ռ Ս Վ Տ
U+055x Ր Ց Ւ Փ Ք Օ Ֆ ՙ ՚ ՛ ՜ ՝ ՞ ՟
U+056x ՠ ա բ գ դ ե զ է ը թ ժ ի լ խ ծ կ
U+057x հ ձ ղ ճ մ յ ն շ ո չ պ ջ ռ ս վ տ
U+058x ր ց ւ փ ք օ ֆ և ֈ ։ ֊ ֍ ֎ ֏
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points

Hebrew

Default font   Unicode font
Hebrew[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+059x ֑  ֒  ֓  ֔  ֕  ֖  ֗  ֘  ֙  ֚  ֛  ֜  ֝  ֞  ֟ 
U+05Ax ֠  ֡  ֢  ֣  ֤  ֥  ֦  ֧  ֨  ֩  ֪  ֫  ֬  ֭  ֮  ֯ 
U+05Bx ְ  ֱ  ֲ  ֳ  ִ  ֵ  ֶ  ַ  ָ  ֹ  ֺ  ֻ  ּ  ֽ  ־ ֿ 
U+05Cx ׀ ׁ  ׂ  ׃ ׄ  ׅ  ׆ ׇ 
U+05Dx א ב ג ד ה ו ז ח ט י ך כ ל ם מ ן
U+05Ex נ ס ע ף פ ץ צ ק ר ש ת ׯ
U+05Fx װ ױ ײ ׳ ״
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
 
Hebrew[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+059x ֑  ֒  ֓  ֔  ֕  ֖  ֗  ֘  ֙  ֚  ֛  ֜  ֝  ֞  ֟ 
U+05Ax ֠  ֡  ֢  ֣  ֤  ֥  ֦  ֧  ֨  ֩  ֪  ֫  ֬  ֭  ֮  ֯ 
U+05Bx ְ  ֱ  ֲ  ֳ  ִ  ֵ  ֶ  ַ  ָ  ֹ  ֺ  ֻ  ּ  ֽ  ־ ֿ 
U+05Cx ׀ ׁ  ׂ  ׃ ׄ  ׅ  ׆ ׇ 
U+05Dx א ב ג ד ה ו ז ח ט י ך כ ל ם מ ן
U+05Ex נ ס ע ף פ ץ צ ק ר ש ת ׯ
U+05Fx װ ױ ײ ׳ ״
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points

Note that the mathematical symbol for Aleph (eg Aleph-0) is in the math section at U+2135. -- Curps 21:10, 24 August 2005 (UTC)[reply]

See also #Alphabetical presentation forms for forms used by Yiddish. -- Curps 00:56, 11 September 2005 (UTC)[reply]

Arabic

Default font   Unicode font
Arabic[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+060x  ؀   ؁   ؂   ؃   ؄   ؅  ؆ ؇ ؈ ؉ ؊ ؋ ، ؍ ؎ ؏
U+061x ؐ ؑ ؒ ؓ ؔ ؕ ؖ ؗ ؘ ؙ ؚ ؛  ALM  ؝ ؞ ؟
U+062x ؠ ء آ أ ؤ إ ئ ا ب ة ت ث ج ح خ د
U+063x ذ ر ز س ش ص ض ط ظ ع غ ػ ؼ ؽ ؾ ؿ
U+064x ـ ف ق ك ل م ن ه و ى ي ً ٌ ٍ َ ُ
U+065x ِ ّ ْ ٓ ٔ ٕ ٖ ٗ ٘ ٙ ٚ ٛ ٜ ٝ ٞ ٟ
U+066x ٠ ١ ٢ ٣ ٤ ٥ ٦ ٧ ٨ ٩ ٪ ٫ ٬ ٭ ٮ ٯ
U+067x ٰ ٱ ٲ ٳ ٴ ٵ ٶ ٷ ٸ ٹ ٺ ٻ ټ ٽ پ ٿ
U+068x ڀ ځ ڂ ڃ ڄ څ چ ڇ ڈ ډ ڊ ڋ ڌ ڍ ڎ ڏ
U+069x ڐ ڑ ڒ ړ ڔ ڕ ږ ڗ ژ ڙ ښ ڛ ڜ ڝ ڞ ڟ
U+06Ax ڠ ڡ ڢ ڣ ڤ ڥ ڦ ڧ ڨ ک ڪ ګ ڬ ڭ ڮ گ
U+06Bx ڰ ڱ ڲ ڳ ڴ ڵ ڶ ڷ ڸ ڹ ں ڻ ڼ ڽ ھ ڿ
U+06Cx ۀ ہ ۂ ۃ ۄ ۅ ۆ ۇ ۈ ۉ ۊ ۋ ی ۍ ێ ۏ
U+06Dx ې ۑ ے ۓ ۔ ە ۖ ۗ ۘ ۙ ۚ ۛ ۜ  ۝  ۞ ۟
U+06Ex ۠ ۡ ۢ ۣ ۤ ۥ ۦ ۧ ۨ ۩ ۪ ۫ ۬ ۭ ۮ ۯ
U+06Fx ۰ ۱ ۲ ۳ ۴ ۵ ۶ ۷ ۸ ۹ ۺ ۻ ۼ ۽ ۾ ۿ
Notes
1.^ As of Unicode version 16.0
2.^ Unicode code point U+0673 is deprecated as of Unicode version 6.0
 
Arabic[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+060x  ؀   ؁   ؂   ؃   ؄   ؅  ؆ ؇ ؈ ؉ ؊ ؋ ، ؍ ؎ ؏
U+061x ؐ ؑ ؒ ؓ ؔ ؕ ؖ ؗ ؘ ؙ ؚ ؛  ALM  ؝ ؞ ؟
U+062x ؠ ء آ أ ؤ إ ئ ا ب ة ت ث ج ح خ د
U+063x ذ ر ز س ش ص ض ط ظ ع غ ػ ؼ ؽ ؾ ؿ
U+064x ـ ف ق ك ل م ن ه و ى ي ً ٌ ٍ َ ُ
U+065x ِ ّ ْ ٓ ٔ ٕ ٖ ٗ ٘ ٙ ٚ ٛ ٜ ٝ ٞ ٟ
U+066x ٠ ١ ٢ ٣ ٤ ٥ ٦ ٧ ٨ ٩ ٪ ٫ ٬ ٭ ٮ ٯ
U+067x ٰ ٱ ٲ ٳ ٴ ٵ ٶ ٷ ٸ ٹ ٺ ٻ ټ ٽ پ ٿ
U+068x ڀ ځ ڂ ڃ ڄ څ چ ڇ ڈ ډ ڊ ڋ ڌ ڍ ڎ ڏ
U+069x ڐ ڑ ڒ ړ ڔ ڕ ږ ڗ ژ ڙ ښ ڛ ڜ ڝ ڞ ڟ
U+06Ax ڠ ڡ ڢ ڣ ڤ ڥ ڦ ڧ ڨ ک ڪ ګ ڬ ڭ ڮ گ
U+06Bx ڰ ڱ ڲ ڳ ڴ ڵ ڶ ڷ ڸ ڹ ں ڻ ڼ ڽ ھ ڿ
U+06Cx ۀ ہ ۂ ۃ ۄ ۅ ۆ ۇ ۈ ۉ ۊ ۋ ی ۍ ێ ۏ
U+06Dx ې ۑ ے ۓ ۔ ە ۖ ۗ ۘ ۙ ۚ ۛ ۜ  ۝  ۞ ۟
U+06Ex ۠ ۡ ۢ ۣ ۤ ۥ ۦ ۧ ۨ ۩ ۪ ۫ ۬ ۭ ۮ ۯ
U+06Fx ۰ ۱ ۲ ۳ ۴ ۵ ۶ ۷ ۸ ۹ ۺ ۻ ۼ ۽ ۾ ۿ
Notes
1.^ As of Unicode version 16.0
2.^ Unicode code point U+0673 is deprecated as of Unicode version 6.0

Actually I'm getting better coverage with the default font here. Phil | Talk 12:30, 24 October 2005 (UTC)[reply]

Arabic supplement

Default font   Unicode font
Arabic Supplement[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+075x ݐ ݑ ݒ ݓ ݔ ݕ ݖ ݗ ݘ ݙ ݚ ݛ ݜ ݝ ݞ ݟ
U+076x ݠ ݡ ݢ ݣ ݤ ݥ ݦ ݧ ݨ ݩ ݪ ݫ ݬ ݭ ݮ ݯ
U+077x ݰ ݱ ݲ ݳ ݴ ݵ ݶ ݷ ݸ ݹ ݺ ݻ ݼ ݽ ݾ ݿ
Notes
1.^ As of Unicode version 16.0
 
Arabic Supplement[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+075x ݐ ݑ ݒ ݓ ݔ ݕ ݖ ݗ ݘ ݙ ݚ ݛ ݜ ݝ ݞ ݟ
U+076x ݠ ݡ ݢ ݣ ݤ ݥ ݦ ݧ ݨ ݩ ݪ ݫ ݬ ݭ ݮ ݯ
U+077x ݰ ݱ ݲ ݳ ݴ ݵ ݶ ݷ ݸ ݹ ݺ ݻ ݼ ݽ ݾ ݿ
Notes
1.^ As of Unicode version 16.0

Not getting anything here with either option: is there a better alternative for displaying Arabic? —Phil | Talk 12:45, 24 October 2005 (UTC)[reply]

Syriac

Default font   Unicode font
Syriac[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+070x ܀ ܁ ܂ ܃ ܄ ܅ ܆ ܇ ܈ ܉ ܊ ܋ ܌ ܍ SAM
U+071x ܐ ܑ ܒ ܓ ܔ ܕ ܖ ܗ ܘ ܙ ܚ ܛ ܜ ܝ ܞ ܟ
U+072x ܠ ܡ ܢ ܣ ܤ ܥ ܦ ܧ ܨ ܩ ܪ ܫ ܬ ܭ ܮ ܯ
U+073x ܰ ܱ ܲ ܳ ܴ ܵ ܶ ܷ ܸ ܹ ܺ ܻ ܼ ܽ ܾ ܿ
U+074x ݀ ݁ ݂ ݃ ݄ ݅ ݆ ݇ ݈ ݉ ݊ ݍ ݎ ݏ
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
 
Syriac[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+070x ܀ ܁ ܂ ܃ ܄ ܅ ܆ ܇ ܈ ܉ ܊ ܋ ܌ ܍ SAM
U+071x ܐ ܑ ܒ ܓ ܔ ܕ ܖ ܗ ܘ ܙ ܚ ܛ ܜ ܝ ܞ ܟ
U+072x ܠ ܡ ܢ ܣ ܤ ܥ ܦ ܧ ܨ ܩ ܪ ܫ ܬ ܭ ܮ ܯ
U+073x ܰ ܱ ܲ ܳ ܴ ܵ ܶ ܷ ܸ ܹ ܺ ܻ ܼ ܽ ܾ ܿ
U+074x ݀ ݁ ݂ ݃ ݄ ݅ ݆ ݇ ݈ ݉ ݊ ݍ ݎ ݏ
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points

Thaana

Default font   Unicode font
Thaana[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+078x ހ ށ ނ ރ ބ ޅ ކ އ ވ މ ފ ދ ތ ލ ގ ޏ
U+079x ސ ޑ ޒ ޓ ޔ ޕ ޖ ޗ ޘ ޙ ޚ ޛ ޜ ޝ ޞ ޟ
U+07Ax ޠ ޡ ޢ ޣ ޤ ޥ ަ ާ ި ީ ު ޫ ެ ޭ ޮ ޯ
U+07Bx ް ޱ
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
 
Thaana[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+078x ހ ށ ނ ރ ބ ޅ ކ އ ވ މ ފ ދ ތ ލ ގ ޏ
U+079x ސ ޑ ޒ ޓ ޔ ޕ ޖ ޗ ޘ ޙ ޚ ޛ ޜ ޝ ޞ ޟ
U+07Ax ޠ ޡ ޢ ޣ ޤ ޥ ަ ާ ި ީ ު ޫ ެ ޭ ޮ ޯ
U+07Bx ް ޱ
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points

Devanagari

Default font   Unicode font
Devanagari[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+090x
U+091x
U+092x
U+093x ि
U+094x
U+095x
U+096x
U+097x ॿ
Notes
1.^ As of Unicode version 16.0
 
Devanagari[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+090x
U+091x
U+092x
U+093x ि
U+094x
U+095x
U+096x
U+097x ॿ
Notes
1.^ As of Unicode version 16.0

Bengali

Default font   Unicode font
Bengali[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+098x
U+099x
U+09Ax
U+09Bx ি
U+09Cx
U+09Dx
U+09Ex
U+09Fx
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
 
Bengali[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+098x
U+099x
U+09Ax
U+09Bx ি
U+09Cx
U+09Dx
U+09Ex
U+09Fx
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points

Gujarati

Default font   Unicode font
Gujarati[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0A8x
U+0A9x
U+0AAx
U+0ABx િ
U+0ACx
U+0ADx
U+0AEx
U+0AFx ૿
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
 
Gujarati[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0A8x
U+0A9x
U+0AAx
U+0ABx િ
U+0ACx
U+0ADx
U+0AEx
U+0AFx ૿
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points

Tamil

Default font   Unicode font
Tamil[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0B8x
U+0B9x
U+0BAx
U+0BBx ி
U+0BCx
U+0BDx
U+0BEx
U+0BFx
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
 
Tamil[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0B8x
U+0B9x
U+0BAx
U+0BBx ி
U+0BCx
U+0BDx
U+0BEx
U+0BFx
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points

Telugu

Default font   Unicode font
Telugu[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0C0x
U+0C1x
U+0C2x
U+0C3x ి
U+0C4x
U+0C5x
U+0C6x
U+0C7x ౿
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
 
Telugu[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0C0x
U+0C1x
U+0C2x
U+0C3x ి
U+0C4x
U+0C5x
U+0C6x
U+0C7x ౿
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points

Kannada

Default font   Unicode font
Kannada[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0C8x
U+0C9x
U+0CAx
U+0CBx ಿ
U+0CCx
U+0CDx
U+0CEx
U+0CFx  ೱ   ೲ 
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
 
Kannada[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0C8x
U+0C9x
U+0CAx
U+0CBx ಿ
U+0CCx
U+0CDx
U+0CEx
U+0CFx  ೱ   ೲ 
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points

Malayalam

Default font   Unicode font
Malayalam[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0D0x
U+0D1x
U+0D2x
U+0D3x ി
U+0D4x   ൎ  
U+0D5x
U+0D6x
U+0D7x ൿ
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
 
Malayalam[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0D0x
U+0D1x
U+0D2x
U+0D3x ി
U+0D4x   ൎ  
U+0D5x
U+0D6x
U+0D7x ൿ
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points

Thai

Default font   Unicode font
Thai[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0E0x
U+0E1x
U+0E2x
U+0E3x ฿
U+0E4x
U+0E5x
U+0E6x
U+0E7x
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
 
Thai[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0E0x
U+0E1x
U+0E2x
U+0E3x ฿
U+0E4x
U+0E5x
U+0E6x
U+0E7x
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points

Georgian

Default font   Unicode font
Georgian[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+10Ax
U+10Bx
U+10Cx
U+10Dx
U+10Ex
U+10Fx
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
 
Georgian[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+10Ax
U+10Bx
U+10Cx
U+10Dx
U+10Ex
U+10Fx
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points

General punctuation

Default font   Unicode font
General Punctuation[1][2][3]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+200x NQ
 SP 
MQ
 SP 
EN
 SP 
EM
 SP 
 3/M 
SP
 4/M 
SP
 6/M 
SP
F
 SP 
P
 SP 
TH
 SP 
H
 SP 
ZW
 SP 
ZW
 NJ 
 ZW 
J
 LRM   RLM 
U+201x  NB 
U+202x L
 SEP 
P
 SEP 
 LRE   RLE   PDF   LRO   RLO   NNB 
SP
U+203x
U+204x
U+205x MM
  SP  
U+206x  WJ   ƒ()    ×     ,     +    LRI   RLI   FSI   PDI  I
 SS 
A
 SS 
I
 AFS 
A
 AFS 
NA
 DS 
NO
 DS 
Notes
1.^ As of Unicode version 16.0
2.^ Grey area indicates non-assigned code point
3.^ Unicode code points U+206A – U+206F are deprecated as of Unicode version 3.0
 
General Punctuation[1][2][3]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+200x NQ
 SP 
MQ
 SP 
EN
 SP 
EM
 SP 
 3/M 
SP
 4/M 
SP
 6/M 
SP
F
 SP 
P
 SP 
TH
 SP 
H
 SP 
ZW
 SP 
ZW
 NJ 
 ZW 
J
 LRM   RLM 
U+201x  NB 
U+202x L
 SEP 
P
 SEP 
 LRE   RLE   PDF   LRO   RLO   NNB 
SP
U+203x
U+204x
U+205x MM
  SP  
U+206x  WJ   ƒ()    ×     ,     +    LRI   RLI   FSI   PDI  I
 SS 
A
 SS 
I
 AFS 
A
 AFS 
NA
 DS 
NO
 DS 
Notes
1.^ As of Unicode version 16.0
2.^ Grey area indicates non-assigned code point
3.^ Unicode code points U+206A – U+206F are deprecated as of Unicode version 3.0

Pages that already use such letters:

Candidate pages for use:

  • RIGHT SINGLE QUOTATION MARK (’): Mother's day, St. John's Cathedral, and thousands more that use a possessive. Also T'Pol and many more
  • LEFT SINGLE QUOTATION MARK (‘): 'Okina, Hawaii, Ayin and others, but all of those are more correctly spelt with MODIFIER LETTER TURNED COMMA (U+02BB in the "Spacing modifier letters" section)
  • EN DASH (–): Hasse-Minkowski theorem and hundreds more
  • LEFT and RIGHT DOUBLE QUOTATION MARK (“ and ”): Knights who say Ni and others that use a quotation in the title

Superscripts and subscripts

Default font   Unicode font
Superscripts and Subscripts[1][2][3]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+207x
U+208x
U+209x
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
3.^ Refer to the Latin-1 Supplement Unicode block for characters ¹ (U+00B9), ² (U+00B2) and ³ (U+00B3)
 
Superscripts and Subscripts[1][2][3]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+207x
U+208x
U+209x
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
3.^ Refer to the Latin-1 Supplement Unicode block for characters ¹ (U+00B9), ² (U+00B2) and ³ (U+00B3)

Probably not relevant for titles and in body text its probably safer to use html subscript/superscript.

Besides requiring some heavy-duty fonts to be rendered correctly, the lesser-known of these characters (all but ² and ³, really) tend to appear too small to be legible, at least compared with <sub>. The policy in place for minor planets (whose designations often include subscripts) is 1) to use <sub> systematically in the article and in the wrongtitle template, 2) use non-subscripts in the title, and 3) put in place a redirect from the subscripted title. Urhixidur 14:51, 2005 August 2 (UTC)

Currency symbols

Default font   Unicode font
Currency Symbols[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+20Ax
U+20Bx
U+20Cx
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
 
Currency Symbols[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+20Ax
U+20Bx
U+20Cx
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points

The Euro (U+20AC) is heavily used within articles and should be OK to use in an article title (though it's hard to think of a plausible title that would use it... maybe if some book or movie title incorporates the symbol). -- Curps 21:08, 24 August 2005 (UTC)[reply]

How about €2 commemorative coins? ;)Nightstallion (?) 12:48, 28 July 2006 (UTC)[reply]

Letterlike symbols

Default font   Unicode font
Letterlike Symbols[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+210x
U+211x
U+212x
U+213x
U+214x
Notes
1.^ As of Unicode version 16.0
 
Letterlike Symbols[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+210x
U+211x
U+212x
U+213x
U+214x
Notes
1.^ As of Unicode version 16.0

Miscellaneous symbols

Default font   Unicode font
Miscellaneous Symbols[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+260x
U+261x
U+262x
U+263x
U+264x
U+265x
U+266x
U+267x
U+268x
U+269x
U+26Ax
U+26Bx
U+26Cx
U+26Dx
U+26Ex
U+26Fx
Notes
1.^ As of Unicode version 16.0
 
Miscellaneous Symbols[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+260x
U+261x
U+262x
U+263x
U+264x
U+265x
U+266x
U+267x
U+268x
U+269x
U+26Ax
U+26Bx
U+26Cx
U+26Dx
U+26Ex
U+26Fx
Notes
1.^ As of Unicode version 16.0
  • MUSICAL SHARP SIGN (Unicode 266F), ♯. Needed for Sharp-P and Sharp-P-complete. Some textbooks use NUMBER SIGN (#) instead, and refer to the complexity class as Number-P. I prefer that symbol myself, but that’s not the point here. Needed for C sharp. Arbor 7 July 2005 07:18 (UTC)

Alphabetical presentation forms

Default font   Unicode font
Alphabetic Presentation Forms[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+FB0x
U+FB1x ﬞ 
U+FB2x
U+FB3x
U+FB4x
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
 
Alphabetic Presentation Forms[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+FB0x
U+FB1x ﬞ 
U+FB2x
U+FB3x
U+FB4x
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points

Arabic presentation forms-A

Default font   Unicode font
Arabic Presentation Forms-A[1][2][3]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+FB5x
U+FB6x
U+FB7x ﭿ
U+FB8x
U+FB9x
U+FBAx
U+FBBx ﮿
U+FBCx
U+FBDx
U+FBEx
U+FBFx ﯿ
U+FC0x
U+FC1x
U+FC2x
U+FC3x ﰿ
U+FC4x
U+FC5x
U+FC6x
U+FC7x ﱿ
U+FC8x
U+FC9x
U+FCAx
U+FCBx ﲿ
U+FCCx
U+FCDx
U+FCEx
U+FCFx ﳿ
U+FD0x
U+FD1x
U+FD2x
U+FD3x ﴿
U+FD4x
U+FD5x
U+FD6x
U+FD7x ﵿ
U+FD8x
U+FD9x
U+FDAx
U+FDBx ﶿ
U+FDCx
U+FDDx
U+FDEx
U+FDFx ﷿
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
3.^ Black areas indicate noncharacters (code points that are guaranteed never to be assigned as encoded characters in the Unicode Standard)
 
Arabic Presentation Forms-A[1][2][3]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+FB5x
U+FB6x
U+FB7x ﭿ
U+FB8x
U+FB9x
U+FBAx
U+FBBx ﮿
U+FBCx
U+FBDx
U+FBEx
U+FBFx ﯿ
U+FC0x
U+FC1x
U+FC2x
U+FC3x ﰿ
U+FC4x
U+FC5x
U+FC6x
U+FC7x ﱿ
U+FC8x
U+FC9x
U+FCAx
U+FCBx ﲿ
U+FCCx
U+FCDx
U+FCEx
U+FCFx ﳿ
U+FD0x
U+FD1x
U+FD2x
U+FD3x ﴿
U+FD4x
U+FD5x
U+FD6x
U+FD7x ﵿ
U+FD8x
U+FD9x
U+FDAx
U+FDBx ﶿ
U+FDCx
U+FDDx
U+FDEx
U+FDFx ﷿
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
3.^ Black areas indicate noncharacters (code points that are guaranteed never to be assigned as encoded characters in the Unicode Standard)

Arabic presentation forms-B

Default font   Unicode font
Arabic Presentation Forms-B[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+FE7x ﹿ
U+FE8x
U+FE9x
U+FEAx
U+FEBx ﺿ
U+FECx
U+FEDx
U+FEEx
U+FEFx ZW
NBSP
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
 
Arabic Presentation Forms-B[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+FE7x ﹿ
U+FE8x
U+FE9x
U+FEAx
U+FEBx ﺿ
U+FECx
U+FEDx
U+FEEx
U+FEFx ZW
NBSP
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points

Halfwidth and fullwidth forms

Default font   Unicode font
Halfwidth and Fullwidth Forms[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+FF0x
U+FF1x
U+FF2x
U+FF3x _
U+FF4x
U+FF5x
U+FF6x
U+FF7x ソ
U+FF8x
U+FF9x
U+FFAx  HW 
HF
U+FFBx
U+FFCx
U+FFDx
U+FFEx
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points
 
Halfwidth and Fullwidth Forms[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+FF0x
U+FF1x
U+FF2x
U+FF3x _
U+FF4x
U+FF5x
U+FF6x
U+FF7x ソ
U+FF8x
U+FF9x
U+FFAx  HW 
HF
U+FFBx
U+FFCx
U+FFDx
U+FFEx
Notes
1.^ As of Unicode version 16.0
2.^ Grey areas indicate non-assigned code points