Jump to content

Arabic script in Unicode

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Dbachmann (talk | contribs) at 08:17, 25 July 2007 (to be expanded). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)

As of Unicode 5.0, the following ranges encode Arabic characters:

The basic Arabic range encodes the standard letters and diacritics, but does not encode contextual forms (U+0621–U+0652 being directly based on ISO 8859-6); and also includes the most common diacritics and Arabic-Indic digits. U+06D6 to U+06ED encode qur'anic annotation signs such as "end of ayah" ۝ۖ‎ and "start of rub el hizb" ۞‎. The Arabic Supplement range encodes letter variants mostly used for writing African (non-Arabic) languages. The Arabic Presentation Forms-A range encodes contextual forms and ligatures of letter variants needed for Persian, Urdu, Sindhi and Central Asian languages. The Arabic Presentation Forms-B range encodes spacing forms of Arabic diacritics, and more contextual letter forms.