Arabic script in Unicode
As of Unicode 5.0, the following ranges encode Arabic characters:
- Arabic (0600–06FF)
- Arabic Supplement (0750–077F)
- Arabic Presentation Forms-A (FB50–FDFF)
- Arabic Presentation Forms-B (FE70–FEFF)
The basic Arabic range encodes the standard letters and diacritics, but does not encode contextual forms (U+0621–U+0652 being directly based on ISO 8859-6); and also includes the most common diacritics and Arabic-Indic digits. U+06D6 to U+06ED encode qur'anic annotation signs such as "end of ayah" ۖ and "start of rub el hizb" ۞. The Arabic Supplement range encodes letter variants mostly used for writing African (non-Arabic) languages. The Arabic Presentation Forms-A range encodes contextual forms and ligatures of letter variants needed for Persian, Urdu, Sindhi and Central Asian languages. The Arabic Presentation Forms-B range encodes spacing forms of Arabic diacritics, and more contextual letter forms.